Our Technology Stack

We build high-performance data infrastructure by combining cutting-edge open source and partner technologies.

Open Source Technologies

AI Agents
LangChain icon
LangChain
Standard framework for LLM applications integrating RAG/agents/tools in Python/JS
LlamaIndex icon
LlamaIndex
RAG-focused data connection, indexing, and query processing
CrewAI icon
CrewAI
Workflow-based multi-agent framework
Big Data Framework
Apache Spark icon
Apache Spark
Unified analytics engine for large-scale data processing
MLflow icon
MLflow
Platform for managing the ML lifecycle
Presto icon
Presto
Distributed SQL query engine for big data
Data AI Governance
Unity Catalog icon
Unity Catalog
Unified governance and discovery platform for data and AI assets across clouds
Apache Polaris icon
Apache Polaris
Open-source catalog for Apache Iceberg with REST API for multi-engine interoperability
Open Data Format
Delta Lake icon
Delta Lake
Open-source storage layer for reliable data lakes
Apache Iceberg icon
Apache Iceberg
Open table format for huge analytic datasets
Apache Hudi icon
Apache Hudi
Transactional data lake platform
Apache Xtable icon
Apache Xtable
Cross-platform table format specification

Partner Technologies

Databricks icon
Databricks
Unified analytics Data Intelligence Platform for AI/ML development
Snowflake icon
Snowflake
Traditional data platform for business users (DWH to BI)
OneHouse icon
OneHouse
Modern data lakehouse platform