- Distributed systems · clustering · fault tolerance
- Search infra: Lucene, embeddings, ranking pipelines
- High-performance backend services (Java, Go)
- Data infra: Spark, Airflow, Hive, Hadoop, GCP, AWS
- ML-assisted retrieval (vector DBs, CLIP, embeddings)
Pinned Loading
-
dsearch
dsearch PublicA distributed search engine supporting BM25, vector search, and hybrid ranking over sharded Lucene indices.
Java
-
kvDB
kvDB PublicA distributed key-value database with shard routing, replication, and a dedicated control plane, built in Java.
Java
-
local-data-platform
local-data-platform PublicLocal Hadoop (HDFS/YARN) + Hive + Spark dev environment manager with profile-based config overlays.
Go 4
-
local-agent
local-agent PublicAn experimental, local-first AI agent runtime built to explore safety-first tooling, RAG, identity models, and local persistence.
Python 3
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



