Skip to content
View danieljhkim's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@coinStatData

Block or report danieljhkim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
danieljhkim/README.md

Hi there, I'm Daniel 👋

🔍 Projects

🛠️ Skills

  • Distributed systems · clustering · fault tolerance
  • Search infra: Lucene, embeddings, ranking pipelines
  • High-performance backend services (Java, Go)
  • Data infra: Spark, Airflow, Hive, Hadoop, GCP, AWS
  • ML-assisted retrieval (vector DBs, CLIP, embeddings)

Pinned Loading

  1. dsearch dsearch Public

    A distributed search engine supporting BM25, vector search, and hybrid ranking over sharded Lucene indices.

    Java

  2. kvDB kvDB Public

    A distributed key-value database with shard routing, replication, and a dedicated control plane, built in Java.

    Java

  3. DevBox DevBox Public template

    DevBox is a minimal, language-agnostic contract that standardizes how local systems are started, validated, observed, and safely operated by humans and AI agents.

    Shell 1

  4. local-data-platform local-data-platform Public

    Local Hadoop (HDFS/YARN) + Hive + Spark dev environment manager with profile-based config overlays.

    Go

  5. DataStructures-Algorithms DataStructures-Algorithms Public

    Data Structures and Algorithms

    Python 1

  6. hive-duck hive-duck Public

    Hive-compatible CLI (-e, -f) backed by DuckDB for fast local SQL development without Hadoop or Hive.

    Go