Skip to content
View NasitSony's full-sized avatar

Block or report NasitSony

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nasitsony/README.md

Hi, I'm Nasit Sony πŸ‘‹

πŸ’‘ AI Infrastructure Engineer | Distributed Systems | Consensus Protocols (BFT)
πŸ“ Building production-grade backend systems for AI and fault-tolerant distributed computing


🧠 About Me

I build distributed backend systems where correctness matters β€” even under failures, partitions, and adversarial conditions.

My work sits at the intersection of:

  • AI infrastructure
  • Distributed systems engineering
  • Byzantine fault-tolerant consensus

I focus on treating AI systems not as simple APIs, but as fault-tolerant distributed systems with strong correctness guarantees.


⚑ Experience Snapshot

πŸ’° Production Systems (Fintech)

  • Built international money transfer systems handling $600M+ annual volume
  • Focus: correctness, consistency, and performance under real-world constraints

πŸ”¬ Distributed Systems & BFT Research

  • Published work in Springer journals and international conferences
  • Designed and implemented Byzantine fault-tolerant protocols
  • Bridging theoretical guarantees with practical system design

πŸš€ Current Focus β€” AI Infrastructure

  • Building production-grade AI pipelines (SmartSearch)
  • Applying distributed systems thinking to RAG and ML systems
  • Exploring decentralized and fault-tolerant ML infrastructure

πŸ”₯ Featured Projects

🚧 SmartSearch β€” AI-Powered Semantic Search & RAG Backend

Production-oriented AI system built like real backend infrastructure.

Key Features:

  • Asynchronous ingestion (Kafka β†’ Workers β†’ PostgreSQL)
  • Embedding-based semantic search (pgvector + OpenAI)
  • Retrieval-Augmented Generation (RAG)

Reliability Engineering:

  • Idempotent processing (no duplicate chunks)
  • Retry + DLQ handling
  • Worker crash recovery (offset-safe Kafka processing)
  • Explicit lifecycle tracking (PENDING β†’ PROCESSING β†’ READY/FAILED)
  • Eventual consistency guarantees

πŸ‘‰ Focus: AI systems as distributed systems


🌐 Cloud-Native API Gateway

Backend infrastructure system focused on security, scalability, and reliability.

  • JWT & OAuth2 authentication
  • Request routing and service orchestration
  • Observability and failure handling

🚧 Upcoming Work

βš™οΈ LSM-based KV Store (C++)

Storage engine inspired by LSM-tree designs (RocksDB-style).

πŸ‘‰ Focus: durability, compaction, and performance trade-offs


βš™οΈ Consensus Protocol Implementation (Rust)

Fault-tolerant distributed system inspired by Raft and BFT protocols.

πŸ‘‰ Focus: safety, liveness, and failure handling


🧰 Tech Stack

Languages:
Java, Rust (learning), Python

Backend & Infra:
Spring Boot, Kafka, PostgreSQL, pgvector

AI Stack:
Embeddings, RAG pipelines, OpenAI APIs

Distributed Systems:
Consensus, fault tolerance, idempotency, retries


πŸ“š Research


✍️ Writing


🎯 Current Focus

  • AI infrastructure & reliability
  • Fault-tolerant distributed systems
  • Consensus protocol engineering

πŸ“¬ Connect


⭐ I believe systems should be designed for failure β€” not just success.

Pinned Loading

  1. AI-Powered-Recommendation-Engine-For-Documents AI-Powered-Recommendation-Engine-For-Documents Public

    Production-grade semantic search and RAG backend built as a distributed system. Features async ingestion (Kafka), embedding pipelines, pgvector search, and strong reliability guarantees β€” including…

    Java

  2. Cloud-Native-API-Gateway Cloud-Native-API-Gateway Public

    Production-style cloud-native API gateway engineered as a core infrastructure layer for secure, reliable, and scalable backend systems. Implements JWT/OAuth2 authentication, request routing, and ob…

    Java

  3. NFS-Workflow-Engine NFS-Workflow-Engine Public

    NFS-safe workflow engine for coordinating distributed jobs using filesystem semantics (atomic rename, idempotency, and crash recovery) without databases or containers.

    Go

  4. kv-engine-cpp kv-engine-cpp Public

    C++ key-value store built as a foundation for a correctness-focused storage engine (WAL, LSM, and distributed replication roadmap).

    C++

  5. raft-simulator raft-simulator Public

    Raft consensus implementation in Rust with a deterministic simulator: leader election, log replication, partitions/crashes, and correctness-focused tests.

    Rust

  6. async-bft-suite async-bft-suite Public

    Prototype framework implementing three asynchronous BFT agreement protocols (Cachin MVBA, VABA, pMVBA) with a unified simulation harness and comparable metrics.

    Python