Reliability-first AI infrastructure on Elixir/BEAM
46 public repos | 18 stars
The Elixir/BEAM platform provides technical capabilities essential for production AI systems that other runtimes cannot match: lightweight process isolation enables thousands of concurrent model calls without shared state corruption, supervisor trees provide automatic failure recovery without human intervention, and hot code upgrades allow model updates without service interruption. These are not conveniences but prerequisites for systems targeting 99%+ reliability in adversarial production environments.
North Shore AI builds the ML reliability research ecosystem that Elixir deserves. From statistical testing and ensemble voting to adversarial robustness and explainable AI, the stack addresses the full lifecycle of getting LLMs to behave predictably. The goal is world-class tooling that leverages BEAM's unique properties while remaining accessible to researchers and practitioners.
Two industrial metaphors unify the ecosystem. The Kitchen/Cookbook pattern treats ML training as a culinary operation: the kitchen provides infrastructure, cookbooks contain recipes, and practitioners are chefs executing them. The Metalworking pattern treats data labeling as manufacturing: raw data enters the forge, receives careful refinement on the anvil, and emerges as polished UI components (ingots) ready for crucible experimentation.
| Tier | Layer | Components |
|---|---|---|
| 1 | Public Interface | nsai_sites (Cloudflare Workers) |
| 2 | Gateway & Orchestration | nsai_gateway, nsai_registry |
| 3 | Processing Domains | Kitchen (training), Forge (data), CNS (reasoning) |
| 4 | Core Framework | crucible_framework, crucible_ir, crucible_bench |
| 5 | MLOps Assembly | crucible_train, crucible_model_registry, crucible_deployment, crucible_feedback |
| 6 | Reliability & Safety | LlmGuard, crucible_ensemble, crucible_hedging, crucible_xai, crucible_adversary |
| 7 | Foundational Utilities | tiktoken_ex, embed_ex, hf_hub_ex, hf_datasets_ex |
The training stack uses culinary terminology to separate concerns:
- Kitchen (
crucible_kitchen): The infrastructure and orchestration engine. Backend-agnostic, handles compute provisioning, job scheduling, and resource management. - Cookbook (
tinkex_cookbook): Training recipes and configurations. Declarative specifications for training runs that can be versioned and shared. - Chef: The ML practitioner who uses the kitchen and follows recipes to produce trained models.
The data labeling pipeline uses metalworking terminology to describe data transformation:
- Forge (
forge): Raw data processing factory. Takes unstructured inputs and shapes them into structured samples. - Anvil (
anvil): Human-in-the-loop labeling and governance. Where careful refinement and quality control occur. - Ingot (
ingot): Polished UI components. Production-ready Phoenix LiveView modules for labeling interfaces. - Crucible: ML experimentation. Where refined data undergoes testing and transformation under pressure.
Open research platform targeting 99%+ LLM reliability through ensembles, hedging, and statistical testing.
| Repository | Description |
|---|---|
| ExDataCheck | Data validation and quality library for ML pipelines in Elixir |
| ExFairness | Fairness and bias detection library for Elixir AI/ML systems |
| LlmGuard | AI Firewall and guardrails for LLM-based Elixir applications |
| cns_crucible | |
| crucible_adversary | Adversarial testing and robustness evaluation for the Crucible fram... |
| crucible_bench | Statistical testing and analysis framework for AI research |
| crucible_datasets | Dataset management and caching for AI research benchmarks |
| crucible_deployment | ML model deployment for the Crucible ecosystem. vLLM and Ollama int... |
| crucible_ensemble | Multi-model ensemble voting strategies for LLM reliability |
| crucible_examples | Interactive Phoenix LiveView demonstrations of the Crucible Framewo... |
| crucible_feedback | ML feedback loop management for the Crucible ecosystem. Quality mon... |
| crucible_framework | CrucibleFramework: A scientific platform for LLM reliability resear... |
| crucible_harness | Experimental research framework for running AI benchmarks at scale |
| crucible_hedging | Request hedging for tail latency reduction in distributed systems |
| crucible_ir | Intermediate Representation for the Crucible ML reliability ecosystem |
| crucible_kitchen | Industrial ML training orchestration - backend-agnostic workflow en... |
| crucible_model_registry | ML model registry for the Crucible ecosystem. Artifact storage, mod... |
| crucible_telemetry | Advanced telemetry collection and analysis for AI research |
| crucible_trace | Structured causal reasoning chain logging for LLM transparency |
| crucible_train | ML training orchestration for the Crucible ecosystem. Distributed t... |
| crucible_ui | Phoenix LiveView dashboard for the Crucible ML reliability stack |
| crucible_xai | Explainable AI (XAI) tools for the Crucible framework |
| datasets_ex | Dataset management library for ML experiments—loaders for SciFact, ... |
| eval_ex | Model evaluation harness for standardized benchmarking—comprehensiv... |
| hf_datasets_ex | HuggingFace Datasets for Elixir - A native Elixir port of the popul... |
| metrics_ex | Metrics aggregation and alerting for ML experiments—multi-backend e... |
| Repository | Description |
|---|---|
| anvil | Labeling queue library for managing human labeling workflows |
| forge | Sample factory library for generating, transforming, and computing ... |
| ingot | Phoenix LiveView interface for sample generation and human labeling... |
| labeling_ir | Shared IR structs for the North Shore labeling stack (Forge/Anvil/I... |
Critic-Network Synthesis: structured argumentation for AI reasoning transparency.
| Repository | Description |
|---|---|
| cns | Chiral Narrative Synthesis - Dialectical reasoning framework for au... |
| cns_ui | Phoenix LiveView interface for CNS dialectical reasoning experiments |
CNS Dialectical Flow:
Proposer (thesis) -> Antagonist (antithesis) -> Synthesizer (synthesis)
| | |
Extract SNOs Flag contradictions Resolve with evidence
(claims+evidence) (B1 gaps, chirality) (critic-guided)
Critics: Grounding, Logic, Novelty, Bias, Causal
| Repository | Description |
|---|---|
| ExDataCheck | Data validation and quality library for ML pipelines in Elixir |
| ExFairness | Fairness and bias detection library for Elixir AI/ML systems |
| LlmGuard | AI Firewall and guardrails for LLM-based Elixir applications |
| datasets_ex | Dataset management library for ML experiments—loaders for SciFact, ... |
| eval_ex | Model evaluation harness for standardized benchmarking—comprehensiv... |
| hf_datasets_ex | HuggingFace Datasets for Elixir - A native Elixir port of the popul... |
| metrics_ex | Metrics aggregation and alerting for ML experiments—multi-backend e... |
| Repository | Description |
|---|---|
| hf_hub_ex | Elixir client for HuggingFace Hub—dataset/model metadata, file down... |
| hf_peft_ex | Elixir port of HuggingFace's PEFT (Parameter-Efficient Fine-Tuning)... |
| nsai_gateway | Unified API gateway for the NSAI ecosystem—authentication (JWT, API... |
| nsai_registry | Service discovery and registry for the NSAI ecosystem—distributed r... |
| pilot | Interactive CLI and REPL for the NSAI ecosystem—unified interface t... |
| tiktoken_ex | Pure Elixir TikToken-style byte-level BPE tokenizer (Kimi K2 compat... |
| tinkerer | Chiral Narrative Synthesis workspace for Thinker/Tinker LoRA pipeli... |
| tinkex | Elixir SDK for the Tinker ML platform—LoRA training, sampling, and ... |
| tinkex_cookbook | Elixir port of tinker-cookbook: training and evaluation recipes for... |
| work | NSAI.Work - Unified job scheduler for North-Shore-AI platform |
| Repository | Description |
|---|---|
| chz_ex | Elixir port of OpenAI's chz library - a powerful configuration mana... |
| embed_ex | Vector embeddings service for Elixir—multi-provider support (OpenAI... |
| ex_topology | Pure Elixir library for graph topology, TDA, and computational topo... |
| nx_penalties | Composable regularization penalties for Elixir Nx. L1/L2/Elastic Ne... |
| Site | Purpose |
|---|---|
| nsai.online | Corporate landing, architecture overview |
| nsai.store | Open source packages catalog |
| nsai.space | Research lab, CNS experiments |
BEAM Native | OTP supervision, telemetry, distributed resilience Research Backed | Every feature tied to reliability research Production Ready | Hex packages, docs, test suites
Updated 2026-01-03
