Unified Dataset Vocab provides formal vocabulary definitions and taxonomies for describing datasets across the hyperpolymath ecosystem. It bridges schema design, data engineering, and formal verification through a type-safe vocabulary layer backed by Idris2 proofs and a Zig FFI for cross-language consumption.
-
Formal Definitions — Vocabulary terms defined with dependent types in Idris2
-
Taxonomy Logic — Hierarchical classification of dataset attributes
-
A2ML Integration — Markup hooks for AI-readable metadata annotations
-
Cross-language FFI — Zig-based C ABI so any language can consume the vocabulary
-
Compile-time Proofs — Memory layout and ABI correctness verified at compile time
All repos with foreign function interfaces follow this standard:
-
ABI (Application Binary Interface) — Idris2 (
src/abi/*.idr) -
FFI (Foreign Function Interface) — Zig (
ffi/zig/src/*.zig) -
Generated C Headers — Auto-generated from Idris2 ABI (
generated/abi/*.h)
See ABI-FFI-README.md for complete documentation.
unified-dataset-vocab/
├── src/abi/ # Idris2 ABI definitions (formal proofs)
│ ├── Types.idr # Core type definitions
│ ├── Layout.idr # Memory layout verification
│ └── Foreign.idr # FFI function declarations
├── ffi/zig/ # Zig FFI implementation
│ ├── build.zig # Build configuration
│ ├── src/main.zig # C-compatible implementation
│ └── test/ # Integration tests
├── examples/ # Usage examples (ReScript, Deno)
├── contractiles/ # Operational contracts (K9, Must, Trust, Dust, Lust)
├── .machine_readable/ # SCM files (STATE, META, ECOSYSTEM, etc.)
├── .well-known/ # security.txt, humans.txt, ai.txt
└── docs/ # Documentation and citationsSee TOPOLOGY.md for a visual architecture map and completion dashboard.
SPDX-License-Identifier: MPL-2.0
Copyright (c) 2026 Jonathan D.A. Jewell (hyperpolymath) <j.d.a.jewell@open.ac.uk>
Licensed under the Palimpsest-MPL License Version 1.0 or any later version.