Skip to content

hyperpolymath/unified-dataset-vocab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

177 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Unified Dataset Vocab

1. Overview

Unified Dataset Vocab provides formal vocabulary definitions and taxonomies for describing datasets across the hyperpolymath ecosystem. It bridges schema design, data engineering, and formal verification through a type-safe vocabulary layer backed by Idris2 proofs and a Zig FFI for cross-language consumption.

2. Key Features

  • Formal Definitions — Vocabulary terms defined with dependent types in Idris2

  • Taxonomy Logic — Hierarchical classification of dataset attributes

  • A2ML Integration — Markup hooks for AI-readable metadata annotations

  • Cross-language FFI — Zig-based C ABI so any language can consume the vocabulary

  • Compile-time Proofs — Memory layout and ABI correctness verified at compile time

3. ABI/FFI Standards (Hyperpolymath Universal Standard)

All repos with foreign function interfaces follow this standard:

  • ABI (Application Binary Interface) — Idris2 (src/abi/*.idr)

  • FFI (Foreign Function Interface) — Zig (ffi/zig/src/*.zig)

  • Generated C Headers — Auto-generated from Idris2 ABI (generated/abi/*.h)

See ABI-FFI-README.md for complete documentation.

4. Directory Structure

unified-dataset-vocab/
├── src/abi/              # Idris2 ABI definitions (formal proofs)
│   ├── Types.idr         # Core type definitions
│   ├── Layout.idr        # Memory layout verification
│   └── Foreign.idr       # FFI function declarations
├── ffi/zig/              # Zig FFI implementation
│   ├── build.zig         # Build configuration
│   ├── src/main.zig      # C-compatible implementation
│   └── test/             # Integration tests
├── examples/             # Usage examples (ReScript, Deno)
├── contractiles/         # Operational contracts (K9, Must, Trust, Dust, Lust)
├── .machine_readable/    # SCM files (STATE, META, ECOSYSTEM, etc.)
├── .well-known/          # security.txt, humans.txt, ai.txt
└── docs/                 # Documentation and citations

5. Architecture

See TOPOLOGY.md for a visual architecture map and completion dashboard.

6. License

SPDX-License-Identifier: MPL-2.0

Copyright (c) 2026 Jonathan D.A. Jewell (hyperpolymath) <j.d.a.jewell@open.ac.uk>

Licensed under the Palimpsest-MPL License Version 1.0 or any later version.

About

Formal vocabulary definitions and taxonomies for datasets — Idris2-typed proofs with Zig FFI for cross-language consumption.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Sponsor this project

  •  

Packages

 
 
 

Contributors