Transform BrightQuery / OpenData.org data into Beneficial Ownership Data Standard (BODS) v0.4 format.
Part of the BODS Interoperability Toolkit.
This pipeline ingests BrightQuery Senzing JSONL data (organizations and people-business records) and produces BODS v0.4 compliant statements, including:
- Entity statements for organizations
- Person statements for individuals linked to businesses
- Ownership-or-control statements linking persons to entities
pip install .For development (includes pytest):
pip install ".[dev]"bods-bq transform --organizations org_file.json -o output.jsonlbods-bq transform --organizations ~/data/Organization --people ~/data/PeopleBusiness -o output.jsonlbods-bq transform --organizations org.json --people people.json -o output.json -f json| Flag | Description |
|---|---|
--organizations |
Path to organizations JSONL file or directory |
--people |
Path to people-business JSONL file or directory |
-o, --output |
Output file path (default: output.jsonl) |
-f, --format |
Output format: json or jsonl |
--publisher-name |
Publisher name for BODS metadata |
-v, --verbose |
Enable verbose logging |
-q, --quiet |
Suppress all output except errors |
src/bods_brightquery/
├── ingestion/ # JSONL reading and data models
├── transform/ # BODS statement generation (entities, persons, relationships, identifiers)
├── output/ # Statement serialisation (JSON/JSONL)
├── utils/ # Date handling and statement helpers
├── pipeline.py # Orchestrates ingestion -> transform -> output
└── cli.py # Click CLI entry point
pytestMIT