Skip to content

StephenAbbott/bods-brightquery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bods-brightquery

Transform BrightQuery / OpenData.org data into Beneficial Ownership Data Standard (BODS) v0.4 format.

Part of the BODS Interoperability Toolkit.

Overview

This pipeline ingests BrightQuery Senzing JSONL data (organizations and people-business records) and produces BODS v0.4 compliant statements, including:

  • Entity statements for organizations
  • Person statements for individuals linked to businesses
  • Ownership-or-control statements linking persons to entities

Installation

pip install .

For development (includes pytest):

pip install ".[dev]"

Usage

Transform organization data

bods-bq transform --organizations org_file.json -o output.jsonl

Transform organizations and people from directories

bods-bq transform --organizations ~/data/Organization --people ~/data/PeopleBusiness -o output.jsonl

Output as JSON array (for smaller datasets)

bods-bq transform --organizations org.json --people people.json -o output.json -f json

Options

Flag Description
--organizations Path to organizations JSONL file or directory
--people Path to people-business JSONL file or directory
-o, --output Output file path (default: output.jsonl)
-f, --format Output format: json or jsonl
--publisher-name Publisher name for BODS metadata
-v, --verbose Enable verbose logging
-q, --quiet Suppress all output except errors

Project Structure

src/bods_brightquery/
├── ingestion/       # JSONL reading and data models
├── transform/       # BODS statement generation (entities, persons, relationships, identifiers)
├── output/          # Statement serialisation (JSON/JSONL)
├── utils/           # Date handling and statement helpers
├── pipeline.py      # Orchestrates ingestion -> transform -> output
└── cli.py           # Click CLI entry point

Testing

pytest

License

MIT

About

Transform BrightQuery's corporate ownership data from opendata.org into Beneficial Ownership Data Standard (BODS) version 0.4 data format

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages