Nabu is a CLI program for:
- crawling remote JSON-LD documents from a remote sitemap and storing them into an S3 bucket
- synchronizing JSON-LD documents from the S3 bucket into a triplestore by:
- removing old triples from the triplestore that are no longer present in S3
- inserting triples into the triplestore which are present in S3 but not in the triplestore
See the examples directory for example CLI usage.
docker run internetofwater/nabu:latest
git clone https://github.com/internetofwater/nabu
cd nabu
go build ./cmd/nabu
./nabu --help
For technical details and design rationale see the docs folder.
This repo is a heavily modified fork of the gleanerio Nabu repo:
It was edited to:
- add unit tests, integration tests, code coverage, and linting to the entire project
- add crawl support from Gleaner into the Nabu binary
- add shacl validation
- refactor code to make it easier to maintain and extend
- improve concurrency
- add observability and tracing output using OpenTelemetry