Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
-
Updated
Oct 1, 2025 - Python
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Intake is a lightweight package for finding, investigating, loading and disseminating data.
🐳 The stupidly simple CLI workspace for your data warehouse.
Work with your web service, database, and streaming schemas in a single format.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Sample code with integration between Data Catalog and RDBMS data sources.
End-to-end DataOps platform deployed by Terraform.
Data catalog for everything in your company
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
Open-source metadata collector based on ODD Specification
Sample code with integration between Data Catalog and BI data sources.
A data lineage tool detects table dependencies from rendered SQL statements.
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
Polar Earth Observation Database of satellite sensors
Data Catalogs Made Easy
Update a Google Data Catalog tag with dbt Cloud run metadata
articat: data artifact catalog
Build a data catalog by running a single line of code
Add a description, image, and links to the data-catalog topic page so that developers can more easily learn about it.
To associate your repository with the data-catalog topic, visit your repo's landing page and select "manage topics."