Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
-
Updated
Feb 7, 2026 - Python
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
SQL Lineage Analysis Tool powered by Python
🐳 The stupidly simple CLI workspace for your data warehouse.
Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
Work with your web service, database, and streaming schemas in a single format.
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
Open-source metadata collector based on ODD Specification
A data lineage tool detects table dependencies from rendered SQL statements.
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
Data Catalogs Made Easy
WG3 Metadata Specification
Protegrity AI Developer Edition – Data Discovery and Protection Tools
LLM-Powered Data Discovery System for Tabular Data
articat: data artifact catalog
Toolkit for discovering and aggregating data for whole-cell modeling
Valentine scalable deployment for VLDB demo
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
A fast and accurate index for distribution-aware dataset search.
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
Add a description, image, and links to the data-discovery topic page so that developers can more easily learn about it.
To associate your repository with the data-discovery topic, visit your repo's landing page and select "manage topics."