A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
-
Updated
May 28, 2025 - Python
A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
🌮 Table-based KB Completer
Master thesis - reproducing state-of-the-art schema matching algorithms
Valentine scalable deployment for VLDB demo
Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching
CLI tool for inserting SELECT query results into ClickHouse with automatic schema matching and type-safe casting. Ideal for ETL pipelines and SQL-driven data flows.
Python client for the Serene Data Integration software
[Information System] SMUTF: Schema Matching Using Generative Tags and Hybrid Features
Master thesis: Holistic Schema Matching at Scale
Projects for the course Data Engineering held by professor Paolo Merialdo at Roma Tre University.
The PyDI framework provides methods for end-to-end data integration. The framework covers all steps of the integration process, including schema matching, data translation, entity matching, and data fusion. The framework offers traditional string-based methods as well as modern LLM- and embedding-based techniques for these tasks.
Benchmark to evaluate schema matching approaches
The Master Project of Aldi Doanta Kurnia - Master Computer Science student at the University of Twente.
Add a description, image, and links to the schema-matching topic page so that developers can more easily learn about it.
To associate your repository with the schema-matching topic, visit your repo's landing page and select "manage topics."