skrub-data / skrub Star 1.6k Code Issues Pull requests Discussions Machine learning with dataframes data-science data machine-learning data-analysis data-wrangling data-preprocessing dataframe dataframes data-preparation data-cleaning dirty-data Updated Feb 6, 2026 Python
dirty-data-science / python Star 61 Code Issues Pull requests Tutorial material on machine learning with dirty data in Python data-science machine-learning dirty-data Updated Jul 7, 2024 Python
raamana / missingdata Star 18 Code Issues Pull requests missing data handing: visualize and impute visualization data-science machine-learning neuroscience biostatistics imputation epidemiology missing-data dirty-data missing-values Updated Jul 31, 2019 Python
jbn / vaquero Star 0 Code Issues Pull requests A Python library for iterative and interactive data wrangling at laptop-scale. data data-mining etl data-analysis elt data-cleaning dirty-data etl-framework Updated Feb 8, 2018 Python