The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
Updated
Dec 25, 2024 - Python
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
A light-weight, flexible, and expressive statistical data testing library
Lightweight, extensible data validation library for Python
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Automatically find issues in image datasets and practice data-centric computer vision.
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
A dead simple Python string validation library.
Typical: Fast, simple, & correct data-validation using Python 3 typing.
Open Source Data Quality Monitoring.
A collaborative framework for annotating medical datasets using crowdsourcing.
pydantic --> zod data models
Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake Source Export, Snowflake cloning, and data tieout tool
Data validation library for PySpark 3.0.0
Gere e valide dados randômicos com fordev 🎲
With this tool, users can easily add credit card details, view saved cards, check card balances, and retrieve OTPs for verification
Another library for defensive data analysis.
Client interface for all things Cleanlab Studio
Data validator for the zen of python
Rule based data validation library for python 3.
Add a description, image, and links to the data-validation topic page so that developers can more easily learn about it.
To associate your repository with the data-validation topic, visit your repo's landing page and select "manage topics."