The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
-
Updated
Sep 27, 2025 - Python
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 2.5 𝘩𝘰𝘶𝘳𝘴 𝘰𝘧 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 & 𝘷𝘪𝘥𝘦𝘰 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
Distributed version-control for geospatial and tabular data
Metadata store for Production ML
Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.
A demonstration of how DVC and MLFlow can be used in the task of data relabeling
Python Data as Code core implementation
A CKAN extension for data versioning.
Deprecated. See https://github.com/datopian/ckanext-versions. ⏰ CKAN extension providing data versioning (metadata and files) based on git and github.
Automatic data change tracking for Django
Automatic data change tracking for SQLAlchemy
Newron is a data-centric ML platform to easily build, manage, deploy and continuously improve models through data driven development.
This project demonstrates a complete workflow for analyzing sales data with missing values. It includes data cleaning, feature engineering, aggregation, and visualizations using Python libraries such as Pandas, NumPy, and Matplotlib.
The provided demo project demonstrates the practical implementation and advantages of using DVC. It showcases how DVC simplifies data versioning and model versioning while working in tandem with Git to create a cohesive version control system tailored for data science projects.
Production-ready ML pipeline solving reproducibility challenges with DVC, docker, MLflow & DagHub
Testing and implementations with ClearML
Deploying a ML Model to Cloud Platform with FastAPI applying CI/CD practices
Project with tabular data versioned with Artifacts.
A template for building governed and reproducible machine learning projects, enabling transparent tracking of data, models, and deployments across various platforms.
Add a description, image, and links to the data-versioning topic page so that developers can more easily learn about it.
To associate your repository with the data-versioning topic, visit your repo's landing page and select "manage topics."