Code for Machine Learning for Trading, 3rd edition — from data sourcing to live execution.
-
Updated
Jun 23, 2026 - Jupyter Notebook
Code for Machine Learning for Trading, 3rd edition — from data sourcing to live execution.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.
Mimesis is a fast Python library for generating fake data in multiple languages.
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
A procedural Blender pipeline for photorealistic training image generation
Synthetic data generation for tabular data
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Python Toolkit for Causal and Probabilistic Reasoning
Synthetic Patient Population Simulator
SDG is a specialized framework designed to generate high-quality structured tabular data.
UnrealCV: Connecting Computer Vision to Unreal Engine
🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.
Database anonymization and test data management
Synthetic data curation for post-training and structured data extraction
Synthetic data generators for tabular and time-series data
Build, enrich, and transform datasets using AI models with no code
Conditional GAN for generating synthetic tabular data.
The Declarative Data Generator
A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions
Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."