A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios.
-
Updated
Aug 4, 2024 - JavaScript
A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios.
Where the digital world comes to life!
Descentralized Optimization Network (DON) is a REST API that allows extending existing evolutionary algorithms to a decentralized architecture to provide collaboration, scalability, event log and fault-tolerance in an optimization process. It also allows external clients to make remote evaluations of data using the most optimized models.
Interactive demonstration of synthetic data generation using GANs and VAEs with statistical comparison
Code for Roboflow's How to Create a Synthetic Dataset tutorial.
Cultural Learning-Based Culture Adaptation of Language Models (https://aclanthology.org/2025.acl-long.156/)
A full-stack Machine Learning web application for generating synthetic epidemiological data, training multiple ML models, and predicting disease outbreak risk through an interactive dashboard.
Node.js CLI wrapper for Synthea synthetic patient generator with advanced parameter management
This project provides a sample dataset of recipes to be used with MongoDB. It uses Faker JS package for generating Synthatic Data.
Data generator designed specifically for turboMaker. Generates random text, hashtags, words, dates, emails, id, url, arrays, booleans, etc.
Official webpage for the paper: Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning [ECCV24]
Official academic page for the paper: "Diffusing DeBias: Synthetic Bias Amplification for Model Debiasing" (NeurIPS 2025)
Generate realistic AI-powered phone conversations for testing Voice Intelligence and analytics pipelines
Project Page (ObjectDR)
AuldLangSynth is an open-source data-centric language synthesis platform designed to generate, analyze, and curate high-quality instruction datasets for modern AI systems. It provides an end-to-end workflow for producing structured language samples, auditing their quality, and transforming them into embeddings and datasets ready for training.
Superfast, multithreaded document generator for MongoDB, operating through CLI. Generates millions of documents at maximum speed, utilizing all CPU threads.
A carefully curated collection of tools, datasets, papers, and open-source implementations focused on Synthetic Data Generation (SDG) and its realism-driven extension, Realistic Artificial Data (RAD) — a paradigm that moves beyond superficial resemblance toward causally grounded, physically consistent, and privacy-safe data generation.
Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."