Skip to content
#

synthetic-data

Here are 17 public repositories matching this topic...

Descentralized Optimization Network (DON) is a REST API that allows extending existing evolutionary algorithms to a decentralized architecture to provide collaboration, scalability, event log and fault-tolerance in an optimization process. It also allows external clients to make remote evaluations of data using the most optimized models.

  • Updated Feb 27, 2025
  • JavaScript

AuldLangSynth is an open-source data-centric language synthesis platform designed to generate, analyze, and curate high-quality instruction datasets for modern AI systems. It provides an end-to-end workflow for producing structured language samples, auditing their quality, and transforming them into embeddings and datasets ready for training.

  • Updated Dec 13, 2025
  • JavaScript

A carefully curated collection of tools, datasets, papers, and open-source implementations focused on Synthetic Data Generation (SDG) and its realism-driven extension, Realistic Artificial Data (RAD) — a paradigm that moves beyond superficial resemblance toward causally grounded, physically consistent, and privacy-safe data generation.

  • Updated Mar 16, 2026
  • JavaScript

Improve this page

Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."

Learn more