Algorithmic Creativity

Code and data of our paper:
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Vaishnavh Nagarajan*¹, Chen Henry Wu*², Charles Ding², Aditi Raghunathan²
¹Google Research, ²Carnegie Mellon University
ICML 2025 (spotlight)
Paper | Data examples

Note: we have experiments with both Gemma 2B and GPT-2/SEDD in the paper, while this repo only contains the GPT-2/SEDD code.

Code structure

sibling-discovery        # code for Sibling Discovery
├── ntp                      # with next-token training
├── teacherless              # with teacherless training
└── diffusion                # with diffusion training
triangle-discovery       # code for Triangle Discovery
├── ...
circle-construction      # code for Circle Construction
├── ...
line-construction        # code for Line Construction
├── ...
simpletransformers       # helper code for Transformer models

Setup

We use simpletransformers to train and test the Transformer models (for NTP and teacherless training). To set up, please follow the installation instructions in simpletransformers/README.md.

For diffusion model training and inference, we use Score-Entropy-Discrete-Diffusion. We have provided self-contained copies under {task}/diffusion/, so no need to clone the repo. Please follow the dependency installation instructions in their README.

All experiments can be run on a single A6000 GPU. Batch sizes are tuned on this device.

Data generation

We provide Jupyter notebooks to replicate the data generation process. Paths in the notebooks need to be adjusted to your local environment.

Sibling Discovery

To get data with hash-conditioning, run all blocks in:

sibling-discovery/ntp/sibling.ipynb

Here is an example of what the data would look like: sibling example

To get data without hash-conditioning, run all blocks in:

sibling-discovery/ntp/sibling_no_hash.ipynb

Triangle Discovery

To get data with hash-conditioning, run all blocks in:

triangle-discovery/ntp/triangle.ipynb

Here is an example of what the data would look like: triangle example

To get data without hash-conditioning, run all blocks in:

triangle-discovery/ntp/triangle_no_hash.ipynb

Circle Construction

To get both data with and without hash-conditioning, run all blocks in:

circle-construction/ntp/circle.ipynb

Here is an example of what the data would look like: circle example

Line Construction

To get both data with and without hash-conditioning, run all blocks in:

line-construction/ntp/line.ipynb

Here is an example of what the data would look like: line example

Training and evaluation

To run the experiments, replace {task} with sibling-discovery, triangle-discovery, circle-construction, or line-construction. All experiments can be run on a single A6000 GPU. Batch sizes are tuned on this device.

Next-token prediction

Working directory for NTP is {task}/ntp:

cd {task}/ntp

Run the training script:

bash run_train.sh

Run the evaluation script:

bash run_eval.sh

The evaluation script will print the scores for each saved checkpoint.

Teacherless training

Working directory for teacherless training is {task}/teacherless:

cd {task}/teacherless

We provide Jupyter notebooks to preprocess the dataset for teacherless training. Run all blocks in (adjust the paths in the notebook to your local environment):

{task}_hybrid.ipynb

Run the training script:

bash run_train.sh

Run the evaluation script:

bash run_eval.sh

The evaluation script will print the scores for each saved checkpoint.

Diffusion models

Working directory for diffusion models should be {task}/diffusion:

cd {task}/diffusion

Run the training script:

bash run_train.sh

Run the evaluation script:

bash run_eval.sh

The evaluation script will print the scores for each saved checkpoint.

Citation

If you find this code useful, please consider citing our paper:

@misc{nagarajan2025roll,
    title={Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction},
    author={Vaishnavh Nagarajan and Chen Henry Wu and Charles Ding and Aditi Raghunathan},
    year={2025},
    eprint={2504.15266},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Algorithmic Creativity

Code structure

Setup

Data generation

Sibling Discovery

Triangle Discovery

Circle Construction

Line Construction

Training and evaluation

Next-token prediction

Teacherless training

Diffusion models

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
circle-construction		circle-construction
docs		docs
line-construction		line-construction
sibling-discovery		sibling-discovery
simpletransformers		simpletransformers
triangle-discovery		triangle-discovery
.gitignore		.gitignore
README.md		README.md

ChenWu98/algorithmic-creativity

Folders and files

Latest commit

History

Repository files navigation

Algorithmic Creativity

Code structure

Setup

Data generation

Sibling Discovery

Triangle Discovery

Circle Construction

Line Construction

Training and evaluation

Next-token prediction

Teacherless training

Diffusion models

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages