jax-edm2

Love jax? Tired of not having good networks pre-implemented and the infrastructure being less robust than PyTorch? Bummed that HuggingFace Diffusers just let go of Flax?

To help with all of that, this repository contains an implementation of NVIDIA's latest and greatest EDM2 UNet architecture in jax. For the official PyTorch implementation, see https://github.com/NVlabs/edm2.

As described in example.py, usage of the network is simple:

## initialize the model
model = edm2_net.PrecondUNet(
    img_resolution=32,
    img_channels=3,
    label_dim=10,
    sigma_data=0.5,
    logvar_channels=128,
    use_bfloat16=True,
    unet_kwargs={
        "model_channels": 128,
        "channel_mult": [2, 2, 2],
        "num_blocks": 3,
        "attn_resolutions": [16, 8],
        "use_fourier": False,
        "block_kwargs": {"dropout": 0.13},
    },
)

## note that we use the pytorch (NCHW) convention
prng_key = jax.random.PRNGKey(42)
ex_input = jax.random.normal(prng_key, (1, 3, 32, 32))
ex_t = jnp.array([0.0])
ex_label = jax.nn.one_hot(0, num_classes=10).reshape((1, -1))

## initialize the model parameters
params = model.init(
    {"params": prng_key},
    ex_t,
    ex_input,
    ex_label,
    train=False,
    calc_weight=True,
)
print(f"Number of parameters: {ravel_pytree(params)[0].size}")

## note need to project to sphere due to jax functional style
## this also needs to happen after every gradient step in a training loop!
params = edm2_net.project_to_sphere(params)

The EDM2 network architecture relies on careful normalization and projection of the model's convolutional weights. In PyTorch, this can happen in the network itself. Because jax is functional, this will need to happen in the training loop. As shown in the above code snippet, we've included a function project_to_sphere that accomplishes this projection for you. Just apply it after every gradient step.

References

The following references were useful in the development of this code. The first is the original paper by NVIDIA introducing the network architecture. The second is a paper by your's truly (along with the great & powerful Michael Albergo and Eric Vanden-Eijnden) that leverages this code for learning flow map-based generative models.

@article{karras_analyzing_2024,
	title = {Analyzing and {Improving} the {Training} {Dynamics} of {Diffusion} {Models}},
	url = {http://arxiv.org/abs/2312.02696},
	author = {Karras, Tero and Aittala, Miika and Lehtinen, Jaakko and Hellsten, Janne and Aila, Timo and Laine, Samuli},
	month = mar,
	year = {2024},
	journal = {arXiv:2312.02696},
}


@article{boffi_how_2025,
	title = {How to build a consistency model: {Learning} flow maps via self-distillation},
	shorttitle = {How to build a consistency model},
	url = {https://www.arxiv.org/abs/2505.18825},
	author = {Boffi, Nicholas M. and Albergo, Michael S. and Vanden-Eijnden, Eric},
	month = may,
	year = {2025},
	journal = {arXiv:2505.18825},
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
imgs		imgs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
edm2_net.py		edm2_net.py
example.py		example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

jax-edm2

References

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

nmboffi/jax-edm2

Folders and files

Latest commit

History

Repository files navigation

jax-edm2

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages