Skip to content

oolong-dev/julax

Repository files navigation

julax

2025-09-26

  • Rethink Why Flax NNX?
    • Inspection: doesn't work for us.
    • Running computation: worth rethinking whether we should support it or not.
    • State handling: doesn't work for us. The state is explicitly considered. A dedicated new class is created before 02_mnist/v5.py. But it seems a dict like object is already enough.
    • Model surgery: unclear about the real benefit here. The param & state sync might be an issue. But the operation should be easy since model/param/state are mirrored trees.
    • Transforms: need revisit in the future.
  • It seems Lux.jl is mainly inspired by the linen style in flax. While the NNX style is more close to pytorch. And existing implementation in this repo is more close to Lux and axlearn.

2025-09-15

2025-08-21

2025-08-20

2025-08-07

2025-08-06

2025-08-03

  • penzai v2 background
    • Parameters and state variables becoming mutable, shareable variable objects

    • This seems to be aligned with current design. Currently a general dict is used. Maybe I should also introduce a dedicated class for params and states.
    • all variable objects must have a unique label, which can either be specified manually or generated automatically.

    • Hmm, I find it difficult to search for a specific Param/State. MAYBE a unique label will do some help here?
    • Eager parameter initialization

    • In current design, params & states are separated from models. So more close to lazy initialization?
    • The built-in Transformer implementation also supports loading Llama, Mistral, and GPT-NeoX / Pythia models.

    • TODO: this is a good feature to have.

2025-07-28

2025-07-26

2025-07-25

2025-07-24

2025-07-20

Investigations

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published