A sparse autoencoder for mechanistic interpretability research.
pip install sparse_autoencoder
Check out the demo notebook for a guide to using this library.
This project uses Poetry for dependency management, and PoeThePoet for scripts. After checking out the repo, install all dependencies with:
poetry install --with dev,demos
Then for a full list of available commands (e.g. test
or typecheck
):
poe