dl-quick-train

Experimental faster training for dictionary learning on single GPU machines.

Installation

Install the package using pip:

pip install dl-quick-train

Usage

You can run the default pipeline from the command line:

dl-quick-train

Alternatively you can import the library and call run_pipeline yourself:

from dl_quick_train import run_pipeline

run_pipeline([...], activation_cache_dir="/tmp/activations")

Set use_transformer_lens=True to collect activations with TransformerLens instead of nnsight.

Setting activation_cache_dir enables caching of model activations on disk. Caches are stored in subdirectories determined by the model name, dataset, layer, activation dimension, submodule and sequence length. If the directory already contains cached activations for the requested configuration they will be loaded instead of recomputed.

run_pipeline accepts a start_method argument controlling the multiprocessing start method (default: "forkserver"). Crash reporting is improved by enabling Python's faulthandler in worker processes.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
dl_quick_train		dl_quick_train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dl-quick-train

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

pleask/dl-quick-train

Folders and files

Latest commit

History

Repository files navigation

dl-quick-train

Installation

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages