Tied/Untied Mechanism #7

fel-thomas · 2025-11-04T05:42:04Z

Tied/Untied Mechanism Weight Encoder

Weight tying aligns encoder and decoder at training start, leading to faster convergence and reduced parameters.
The goal was to have a simple API: call .tied() or .untied() on any SAE in Overcomplete and you're done.

Implementation

Introduces TieableEncoder - a lightweight linear encoder that either:

Uses dictionary transpose D^T when tied
Maintains independent weights when untied

Works seamlessly with all SAE variants (TopK, Jump, Batch, MP, OMP, etc.)

Usage

sae = SAE(input_shape=768, nb_concepts=2048)
sae.tied()  # encoder now uses D^T

# ... train with tied weights ...

sae.untied(copy_from_dictionary=True)  # switch to independent weights

# ... continue training ...

~50% fewer parameters when tied, natural weight initialization when transitioning.

fel-thomas added 4 commits November 4, 2025 00:45

deps: ensure scipy for nnls & hugarian matching

92a338c

kernels: remove unused import

ea27d50

deps: introduce numkdocs for numpy autodoc

ea196dc

sae: introduce tied/untied mechanism

8241b6d

fel-thomas force-pushed the tied_encoder branch from 7f0becf to 8241b6d Compare November 4, 2025 05:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tied/Untied Mechanism #7

Tied/Untied Mechanism #7

Uh oh!

fel-thomas commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tied/Untied Mechanism #7

Are you sure you want to change the base?

Tied/Untied Mechanism #7

Uh oh!

Conversation

fel-thomas commented Nov 4, 2025

Tied/Untied Mechanism Weight Encoder

Implementation

Usage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants