This is a replication of the Anthropic paper "Toy Models of Superposition" by Elhage et al. (2022). Best viewed on Colab:
Here's my plot:

Here's Anthropic's plot:

Here's my plot:

Here's Anthropic's plot:

To do:
- (Tentative) Replicate the sections on feature geometry and computation in superposition.