Skip to content
View attentionmech's full-sized avatar

Block or report attentionmech

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 449 41 Updated Dec 30, 2024

A roadmap for "generative AI" learning resources

CSS 158 25 Updated Sep 23, 2024

Fast parallel LLM inference for MLX

Jupyter Notebook 153 5 Updated Jul 7, 2024

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,502 357 Updated Dec 20, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,605 5,866 Updated Aug 24, 2024

The most cited deep learning papers

TeX 25,594 4,478 Updated Jan 18, 2024

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 11,566 3,366 Updated Sep 12, 2024

Sparse autoencoders

Python 394 51 Updated Dec 18, 2024

Implementation of papers in 100 lines of code.

Python 1,368 148 Updated Dec 2, 2024
Jupyter Notebook 394 236 Updated Jan 1, 2025

Training Sparse Autoencoders on Language Models

Jupyter Notebook 553 132 Updated Dec 29, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,694 315 Updated Dec 31, 2024

Mechanistic Interpretability in Transformers: This repository explores advanced techniques like Induction Head Detection and QK Circuit Analysis to uncover the inner workings of transformer-based m…

Jupyter Notebook 18 3 Updated Sep 27, 2024

Sparse Autoencoder for Mechanistic Interpretability

Python 204 40 Updated Jul 20, 2024