OpenMOSS
OpenMOSS shares a collection of our research in large language models. The team is affiliated to the FudanNLP lab.
Popular repositories Loading
-
Say-I-Dont-Know
Say-I-Dont-Know Public[ICML'2024] Can AI Assistants Know What They Don't Know?
-
Language-Model-SAEs
Language-Model-SAEs PublicFor OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
Repositories
Showing 8 of 8 repositories
- Language-Model-SAEs Public
For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
OpenMOSS/Language-Model-SAEs’s past year of commit activity - TransformerLens Public Forked from TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
OpenMOSS/TransformerLens’s past year of commit activity