GitHub - okarthikb/attention-visualizer: LLM attention pattern visualizer

Attention visualizer

A web interface for viewing attention layers in language models: load any HF model (or your own) and adjust sliders (adjust slider limits for layer and head in app.js when using a different model) to view attention pattern at a particular layer and head.

Above example: GPT-2 small's head 2 in layer 6 is an induction head ('y' is selected here and anything that comes after 'y' is highlighted; the same character follows 'y' everywhere, so the scores are higher). Induction heads are hypothesized to be the main driver of in-context learning in large language models.

This project was inspired by Anthropic's A Mathematical Framework for Transformer Circuits and In-context Learning and Induction Heads posts and Neel Nanda's induction mosaic.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
README.md		README.md
app.js		app.js
app.py		app.py
cmu.sans-serif-medium.ttf		cmu.sans-serif-medium.ttf
index.html		index.html
requirements.txt		requirements.txt
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Attention visualizer

About

Uh oh!

Releases

Packages

Uh oh!

Languages

okarthikb/attention-visualizer

Folders and files

Latest commit

History

Repository files navigation

Attention visualizer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages