GitHub - elonlit/goat: Generalized Optimal Transport Attention with Trainable Priors

GOAT Attention

Generalized Optimal Transport Attention with Trainable Priors (GOAT), available as a PyTorch multi-head attention module.

Install name: goat-attention (PyPI) · Import name: goat

Installation

From PyPI (recommended):

uv add goat-attention

pip:

pip install goat-attention

From source (editable):

uv pip install -e .

From source (editable, pip):

pip install -e .

Quickstart

import torch
from goat import GoatAttention

B, L, S, E, H = 2, 5, 7, 64, 8
xq = torch.randn(B, L, E)
xk = torch.randn(B, S, E)
xv = torch.randn(B, S, E)

attn = GoatAttention(
    embed_dim=E,
    num_heads=H,
    batch_first=True,
    pos_rank=2,
    abs_rank=4,
    enable_key_bias=True,
)

out, weights = attn(xq, xk, xv, is_causal=False, need_weights=True)
print(out.shape, None if weights is None else weights.shape)

Key Parameters

GOAT uses spectral (Fourier) features to model positional relationships. The two main hyperparameters are:

Parameter	Description	Typical Range
`pos_rank`	Number of Fourier frequencies for relative position encoding. Controls how finely the model can distinguish between different relative distances. Higher values = more expressive distance modeling.	2–16
`abs_rank`	Number of Fourier frequencies for absolute position encoding, used by the learned "sink" bias `u(j)`. Controls position-dependent attention patterns (e.g., attending to sequence start).	2–8

Guidelines:

For language models (GPT-style): pos_rank=4, abs_rank=4 is a good starting point
For vision transformers (ViT): pos_rank=16, abs_rank=2 works well with 2D positional encoding
For long-context tasks: Higher pos_rank may help capture fine-grained positional patterns
Set enable_key_bias=False to disable the sink term entirely (removes abs_rank dependency)

CLI

After installation:

goat info
goat smoke

Documentation

See docs/:

Development

uv pip install -e ".[dev]"
pytest

License

MIT (see LICENSE).

Citation

If you find GOAT useful, please cite:

@misc{goat,
  title         = {You Need Better Attention Priors},
  author        = {Elon Litman and Gabe Guo},
  year          = {2026},
  eprint        = {2601.15380},
  archivePrefix = {arXiv},
  primaryClass  = {cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
docs		docs
experiments		experiments
figs		figs
src/goat		src/goat
tests		tests
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GOAT Attention

Installation

Quickstart

Key Parameters

CLI

Documentation

Development

License

Citation

About

Uh oh!

Releases

Packages

Languages

License

elonlit/goat

Folders and files

Latest commit

History

Repository files navigation

GOAT Attention

Installation

Quickstart

Key Parameters

CLI

Documentation

Development

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages