Skip to content

Commit

Permalink
README
Browse files Browse the repository at this point in the history
  • Loading branch information
albertfgu committed Dec 4, 2023
1 parent 1eeb1c8 commit 6dbfc45
Showing 1 changed file with 7 additions and 1 deletion.
8 changes: 7 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,9 +5,15 @@
> Albert Gu*, Tri Dao*\
> Paper: https://arxiv.org/abs/2312.00752
## About

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers.
It is based on the line of progress on [structured state space models](https://github.com/state-spaces/s4),
with an efficient hardware-aware design and implementation in the spirit of [FlashAttention](https://github.com/Dao-AILab/flash-attention).

## Installation

- `pip install causal-conv1d`: an efficient implemention of a simple causal Conv1d layer used inside the Mamba block.
- `pip install causal-conv1d`: an efficient implementation of a simple causal Conv1d layer used inside the Mamba block.
- `pip install mamba-ssm`: the core Mamba package.

It can also be built from source with `pip install .` from this repository.
Expand Down

0 comments on commit 6dbfc45

Please sign in to comment.