-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: state-spaces/mamba
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix custom fwd and bwd for older PyTorch versions
#608
opened Oct 26, 2024 by
KokeCacao
Loading…
updated Oct 26, 2024
[Feature] Support variable-length sequences for mamba block
#244
opened Mar 14, 2024 by
zigzagcai
Loading…
updated Oct 25, 2024
feat: Initial state support for Mamba SSM (1)
#488
opened Jul 24, 2024 by
mzusman
Loading…
updated Sep 3, 2024
Feat: Add the support for non-learnable RMS norm for large-scale training in
mamba_inner_fn
#543
opened Aug 27, 2024 by
younesbelkada
Loading…
updated Aug 29, 2024
Clarifying no build isolation instructions
#542
opened Aug 26, 2024 by
amoskvic
Loading…
updated Aug 26, 2024
Fix Incorrect Gradients and Illegal Memory Access Error in Mamba2
#537
opened Aug 24, 2024 by
Hprairie
Loading…
updated Aug 25, 2024
Change interface to selective_state_update for continuous batching
#521
opened Aug 12, 2024 by
tlrmchlsmth
Loading…
updated Aug 12, 2024
Support variable-length sequences for mamba block with position indices
#434
opened Jul 1, 2024 by
ptxu78
Loading…
updated Jul 25, 2024
Better HF integration for
MambaLMHeadModel
#471
opened Jul 16, 2024 by
Wauplin
Loading…
updated Jul 16, 2024
Implement bi-directionality
#52
opened Dec 13, 2023 by
yair-schiff
Loading…
updated Jul 15, 2024
2 tasks done
Fixes bug in SelectiveScanFn.forward for when B is not variable and last_state is returned
#371
opened Jun 7, 2024 by
vidavakil
Loading…
updated Jun 7, 2024
Allow the model to be trained with most frameworks.
#188
opened Feb 23, 2024 by
deroholic
Loading…
updated Jun 3, 2024
Add inputs_embeds as alternative for input_ids
#158
opened Feb 3, 2024 by
Maykeye
Loading…
updated Jun 3, 2024
Error due to missing to_json_string method in MambaConfig class
#109
opened Jan 16, 2024 by
teticio
Loading…
updated Jun 3, 2024
Add support for left padding and masking in forward() and generate()
#70
opened Dec 20, 2023 by
normster
Loading…
updated Jun 3, 2024
Subclass from transformers for PEFT support and overall wider adoption
#227
opened Mar 7, 2024 by
markrogersjr
Loading…
updated Jun 3, 2024
ProTip!
Updated in the last three days: updated:>2024-10-23.