A Survey on Visual Mamba

Authors: Hanwei Zhang, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Ziyang Wang and Zi Ye.

A curated list of awesome Mamba for Computer Vision, inspired by the other awesome-initiatives. We intend to regularly update the relevant latest papers and their open-source implementations on this page.

We strongly encourage the researchers that want to promote their fantastic work to the community to make pull request, remind us on issue, or contact with email to update their paper's information!

Citation

If you find the listing and survey useful for your work, please cite the paper:

Zhang, Hanwei, et al. "A Survey on Visual Mamba." Applied Sciences 13.14 (2024): 5683.

@article{zhang2024survey,
  title={A Survey on Visual Mamba},
  author={Zhang, Hanwei and Zhu, Ying and Wang, Dan and Zhang, Lijun and Chen, Tianxiang and Wang, Ziyang and Ye, Zi},
  journal={Applied Sciences},
  volume={14},
  number={13},
  pages={5683},
  year={2024},
  publisher={MDPI}
}

Overview

Survey Papers
Mamba Backbone
Image Classification
Object Detection
Image Segmentation
Video Classification
Video Understanding
Multi-Modal Understanding
Video Prediction
Image Registration
Image Super-Resolution
Image Restoration
Image Dehazing
Image Derain
Image Deblurring
Visual Generation
Point Cloud
Depth Estimation
3D Reconstruction
Video Generation
Others

Survey Papers

A Survey on Visual Mamba. [24th April., 2024].
Zhang, Hanwei, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Ziyang Wang, and Zi Ye.
[PDF]

A survey on vision mamba: Models, applications and challenges. [29th April., 2024].
Xu, Rui, Shu Yang, Yihui Wang, Bo Du, and Hao Chen.
[PDF]

Vision Mamba: A Comprehensive Survey and Taxonomy [7th May., 2024].
Liu, Xiao, Chenxu Zhang, and Lei Zhang.
[PDF]

Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study. [14th May., 2024].
Zhu, Qinfeng, Yuan Fang, Yuanzhi Cai, Cheng Chen, and Lei Fan.
[PDF]

Mamba-360: Survey of state space models as transformer alternative for long sequence modelling: Methods, applications, and challenges. [24th April., 2024].
Patro, Badri Narayana, and Vijay Srinivas Agneeswaran.
[PDF]

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis. [5th Jun., 2024].
Moein Heidari, Sina Ghorbani Kolahi, Sanaz Karimijafarbigloo, Bobby Azad, Afshin Bozorgpour, Soheila Hatami, Reza Azad, Ali Diba, Ulas Bagci, Dorit Merhof, Ilker Hacihaliloglu.
[PDF]

Mamba Backbone

Vision mamba: Efficient visual representation learning with bidirectional state space mode. [17th Jan., 2024].
Zhu, Lianghui, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, and Xinggang Wang.
[PDF]

Vmamba: Visual state space model. [18th Jan., 2024].
Liu, Yue, Yunjie Tian, Yuzhong Zhao, Hongtian Yu, Lingxi Xie, Yaowei Wang, Qixiang Ye, and Yunfan Liu.
[PDF]

Plainmamba: Improving non-hierarchical mamba in visual recognition. [26th Mar., 2024].
Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley.
[PDF]

Localmamba: Visual state space model with windowed selective scan. [14th Mar., 2024].
Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu.
[PDF]

Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data. [8th Feb., 2024].
Shufan Li, Harkanwar Singh, Aditya Grover.
[PDF]

SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series. [22nd Mar., 2024].
Badri N. Patro, Vijay S. Agneeswaran.
[PDF]

Scalable Visual State Space Model with Fractal Scanning. [23rd Mar., 2024].
Lv Tang, HaoKe Xiao, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Bo Li.
[PDF]

Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model. [23rd Mar., 2024].
Yuheng Shi, Minjing Dong, Chang Xu.
[PDF], [Code]

Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model. [28th May., 2024].
Wenbing Li, Hang Zhou, Zikai Song, Wei Yang.
[PDF]

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality. [31st May., 2024].
Tri Dao, Albert Gu.
[PDF]

Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain. [29th May., 2024].
Juntao Zhang, Kun Bian, Peng Cheng, Wenbo An, Jianning Liu, Jun Zhou.
[PDF], [Code]

Autoregressive Pretraining with Mamba in Vision. [11th Jan., 2024].
Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan Yuille, Cihang Xie.
[PDF], [Code]

Image Classification

Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning. [24th Feb., 2024].
Chen, Chi-Sheng, Guan-Ying Chen, Dong Zhou, Di Jiang, and Dai-Shi Chen.
[PDF]

Medmamba: Vision mamba for medical image classification. [6th Mar., 2024].
Yue, Yubiao, and Zhenzhang Li.
[PDF]

CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification. [25th Mar., 2024].
Guangqian Yang, Kangrui Du, Zhihan Yang, Ye Du, Yongping Zheng, Shujun Wang.
[PDF]

DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification. [11th Jan., 2024].
Jiamu Sheng, Jingyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan.
[PDF]

Object Detection

State Space Models for Event Cameras. [23th Feb., 2024].
Nikola Zubić, Mathias Gehrig, Davide Scaramuzza.
[PDF]

MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection. [3rd Apr., 2024].
Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu, Yue Wu, Bin Liu, Jieping Ye, Nenghai Yu.
[PDF] [Code]

CDMamba: Remote Sensing Image Change Detection with Mamba. [6th JUn., 2024].
Haotian Zhang, Keyan Chen, Chenyang Liu, Hao Chen, Zhengxia Zou, Zhenwei Shi.
[PDF] [Code]

Image Segmentation

ReMamber: Referring Image Segmentation with Mamba Twister. [26th Mar., 2024].
Yuhuan Yang, Chaofan Ma, Jiangchao Yao, Zhun Zhong, Ya Zhang, Yanfeng Wang.
[PDF]

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation. [7th Feb., 2024].
Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei Li.
[PDF]

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation. [5th Apr., 2024].
Zifu Wan, Yuhao Wang, Silong Yong, Pingping Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie.
[PDF]

Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model. [11th Apr., 2024].
Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen.
[PDF] [Code]

nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model. [5th Feb., 2024].
Haifan Gong, Luoyao Kang, Yitao Wang, Xiang Wan, Haofeng Li.
[PDF] [Code]

T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation. [1st Apr., 2024].
Jing Hao, Lei He, Kuo Feng Hung.
[PDF] [Code]

Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention. [12th Mar., 2024].
Jinhong Wang, Jintai Chen, Danny Chen, Jian Wu.
[PDF] [Code]

RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation. [3rd Apr., 2024].
Xianping Ma, Xiaokang Zhang, Man-On Pun.
[PDF] [Code]

VM-UNet: Vision Mamba UNet for Medical Image Segmentation. [4th Feb., 2024].
Jiacheng Ruan, Suncheng Xiang.
[PDF] [Code]

UU-Mamba: Uncertainty-aware U-Mamba for Cardiac Image Segmentation. [25th May., 2024].
Ting Yu Tsai, Li Lin, Shu Hu, Ming-Ching, Hongtu Zhu, Xin Wang.
[PDF]

MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba. [10th Jan., 2024].
Zhongping Ji.
[PDF]

Segmentation in X-ray Fluoroscopy Utilizing Virtual Simulations of Cardiovascular Procedures. [2024].
Andersson Rasmus, Ekerstedt Martin.
[PDF]

Rotate to scan: Unet-like mamba with triplet ssm module for medical image segmentation. [2024].
Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu.
[PDF]

Video Classification

Long Movie Clip Classification with State-Space Video Models. [14th Nov., 2022].
Islam, Md Mohaiminul, and Gedas Bertasius.
[PDF]

Video Understanding

VideoMamba: State Space Model for Efficient Video Understanding. [11th Mar., 2024].
Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao.
[PDF]

SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding. [1st Apr., 2024].
Wenrui Li, Xiaopeng Hong, Xiaopeng Fan.
[PDF]

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding. [14th Mar., 2024].
Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang.
[PDF]

Image Registration

MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration. [25th Jan., 2024].
Tao Guo, Yinuo Wang, Shihao Shu, Diansheng Chen, Zhouping Tang, Cai Meng, Xiangzhi Bai.
[PDF] [Code]

VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module. [7th Apr., 2024].
Ziyang Wang, Jian-Qing Zheng, Chao Ma, Tao Guo.
[PDF]

Multi-Modal Understanding

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. [21th Mar., 2024].
Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang.
[PDF]

Video Prediction

VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting. [25th Mar., 2024].
Yujin Tang, Peijie Dong, Zhenheng Tang, Xiaowen Chu, Junwei Liang.
[PDF]

Image Super-Resolution

Activating Wider Areas in Image Super-Resolution. [13th Mar., 2024].
Cheng Cheng, Hang Wang, Hongbin Sun.
[PDF]

Image Restoration

MambaIR: A Simple Baseline for Image Restoration with State-Space Model. [23th Feb., 2024].
Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia.
[PDF]

Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models. [26th Mar., 2024].
Mohammad Shahab Sepehri, Zalan Fabian, Mahdi Soltanolkotabi.
[PDF]

MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space. [25th May., 2024].
Jiangwei Weng, Zhiqiang Yan, Ying Tai, Jianjun Qian, Jian Yang, Jun Li.
[PDF], [Code]

LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network. [3rd Jun., 2024].
Xuanqi Zhang, Haijin Zeng, Jinwang Pan, Qiangqiang Shen, Yongyong Chen.
[PDF]

Image Dehazing

U-shaped Vision Mamba for Single Image Dehazing. [6th Feb., 2024].
Zhuoran Zheng, Chen Wu.
[PDF]

Image Derain

FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining. [15th Apr., 2024].
Zou Zhen, Yu Hu, Zhao Feng.
[PDF]

FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining. [29th May., 2024].
Dong Li, Yidi Liu, Xueyang Fu, Senyan Xu, Zheng-Jun Zha.
[PDF]

Image Deblurring

Learning Enriched Features via Selective State Spaces Model for Efficient Image Deblurring. [29th Mar., 2024].
Hu Gao, Depeng Dang.
[PDF]

Efficient Visual State Space Model for Image Deblurring. [23rd May., 2024].
Lingshun Kong, Jiangxin Dong, Ming-Hsuan Yang, Jinshan Pan.
[PDF]

Visual Generation

MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models. [14th Mar., 2024].
Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li.
[PDF]

Scalable Diffusion Models with State Space Backbone. [8th Feb., 2024].
Zhengcong Fei, Mingyuan Fan, Changqian Yu, Junshi Huang.
[PDF]

ZigMa: A DiT-style Zigzag Mamba Diffusion Model. [20th Mar., 2024].
Vincent Tao Hu, Stefan Andreas Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes Fischer, Björn Ommer.
[PDF]

Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM. [12th Mar., 2024].
Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang.
[PDF]

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling. [22nd May., 2024].
Omer F. Atli, Bilal Kabas, Fuat Arslan, Mahmut Yurt, Onat Dalmaz, Tolga Çukur.
[PDF]

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis. [23rd May., 2024].
Yao Teng, Yue Wu, Han Shi, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu.
[PDF], [Code]

Soft Masked Mamba Diffusion Model for CT to MRI Conversion. [22nd Jun., 2024].
Zhenbin Wang, Lei Zhang, Lituan Wang, Zhenwei Zhang.
[PDF], [Code]

Point Cloud

Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy. [11th Mar., 2024].
Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang.
[PDF]

3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion. [10th Apr., 2024].
Yixuan Li, Weidong Yang, Ben Fei.
[PDF]

3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering. [8th Apr., 2024].
Qingyuan Zhou, Weidong Yang, Ben Fei, Jingyi Xu, Rui Zhang, Keyi Liu, Yeqi Luo, Ying He.
[PDF]

Point Cloud Mamba: Point Cloud Learning via State Space Model. [1th Mar., 2024].
Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, Shuicheng Yan.
[PDF]

MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Model. [23rd May., 2024].
Jiuming Liu, Jinru Han, Lihao Liu, Angelica I. Aviles-Rivero, Chaokang Jiang, Zhe Liu, Hesheng Wang.
[PDF]

Depth Estimation

MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation. [6th Jun., 2024].
Ionuţ Grigore, Călin-Adrian Popa.
[PDF]

3D Reconstruction

Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction. [27th Mar., 2024].
Qiuhong Shen, Xuanyu Yi, Zike Wu, Pan Zhou, Hanwang Zhang, Shuicheng Yan, Xinchao Wang.
[PDF]

MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion. [27th Jun., 2024].
Jing Zou, Lanqing Liu, Qi Chen, Shujun Wang, Xiaohan Xing, Jing Qin.
[PDF]

Video Generation

SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces. [12th Mar., 2024].
Yuta Oshima, Shohei Taniguchi, Masahiro Suzuki, Yutaka Matsuo.
[PDF]

Others

Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces. [1st Feb., 2024].
Chloe Wang, Oleksii Tsepa, Jun Ma, Bo Wang.
[PDF], [Code]

HeteGraph-Mamba: Heterogeneous Graph Learning via Selective State Space Model. [22nd May., 2024].
Zhenyu Pan, Yoonsung Jeong, Xiaoda Liu, Han Liu.
[PDF]

Mamba-R: Vision Mamba ALSO Needs Registers. [23rd May., 2024].
Feng Wang, Jiahao Wang, Sucheng Ren, Guoyizhe Wei, Jieru Mei, Wei Shao, Yuyin Zhou, Alan Yuille, Cihang Xie.
[PDF]

MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging. [2nd Jun., 2024].
Jiaying Zhou, Mingzhou Jiang, Junde Wu, Jiayuan Zhu, Ziyue Wang, Yueming Jin.
[PDF]

Zamba: A Compact 7B SSM Hybrid Model. [26th May., 2024].
Paolo Glorioso, Quentin Anthony, Yury Tokpanov, James Whittington, Jonathan Pilault, Adam Ibrahim, Beren Millidge.
[PDF]

MambaLRP: Explaining Selective State Space Sequence Models. [11th Jun., 2024].
Farnoush Rezaei Jafari, Grégoire Montavon, Klaus-Robert Müller, Oliver Eberle.
[PDF]

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
num.png		num.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Survey on Visual Mamba

Citation

Overview

Survey Papers

Mamba Backbone

Image Classification

Object Detection

Image Segmentation

Video Classification

Video Understanding

Image Registration

Multi-Modal Understanding

Video Prediction

Image Super-Resolution

Image Restoration

Image Dehazing

Image Derain

Image Deblurring

Visual Generation

Point Cloud

Depth Estimation

3D Reconstruction

Video Generation

Others

About

Releases

Packages

License

ziyangwang007/Awesome-Visual-Mamba

Folders and files

Latest commit

History

Repository files navigation

A Survey on Visual Mamba

Citation

Overview

Survey Papers

Mamba Backbone

Image Classification

Object Detection

Image Segmentation

Video Classification

Video Understanding

Image Registration

Multi-Modal Understanding

Video Prediction

Image Super-Resolution

Image Restoration

Image Dehazing

Image Derain

Image Deblurring

Visual Generation

Point Cloud

Depth Estimation

3D Reconstruction

Video Generation

Others

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages