This repository includes all the referenced papers, along with the sections covered in the survey titled A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration, and Beyond. If you encounter any issues with the links or notice any errors, please feel free to reach out via email at 21204028@rmd.ac.in
While some papers may be repeated, our objective is to highlight the papers available in each specific sections. For the best experience, we recommend keeping this repository open alongside the topics you are currently reading. If you wish to explore further, this resource can serve as a useful tool.
Version 1 of the paper is out on arxiv (3/10/2024) - Paper
(click to expand)
Figure 1 - General Illustration of Mamba models
Recurrent Networks such as LSTM,RNN GRU
(NIPS 2017) Attention is all you need - [Paper] [Code]
(arXiv:2111.00396) Efficiently Modeling Long Sequences with Structured State Spaces (S4)- [Paper] [Code]
(arXiv:2208.04933) Simplified State Space Layers for Sequence Modeling (S5) - [Paper] [Code]
(arXiv:2008.07669) HiPPO: Recurrent Memory with Optimal Polynomial Projections - [Paper] [Code]
(arXiv:2212.14052) Hungry Hungry Hippos: Towards Language Modeling with State Space Models - [Paper] [Code]
(NIPS 2022) On the Parameterization and Initialization of Diagonal State Space Models (S4D) -[Paper]
(arXiv:2312.00752) Mamba: Linear-Time Sequence Modeling with Selective State Spaces - [Paper] [Code]
(arXiv:2302.10866) Hyena Hierarchy: Towards Larger Convolutional Language Models - [Paper]
Figure 2- Time Line of Mamba architecture
We have mapped the Time Line of Mamba Architectures on Medical as well as on non medical domain starting from December 2023 to September 2024
In this section, the papers are cited according to the survey. The survey provides an overview of these models, beginning with their core architectures. The models are classified into three categories: Pure Mamba which consists solely of Mamba blocks; Variants of Unet which includes models that adapt a U-Net-based architecture; Hybrid architectures which incorporate Mamba blocks with attention mechanisms, convolution, or RNNs.
Figure 4- Illustration of different Mamba architectures such as ViM, VMamba, PlainMamba, EfficientVMamba, Mamba-ND, MHS-VM
(arXiv:2401.09417) Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model (ViM) - [Paper] [Code]
(arXiv:2401.10166) VMamba: Visual State Space Model (VSS) - [Paper] [Code]
(arXiv:2403.17695) PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition - [Paper] [Code]
(arXiv:2403.09977) EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba (EVSS) - [Paper] [Code]
(arXiv:2402.05892) Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data - [Paper] [Code]
(arXiv:2406.05992) MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba - [Paper] [Code]
Figure 3 - Traditional UNet vs Mamba U Net
(arXiv:2402.05079) Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation -[Paper] [Code]
(arXiv:2404.07705) ViM-UNet: Vision Mamba for Biomedical Segmentation - [Paper] [Code]
Figure 5 - Architecture diagram of LKM-UNet
(arXiv:2403.07332) LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
Figure 6 - Architecture diagram of VM-UNET
(arXiv:2402.02491) VM-UNet: Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
Figure 7 - Architecture diagram of VM-UNET-V2
(arXiv:2403.09157) VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
Figure 8 - Architecture diagram of H-vmunet
(arXiv:2403.13642) H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
Figure 9 - Illustration of Bidirectional Scan,Continous 2D Scanning,Local Scan, Cross Scan,Multi head Scan
Figure 10 - Illustration of Selective 2D Scan, Efficient 2D Scan, Omnidirection Selective Scan, 3D Bidirectional Scan
Figure 11 - Illustration of Hierarchical Scan, Zigzag Scan, Spatiotemporal Scan,Multi-Path Scan
Figure 12 - Illustration of Three Directional Scan, BSS Scan, Pixelwise and Patchwise Scan
-
Bidirectional Scan (Forward and backward Scan)
Initally proposed by:
(arXiv:2401.09417) Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model (ViM) - [Paper] [Code]
Other paper which incorporates it:
(arXiv:2403.05146)Motion-Guided Dual-Camera Tracker for Endoscope Tracking and Motion Analysis in a Mechanical Gastric Simulator[Paper]
(arXiv:2406.05786) CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation - [Paper]
(arXiv:2403.05160) MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models - [Paper]
-
Selective Scan 2D
Initally proposed by:
(arXiv:2401.10166) VMamba: Visual State Space Model (VSS) - [Paper] [Code]
Other paper which incorporates it:
(arXiv:2403.03849) MedMamba: Vision Mamba for Medical Image Classification - [Paper] [Code]
(arXiv:2402.10887) Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation - [Paper] [Code]
(arXiv:2402.08506v2) P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation - [Paper]
(arXiv:2403.05246) LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2405.05007) HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation - [Paper]
(arXiv:2403.17432)Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion - [Paper]
(arXiv:2403.20035) UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation - [Paper] [Code]
(arXiv:2402.02491) VM-UNet: Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.09157) VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2404.05105) VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module [Paper] [Code]
(arXiv:2402.18451) MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation [Paper] [Code]
(arXiv:2402.07245) Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation - [Paper] [Code]
(arXiv:2402.03302) Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining - [Paper] [Code]
(arXiv:2402.05079) Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.13642) H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2405.05667) VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis [Paper]
-
Continous 2D Scanning
Initally proposed by:
(arXiv:2403.17695) PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition - [Paper] [Code]
-
Spatiotemporal Selective Scan
(arXiv:2401.14168) Vivim: a Video Vision Mamba for Medical Video Segmentation - [Paper] [Code]
-
Local Scanning Initally proposed by:
(arXiv:2403.09338) LocalMamba: Visual State Space Model with Windowed Selective Scan - [Paper] [Code]
Other paper which incorporates it: (arXiv:2404.09476) FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining - [Paper] [Code]
-
Efficient 2D Scan
EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba - [Paper] [Code]
(arXiv:2404.09498) FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba - [Paper] [Code]
-
Multi-Path Scan
(arXiv:2403.19654) RSMamba: Remote Sensing Image Classification with State Space Model - [Paper]
-
Omnidirectional Selective Scan
(arXiv:2403.11423)VmambaIR: Visual State Space Model for Image Restoration - [Paper] [Code]
-
Hierarchical Scan
(ECCV 24) Motion Mamba: Efficient and Long Sequence Motion Generation[Paper] [Code]
-
Multi-Head Scan
(arXiv:2406.05992) MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba - [Paper] [Code]
-
Zigzag Scan (ECCV 2024) ZigMa: A DiT-style Zigzag Mamba Diffusion Model - [Paper] [Code]
-
Other Scanning Mechanisms
TOM(Tri-orientated Spatial Mamba) - Three directional scanning - (arXiv:2401.13560) SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation - [Paper] [Code]
Pixelwise and patchwise Scan - (arXiv:2403.07332) LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
Slice Scan - (arXiv:2407.08481) SliceMamba with Neural Architecture Search for Medical Image Segmentation [Paper]
Figure 13 - Architecture Diagram of Swin-UMamba
(arXiv:2402.03302) Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining - [Paper] [Code]
(arXiv:2402.03526) nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model - [Paper] [Code]
(DOI:10.48550/arXiv.2401.04722) U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation - [Paper] [Code]
(arXiv:2402.05079) Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation - [Paper] [Code]
(arXiv:2405.05007) HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation - [Paper]
(arXiv:2401.13560) SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation - [Paper] [Code]
Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images -[Paper] [Code]
(arXiv:2402.10887) Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation - [Paper] [Code]
(arXiv:2408.11289) HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.16536) VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting - [Paper] [Code]
(arXiv:2406.04377) Combining Graph Neural Network and Mamba to Capture Local and Global Tissue Spatial Relationships in Whole Slide Images - [Paper] [Code]
(arXiv:2404.13222) Vim4Path: Self-Supervised Vision Mamba for Histopathology Images - [Paper] [Code]
(arXiv:2403.06800) MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology - [Paper][Code]
(arXiv:2403.05246) LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.20035) UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation - [Paper] [Code]
(arXiv:2405.15925) MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation - [Paper] [Code]
LightCF-Net: A Lightweight Long-Range Context Fusion Network for Real-Time Polyp Segmentation - [Paper]
Balancing High-performance and Lightweight: HL-UNet for 3D Cardiac Medical Image Segmentation - [Paper]
(arXiv:2403.02148) MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection - [Paper] [Code]
(arXiv:2402.10887) Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation - [Paper] [Code]
(arXiv:2402.07245) Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation - [Paper] [Code]
(arXiv:2404.13222) Vim4Path: Self-Supervised Vision Mamba for Histopathology Images - [Paper] [Code]
(arXiv:2408.08070) MambaMIM: Pre-training Mamba with State Space Token-interpolation - [Paper] [Code]
(arXiv:2403.16520) CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification - [Paper]
Figure 14 - Architecture Diagram of Fusion Mamba
(arXiv:2404.09498) FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba - [Paper] [Code]
(arXiv:2406.00631) MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging - [Paper]
(arXiv:2407.15719) GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI - [Paper] [Code]
Figure 15 - General Flow of Segmentation
(arXiv:2401.13560) SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.13642) H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.07332) LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2402.05079) Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation - [Paper] [Code]
(arXiv:2402.10887) Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.05246) LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation - [Paper] [Code]
(arXiv:2403.20035) UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation - [Paper] [Code]
(arXiv:2404.01065) T-Mamba: A unified framework with Long-Range Dependency in dual-domain for 2D & 3D Tooth Segmentation - [Paper] [Code]
(arXiv:2405.05007) HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation - [Paper]
(arXiv:2402.07245) Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation - [Paper] [Code]
(arXiv:2404.07705) ViM-UNet: Vision Mamba for Biomedical Segmentation - [Paper] [Code]
(arXiv:2401.04722) U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation - [Paper] [Code]
(arXiv:2403.17432) Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion - [Paper]
(arXiv:2405.17496) UU-Mamba: Uncertainty-aware U-Mamba for Cardiac Image Segmentation - [Paper] [Code]
(arXiv:2406.05786) CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation - [Paper]
(arXiv:2405.15925) MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation - [Paper] [Code]
Figure 16 - General Flow of Classification
(arXiv:2402.03526) nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model - [Paper] [Code]
(arXiv:2403.03849) MedMamba: Vision Mamba for Medical Image Classification - [Paper] [Code]
(arXiv:2404.13222) Vim4Path: Self-Supervised Vision Mamba for Histopathology Images - [Paper] [Code]
(arXiv:2409.07896) Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters - [Paper] [Code]
Figure 17 - General Flow of Restoration
(arXiv:2402.18451) MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation - [Paper] [Code]
(arXiv:2404.08406) MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion - [Paper]
(arXiv:2402.06378) FD-Vision Mamba for Endoscopic Exposure Correction - [Paper]
(arXiv:2404.09498) FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba - [Paper] [Code]
https://github.com/zzr-idam/FDVM-Net
Figure 18 - General Flow of Registration
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration - [Paper] [Code]
(arXiv:2404.05105) VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module- [Paper] [Code]
(arXiv:2403.13660) ProMamba: Prompt-Mamba for polyp segmentation -[Paper]
(arXiv:2402.08506v2) P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation - [Paper]
(arXiv:2403.05795) ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes - [Paper]
(arXiv:2401.14168) Vivim: a Video Vision Mamba for Medical Video Segmentation - [Paper] [Code]
(arXiv:2403.05146 ) Motion-Guided Dual-Camera Tracker for Endoscope Tracking and Motion Analysis in a Mechanical Gastric Simulator - [Paper]
(arXiv:2405.05667) VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis - [Paper]
(arXiv:2403.08479) MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction - [Paper] [Code]
Dataset | Task Performed | Targeted Area | Paper/Dataset |
---|---|---|---|
BraTS2023 | Segmentation | Brain | [Paper] |
AIIB2023 | Segmentation | Lungs | [Paper] |
CRC-500 datase | Segmentation | Colon, Rectum | [Paper] |
ISIC2017 | Segmentation | Skin | [Download] |
ISIC2018 | Segmentation | Skin | [Paper] |
MICCAI-2023 | Segmentation | Heart | [Paper] |
Spleen | Segmentation | Spleen | [Paper] |
CVC-ClinicDB | Segmentation | Intestine | [Paper] |
3D Abdomen CT dataset | Segmentation | Liver, Kidney, Spleen, etc. | [Paper] |
2D Abdomen MRI dataset | Segmentation | Kidney, Gallbladder, Esophagus, Liver, Pancreas, Adrenal gland, etc | [Paper] |
Endoscopy dataset | Segmentation | Surgical Instruments | [Paper] |
Microscopy dataset | Segmentation | Cell | [Paper] |
Synapse Multi-Organ Abdominal CT Dataset | Segmentation | Pancreas, Spleen , Liver | [Paper] |
𝑃𝐻2 Database | Segmentation | Skin | [Paper] |
LiTS dataset | Segmentation | Liver | [Paper] |
Montgomery and Shenzhen dataset | Segmentation | Lungs | [Paper] |
Brain MRI Multiple Sclerosis Dataset | Segmentation | Brain | [Paper] |
Alzheimer’s Disease Neuroimaging (ADNI) Dataset | Classification | Brain | [Paper] |
Otoscopy | Classification | Ear | [Paper] |
PathMNIST | Classification | Colon Pathology | [Paper] |
Camelyon | Classification | Lymph node | [Paper] |
Colorectal Cancer Histopathology Dataset | Classification | Colon, Rectum | [Paper] |
SR-Reg(SynthRAD Registration) dataset | Registration | Brain | [Paper] |
FastMRI knee dataset | Restoration/Reconstruction | Knee | [Paper] |
Low-Dose CT Image and Projection Datasets | Restoration/Reconstruction | Knee | [Paper] |
As the authors of this paper, we believe that the following are potential future directions for discovering more medical papers:
-
Mamba-2: Mamba-2 reduces the gap between parallel nature of attention in transformers and recurrent nature of SSM. Mamba-2 parallelizes the scanning process by constructing a 1SS-Matrix on the parameter A, represented by a lower triangular matrix.
-
xLSTM: The emergence of recurrent architectures like xLSTM, an extension of LSTM, has also gained popularity due to the emergence of Mamba. In certain smaller studies, xLSTM has shown better performance compared to Mamba in specific cases
Figure 19 - Architecture diagram of Mamba v2 compared to Mamba v1
(arXiv:2405.21060) Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality - [Paper] [Code]
(arXiv:2405.04517 ) xLSTM: Extended Long Short-Term Memory - [Paper] [Code]
Vision-LSTM: xLSTM as Generic Vision Backbone - [Paper] [Code]
Papers of xLSTM in Medical Domain
(arXiv:2406.16993) Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? - [Paper] [Code]
(arXiv:2407.01530) xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart - [Paper] [Code]
If you found this repo / paper, cite them as given below. Thank you
@misc{bansal2024comprehensivesurveymambaarchitectures,
title={A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond},
author={Shubhi Bansal and Sreeharish A and Madhava Prasath J and Manikandan S and Sreekanth Madisetty and Mohammad Zia Ur Rehman and Chandravardhan Singh Raghaw and Gaurav Duggal and Nagendra Kumar},
year={2024},
eprint={2410.02362},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2410.02362},
}