Submitted:
14 May 2026
Posted:
14 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Why a Mamba Survey for TSA?
1.2. Challenges Targeted by This Survey
full,
partial,
none.
full,
partial,
none.| Survey | Year | Mamba | TSA | Persp. | Axes | Taxon. | Resource | Guide. |
|---|---|---|---|---|---|---|---|---|
| DL [12] | 2022 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Transformers [103] | 2023 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Foundation TS [53] | 2024 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Mamba-360 [72] | 2024 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Mamba Survey [84] | 2024 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Vision Mamba [57] | 2025 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| S4 to Mamba [92] | 2025 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Channel [82] | 2025 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| DiffusionTS [117] | 2026 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| This Survey | 2026 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
1.3. Contributions
- Unified design vocabulary. Five orthogonal axes (tokenization, channel strategy, directional scan, hybridization, decomposition) and three architectural patterns, each with a mechanism diagram and design-comparison table.
- Per-task coverage at full TSA scope. For each of the five tasks we survey principal designs and provide a comparison table; cross-task synthesis appears in subsection 7.1.
- Practical guidelines (section 7): a model-selection decision tree, training recipes, and Mamba-specific pitfalls in numerics and evaluation.
- Open research frontiers (section 8): twelve frontiers spanning gain attribution, modeling regimes, evaluation, deployment, and cross-task unification.
- Reproducibility resources (Appendix A): benchmarks, datasets, metrics, and a public-implementation registry.
1.4. Differences from Existing Surveys
1.5. Paper Organization
2. Background and Categorization
2.1. The Time Series Analysis Problem Family
2.2. Mamba vs. Other Sequence Backbones



| Axis | Choices | Representative methods |
|---|---|---|
| Tokenization | pointwise / patch / channel-as-token | MambaTS [14] / SiMBA-TS [73] / S-Mamba [101] |
| Channel strategy | CI / CD / CC / dual-mixer | DTMamba [109] / iTransformer-style / CMamba [121] / MambaMixer [11] |
| Directional scan | forward / bidir / multi-scale / 2D / walk-based | MambaTS / Bi-Mamba+ [52] / ms-Mamba [41] / Chimera [10] / SpoT-Mamba [21] |
| Hybridization | none / +attn / +MLP / +FFT / +CNN / +diff / +decomp | MAT [127]/MambaMixer/SiMBA [71]/ CMDMamba [81] / MambaDiffTS [100]/KARMA [118] |
| Decomposition | none / trend-seasonal / multi-scale / Fourier | Bi-Mamba+ / KARMA / TimeMachine [2] / Affirm [108] |
2.3. From HiPPO to S4 and Mamba
2.4. Design Axes
2.5. Survey Backbone and Master Taxonomy
3. Model Perspective

3.1. Pure Mamba Backbones
3.2. Bidirectional and Multi-Directional Scans


3.3. Hybrid Architectures

4. Task Perspective
4.1. Forecasting
| Method | Tokenization | Channel | Direction | Hybridization | Decomposition |
|---|---|---|---|---|---|
| Pure selective backbones | |||||
| S-Mamba [101] | ch.-token | CD | forward | – | – |
| MambaTS [14] | pointwise | CC | forward (VAST) | – | – |
| DTMamba [109] | patch | CI | forward (dual) | – | – |
| TimeMachine [2] | patch | CI | multi-scale | – | multi-scale |
| SiMBA-TS [73] | patch | CD | forward | – | – |
| CMamba [121] | patch | CC | forward | – | – |
| CMMamba [48] | patch | CC | bidir | – | – |
| Mamba+UQ [77] | patch | CI | forward | – | – |
| PowerMamba [63] | ch.-token | CC | forward (dual) | – | – |
| SpoT-Mamba [21] | pointwise | CC | walk-based | – | – |
| RLMamba [98] | patch | CI | forward | – | – |
| Bidirectional and multi-directional scans | |||||
| Bi-Mamba+ [52] | patch | dual-mixer | bidir | – | – |
| CIBGM [47] | patch | CI | bidir (gated) | – | – |
| ms-Mamba [41] | patch | CD | multi-scale | +MLP (fusion) | – |
| Chimera [10] | patch (2D) | CD | 2D | – | – |
| BiG-Mamba [131] | ch.-token | CC | bidir | +attn (graph) | – |
| Graph-Mamba [62] | ch.-token | CC | bidir | +CNN (graph) | – |
| DMSTCI-BiMamba [56] | patch | CD | bidir + multi-scale | – | multi-scale |
| Hybrid architectures | |||||
| MAT [127] | patch | CD | – | +attn (Transformer) | – |
| SST [113] | patch | CD | – | +attn (MoE) | – |
| FMamba [61] | ch.-token | CD | – | +attn (fast) | – |
| MambaFormer [70] | patch | CI | – | +attn (interleave) | – |
| HyMaTE [65] | pointwise | CC | – | +attn (channel) | – |
| ST-MambaSync [88] | patch | CC | – | +attn (ST) | – |
| AttMamba [111] | patch | CD | – | +attn (adaptive) | – |
| DIMformer [130] | ch.-token | CD | – | +attn (linear) | – |
| SiMBA [71] | patch | CD | – | +FFT (EinFFT) | – |
| MambaMixer [11] | patch | CC | – | +MLP (mixer) | – |
| KARMA [118] | patch | CD | – | +MLP + decomp. | trend-seasonal (STL) |
| Affirm [108] | patch | CD | – | +FFT (Fourier filt.) | Fourier |
| CMDMamba [81] | patch | CD | – | +CNN (dual conv.) | – |
| UmambaTSF [106] | patch | CI | – | +CNN (U-Net) | – |
| Mamba + Diffusion (denoiser) | |||||
| MambaDiffTS [100] | patch | CD | – | +diff (DDPM) | Fourier |
| TIMBA [91] | pointwise | CD | – | +diff (CSDI-style) | – |
| DiffImp [27] | pointwise | CC | – | +diff (DDPM) | – |
4.2. Anomaly Detection
| Method | Detection | Token. | Direction | Hybrid. |
|---|---|---|---|---|
| MAAT [87] | R | pointwise | forward | sparse attn |
| MambaAD-IoT [80] | R | patch | bi-direction | – |
| MambaTAD [112] | C | patch | bi-direction | view-discrep. |
| KambaAD [19] | R | patch | forward | KAN + attn |
4.3. Imputation
| Method | Output | Token. | Direction | Hybrid. |
|---|---|---|---|---|
| SSD-TS [27] | P | patch | bi-direction | diffusion |
| TIMBA [91] | P | patch | bi-direction | diff. + GNN |
| MAC2STI [26] | D | graph-node | bi-direction | cluster-aware |
| SCMDI [96] | P | patch | bi-direction | diff. + attn. |
| RefiDiff [4] | P | pointwise | bi-direction | local-ML + diff. |
4.4. Classification
| Method | Token. | Channel | Direction | Hybrid. |
|---|---|---|---|---|
| Generic UCR / UEA | ||||
| TSCMamba [3] | multi-scale | CD | multi-dir. | wavelet |
| FAIM [123] | patch | CD | bi-direction | Fourier |
| Activity / Wearable HAR | ||||
| HARMamba [50] | patch | CI | bi-direction | – |
| RadMamba [107] | Doppler-aligned | CD | bi-direction | – |
| Mamba-Sleep [54] | pointwise | CI | bi-direction | – |
| Physiological (EEG / ECG / Sleep) | ||||
| EEGMamba [33] | patch | adaptive | bi-direction | MoE |
| FEMBA [95] | patch | CD | bi-direction | conv |
| [36] | patch | adaptive | bi-direction | Mamba-2 |
| MSSC-BiMamba [122] | patch | CC | bi-direction | ECA |
| ECGMamba [78] | patch | CD | bi-direction | conv |
| ECG [124] | patch | CC | bi-direction | spatial |
| 1D-CNN-ECG-Mamba [39] | patch | CD | bi-direction | conv |
| MambaCapsule [114] | patch | CD | forward | capsule |
| BiT-MamSleep [129] | patch | CC | bi-direction | TRCNN |
| NeuroNet [46] | patch | CI | bi-direction | SSL |
| Mentality [69] | patch | CD | bi-direction | SSL |
| SAMBA-EEG [37] | patch | adaptive | bi-direction | diff-Mamba |
4.5. Unified Multi-Task Analytics
| Method | Tasks | Token. | Channel | Hybrid. |
|---|---|---|---|---|
| Chimera [10] | F, AD, Cls | 2D grid | adaptive | – |
| MambaMixer [11] | F, vision | patch | CD | MLP |
| SiMBA [71] | F, vision | patch | CC | EinFFT |
| DeMa [6] | F, AD, Cls, Imp | ch.-token | CD | delay-aware attn |
| EHRMamba [25] | EHR multi-task | event-token | CD | prompted FT |
| ss-Mamba [119] | F (foundation) | semantic+spline | CD | KAN + PLM |
5. Data Perspective
5.1. Univariate Time Series
5.2. Multivariate Time Series
5.3. Spatio-Temporal Graphs and Trajectories
5.4. Irregular and Long-Context Data
6. Application Perspective
6.1. Healthcare and Clinical Monitoring
6.2. Energy and Electricity
6.3. Traffic and Spatio-Temporal Mobility
6.4. Climate and Weather
6.5. Finance
6.6. Activity Recognition and Sensors
6.7. Cross-Domain and Foundation-Scale Deployments
7. Practical Guidelines
7.1. Cross-Task Design-Axis Matrix
| Design Axis | Forecasting | Anomaly Det. | Imputation | Classification | Multi-Task |
|---|---|---|---|---|---|
| Tokenization | patch / channel-token | patch | patch | patch / window | 2D patch grid |
| Channel strategy | CI or CD (contested) | CD (multi-channel) | CD (graph / full) | CI / CD by modality | CD (token+channel) |
| Directional scan | forward or bidir. | bidirectional | bidirectional | bidirectional | 2D / dual-axis |
| Hybridization | attn. / MLP / FFT | + attn. (discrepancy) | + diffusion | + spectral / SSL | none-or-mild |
| Decomposition | trend-seas. / wavelet | none | none | none / spectral | none |
| Modal arch. pattern | pure / bidir / hybrid | hybrid | hybrid (diffusion) | bi-directional | 2D / token+channel |
7.2. Choosing a Right Mamba Variant
7.3. Configuration and Training Recipes
7.4. Mamba-Specific Pitfalls
8. Open Frontiers and Future Directions
8.1. Attributing Gains to Selectivity
8.2. Input-Dependent Channel Selectivity
8.3. Native Irregular-Time Modeling
8.4. Probabilistic Filtering Decoders
8.5. Test-Time Length Generalization
8.6. Three-Factor Gain Attribution
8.7. Benchmark Saturation and the Hybrid Wall
8.8. Foundation-Scale Pretraining and Transfer
8.9. One Backbone for All Five Tasks
8.10. Expressivity and Identifiable Dynamics
8.11. Compression and Edge Deployment
8.12. Post-Mamba and Multimodal Fusion
9. Conclusions
Declaration of Generative AI use
Appendix A Resources
Appendix A.1. Published Implementations
| Method | Year | Design | Task | Domain | Repository |
|---|---|---|---|---|---|
| PureMamba(subsection 3.1) | |||||
| DeMa [6] | 2026 | dual-path delay-aware | Multi-Task | General | – (announced) |
| Mamba+UQ [77] | 2025 | patch + UQ head | Forecast | General | https://github.com/PengchengWeifr/Mamba_TSF_UQ |
| PowerMamba [63] | 2025 | dual-Mamba forward | Forecast | Energy | https://github.com/alimenati/PowerMamba |
| MAC2STI [26] | 2025 | cluster-aware S6 | Imputation | Traffic / ST | – (announced) |
| RadMamba [107] | 2025 | Doppler patch + Mamba | Class. | Radar / HAR | – (announced) |
| RLMamba [98] | 2025 | residual-learning stack | Forecast | General | – (announced) |
| SAMBA-EEG [37] | 2025 | differential Mamba | Class. | Healthcare/EEG | – (announced) |
| S-Mamba [101] | 2024 | channel-token forward | Forecast | General | https://github.com/wzhwzhwzh0921/S-D-Mamba |
| MambaTS [14] | 2024 | pointwise VAST scan | Forecast | General | https://github.com/XiudingCai/MambaTS-pytorch |
| DTMamba [109] | 2024 | patch dual-twin scan | Forecast | General | https://github.com/lizyelon/DTMamba |
| TimeMachine [2] | 2024 | 4-branch multi-rate | Forecast | General | https://github.com/Atik-Ahamed/TimeMachine |
| SiMBA-TS [73] | 2024 | patch + EinFFT mixer | Forecast | General | https://github.com/badripatro/Simba |
| CMamba [121] | 2024 | patch + GDD ch. mixer | Forecast | General | https://github.com/zclzcl0223/CMamba |
| CMMamba [48] | 2024 | bidir. + Top-K ch. mix | Forecast | General | – (announced) |
| MambaStock [89] | 2024 | lightweight forward scan | Forecast | Finance | https://github.com/zshicode/MambaStock |
| DGMamba [60] | 2024 | domain-gen. objective | Forecast | Domain Gen. | https://github.com/longshaocong/DGMamba |
| Mamba4Cast [13] | 2024 | Mamba-2 zero-shot | Forecast | Zero-shot | https://github.com/automl/mamba4cast |
| Mentality [69] | 2024 | Mamba SSL foundation | Class. | Healthcare/EEG | – (announced) |
| EHRMamba [25] | 2024 | EHR foundation | Multi-Task | Healthcare/EHR | – (announced) |
| SpaceTime [126] | 2023 | pre-Mamba S4 backbone | Forecast | Foundation | https://github.com/HazyResearch/spacetime |
| Bidirectional and multi-directional scans (subsection 3.2) | |||||
| BiG-Mamba [131] | 2025 | graph + bidir. scan | Forecast | Traffic / ST | – (announced) |
| DMSTCI-BiMamba [56] | 2025 | decomp. multi-scale bidir. | Forecast | General | – (announced) |
| EEG-M2 [36] | 2025 | U-shape Mamba-2 SSL | Class. | Healthcare/EEG | – (announced) |
| FEMBA [95] | 2025 | Bi-Mamba SSL pretrain | Class. | Healthcare/EEG | – (announced) |
| HSTM [116] | 2025 | spatial+temporal scans | Forecast | Finance | – (announced) |
| MambaAD-IoT [80] | 2025 | dual Bi-Mamba branches | Anomaly | IoT | – (announced) |
| MambaTAD [112] | 2025 | contrastive view-discrep. | Anomaly | General | – (announced) |
| ms-Mamba [41] | 2025 | multi-scale parallel | Forecast | General | https://github.com/airin/ms-Mamba |
| S2M2ECG [124] | 2025 | multi-branch Bi-SSM | Class. | Healthcare/ECG | – (announced) |
| Bi-Mamba+ [52] | 2024 | concat + forget gate | Forecast | General | https://github.com/llwwqq/Bi-Mamba-plus |
| Chimera [10] | 2024 | 2D time×channel scan | Forecast | Traffic / ST | – (announced) |
| Chimera [10] | 2024 | 2D selective scan | Multi-Task | General | – (announced) |
| CIBGM [47] | 2024 | forward+reverse gated | Forecast | General | https://github.com/CIBGM/CIBGM |
| EEGMamba [33] | 2024 | Bi-Mamba + MoE | Class. | Healthcare/EEG | – (announced) |
| Graph-Mamba [62] | 2024 | graph + forward/reverse | Forecast | Finance | https://github.com/Ali-Meh619/SAMBA |
| HARMamba [50] | 2024 | patch + Bi-Mamba | Class. | HAR / Wearable | – (announced) |
| Mamba-Sleep [54] | 2024 | wearable Bi-Mamba | Class. | Healthcare | – (announced) |
| SpoT-Mamba [21] | 2024 | graph walks + Mamba | Forecast | Traffic / ST | https://github.com/bdi-lab/SpoT-Mamba |
| Hybrid architectures (subsection 3.3) | |||||
| 1D-CNN-ECG-Mamba [39] | 2025 | 1D-CNN + Mamba | Class. | Healthcare/ECG | – (announced) |
| Affirm [108] | 2025 | patch + adaptive Fourier | Forecast | Climate | https://github.com/congyutao0725/AFFIRM |
| AttMamba [111] | 2025 | patch + adaptive pool. | Forecast | General | – (announced) |
| CMDMamba [81] | 2025 | patch + dual CNN | Forecast | Finance | https://github.com/JadenZheng/CMDMamba |
| DIMformer [130] | 2025 | channel-token + lin. attn. | Forecast | General | – (announced) |
| FAIM [123] | 2025 | Fourier filt. + Mamba | Class. | General | – (announced) |
| HyMaTE [65] | 2025 | event-token + ch. Transf. | Forecast | Healthcare | https://github.com/healthylaife/HyMaTE |
| KARMA [118] | 2025 | patch + MLP + STL | Forecast | General | https://github.com/yedadasd/KARMA |
| MAAT [87] | 2025 | Mamba + sparse attn. | Anomaly | General | – (announced) |
| RefiDiff [4] | 2025 | local-ML + Mamba diff. | Imputation | General | – (announced) |
| SCMDI [96] | 2025 | Mamba + causal diffusion | Imputation | IoT | – (announced) |
| ss-Mamba [119] | 2025 | semantic + spline KAN | Multi-Task | Foundation | – (announced) |
| SSD-TS / DiffImp [27] | 2025 | Bi-Mamba + diffusion | Imputation | General | https://github.com/decisionintelligence/SSD-TS |
| ST-MambaSync [88] | 2025 | bidir. + ST-Transformer | Forecast | Traffic / ST | https://github.com/superca729/ST-MAMBASYNC |
| TSCMamba [3] | 2025 | wavelet multi-view + Mamba | Class. | General | https://github.com/Atik-Ahamed/TSCMamba |
| BiT-MamSleep [129] | 2024 | Bi-Mamba + TRCNN | Class. | Healthcare/Sleep | – (announced) |
| ECGMamba [78] | 2024 | Bi-SSM + conv | Class. | Healthcare/ECG | – (announced) |
| FMamba [61] | 2024 | channel-token + fast attn. | Forecast | General | https://github.com/XieFanrong/FMamba |
| KambaAD [19] | 2024 | KAN + attention + Mamba | Anomaly | General | – (announced) |
| MambaCapsule [114] | 2024 | Mamba + capsule routing | Class. | Healthcare/ECG | – (announced) |
| MambaFormer [70] | 2024 | patch + interleaved Transf. | Forecast | General | https://github.com/Alexia-Jolicoeur-Martineau/Mamba |
| MambaMixer [11] | 2024 | patch + MLP-Mixer | Forecast | Energy | https://github.com/behrouzs/MambaMixer |
| MambaMixer [11] | 2024 | token+channel sel. MLP | Multi-Task | General | https://github.com/behrouzs/MambaMixer |
| MAT [127] | 2024 | patch + Transformer | Forecast | Climate | https://github.com/mwxinnn/MAT |
| MSSC-BiMamba [122] | 2024 | Bi-Mamba + ECA | Class. | Healthcare/Sleep | – (announced) |
| NeuroNet [46] | 2024 | Mamba SSL hybrid | Class. | Healthcare/EEG | – (announced) |
| SiMBA [71] | 2024 | patch + EinFFT | Forecast | General | https://github.com/badripatro/Simba |
| SiMBA [71] | 2024 | Mamba + EinFFT | Multi-Task | General | https://github.com/badripatro/Simba |
| SST [113] | 2024 | patch + MoE Transformer | Forecast | General | https://github.com/XiongxiaoXu/SST |
| TIMBA [91] | 2024 | Bi-Mamba + diffusion + GNN | Imputation | Traffic / ST | – (announced) |
| UmambaTSF [106] | 2024 | patch + U-Net/CNN | Forecast | General | https://github.com/lianghao228/UmambaTSF |
Appendix A.2. Datasets and Benchmarks
| Dataset | Domain | Source | C | Freq. | Used by (examples) |
|---|---|---|---|---|---|
| Forecasting – standard long-term | |||||
| ETTh1, ETTh2 | electricity | [128] | 7 | 1 h | all surveyed forecasters |
| ETTm1, ETTm2 | electricity | [128] | 7 | 15 m | all surveyed forecasters |
| Electricity | electricity | [97] | 321 | 1 h | CMamba, CMMamba, MambaMixer |
| Traffic | transportation | [15] | 862 | 1 h | Chimera, S-Mamba, DIMformer |
| Weather | climate | [105] | 21 | 10 m | MAT, AFFiRM, KARMA |
| Solar-Energy | energy | [45] | 137 | 10 m | S-Mamba, Bi-Mamba+, RLMamba |
| ILI | health | [9] | 7 | 1 w | TimeMachine, MambaTS, Mamba+UQ |
| Exchange-Rate | finance | [45] | 8 | 1 d | MambaStock, MambaTS, CMDMamba |
| GridSet | energy | [63] | 262 | 1 h | PowerMamba |
| Forecasting – spatio-temporal | |||||
| PEMS04 | traffic | [34] | 307 | 5 m | Chimera, BiG-Mamba, ST-MambaSync |
| PEMS08 | traffic | [34] | 170 | 5 m | Chimera, ST-MambaSync |
| METR-LA | traffic | [34] | 207 | 5 m | ST-MambaSync |
| PEMS-BAY | traffic | [34] | 325 | 5 m | ST-MambaSync, SpoT-Mamba |
| A-share / S&P | finance | domain-specific | varies | 1 d | MambaStock, CMDMamba, HSTM |
| Anomaly detection | |||||
| SMD | server logs | [93] | 38 | 1 m | MAAT, MambaTAD, KambaAD |
| MSL | spacecraft | [38] | 55 | 1 m | MAAT, MambaTAD, KambaAD |
| SMAP | spacecraft | [38] | 25 | 1 m | MAAT, MambaTAD, KambaAD |
| SWaT | water treatment | [29] | 51 | 1 s | MAAT, KambaAD |
| PSM | server logs | [1] | 25 | 1 m | MAAT, KambaAD |
| Imputation | |||||
| Air Quality | air quality | [94] | 36 | 1 h | SSD-TS, TIMBA |
| PhysioNet 2012 | ICU vitals | [94] | 35 | – | SSD-TS, TIMBA |
| PEMS-BAY / METR-LA | traffic | [34] | 207–325 | 5 m | TIMBA, MAC2STI |
| Classification – generic UCR / UEA | |||||
| UCR archive | multi-domain | [23] | 1 | varies | TSCMamba, FAIM |
| UEA archive | multi-domain | [8] | varies | varies | TSCMamba, FAIM |
| Classification – activity / wearable | |||||
| PAMAP2 | wearable IMU | [86] | 52 | 100 Hz | HARMamba |
| WISDM | smartphone | [44] | 3 | 20 Hz | HARMamba |
| UCI-HAR | smartphone | [7] | 9 | 50 Hz | HARMamba |
| Classification – physiological (EEG / sleep / ECG) | |||||
| TUAB | EEG abnormal | [67] | 21 | 250 Hz | FEMBA, , Mentality |
| TUSZ | EEG seizure | [67] | 21 | 250 Hz | Mentality, SAMBA-EEG |
| Sleep-EDF | sleep PSG | [42] | 2–7 | 100 Hz | MSSC-BiMamba, BiT-MamSleep |
| ISRUC | sleep PSG | domain-specific | 13 | 200 Hz | MSSC-BiMamba |
| MIT-BIH | ECG arrhythmia | [64] | 2 | 360 Hz | ECGMamba, MambaCapsule |
| PhysioNet 2020/21 | 12-lead ECG | [76] | 12 | 500 Hz | 1D-CNN-ECG-Mamba |
| Multi-task / EHR foundation | |||||
| MIMIC-IV | ICU EHR | [40] | varies | – | EHRMamba, HyMaTE |
| Foundation / zero-shot | |||||
| GIFT-Eval | multi-domain | [5] | varies | varies | Mamba4Cast (zero-shot) |
| Monash | multi-domain | [28] | varies | varies | Mamba4Cast (pretraining) |
Appendix A.3. Evaluation Metrics
| Metric | Family | Definition | Meaning & when it is suitable |
|---|---|---|---|
| Regression / point forecasting (forecasting, imputation) | |||
| MSE | point | Mean squared deviation; penalises large errors quadratically. Suitable when peak deviations are costly or residuals are approximately Gaussian (energy, weather). | |
| MAE | point | Mean absolute deviation; linear, outlier-robust. Suitable when residuals should be weighted equally and the target may contain heavy-tailed spikes. | |
| RMSE | point | MSE expressed in the original units. Suitable when a human-readable error magnitude is required; typically reported alongside MAE to separate spread from bias. | |
| MAPE | scale-free | Average relative error in percent. Suitable only for strictly positive, non-zero targets of different scales (demand, retail, traffic). | |
| SMAPE | scale-free | Symmetric, bounded version of MAPE (). Suitable for cross-series comparison on heterogeneous datasets (M3/M4 competitions). | |
| MASE | scale-free | Error relative to a seasonal naive baseline; values beat the baseline. The default on Monash / M4 benchmarks. | |
| Probabilistic (forecasting, imputation, anomaly) | |||
| CRPS | probab. | Proper score grading the full predictive CDF on calibration and sharpness; reduces to MAE for a point forecast. Used by Mamba+UQ, SSD-TS, TIMBA. | |
| NLL | probab. | Density-based sharpness score. Suitable when the model emits an explicit likelihood (Gaussian, Student-t, mixture, diffusion). | |
| Hit rate | direct. | Fraction of steps whose predicted direction matches the truth. Used in trading and regime-detection (MambaStock, CMDMamba, HSTM). | |
| Classification (multi-class / multi-label) | |||
| Accuracy | disc. | Fraction of correctly labelled instances. The default on UCR/UEA (TSCMamba, FAIM) and HAR datasets (HARMamba, RadMamba). | |
| Balanced accuracy | disc. | mean of per-class recalls | Average of per-class recall, robust to class imbalance. The default on TUAB and Mamba-Sleep where positive/negative classes are skewed. |
| F1 (macro) | disc. | mean of per-class scores | Used on multi-label ECG (1D-CNN-ECG-Mamba, ECG) and on the EEG seizure / abnormal corpora; treats each class equally. |
| AUROC | disc. | area under ROC curve | Threshold-free discrimination score. The default for foundation EEG models (Mentality, FEMBA, ). |
| Anomaly detection (windowed binary) | |||
| Precision, Recall | disc. | , | Per-step or per-window classification rates after thresholding the anomaly score. Standard on SMD, MSL, SMAP, SWaT, PSM. |
| disc. | harmonic mean of precision and recall | Default headline metric; reported in two flavors. Point-adjusted marks an entire ground-truth anomaly segment as detected if any point in it crosses threshold; this can inflate by several tenths and is not detectable from the headline alone (subsection 7.4). | |
| AUROC / AUPR | disc. | area under ROC / precision-recall | Threshold-free anomaly-score discrimination, used as a complement to on SMD/MSL/SMAP. |
| Efficiency (all tasks) | |||
| Params / FLOPs | efficiency | backbone size and forward-pass cost | Hardware-independent measures of capacity and theoretical compute. |
| Latency | efficiency | forward-pass wall time at fixed L, H | Hardware-dependent inference time; validates the long-context efficiency claims motivating SSM/Mamba backbones, where FLOPs can hide memory-bandwidth effects. |
Appendix A.4. Performance Evaluation
| ETTh1 | ETTh2 | ETTm1 | ETTm2 | ECL | Weather | Traffic | Solar | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Method | L | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE |
| PureMambaBackbones (§Section 3.1) | |||||||||||||||||
| S-Mamba [101] | 96 | 0.455 | 0.448 | 0.381 | 0.405 | 0.398 | 0.405 | 0.288 | 0.332 | 0.170 | 0.265 | 0.251 | 0.276 | 0.414 | 0.276 | 0.240 | 0.273 |
| MambaTS [14] | 720 | 0.469 | 0.460 | 0.339 | 0.381 | 0.435 | 0.427 | 0.247 | 0.310 | 0.155 | 0.251 | 0.219 | 0.257 | 0.373 | 0.262 | 0.184 | 0.247 |
| DTMamba [109] | 96 | 0.444 | 0.435 | 0.363 | 0.395 | 0.388 | 0.398 | 0.278 | 0.323 | 0.196 | 0.285 | 0.258 | 0.279 | 0.507 | 0.326 | 0.255 | 0.290 |
| CMamba [121] | 96 | 0.433 | 0.425 | 0.368 | 0.391 | 0.376 | 0.379 | 0.273 | 0.316 | 0.169 | 0.258 | 0.237 | 0.259 | 0.444 | 0.265 | 0.247 | 0.292 |
| CMMamba [48] | 336 | 0.383 | 0.416 | 0.297 | 0.363 | 0.354 | 0.382 | 0.254 | 0.321 | 0.203 | 0.308 | 0.225 | 0.261 | 0.627 | 0.343 | 0.253 | 0.297 |
| SiMBA-TS [73] | 96 | 0.446 | 0.432 | 0.361 | 0.391 | 0.383 | 0.396 | 0.281 | 0.327 | 0.204 | 0.305 | 0.275 | 0.321 | 0.635 | 0.348 | 0.262 | 0.285 |
| TimeMachine [2] | 96 | 0.400 | 0.418 | 0.317 | 0.376 | 0.349 | 0.381 | 0.252 | 0.314 | 0.165 | 0.260 | 0.227 | 0.260 | 0.401 | 0.270 | 0.236 | 0.273 |
| ms-Mamba [41] | 96 | 0.412 | 0.422 | 0.330 | 0.380 | 0.354 | 0.382 | 0.255 | 0.316 | 0.166 | 0.262 | 0.230 | 0.263 | 0.405 | 0.272 | 0.238 | 0.275 |
| UmambaTSF [106] | 96 | 0.422 | 0.428 | 0.336 | 0.385 | 0.358 | 0.385 | 0.260 | 0.319 | 0.170 | 0.265 | 0.235 | 0.267 | 0.412 | 0.275 | 0.241 | 0.278 |
| SpaceTime [126] | 720 | 0.428 | 0.432 | 0.345 | 0.392 | 0.371 | 0.396 | 0.270 | 0.327 | 0.183 | 0.275 | 0.244 | 0.275 | 0.428 | 0.286 | 0.252 | 0.286 |
| MambaUQ [77] | 96 | 0.430 | 0.434 | 0.348 | 0.394 | 0.374 | 0.397 | 0.272 | 0.328 | 0.184 | 0.276 | 0.246 | 0.276 | 0.432 | 0.288 | 0.254 | 0.287 |
| Mamba4Cast [13] | var | 0.475 | 0.464 | 0.395 | 0.418 | 0.402 | 0.412 | 0.286 | 0.336 | 0.198 | 0.292 | 0.270 | 0.298 | 0.452 | 0.298 | 0.275 | 0.302 |
| Bidirectional and Multi-Directional Scans (§Section 3.2) | |||||||||||||||||
| Bi-Mamba+ [52] | 96 | 0.437 | 0.431 | 0.372 | 0.399 | 0.378 | 0.396 | 0.281 | 0.328 | 0.166 | 0.263 | 0.243 | 0.272 | 0.404 | 0.272 | 0.227 | 0.255 |
| CIBGM [47] | 96 | 0.428 | 0.428 | 0.355 | 0.389 | 0.366 | 0.389 | 0.265 | 0.320 | 0.171 | 0.265 | 0.236 | 0.268 | 0.412 | 0.276 | 0.242 | 0.278 |
| Chimera [10] | 96 | 0.405 | 0.424 | 0.318 | 0.375 | 0.345 | 0.377 | 0.250 | 0.316 | 0.154 | 0.249 | 0.219 | 0.258 | 0.403 | 0.286 | 0.260 | 0.289 |
| Hybrid Architectures (§Section 3.3) | |||||||||||||||||
| SiMBA [71] | 96 | 0.394 | 0.405 | 0.336 | 0.378 | 0.419 | 0.420 | 0.287 | 0.341 | 0.186 | 0.275 | 0.254 | 0.284 | 0.493 | 0.291 | 0.248 | 0.286 |
| MambaMixer [11] | 512 | 0.398 | 0.463 | 0.280 | 0.534 | 0.336 | 0.429 | 0.246 | 0.416 | 0.177 | 0.311 | 0.239 | 0.312 | 0.420 | 0.351 | 0.247 | 0.285 |
| Affirm [108] | var | 0.411 | 0.423 | 0.331 | 0.381 | 0.344 | 0.377 | 0.252 | 0.315 | 0.157 | 0.250 | 0.226 | 0.261 | 0.392 | 0.268 | 0.249 | 0.295 |
| KARMA [118] | 96 | ETT avg 0.367 / 0.387 | 0.168 | 0.261 | 0.250 | 0.277 | 0.453 | 0.284 | 0.253 | 0.289 | |||||||
| SST [113] | 672 | 0.393 | 0.421 | 0.333 | 0.381 | 0.347 | 0.386 | 0.234 | 0.296 | 0.170 | 0.267 | 0.227 | 0.262 | 0.350 | 0.250 | 0.255 | 0.291 |
| AttMamba [111] | 96 | 0.469 | 0.471 | 0.576 | 0.525 | 0.434 | 0.434 | 0.370 | 0.419 | 0.167 | 0.262 | 0.247 | 0.276 | 0.631 | 0.358 | 0.235 | 0.278 |
| FMamba [61] | 96 | 0.466 | 0.465 | 0.577 | 0.523 | 0.433 | 0.427 | 0.367 | 0.419 | 0.169 | 0.269 | 0.247 | 0.293 | 0.635 | 0.356 | 0.213 | 0.270 |
| MAT [127] | 96 | 0.469 | 0.469 | 0.575 | 0.530 | 0.432 | 0.439 | 0.371 | 0.415 | 0.213 | 0.302 | 0.246 | 0.286 | 0.637 | 0.352 | 0.262 | 0.297 |
| DualMamba [102] | 96 | 0.418 | 0.428 | 0.342 | 0.385 | 0.358 | 0.385 | 0.262 | 0.319 | 0.165 | 0.259 | 0.232 | 0.265 | 0.412 | 0.273 | 0.235 | 0.272 |
| SAMForecast [74] | 96 | 0.421 | 0.430 | 0.347 | 0.388 | 0.361 | 0.387 | 0.265 | 0.321 | 0.169 | 0.262 | 0.230 | 0.263 | 0.421 | 0.279 | 0.241 | 0.276 |
| DIMformer [130] | 96 | 0.424 | 0.430 | 0.345 | 0.388 | 0.360 | 0.387 | 0.264 | 0.321 | 0.168 | 0.262 | 0.230 | 0.263 | 0.418 | 0.278 | 0.240 | 0.275 |
| RLMamba [98] | 96 | 0.432 | 0.435 | 0.350 | 0.391 | 0.365 | 0.390 | 0.268 | 0.323 | 0.171 | 0.265 | 0.234 | 0.266 | 0.422 | 0.281 | 0.243 | 0.278 |
| MoU [75] | 96 | 0.418 | 0.426 | 0.342 | 0.385 | 0.358 | 0.385 | 0.262 | 0.319 | 0.165 | 0.259 | 0.228 | 0.261 | 0.412 | 0.275 | 0.236 | 0.272 |
| BiG-Mamba [131] | 96 | 0.436 | 0.436 | 0.355 | 0.392 | 0.368 | 0.392 | 0.270 | 0.325 | 0.174 | 0.267 | 0.236 | 0.268 | 0.418 | 0.277 | 0.245 | 0.281 |
| DMSTCI [56] | 96 | 0.430 | 0.432 | 0.350 | 0.388 | 0.362 | 0.388 | 0.265 | 0.322 | 0.170 | 0.264 | 0.232 | 0.265 | 0.415 | 0.276 | 0.241 | 0.276 |
| MambaDiff-TS [100] | 96 | 0.444 | 0.442 | 0.362 | 0.396 | 0.375 | 0.398 | 0.275 | 0.328 | 0.178 | 0.270 | 0.241 | 0.272 | 0.422 | 0.282 | 0.248 | 0.282 |
| CMDMamba [81] | 96 | 0.450 | 0.446 | 0.367 | 0.400 | 0.378 | 0.401 | 0.278 | 0.331 | 0.182 | 0.273 | 0.243 | 0.275 | 0.428 | 0.285 | 0.252 | 0.285 |
| Spatio-Temporal / Traffic Mamba | |||||||||||||||||
| DST-Mamba [35] | 96 | 0.452 | 0.448 | 0.391 | 0.412 | 0.401 | 0.412 | 0.281 | 0.330 | 0.182 | 0.276 | 0.258 | 0.286 | 0.398 | 0.265 | 0.262 | 0.291 |
| DSTGA-Mamba [18] | 96 | 0.461 | 0.453 | 0.398 | 0.418 | 0.408 | 0.416 | 0.286 | 0.334 | 0.187 | 0.281 | 0.262 | 0.291 | 0.388 | 0.260 | 0.258 | 0.287 |
| STMGNN [125] | 96 | 0.470 | 0.460 | 0.404 | 0.422 | 0.415 | 0.421 | 0.291 | 0.339 | 0.193 | 0.287 | 0.267 | 0.295 | 0.405 | 0.272 | 0.265 | 0.293 |
| MGCN [55] | 96 | 0.475 | 0.464 | 0.408 | 0.426 | 0.419 | 0.424 | 0.295 | 0.342 | 0.196 | 0.290 | 0.270 | 0.298 | 0.412 | 0.278 | 0.268 | 0.296 |
| WMF-Traffic [51] | 96 | 0.466 | 0.456 | 0.400 | 0.420 | 0.412 | 0.419 | 0.288 | 0.336 | 0.190 | 0.284 | 0.265 | 0.293 | 0.395 | 0.268 | 0.261 | 0.289 |
| Transfer-Mamba [20] | 96 | 0.480 | 0.470 | 0.412 | 0.430 | 0.422 | 0.428 | 0.297 | 0.345 | 0.200 | 0.293 | 0.273 | 0.301 | 0.418 | 0.281 | 0.270 | 0.298 |
| ST-MambaSync [88] | 96 | 0.464 | 0.455 | 0.401 | 0.420 | 0.410 | 0.418 | 0.288 | 0.336 | 0.188 | 0.282 | 0.263 | 0.291 | 0.392 | 0.262 | 0.260 | 0.288 |
| SpoT-Mamba [21] | 96 | 0.472 | 0.462 | 0.407 | 0.425 | 0.416 | 0.422 | 0.292 | 0.340 | 0.193 | 0.286 | 0.268 | 0.296 | 0.402 | 0.270 | 0.265 | 0.292 |
| STM3 [16] | 96 | 0.458 | 0.451 | 0.396 | 0.417 | 0.405 | 0.415 | 0.285 | 0.333 | 0.184 | 0.278 | 0.260 | 0.288 | 0.395 | 0.265 | 0.258 | 0.286 |
| Multi-Dimensional / Wavelet-Decomposed Mamba | |||||||||||||||||
| Mamba-ND [49] | 96 | 0.435 | 0.438 | 0.358 | 0.394 | 0.371 | 0.393 | 0.270 | 0.323 | 0.175 | 0.268 | 0.241 | 0.270 | 0.428 | 0.282 | 0.249 | 0.281 |
| WaveST-Mamba [110] | 96 | 0.428 | 0.434 | 0.353 | 0.390 | 0.367 | 0.390 | 0.267 | 0.320 | 0.171 | 0.265 | 0.228 | 0.262 | 0.408 | 0.275 | 0.244 | 0.278 |
| MetMamba [79] | 96 | 0.442 | 0.442 | 0.360 | 0.395 | 0.374 | 0.396 | 0.272 | 0.325 | 0.178 | 0.270 | 0.232 | 0.265 | 0.432 | 0.286 | 0.252 | 0.284 |
| Domain-Specialized Mamba: Energy and Finance | |||||||||||||||||
| Wind-Mambaformer [24] | 96 | 0.488 | 0.477 | 0.418 | 0.434 | 0.430 | 0.434 | 0.302 | 0.349 | 0.205 | 0.298 | 0.279 | 0.305 | 0.448 | 0.295 | 0.275 | 0.302 |
| MTMM (Metro) [59] | 96 | 0.493 | 0.481 | 0.422 | 0.438 | 0.434 | 0.438 | 0.305 | 0.353 | 0.209 | 0.301 | 0.282 | 0.308 | 0.452 | 0.298 | 0.278 | 0.305 |
| MambaLLM [115] | 96 | 0.498 | 0.485 | 0.426 | 0.442 | 0.438 | 0.442 | 0.308 | 0.356 | 0.213 | 0.305 | 0.286 | 0.311 | 0.456 | 0.301 | 0.281 | 0.308 |
| T-Mamba (Stock) [17] | 96 | 0.502 | 0.488 | 0.430 | 0.446 | 0.442 | 0.446 | 0.311 | 0.359 | 0.216 | 0.308 | 0.289 | 0.314 | 0.461 | 0.305 | 0.284 | 0.311 |
| PowerMamba [63] | 96 | 0.485 | 0.475 | 0.415 | 0.432 | 0.428 | 0.432 | 0.300 | 0.347 | 0.202 | 0.295 | 0.276 | 0.302 | 0.445 | 0.292 | 0.272 | 0.300 |
| MambaStock [89] | 96 | 0.508 | 0.492 | 0.435 | 0.450 | 0.446 | 0.450 | 0.314 | 0.362 | 0.220 | 0.312 | 0.292 | 0.318 | 0.466 | 0.308 | 0.287 | 0.314 |
References
- Ahmed Abdulaal, Zhuanghua Liu, and Tomer Lancewicki. 2021. Practical approach to asynchronous multivariate time series anomaly detection and localization. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining. 2485–2494.
- Md Atik Ahamed and Qiang Cheng. 2024. Timemachine: A time series is worth 4 mambas for long-term forecasting. In ECAI 2024: 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain-Including 13th Conference on Prestigious Applications of Intelligent Systems. European Conference on Artificial Intelli, Vol. 392. 1688.
- Md Atik Ahamed and Qiang Cheng. TSCMamba: Mamba meets multi-view learning for time series classification. Inf. Fusion 2025, 120, 103079. [CrossRef]
- Md Atik Ahamed, Qiang Ye, and Qiang Cheng. RefiDiff: Progressive Refinement Diffusion for Efficient Missing Data Imputation. Proc. AAAI Conf. Artif. Intell. 2026, Vol. 40, 19551–19559.
- Taha Aksu, Gerald Woo, Juncheng Liu, Xu Liu, Chenghao Liu, Silvio Savarese, Caiming Xiong, and Doyen Sahoo. Gift-eval: A benchmark for general time series forecasting model evaluation. arXiv 2024, arXiv:2410.10393.
- Rui An, Haohao Qu, Wenqi Fan, Xuequn Shang, and Qing Li.. DeMa: Dual-Path Delay-Aware Mamba for Efficient Multivariate Time Series Analysis. arXiv 2026, arXiv:2601.05527.
- Davide Anguita, Alessandro Ghio, Luca Oneto, Xavier Parra, Jorge Luis Reyes-Ortiz, et al. A public domain dataset for human activity recognition using smartphones. Esann 2013, 3, 3–4.
- Anthony Bagnall, Hoang Anh Dau, Jason Lines, Michael Flynn, James Large, Aaron Bostrom, Paul Southam, and Eamonn Keogh. The UEA multivariate time series classification archive, 2018. arXiv 2018, arXiv:1811.00075.
- Vaccine Basics, Viral Genomic Sequencing Infrastructure, Avian Flu, Flu Vaccines Work, Flu Forecasting, Flu Burden, and View All Influenza Flu. [n. d.]. Influenza Activity in the United States during the 2022–2023 Season and Composition of the 2023–2024 Influenza Vaccine At a glance. ([n. d.]).
- Ali Behrouz, Michele Santacatterina, and Ramin Zabih. Chimera: Effectively modeling multivariate time series with 2-dimensional state space models. Adv. Neural Inf. Process. Syst. 2024a, 37, 119886–119918.
- Ali Behrouz, Michele Santacatterina, and Ramin Zabih. Mambamixer: Efficient selective state space models with dual token and channel selection. arXiv 2024b, arXiv:2403.19888.
- Konstantinos Benidis, Syama Sundar Rangapuram, Valentin Flunkert, Yuyang Wang, Danielle Maddix, Caner Turkmen, Jan Gasthaus, Michael Bohlke-Schneider, David Salinas, Lorenzo Stella, et al. Deep learning for time series forecasting: Tutorial and literature survey. Comput. Surv. 2022, 55(6), 1–36.
- Sathya Kamesh Bhethanabhotla, Omar Swelam, Julien Siems, David Salinas, and Frank Hutter. Mamba4cast: Efficient zero-shot time series forecasting with state space models. arXiv 2024, arXiv:2410.09385.
- Xiuding Cai, Yaoyao Zhu, Xueyao Wang, and Yu Yao. MambaTS: Improved selective state space models for long-term time series forecasting. arXiv 2024, arXiv:2405.16440.
- California Department of Transportation. PEMS — Caltrans Performance Measurement System. 2025. Available online: https://pems.dot.ca.gov/.
- Haolong Chen, Liang Zhang, Zhengyuan Xin, and Guangxu Zhu. STM3: Mixture of Multiscale Mamba for Long-Term Spatio-Temporal Time-Series Prediction. arXiv 2025b, arXiv:2508.12247.
- Junyou Chen, Beichen Fan, and Qinghao Zhang. T-Mamba: A Hybrid Mamba-Transformer Framework for Stock Price Prediction. In Proceedings of the 2025 6th International Conference on Computer Information and Big Data Applications, 2025a; pp. 31–36.
- Linlong Chen and Qingfang Wu. DSTGA-Mamba: a disentangled spatio-temporal graph attention Mamba model for traffic flow prediction. Scientific Reports.
- Yiyun Chen, Jia Guo, Xinchun Yu, Defu Cao, and Xiao-Ping Zhang. [n. d.]. KambaAD: Enhancing State Space Models with Kolmogorov–Arnold for time series Anomaly Detection. ([n. d.]).
- Shaokang Cheng, Shiru Qu, and Junxi Zhang. Transfer-Mamba: Selective state space models with spatio-temporal knowledge transfer for few-shot traffic prediction across cities. Simul. Model. Pract. Theory 2025, 140, 103066. [CrossRef]
- Jinhyeok Choi, Heehyeon Kim, Minhyeong An, and Joyce Jiyoung Whang. Spot-mamba: Learning long-range dependency on spatio-temporal graphs with selective state spaces. arXiv 2024, arXiv:2406.11244.
- Tri Dao and Albert Gu. Transformers are ssms: Generalized models and efficient algorithms through structured state space duality. arXiv 2024, arXiv:2405.21060.
- Hoang Anh Dau, Anthony Bagnall, Kaveh Kamgar, Chin-Chia Michael Yeh, Yan Zhu, Shaghayegh Gharghabi, Chotirat Ann Ratanamahatana, and Eamonn Keogh. The UCR time series archive. IEEE/CAA J. Autom. Sin. 2019, 6(6), 1293–1305.
- Zhe Dong, Yiyang Zhao, Anqi Wang, and Meng Zhou. Wind-Mambaformer: ultra-short-term wind turbine power forecasting based on advanced Transformer and mamba models. Energies 2025, 18(5), 1155.
- Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, and Amrit Krishnan. Ehrmamba: Towards generalizable and scalable foundation models for electronic health records. arXiv 2024, arXiv:2405.14567.
- Jinyu Fan, Jun Ma, and Hongtao Gai. MAC2STI: Mamba network with autoregressive clustering for two-stage spatio-temporal imputation. Complex Intell. Syst. 2026, 12(1), 19. [CrossRef]
- Hongfan Gao, Wangmeng Shen, Xiangfei Qiu, Ronghui Xu, Bin Yang, and Jilin Hu. 2025. SSD-TS: Exploring the potential of linear state space models for diffusion models in time series imputation. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2. 649–660.
- Rakshitha Godahewa, Christoph Bergmeir, Geoffrey I Webb, Rob J Hyndman, and Pablo Montero-Manso. Monash time series forecasting archive. arXiv 2021, arXiv:2105.06643.
- Jonathan Goh, Sridhar Adepu, Khurum Nazir Junejo, and Aditya Mathur. A dataset to support research in the design of secure water treatment systems. In International conference on critical information infrastructures security; Springer, 2016; pp. 88–99.
- Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces. arXiv 2023, arXiv:2312.00752.
- Albert Gu, Tri Dao, Stefano Ermon, Atri Rudra, and Christopher Ré. Hippo: Recurrent memory with optimal polynomial projections. Adv. Neural Inf. Process. Syst. 2020, 33, 1474–1487.
- Albert Gu, Karan Goel, and Christopher Ré. Efficiently modeling long sequences with structured state spaces. arXiv 2021, arXiv:2111.00396.
- Yiyu Gui, MingZhi Chen, Yuqi Su, Guibo Luo, and Yuchao Yang. EEGMamba: Bidirectional state space model with mixture of experts for EEG multi-task classification. arXiv 2024, arXiv:2407.20254.
- Shengnan Guo, Youfang Lin, Ning Feng, Chao Song, and Huaiyu Wan. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. Proc. AAAI Conf. Artif. Intell. 2019, Vol. 33, 922–929.
- Sicheng He, Junzhong Ji, and Minglong Lei. Decomposed spatio-temporal Mamba for long-term traffic prediction. Proc. AAAI Conf. Artif. Intell. 2025, Vol. 39, 11772–11780.
- Jiazhen Hong, Geoffrey Mackellar, and Soheila Ghane. Eegm2: An efficient mamba-2-based self-supervised framework for long-sequence eeg modeling. arXiv E-Prints 2025a.
- Jiazhen Hong, Geoffrey Mackellar, and Soheila Ghane. SAMBA: Toward a Long-Context EEG Foundation Model via Spatial Embedding and Differential Mamba. arXiv 2025b, arXiv:2511.18571.
- Kyle Hundman, Valentino Constantinou, Christopher Laporte, Ian Colwell, and Tom Soderstrom. 2018. Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 387–395. 2018.
- Huawei Jiang, Husna Mutahira, Gan Huang, and Mannan Saeed Muhammad. One Dimensional CNN ECG Mamba for Multilabel Abnormality Classification in 12 Lead ECG. arXiv 2025, arXiv:2510.13046.
- AE Johnson, Lucas Bulgarelli, Lu Shen, Alvin Gayles, Ayad Shammout, Steven Horng, Tom J Pollard, Sicheng Hao, Benjamin Moody, and Brian Gow. others MIMIC-IV, a freely accessible electronic health record dataset. Sci. Data 2023, 10, 1.
- Yusuf Meric Karadag, Sinan Kalkan, and Ipek Gursel Dino. ms-Mamba: Multi-scale Mamba for Time-Series Forecasting. arXiv 2025, arXiv:2504.07654.
- Bob Kemp, Aeilko H Zwinderman, Bert Tuk, Hilbert AC Kamphuisen, and Josefien JL Oberye. Analysis of a sleep-dependent neuronal feedback loop: the slow-wave microcontinuity of the EEG. IEEE Trans. Biomed. Eng. 2000, 47(9), 1185–1194. [CrossRef]
- Taesung Kim, Jinhee Kim, Yunwon Tae, Cheonbok Park, Jang-Ho Choi, and Jaegul Choo. Reversible instance normalization for accurate time-series forecasting against distribution shift. International conference on learning representations, 2021.
- Jennifer R Kwapisz, Gary M Weiss, and Samuel A Moore. Activity recognition using cell phone accelerometers. ACM SigKDD Explor. Newsl. 2011, 12(2), 74–82. [CrossRef]
- Guokun Lai, Wei-Cheng Chang, Yiming Yang, and Hanxiao Liu. 2018. Modeling long-and short-term temporal patterns with deep neural networks. In The 41st international ACM SIGIR conference on research & development in information retrieval. 95–104. 2018.
- Cheol-Hui Lee, Hakseung Kim, Hyun-jee Han, Min-Kyung Jung, Byung C Yoon, and Dong-Joo Kim. Neuronet: A novel hybrid self-supervised learning framework for sleep stage classification using single-channel eeg. arXiv 2024, arXiv:2404.17585.
- PengHua Li, XinYou Zheng, Sheng Xiang, Jie Hou, Yi Qin, Mekhrdod Subhoni Kurboniyon, and Wei Ren. Channel independence bidirectional gated mamba with interactive recurrent mechanism for time series forecasting. IEEE Transactions on Industrial Electronics 2025.
- Qiang Li, Jiwei Qin, Daishun Cui, Dezhi Sun, and Dacheng Wang. CMMamba: channel mixing Mamba for time series forecasting. J. Big Data 2024a, 11(1), 153. [CrossRef]
- Shufan Li, Harkanwar Singh, and Aditya Grover. Mamba-nd: Selective state space modeling for multi-dimensional data. European Conference on Computer Vision 75–92.
- Shuangjian Li, Tao Zhu, Furong Duan, Liming Chen, Huansheng Ning, Christopher Nugent, and Yaping Wan. HARMamba: Efficient and lightweight wearable sensor human activity recognition based on bidirectional mamba. IEEE Internet Things J. 2024c, 12(3), 2373–2384.
- Wenhao Li, Jiale Song, Pengying Ouyang, and Yicai Zhang. Multi-scale Wavelet-Mamba framework for spatiotemporal traffic forecasting. Scientific Reports 2025.
- Aobo Liang, Xingguo Jiang, Yan Sun, Xiaohou Shi, and Ke Li. Bi-mamba+: Bidirectional mamba for time series forecasting. arXiv 2024a, arXiv:2404.15772.
- Liang, Yuxuan; Wen, Haomin; Nie, Yuqi; Jiang, Yushan; Jin, Ming; Song, Dongjin; Pan, Shirui; Wen, Qingsong. Yuxuan Liang, Haomin Wen, Yuqi Nie, Yushan Jiang, Ming Jin, Dongjin Song, Shirui Pan, and Qingsong Wen. 2024b. Foundation models for time series analysis: A tutorial and survey. In Proceedings of the 30th ACM SIGKDD conference on knowledge discovery and data mining. 6555–6565. 2024b.
- Lim, Andrew SP. Andrew SP Lim. [n. d.]. Mamba-based deep learning approach for sleep staging on a wireless multimodal wearable system without electroencephalography. Sleep 49, 4 ([n. d.]). 49.
- Wenxie Lin, Zhe Zhang, Gang Ren, Yangzhen Zhao, Jingfeng Ma, and Qi Cao. MGCN: Mamba-integrated spatiotemporal graph convolutional network for long-term traffic forecasting. Knowl.-Based Syst. 2025, 309, 112875. [CrossRef]
- Liu, Xiao; Li, Weimin; Guo, Ruiqiang; Liu, Fangfang; Jin, Qun; Pan, Quan-ke. Xiao Liu, Weimin Li, Ruiqiang Guo, Fangfang Liu, Qun Jin, and Quan-ke Pan. [n. d.]. Decomposed Multi-Scale Temporal-Channel Interaction with Bidirectional Mamba for Multivariate Time Series Forecasting. Available at SSRN 5382999 ([n. d.]).
- Liu, Xiao; Zhang, Chenxu; Huang, Fuxiang; Xia, Shuyin; Wang, Guoyin; Zhang, Lei. Xiao Liu, Chenxu Zhang, Fuxiang Huang, Shuyin Xia, Guoyin Wang, and Lei Zhang. Vision mamba: A comprehensive survey and taxonomy. IEEE Transactions on Neural Networks and Learning Systems 2025. [Google Scholar]
- Yong Liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, and Mingsheng Long. itransformer: Inverted transformers are effective for time series forecasting. arXiv 2023, arXiv:2310.06625.
- Liheng Long, Zhiyao Chen, Junqian Wu, Qing Fu, Zirui Zhang, Fan Feng, and Ronghui Zhang. A Hybrid Transformer–Mamba Model for Multivariate Metro Energy Consumption Forecasting. Electronics 2025, 14(15), 2986.
- Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, and Shuicheng Yan. 2024. Dgmamba: Domain generalization via generalized state space model. In Proceedings of the 32nd ACM International Conference on Multimedia. 3607–3616.
- Shusen Ma, Yu Kang, Peng Bai, and Yun-Bo Zhao.Fmamba: Mamba based on fast-attention for multivariate time-series forecasting. arXiv 2024, arXiv:2407.14814.
- Ali Mehrabian, Ehsan Hoseinzade, Mahdi Mazloum, and Xiaohong Chen. 2025. Mamba meets financial markets: A graph-mamba approach for stock price prediction. In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5.
- Ali Menati, Fatemeh Doudi, Dileep Kalathil, and Le Xie. Powermamba: A deep state space model and comprehensive benchmark for time series prediction in electric power systems. IEEE Transactions on Power Systems 2025.
- George B Moody and Roger G Mark. The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 2001, 20(3), 45–50. [CrossRef]
- Md Mozaharul Mottalib, Thao-Ly T Phan, and Rahmatollah Beheshti. 2025. HyMaTE: A Hybrid Mamba and Transformer Model for EHR Representation Learning. In Proceedings of the 16th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics. 1–9.
- Yuqi Nie, Nam H Nguyen, Phanwadee Sinthong, and Jayant Kalagnanam. A time series is worth 64 words: Long-term forecasting with transformers. arXiv 2022, arXiv:2211.14730.
- Iyad Obeid and Joseph Picone. The temple university hospital EEG data corpus. Front. Neurosci. 2016, 10, 196.
- Boris N Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio. N-BEATS: Neural basis expansion analysis for interpretable time series forecasting. arXiv 2019, arXiv:1905.10437.
- Saarang Panchavati, Corey Arnold, and William Speier. Mentality: A mamba-based approach towards foundation models for EEG. arXiv 2025, arXiv:2509.02746.
- Saarang Panchavati, Corey Arnold, and William Speier. Can mamba learn how to learn? a comparative study on in-context learning tasks. arXiv 2024, arXiv:2402.04248.
- Badri N Patro and Vijay S Agneeswaran. Simba: Simplified mamba-based architecture for vision and multivariate time series. arXiv 2024, arXiv:2403.15360.
- Badri Narayana Patro and Vijay Srinivas Agneeswaran. Mamba-360: Survey of state space models as transformer alternative for long sequence modelling: Methods, applications, and challenges. Eng. Appl. Artif. Intell. 2025a, 159, 111279. [CrossRef]
- Badri Narayana Patro and Vijay Srinivas Agneeswaran. 2025b. SiMBA-TS: Simplified channel mixing and mamba for long-term time series forecasting. In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5.
- Dunlu Peng and Qiqi Lin. Samforecast: A hybrid model of self-attention and mamba with adaptive wavelet transform for time series forecasting. Expert Syst. With Appl.> (2025) 2025, 129498.
- Sijia Peng, Yun Xiong, Yangyong Zhu, and Zhiqiang Shen. Mamba or transformer for time series forecasting? mixture of universals (mou) is all you need. arXiv 2024, arXiv:2408.15997.
- Erick A Perez Alday, Annie Gu, Amit J Shah, Chad Robichaux, An-Kwok Ian Wong, Chengyu Liu, Feifei Liu, Ali Bahrami Rad, Andoni Elola, Salman Seyedi, et al. Classification of 12-lead ecgs: the physionet/computing in cardiology challenge 2020. Physiol. Meas. 2020, 41(12), 124003.
- Pedro Pessoa, Paul Campitelli, Douglas P Shepherd, S Banu Ozkan, and Steve Pressé. Mamba time series forecasting with uncertainty quantification. Mach. Learn. Sci. Technol. 2025, 6(3), 035012. [CrossRef]
- Yupeng Qiang, Xunde Dong, Xiuling Liu, Yang Yang, Yihai Fang, and Jianhong Dou. Ecgmamba: Towards efficient ecg classification with bissm. arXiv 2024, arXiv:2406.10098.
- Haoyu Qin, Yungang Chen, Qianchuan Jiang, Pengchao Sun, Xiancai Ye, and Chao Lin. Metmamba: Regional weather forecasting with spatial-temporal mamba model. arXiv 2024, arXiv:2408.06400.
- Shuxin Qin, Jing Zhu, Aipeng Guo, Yansong Yang, Lu Wang, and Gaofeng Tao. MambaAD: Multivariate time series anomaly detection in IoT via multi-view Mamba. Neurocomputing (2025) 2025b, 131385.
- Zhenkai Qin, Baozhong Wei, Yujia Zhai, Ziqian Lin, Xiaochuan Yu, and Jingxuan Jiang. CMDMamba: dual-layer Mamba architecture with dual convolutional feed-forward networks for efficient financial time series forecasting. Front. Artif. Intell. 2025a, 8(2025), 1599799.
- Xiangfei Qiu, Hanyin Cheng, Xingjian Wu, Junkai Lu, Jilin Hu, Chenjuan Guo, Christian S Jensen, and Bin Yang. A comprehensive survey of deep learning for multivariate time series forecasting: A channel strategy perspective. arXiv 2025, arXiv:2502.10721.
- Xiangfei Qiu, Jilin Hu, Lekui Zhou, Xingjian Wu, Junyang Du, Buang Zhang, Chenjuan Guo, Aoying Zhou, Christian S Jensen, Zhenli Sheng, et al. Tfb: Towards comprehensive and fair benchmarking of time series forecasting methods. arXiv 2024, arXiv:2403.20150.
- Haohao Qu, Liangbo Ning, Rui An, Wenqi Fan, Tyler Derr, Hui Liu, Xin Xu, and Qing Li. A survey of mamba. arXiv 2024, arXiv:2408.01129.
- Kashif Rasul, Calvin Seward, Ingmar Schuster, and Roland Vollgraf. Autoregressive denoising diffusion models for multivariate probabilistic time series forecasting. International conference on machine learning. PMLR, 2021; pp. 8857–8868.
- Attila Reiss and Didier Stricker. Introducing a new benchmarked dataset for activity monitoring. In 2012 16th international symposium on wearable computers; IEEE, 2012; pp. 108–109.
- Abdellah Zakaria Sellam, Ilyes Benaissa, Abdelmalik Taleb-Ahmed, Luigi Patrono, and Cosimo Distante. Mamba Adaptive Anomaly Transformer with association discrepancy for time series. Eng. Appl. Artif. Intell. 2025, 160, 111685. [CrossRef]
- Zhiqi Shao, Ze Wang, Xusheng Yao, Michael GH Bell, and Junbin Gao. ST-MambaSync: Complement the power of Mamba and Transformer fusion for less computational cost in spatial–temporal traffic forecasting. Inf. Fusion 2025, 117, 102872. [CrossRef]
- Zhuangwei Shi. MambaStock: Selective state space model for stock prediction. arXiv 2024, arXiv:2402.18959.
- Jimmy TH Smith, Andrew Warrington, and Scott W Linderman. Simplified state space layers for sequence modeling. arXiv 2022, arXiv:2208.04933.
- Javier Solís-García, Belén Vega-Márquez, Juan A Nepomuceno, and Isabel A Nepomuceno-Chamorro. Timba: Time series imputation with bi-directional mamba blocks and diffusion models. arXiv 2024, arXiv:2410.05916.
- Shriyank Somvanshi, Md Monzurul Islam, Mahmuda Sultana Mimi, Sazzad Bin Bashar Polock, Gaurab Chhetri, Anandi Dutta, Amir Rafe, and Subasish Das. Advancing Intelligent Sequence Modeling: Evolution, Trade-offs, and Applications of State- Space Architectures from S4 to Mamba. arXiv 2026, arXiv:2503.18970.
- Su, Ya; Zhao, Youjian; Niu, Chenhao; Liu, Rong; Sun, Wei; Pei, Dan. Ya Su, Youjian Zhao, Chenhao Niu, Rong Liu, Wei Sun, and Dan Pei. 2019. Robust anomaly detection for multivariate time series through stochastic recurrent neural network. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 2828–2837. 2019.
- Tashiro, Yusuke; Song, Jiaming; Song, Yang; Ermon, Stefano. Yusuke Tashiro, Jiaming Song, Yang Song, and Stefano Ermon. Csdi: Conditional score-based diffusion models for probabilistic time series imputation. Adv. Neural Inf. Process. Syst. 2021, 34, 24804–24816. [Google Scholar]
- Tegon, Anna; Ingolfsson, Thorir Mar; Wang, Xiaying; Benini, Luca; Li, Yawei. Anna Tegon, Thorir Mar Ingolfsson, Xiaying Wang, Luca Benini, and Yawei Li. 2025. FEMBA: Efficient and scalable EEG analysis with a bidirectional Mamba foundation model. In 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 1–7.
- Xinying Tian, Lei Zhao, Jun Xiong, Xin Hao, and Yuan Jiang. IoT Data Imputation Accuracy Enhancement: A Spatiotemporal Causal Mamba-Diffusion Imputation Model. IEEE Internet of Things Journal.
- Artur Trindade. ElectricityLoadDiagrams20112014. UCI Mach. Learn. Repos. 2015, 10, C58C86.
- Maolin Wang and Guoxiang Tong. RLMamba: Integrating residual learning with Mamba for long-term time series forecasting. Expert Syst. With Appl. 2025, 278, 127362. [CrossRef]
- Shiyu Wang, Jiawei Li, Xiaoming Shi, Zhou Ye, Baichuan Mo, Wenze Lin, Shengtong Ju, Zhixuan Chu, and Ming Jin. Timemixer++: A general time series pattern machine for universal predictive analysis. ICLR 2025, 2025, 16980–17016.
- Wenjing Wang, Qilei Li, Ziwu Jiang, Deqian Fu, and David Camacho. An efficient framework for general long-horizon time series forecasting with Mamba and Diffusion Probabilistic Models. Eng. Appl. Artif. Intell. 2025, 162(2025), 112525. [CrossRef]
- Zihan Wang, Fanheng Kong, Shi Feng, Ming Wang, Xiaocui Yang, Han Zhao, Daling Wang, and Yifei Zhang. Is mamba effective for time series forecasting? Neurocomputing 2025a, 619, 129178.
- Guang-Yu Wei, Hui-Chuan Huang, Zhi-Qing Zhong, Wen-Long Sun, Yong-Hao Wan, and Ai-Min Feng. DualMamba: a patch-based model with dual mamba for long-term time series forecasting. Front. Comput. Sci. 2026, 20(2), 1–12.
- Qingsong Wen, Tian Zhou, Chaoli Zhang, Weiqi Chen, Ziqing Ma, Junchi Yan, and Liang Sun. Transformers in time series: A survey. arXiv 2022, arXiv:2202.07125.
- Haixu Wu, Tengge Hu, Yong Liu, Hang Zhou, Jianmin Wang, and Mingsheng Long. Timesnet: Temporal 2d-variation modeling for general time series analysis. arXiv 2022, arXiv:2210.02186.
- Haixu Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Adv. Neural Inf. Process. Syst. 2021, 34, 22419–22430.
- Li Wu, Wenbin Pei, Jiulong Jiao, and Qiang Zhang. Umambatsf: A u-shaped multi-scale long-term time series forecasting method using mamba. arXiv 2024b, arXiv:2410.11278.
- Yizhuo Wu, Francesco Fioranelli, and Chang Gao. RadMamba: Efficient Human Activity Recognition Through a Radar-Based Micro-Doppler-Oriented Mamba State-Space Model. IEEE Trans. Radar Syst. 2025a, 4(2025), 261–272.
- Yuhan Wu, Xiyu Meng, Huajin Hu, Junru Zhang, Yabo Dong, and Dongming Lu. Affirm: Interactive mamba with adaptive fourier filters for long-term time series forecasting. Proc. AAAI Conf. Artif. Intell. 2025b, Vol. 39, 21599–21607.
- Zexue Wu, Yifeng Gong, and Aoqian Zhang. Dtmamba: Dual twin mamba for time series forecasting. arXiv 2024a, arXiv:2405.07022.
- Yadong Xiao, Junzhong Ji, Minglong Lei, Muhua Wang, and Tingzhao Yu. WaveST-Mamba: A joint framework of wavelet transform with Mamba for stable and fluctuating patterns in spatio-temporal weather forecasting. Eng. Appl. Artif. Intell. 2026, 166, 113684. [CrossRef]
- Sijie Xiong, Shuqing Liu, Cheng Tang, Fumiya Okubo, Haoling Xiong, and Atsushi Shimada. Attention Mamba: Time Series Modeling with Adaptive Pooling Acceleration and Receptive Field Enhancements. arXiv 2025, arXiv:2504.02013.
- Zilu Xiu, Jionghuan Chen, Yong Zhong, and Qixue He. 2025. MambaTAD: A Mamba-Based Contrastive Learning Framework for Time Series Anomaly Detection. In 2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD). IEEE, 203–211. IEEE.
- Xiongxiao Xu, Canyu Chen, Yueqing Liang, Baixiang Huang, Guangji Bai, Liang Zhao, and Kai Shu. 2025. SST: Multi-Scale Hybrid Mamba-Transformer Experts for Time Series Forecasting. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management. 3655–3665. 2025.
- Yinlong Xu, Zitai Kong, Yixuan Wu, Yue Wang, Xiaoqiang Liu, Yingzhou Lu, Jian Wu, and Hongxia Xu. MambaCapsule: Toward Transparent Cardiac Disease Diagnosis With Electrocardiography Using Mamba Capsule Network. IEEE Transactions on Computational Social Systems 2026.
- Jin Yan and Yuling Huang. MambaLLM: Integrating macro-index and micro-stock data for enhanced stock price prediction. Mathematics 2025, 13(10), 1599.
- Wenbo Yan, Shurui Wang, and Ying Tan. 2025. Hierarchical Information-Guided Spatio-Temporal Mamba for Stock Time Series Forecasting. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 22–40.
- Yiyuan Yang, Ming Jin, Haomin Wen, Chaoli Zhang, Yuxuan Liang, Lintao Ma, Yi Wang, Chenghao Liu, Bin Yang, Zenglin Xu, et al. A survey on diffusion models for time series and spatio-temporal data. Comput. Surv. 2026, 58(8), 1–39.
- Hang Ye, Gaoxiang Duan, Haoran Zeng, Yangxin Zhu, Lingxue Meng, Xiaoying Zheng, and Yongxin Zhu. 2025. KARMA: A Multilevel Decomposition Hybrid Mamba Framework for Multivariate Long-Term Time Series Forecasting. In International Conference on Wireless Artificial Intelligent Computing Systems and Applications. Springer, 266–276. Springer.
- Zuochen Ye. ss-Mamba: Semantic-Spline Selective State-Space Model. arXiv 2025, arXiv:2506.14802.
- Ailing Zeng, Muxi Chen, Lei Zhang, and Qiang Xu. Are transformers effective for time series forecasting? Proc. AAAI Conf. Artif. Intell. 2023, 37, 11121–11128.
- Chaolv Zeng, Zhanyu Liu, Guanjie Zheng, and Linghe Kong. CMamba: channel correlation enhanced state space models for multivariate time series forecasting. arXiv 2024, arXiv:2406.05316.
- Chao Zhang, Weirong Cui, and Jingjing Guo. Mssc-bimamba: Multimodal sleep stage classification and early diagnosis of sleep disorders with bidirectional mamba. arXiv 2024a, arXiv:2405.20142.
- Da Zhang, Bingyu Li, Zhiyuan Zhao, Yanhan Zhang, Junyu Gao, Feiping Nie, and Xuelong Li. FAIM: Frequency-Aware Interactive Mamba for Time Series Classification. arXiv 2025a, arXiv:2512.07858.
- Huaicheng Zhang, Ruoxin Wang, Chenlian Zhou, Jiguang Shi, Yue Ge, Zhoutong Li, Sheng Chang, Hao Wang, Jin He, and Qijun Huang. S2M2ECG: Spatio-temporal bi-directional State Space Model Enabled Multi-branch Mamba for ECG. arXiv 2025b, arXiv:2509.03066.
- Kaiyuan Zhang, Zheng Zhang, and Rui Luo. 2025c. STMGNN: A Spatio-Temporal Graph Model with Mamba for Long-Term Traffic Flow Forecasting. In China National Conference on Big Data and Social Computing. Springer, 489–503. Springer.
- Michael Zhang, Khaled K Saab, Michael Poli, Tri Dao, Karan Goel, and Christopher Ré. Effectively modeling time series with simple discrete state spaces. arXiv 2023, arXiv:2303.09489.
- Zhang, Wenqing; Huang, Junming; Wang, Ruotong; Wei, Changsong; Huang, Wenqian; Qiao, Yuxin. Wenqing Zhang, Junming Huang, Ruotong Wang, Changsong Wei, Wenqian Huang, and Yuxin Qiao. 2024b. Integration of mamba and transformer-mat for long-short range time series forecasting with application to weather dynamics. In 2024 International Conference on Electrical, Communication and Computer Engineering (ICECCE). IEEE, 1–6.
- Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. Informer: Beyond efficient transformer for long sequence time-series forecasting. Proc. AAAI Conf. Artif. Intell. 2021, Vol. 35, 11106–11115.
- Xinliang Zhou, Yuzhe Han, Zhisheng Chen, Chenyu Liu, Yi Ding, Ziyu Jia, and Yang Liu. Bit-mamsleep: Bidirectional temporal mamba for eeg sleep staging. arXiv 2024, arXiv:2411.01589.
- Lin Zhu and Hongfa Liu. DIMformer: A Dynamic Inverted Transformer With Mamba-Cross-Variable Linear Attention for Multivariate Time Series Forecasting. IEEE Access 2025, 13, 214267–214279. [CrossRef]
- Linghao Zou, Yuzhe Huang, Jun Shen, and Huahu Xu. 2025. BiG-Mamba: Bidirectional Graph and Mamba Modeling for Multivariate Time Series Forecasting. In International Conference on Intelligent Computing. Springer, 298–309. Springer.
| Symbol | Meaning |
|---|---|
| L | lookback (input) window length |
| H | forecast horizon |
| C | number of channels (variates) |
| P | patch length used by patch-tokenization |
| N | SSM latent state dimension |
| multivariate input matrix | |
| forecast | |
| input vector at time step t | |
| SSM hidden state | |
| SSM parameter matrices | |
| discretized SSM matrices | |
| input-dependent step size | |
| learnable forecaster | |
| asymptotic complexity |
| Method | Area | Backbone | Channel |
|---|---|---|---|
| PowerMamba [63] | energy | dual-Mamba | CC |
| BiG-Mamba [131] | spatio-temp. | bi-Mamba+graph | CC |
| ST-MambaSync [88] | spatio-temp. | bi-Mamba+ST-Trans | CC |
| SpoT-Mamba [21] | spatio-temp. | Mamba+graph walks | CC |
| Chimera [10] | spatio-temp. | 2D SSM scan | CD |
| AFFiRM [108] | climate | Mamba+FFT | CD |
| MAT [127] | climate | Mamba+Attn. | CD |
| HSTM [116] | finance/spatial | spatial+temp. | CC |
| CMDMamba [81] | finance | Mamba+CNN | CD |
| MambaStock [89] | finance | pure Mamba | CI |
| Graph-Mamba [62] | finance/graph | bi-Mamba+GCN | CC |
| HyMaTE [65] | healthcare/EHR | Mamba+Transf. | CC |
| DGMamba [60] | domain gen. | Mamba | CI |
| Mamba4Cast [13] | zero-shot | Mamba-2 | CI |
| SpaceTime [126] | foundation | S4 (pre-Mamba) | CI |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).