Submitted on 1st December, 2023 on arXiv, the paper titled “Mamba: Linear-Time Sequence Modeling with Selective State Spaces” proposed an interesting approach to sequence modeling. The authors —…

Towards Data Science is a community-powered publication that showcases work in data science, machine learning and artificial intelligence. Every day newcomers, seasoned researchers and industry practitioners publish tutorials, research notes and real-world case studies that help the field move forward.

Towards Data Science

The post discusses the Mamba model, which utilizes selective state space models (SSM) for sequence modeling. It addresses the limitations of multi-head attention in Transformers and explains how Mamba scales linearly. The post also covers the core issue with SSMs and the implementation of Mamba in Keras and TensorFlow.

Mamba: SSM, Theory, and Implementation in Keras and TensorFlow

The backbone of Mamba: State Space Models