Mamba-3 is a new state space model (SSM) designed with inference efficiency as the primary goal, contrasting with Mamba-2's training-speed focus. Key improvements include a more expressive recurrence via exponential-trapezoidal discretization, complex-valued state tracking, and a MIMO (multi-input, multi-output) variant that
Table of contents
The Mamba-3 modelArchitectureEmpirical resultsKernels here, there, and everywhereNext upReferencesSort: