Introduction of SiMBA architecture: a new architecture that utilizes EinFFT for channel modeling and Mamba for sequence modeling, effectively addressing stability issues observed in Mamba when scaling to large networks. SiMBA demonstrates superior performance in multiple evaluation metrics and bridges the performance gap with state-of-the-art attention-based transformers on the ImageNet dataset and six standard time series datasets.
Sort: