exploration on Mamba models are still within the earliest phase—Mamba was first released in a 2023 paper—but the novel architecture presents substantial theoretical gain in both velocity and context size.
although https://saulqovj981281.pointblog.net/the-smart-trick-of-mistral-ai-that-nobody-is-discussing-77243606