RecurrentGemma is an open-weights Language Model by Google DeepMind based on the novel Griffin architecture. It achieves fast inference when generating long sequences by using a mixture of local attention and linear recurrences. The Flax implementation is recommended for most users. The repository includes the model

3m read time From github.com
Post cover image
Table of contents
Quick startExamplesSystem RequirementsContributingLicenseDisclaimer

Sort: