Skip to content

Model request: RecurrentGemma #3216

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
2 tasks done
redbrain opened this issue May 9, 2025 · 0 comments
Open
2 tasks done

Model request: RecurrentGemma #3216

redbrain opened this issue May 9, 2025 · 0 comments

Comments

@redbrain
Copy link

redbrain commented May 9, 2025

Model description

RecurrentGemma is a family of open language models built on Griffin, a novel recurrent architecture developed at Google. Both pre-trained and instruction-tuned versions are available in English.

Like Gemma, RecurrentGemma models are well-suited for a variety of text generation tasks, including question answering, summarization, and reasoning. Because of its novel architecture, RecurrentGemma requires less memory than Gemma and achieves faster inference when generating long sequences.

RecurrentGemma is already in transformers; it just lacks the features that TGI's officially supported models have (ie sharding).

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

https://ai.google.dev/gemma/docs/recurrentgemma/model_card
https://huggingface.co/google/recurrentgemma-2b-it
https://www.kaggle.com/models/google/recurrentgemma
https://github.com/huggingface/transformers/tree/main/src/transformers/models/recurrent_gemma
https://github.com/google/gemma.cpp

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant