With the release of Gemma 4, Google aims to enable local, agentic AI for Android development through a family of models designed to support the entire software lifecycle, from coding to production.

InfoQ is a leading online platform for software developers, architects, and technical leaders, providing news, articles, presentations, and interviews on a wide range of topics, including agile practices, DevOps, microservices, and emerging technologies. With a focus on quality content and expert insights, InfoQ helps professionals stay informed about the latest trends, best practices, and industry developments. Developers can learn from real-world experiences, gain  knowledge, and connect with peers in the global software community through InfoQ's diverse and engaging content.

InfoQ

Google has released Gemma 4, a family of models focused on local, on-device AI inference for Android development. The lineup includes three models: Gemma E2B (8GB RAM, 2GB storage), Gemma E4B (12GB RAM, 4GB storage), and Gemma 26B MoE (24GB RAM, 17GB storage). The 26B MoE model targets desktop coding assistance in Android Studio, enabling agentic coding without sharing code with cloud providers — ideal for privacy-sensitive environments. The two smaller models target on-device inference, with E2B offering 3x faster inference than E4B. All models are up to 4x faster than previous versions and use up to 60% less battery. Gemma 4 also serves as the foundation for the next Gemini Nano generation. Developers can access the models via the AICore Developer Preview, Ollama, or LM Studio.

Google Released Gemma 4 with a Focus On Local-First, On-Device AI Inference