Gemma 3n

Gemma 3n was created in close collaboration with leading mobile hardware manufacturers. It shares architecture with the next generation of Gemini Nano to empower a new wave of intelligent, on-device applications.

Slide 1 of 4

Optimized on-device performance

Engineered for speed and quality, with a significantly reduced memory footprint.

Privacy-first, offline-ready

Enables developers to build intelligent, interactive features that respect user privacy and work reliably offline.

Multimodal understanding

Understands and processes audio, text, images, and videos, and is capable of both transcription and translation.

Dynamic resource usage

Features a 4B active memory footprint with nested 2B active memory submodel – with the ability to create submodels for quality-latency tradeoffs.

Live interactive applications

Create apps that understand and respond to real-time visual and audio cues from the user's environment.

Applications based on deep understanding

Using combined audio, image, video, and text inputs—all processed privately on-device.

Advanced audio-centric applications

Including real-time speech transcription, translation, and rich voice-driven interactions.

Optimized on-device performance

Privacy-first, offline-ready

Multimodal understanding

Dynamic resource usage

Live interactive applications

Applications based on deep understanding

Advanced audio-centric applications

Gemini API

Google AI Edge

Hugging Face

Ollama

Kaggle

LM Studio