105.8K Downloads
Description
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models
Use cases
Minimum system memory
Tags
Last update
Updated on May 17byREADME
Optimized with Quantization Aware Training for improved 4-bit performance.
Supports a context length of 128k tokens, with a max output of 8192.
Multimodal supporting images normalized to 896 x 896 resolution.
Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning.
Sources
The underlying model files this model uses