Google released a new version of Gemma 3, optimized with ‘Quantization-Aware Training’
This reduces the 27B model's memory requirements, enabling it to run on consumer GPUs, like the NVIDIA RTX 3090, with maintained performance