Google’s latest attempt to fix token quotas is here: Say hello to Gemini 3.5 Flash Low

Credit: Generated by Gemini

TL;DR

Google has introduced Gemini 3.5 Flash (Low) to optimize token usage for simple tasks, following user complaints about tight limits in Antigravity.
The new Low variant generates roughly 45% fewer tokens than the original model, which has been seemingly renamed to Gemini 3.5 Flash (Medium).
Alongside the new model, Google has reset the Gemini quota across all paid and free plans to assist users with software engineering tasks.

Google’s latest Gemini 3.5 Flash model has been quite a success. However, the company paired it with a quietly nerfed AI Pro plan, and the tighter Gemini usage limits ended up frustrating users, especially those who used it for coding in Antigravity. Google reacted by increasing Antigravity’s limits by 9x (across two increases), but that still doesn’t seem enough. Now, Google has introduced a new Gemini model that uses even fewer tokens than Gemini 3.5 Flash.

Varun Mohan, Director at Google DeepMind, working on Antigravity, noted user concerns that Antigravity was using too many tokens for simple tasks. Consequently, Google has now introduced Gemini 3.5 Flash (Low) as a way to optimize token usage for these simple tasks.

Related Stories

SteelSeries Aerox 3 Wireless mouse hits near-record low of $53.99 in rare deal

This sharing app brings Quick Share support to Android phones without Google services

YouTube Music bug doesn’t play the next song, but there may be an easy fix