
- Google has introduced Gemini 3.5 Flash (Low) to optimize token usage for simple tasks, following user complaints about tight limits in Antigravity.
- The new Low variant generates roughly 45% fewer tokens than the original model, which has been seemingly renamed to Gemini 3.5 Flash (Medium).
- Alongside the new model, Google has reset the Gemini quota across all paid and free plans to assist users with software engineering tasks.
Google’s latest Gemini 3.5 Flash model has been quite a success. However, the company paired it with a quietly nerfed AI Pro plan, and the tighter Gemini usage limits ended up frustrating users, especially those who used it for coding in Antigravity. Google reacted by increasing Antigravity’s limits by 9x (across two increases), but that still doesn’t seem enough. Now, Google has introduced a new Gemini model that uses even fewer tokens than Gemini 3.5 Flash.
Varun Mohan, Director at Google DeepMind, working on Antigravity, noted user concerns that Antigravity was using too many tokens for simple tasks. Consequently, Google has now introduced Gemini 3.5 Flash (Low) as a way to optimize token usage for these simple tasks.