Gemini 3.5 Flash on AI Gateway

Vercel has added Gemini 3.5 Flash to its AI Gateway, offering developers access to the model with improved coding proficiency, reasoning, and multi-turn coherence. The model defaults to a medium thinking level to balance response quality with cost-efficient generation, though it does not support temperature, topP, topK, or thinking_budget parameters. AI Gateway provides a unified API for model calls, usage tracking, and performance optimizations, including custom reporting and intelligent provider routing.

Gemini 3.5 Flash is now available on Vercel AI Gateway https://vercel.com/ai-gateway . This model has improved coding proficiency and parallel agentic execution loops versus previous Flash versions. It also brings improvements to core reasoning, instruction following, and multi-turn coherence, with stronger performance on complex tasks and higher-quality reasoning traces in thinking mode. 3.5 Flash defaults to the medium thinking level, balancing response quality with faster, more cost-efficient generation. To use Gemini 3.5 Flash, set model to google/gemini-3.5-flash in the AI SDK https://ai-sdk.dev/ . Note that temperature , topP , topK , and thinking budget are not supported by this model. AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in custom reporting https://vercel.com/docs/ai-gateway/capabilities/custom-reporting , observability https://vercel.com/docs/observability/ai-sdk-observability , Bring Your Own Key https://vercel.com/docs/ai-gateway bring-your-own-key support, and intelligent provider routing with automatic retries. Learn more about AI Gateway https://vercel.com/docs/ai-gateway , view the AI Gateway model leaderboard https://vercel.com/ai-gateway/leaderboards or try it in our model playground https://vercel.com/ai-gateway/models/gemini-3.5-flash .