Google Unveils Gemini 3 Flash: Free, Lightning‑Fast, and Outperforms Its Predecessor

Google released Gemini 3 Flash without warning, offering Pro‑level intelligence at Flash‑speed, costing just $0.5 per million input tokens and $3 per million output tokens, delivering three‑times faster inference than Gemini 2.5 Pro and surpassing it on benchmarks such as GPQA Diamond (90.4%), SWE‑bench (78.0%) and MMMU‑Pro (81.2%), while being freely accessible to all users and developers via the Gemini app, AI Studio, or API.

AI Insight Log
AI Insight Log
AI Insight Log
Google Unveils Gemini 3 Flash: Free, Lightning‑Fast, and Outperforms Its Predecessor

Google announced the unexpected launch of Gemini 3 Flash, a large language model that combines the intelligence of Gemini 3 Pro with the speed of the Flash series and a dramatically lower price.

Speed and Low Cost

The Flash series is traditionally associated with high speed but modest reasoning; Gemini 3 Flash breaks that stereotype. Google states the model reaches "Frontier Intelligence" while pushing cost and latency to the limit. It is three times faster than the previous Gemini 2.5 Pro and costs only $0.5 per million input tokens and $3 per million output tokens, far cheaper than GPT‑4o or Claude 3.5 Sonnet.

Gemini 3 Flash cost-performance curve
Gemini 3 Flash cost-performance curve

Performance Beats Previous Pro

Despite the low price, Gemini 3 Flash outperforms its predecessor on several hard benchmarks.

Scientific Knowledge (GPQA Diamond): 90.4% accuracy, exceeding 2.5 Flash (82.8%) and 2.5 Pro (86.4%).

Code Ability (SWE‑bench Verified): 78.0% score, beating 2.5 Pro and slightly surpassing Gemini 3 Pro (76.2%).

Multimodal (MMMU‑Pro): 81.2% score, showing a large improvement in image and video understanding.

Benchmark comparison
Benchmark comparison

For example, when a complex app‑operation video is fed to the model, it can recognize UI changes, identify elements, and infer the underlying interaction logic, a capability that is especially valuable for front‑end development and testing.

The model is also adept at writing code, interpreting complex charts, and answering deep scientific questions.

Who Can Use It?

The model is available immediately and free of charge. Regular users see Gemini 3 Flash as the default model in the Gemini app or web interface. Developers can access it through Google AI Studio or the Gemini API.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

large language modelbenchmarkGoogle AImultimodalpricingGemini 3 Flash
AI Insight Log
Written by

AI Insight Log

Focused on sharing: AI programming | Agents | Tools

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.