Google rolls out Gemini 3 with Deep Think mode, enhanced coding and agentic actions

Google has introduced Gemini 3, the next generation of its AI model series, bringing upgrades in long-form reasoning, multimodal interpretation, interface generation, developer tools, and agent-based task execution.

The rollout includes Gemini 3 Pro, a new Deep Think mode, expanded multimodal learning workflows, redesigned app capabilities, and the first release of Gemini Agent for multi-step automation.

Sundar Pichai Explains Gemini’s Growth and Direction

Google and Alphabet CEO Sundar Pichai shared a brief note highlighting the progress of the Gemini program, which began nearly two years ago. He said Gemini has grown into one of Google’s largest scientific and product efforts, supported by an integrated full-stack approach combining infrastructure, research, models, and products.

Key points he highlighted

Gemini now powers multiple Google products at large scale.
AI Overviews reach around 2 billion monthly users.
The Gemini app has roughly 650 million monthly users.
More than 13 million developers are building with Google’s generative AI tools.
Each Gemini generation added improvements — multimodality, longer context, higher-level reasoning, and agentic capabilities.

Pichai said Gemini 3 brings together advancements from earlier versions to deliver deeper reasoning, better intent understanding, and more accurate multi-step interpretation with fewer instructions. He confirmed that Gemini 3 is rolling out across Google’s ecosystem — AI Mode in Search, the Gemini app, AI Studio, Vertex AI, and Google Antigravity — marking the first time a Gemini release is launching inside Search on day one.

He added that Gemini 3 represents the next phase of Google’s AI roadmap, with continued focus on intelligence, agentic systems, and personalization in future releases.

Enhanced reasoning and benchmark gains

Google says Gemini 3 Pro delivers stronger reasoning, clearer responses, and better multimodal grounding. Benchmark results include:

1501 Elo on LMArena
37.5% on Humanity’s Last Exam (no tools)
91.9% on GPQA Diamond
23.4% on MathArena Apex
81% on MMMU-Pro
87.6% on Video-MMMU
72.1% on SimpleQA Verified

The model is optimized to avoid generic phrasing, offer more direct answers, and maintain accuracy across text, audio, images, video, and code.

Gemini 3 Deep Think

Deep Think extends step-by-step reasoning with stronger analytical performance.

Benchmark results:

41.0% on Humanity’s Last Exam (no tools)
93.8% on GPQA Diamond
45.1% on ARC-AGI-2 (with code execution, ARC Prize verified)

Deep Think is undergoing extended safety reviews before wider rollout.

Learning and multimodal workflows

Gemini 3 supports expanded learning tasks through multimodal understanding and a 1M-token context window, enabling:

Converting handwritten multilingual notes into structured documents
Summarizing long videos, lectures, or research papers
Creating flashcards, diagrams, or small tools
Sports-video analysis with detailed performance breakdowns
Generating simulations, visuals, and layouts directly inside AI Mode in Search

Coding and developer improvements

Gemini 3 improves instruction adherence, zero-shot coding, and agentic coding.

Benchmark highlights:

1487 Elo on WebDev Arena
54.2% on Terminal-Bench 2.0
76.2% on SWE-bench Verified

Developers can access Gemini 3 via:

Google AI Studio
Vertex AI
Gemini CLI
Google Antigravity

Google Antigravity: Agent-first development

Antigravity offers a development environment where Gemini 3 can:

Plan tasks
Write and execute code
Run terminal commands
Validate outputs using a browser-based virtual computer

It integrates:

Gemini 3 Pro
Gemini 2.5 Computer Use
Nano Banana (Gemini 2.5 Image)

This enables parallel, consistent end-to-end software workflows.

Generative interfaces in the Gemini app

Two interface-generation modes are rolling out:

Visual Layout: Creates structured, magazine-style arrangements with images and modular sections.
Dynamic View: Uses agentic coding to generate custom interactive interfaces.

Google also notes that Gemini 3 delivers its best-ever vibe coding performance inside Canvas, enabling more feature-rich app generation within the workspace. These capabilities are part of the revamped Gemini app, which now includes a “Thinking” model selector and a My Stuff library for saved outputs.

Gemini Agent for multi-step automation

Gemini Agent, built using insights from Project Mariner, can break down and execute complex tasks. It can:

Organize Gmail inboxes
Prioritize emails
Draft replies
Manage Google Calendar
Extract travel details from emails
Compare options and generate summaries or plans

The agent confirms sensitive actions such as purchases or message sending.

Long-horizon planning

Gemini 3 strengthens multi-step planning and avoids tool-use drift during long tasks. It leads Vending-Bench 2, designed to test year-long operational decision-making.

Safety improvements

Gemini 3 includes stronger protections through extensive internal and external assessments.

Key enhancements:

Reduced sycophancy
Better resistance to prompt manipulation
Stronger cyberattack defenses
External reviews with the UK AISI, Apollo, Vaultis, and Dreadnode

Availability

Gemini 3 Pro

Rolling out globally in the Gemini app and AI Mode in Search
Available to Google AI Plus, Pro, and Ultra subscribers
U.S. college students get 1 year of Google AI Pro free

Gemini 3 Deep Think

In extended safety review
Coming soon to Google AI Ultra users

Developers

Access via Gemini API, AI Studio, Vertex AI, Gemini CLI, and Antigravity
Supported on Cursor, GitHub, JetBrains, Manus, Replit

Enterprise

Available via Vertex AI and Gemini Enterprise

Generative Interfaces

Visual Layout and Dynamic View rolling out gradually
Some users may receive only one feature initially

Google says additional Gemini 3 series models will be released soon, and the team says it looks forward to user feedback during the rollout.