Frontier Tech

Gemini 3.5 Flash and Omni Explained [What Changes]

Jun 14, 2026

Gemini 3.5 is Google's latest generation of AI models — announced at Google I/O 2026 (May 19-20) — that marks the company's declared pivot from a chat-first AI assistant to an agent-first AI platform capable of running tasks in the background, handling any input type, and orchestrating other agents (Google blog).

That pivot has a concrete implication for businesses: the models are faster, cheaper, and more capable on coding and agentic benchmarks than their predecessors, and the ecosystem around them — from desktop assistants to developer platforms — is being rebuilt to run unattended workflows rather than answer-then-stop conversations.

TL;DR: At Google I/O 2026 (May 19-20), Google launched Gemini 3.5 Flash, which Google describes as its strongest agentic and coding model yet, outperforming Gemini 3.1 Pro on challenging coding and agentic benchmarks — Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), and MCP Atlas (83.6%) (Google blog). It also unveiled Gemini Omni — a multimodal model starting with video generation and editing from any input type. The Gemini app gained proactive-assistant features including personalized daily briefs, Gemini Spark, and background inbox management (Google blog). Google's Antigravity developer platform was upgraded with new agent-orchestration capabilities (Google Cloud blog). As of June 2026, these are the foundational building blocks of Google's agent-first AI platform.

Key Takeaways

Gemini 3.5 Flash launched at Google I/O 2026 (May 19-20) and Google describes it as its strongest agentic and coding model yet, outperforming Gemini 3.1 Pro on challenging coding and agentic benchmarks — Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), MCP Atlas (83.6%) — at Flash-series speed and less than half the cost of comparable models (Google blog).
Gemini Omni — an any-input-to-any-output multimodal model — debuted at the same event, starting with video generation and editing capabilities; Google describes it as supporting all major input and output modalities from a single model call (Google blog).
The Gemini app gained proactive-assistant features including personalized daily briefs, a "Gemini Spark" background agent, and AI Inbox management — moving the assistant from reactive to proactive (Google blog).
Antigravity, Google's agent-first development platform, received new agent-orchestration capabilities at I/O 2026, supporting multi-agent pipelines coordinating specialized agents across long-running workflows (Google Cloud blog).
The shift from reactive chat to background-running agents is the core architectural change — not just a performance upgrade.
Small and mid-size businesses using Google Workspace are the most immediately impacted: the tools they already pay for are now connected to agentic capabilities.

What Happened and When (Timeline)

As of June 2026, here is the documented Google I/O 2026 announcement sequence:

Date	Announcement	Key figures	Source
May 19, 2026	Google I/O 2026 opens; Gemini 3.5 Flash announced	Outperforms Gemini 3.1 Pro on coding + agentic benchmarks	Google blog
May 19, 2026	Gemini Omni announced	Any-input-to-any-output; video gen/editing included	Google blog
May 19, 2026	Gemini app proactive features launched	Daily Brief agent, Gemini Spark background agent, AI Inbox management	Google blog
May 19-20, 2026	Antigravity agent-orchestration upgrades shipped	New multi-agent coordination capabilities on Antigravity platform	Google Cloud blog
June 2026	All announced features in active deployment	Gemini 3.5 Flash available via Google AI and Vertex AI	Google Cloud blog

The Mechanism: How Gemini 3.5 Actually Works

From Benchmark-Chasing to Agent-Building

Previous Gemini releases competed primarily on benchmark scores and context window size. Gemini 3.5 Flash is framed differently: Google describes it as delivering frontier performance for agents and coding at Flash-series latency and less than half the cost of comparable models — a positioning that signals they are optimizing for production deployment in agentic pipelines, not just headline model comparisons.

According to the Google blog, Gemini 3.5 Flash outperforms Gemini 3.1 Pro on challenging coding and agentic benchmarks — Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), and MCP Atlas (83.6%) — that matter most for automated workflows. That is a meaningful claim: it means the faster, cheaper model in the Gemini family now beats the prior-generation premium model on the tasks that actually drive business automation ROI.

Gemini 3.5 Flash benchmarks vs. Gemini 3.1 Pro (per Google blog):

Benchmark	Gemini 3.5 Flash	vs. Gemini 3.1 Pro	Type
Terminal-Bench 2.1	76.2%	Exceeds 3.1 Pro	Coding / agentic
GDPval-AA	1656 Elo	Exceeds 3.1 Pro	Agentic
MCP Atlas	83.6%	Exceeds 3.1 Pro	Agentic
Speed tier	Flash (faster)	Pro (slower)	Latency
Relative cost	Less than half comparable models	Pro pricing	Cost per task

Benchmark scores sourced from Google's own I/O 2026 announcement (Google blog).

Gemini model generations: capability comparison (Google I/O 2026 baseline):

Metric	Gemini 1.5 Pro	Gemini 3.1 Pro	Gemini 3.5 Flash	Source
Coding/agentic benchmarks vs. 3.1 Pro	—	Baseline	Exceeds (Terminal-Bench 76.2%, MCP Atlas 83.6%)	Google blog
Speed tier	Pro	Pro	Flash (faster)	Google blog
Cost vs. comparable models	Flagship price	Flagship price	Less than half comparable models	Google blog
Context window (Flash tier)	1M tokens	1M tokens	1M tokens	Google AI Studio docs
Proactive app agent features	0	0	Daily Brief, Spark, AI Inbox	Google blog

Benchmark scores sourced from Google's own I/O 2026 announcement; independent verification pending.

According to Google Workspace pricing, the 2 Gemini-inclusive Workspace tiers are Standard at $14/user/month and Plus at $22/user/month — the entry points for businesses deploying Gemini proactive features without custom development.

Gemini 3.5 deployment cost and time estimates (as of June 2026, per Google Workspace pricing):

Deployment Path	Monthly Cost (est.)	Setup Time	Admin Hours Saved/Month	Payback (at $50/hr staff cost)
Gemini in Workspace Standard	$14/user/month	2–4 hrs	8–15 hrs/user	1–2 months
Gemini in Workspace Plus	$22/user/month	2–4 hrs	8–15 hrs/user	1–2 months
Antigravity / AI Studio (Gemini 3.5 Flash API)	API pricing (per token)	40–80 hrs dev	20–40 hrs/month	3–6 months
Antigravity custom multi-agent	API + dev hours	80–160 hrs dev	40–80+ hrs/month	4–8 months

Subscription pricing for Workspace tiers from Google Workspace pricing. Hours saved and payback are illustrative estimates based on typical workflow automation; independent verification recommended for your specific use case.

Gemini Omni: Any Input, Any Output

Gemini Omni is a separate model launch within the same I/O 2026 release. According to Google's I/O announcement, Gemini Omni blends 4 input types — image, text, video, and audio — and delivers "a highly intuitive approach to video creation and editing using natural language," with video generation and editing as the launch capability. Google's direction is that the model will produce output from any input type over time.

For businesses, the immediate application is content — video drafts, presentation visuals, social clips — generated from text briefs. The longer-term application is process monitoring: agents that watch video feeds, read documents, and listen to calls, all within a single model call.

Proactive Assistants and Background Agents

The Gemini app gained proactive-assistant features at I/O 2026. According to the Google blog, these include personalized daily briefs that analyze your inbox, calendar, and tasks, plus Gemini Spark — a background agent that proactively sends critical updates and checks with you before taking major actions on your behalf — covering at least 3 distinct workflow surfaces (inbox, calendar, and tasks). This is a structural shift from the assistant answering questions to the assistant monitoring and acting on your behalf between interactions.

Teams already routing documents and calendar events through Google Workspace — including those with US Tech Automations workflows connected to Google services — will find these proactive capabilities plug into existing data connections without requiring new integration work.

Antigravity: The Developer Infrastructure Layer

Antigravity is Google's agent-first development platform, and the I/O 2026 upgrades matter most for businesses building or buying custom AI workflows. According to Google Cloud's I/O summary, Antigravity 2.0 received new agent-orchestration capabilities — a centralized workspace to steer, customize, and orchestrate agents with 3 core tools: the Antigravity 2.0 desktop app, the Antigravity CLI, and an agentic browser simulation layer for QA — enabling multi-step automation that previously required custom engineering.

What this means in practice:

A research agent, a writing agent, and a publishing agent can be chained — each doing one thing well, with Antigravity routing outputs between them.
Businesses that build on Antigravity today are building on a platform Google is actively expanding, not maintaining.
The orchestration layer is the piece that previously required custom engineering to build; Antigravity moves it toward a configured-not-coded pattern.

What Gemini 3.5 Does NOT Change (Honest Limits)

Benchmark scores (Terminal-Bench 2.1: 76.2%, GDPval-AA: 1656 Elo, MCP Atlas: 83.6%) are Google's own published figures, not yet independently verified on third-party public leaderboards as of June 2026.
Gemini Omni is starting with video generation; the any-input-to-any-output claim is directional, not fully available across all modalities at launch.
Proactive inbox and calendar management in the Gemini app will surface privacy concerns for regulated industries — healthcare and legal workflows will require data-handling review before deployment.
Antigravity is a developer platform, not a no-code tool. The agent-orchestration upgrades require engineering resources to configure.

The Three Deployment Scenarios for SMBs

Three deployment scenarios at a glance:

Scenario	Setup Time	Time to First Value	Est. Admin Hours Saved/Month	Est. Annual $ Value (at $40/hr)
Google Workspace user	2–4 hrs	1–3 days	8–20 hrs	$3,840–$9,600
Vertex AI developer	40–80 hrs	1–2 weeks	15–40 hrs	$7,200–$19,200
Antigravity builder	80–160 hrs	4–8 weeks	40–80+ hrs	$19,200–$38,400+

Scenario 1: Google Workspace Users (Immediate Impact)

If your business already uses Google Docs, Gmail, Google Calendar, and Google Meet, you are in the most immediate impact zone. The proactive Gemini app features — daily briefs, inbox drafts, scheduling management — layer on top of your existing data. No new infrastructure is required.

The practical question is which workflows to hand to background agents first: email triage, meeting prep briefs, or follow-up draft generation.

Scenario 2: Developers on Vertex AI (Medium-Term Build)

According to Google Cloud's blog, Gemini 3.5 Flash is available across multiple deployment paths as of June 2026 — including Google AI Studio, Antigravity, and the Gemini Enterprise Agent Platform — and achieves a CharXiv multimodal benchmark score of 84.2%. Businesses building customer-facing or internal AI features — chatbots, document processors, data extractors — can access the model across these paths depending on their infrastructure and scale needs.

Teams already using US Tech Automations' agentic workflow platform to route data between applications will find a Gemini 3.5 model swap is a configuration change, not a rebuild — the workflow logic stays the same while the underlying model improves.

Example workflow: A document-processing pipeline receives a trigger document.review_requested, routes the file to Gemini 3.5 Flash for extraction (Terminal-Bench 2.1 score: 76.2%), stores structured output, and posts a summary to Slack — all within a single 1M-token context window at less than $0.01 per run at Flash pricing. The same pipeline that ran on Gemini 3.1 Pro at higher cost completes on Gemini 3.5 Flash with benchmark scores up 83.6% (MCP Atlas) on the agentic routing steps.

Scenario 3: Content and Creative Teams (Gemini Omni)

For marketing agencies, content teams, and creative studios, Gemini Omni's video generation and editing capabilities are the headline feature. The practical unlock is prototype-speed content: video drafts from a brief, edited in the same model call, ready for human review. The limitation is that this is launch functionality — production-quality video generation from text is still an emerging capability with known limitations around consistency and factual accuracy in visual content.

Signal vs Speculation

What is documented fact (as of June 2026, per Google I/O 2026 announcements):

Gemini 3.5 Flash announced May 19-20, 2026 at Google I/O; outperforms Gemini 3.1 Pro on challenging coding and agentic benchmarks — Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), MCP Atlas (83.6%) — at less than half the cost of comparable models (Google blog).
Gemini Omni announced at the same event; any-input-to-any-output with video gen/editing at launch (Google blog).
Gemini app proactive features live: daily briefs, Gemini Spark background agent, AI Inbox management (Google blog).
Antigravity upgraded with agent-orchestration capabilities (Google Cloud blog).

Our read (forward-looking interpretation):
The "agentic Gemini era" framing is Google's clearest signal yet that the Gemini product line is competing on workflow automation, not just conversational quality. If Gemini 3.5 Flash genuinely delivers flagship-class agentic performance at Flash-tier cost, it will pressure the pricing of competing models across the board — similar to how prior cost reductions in the inference market have forced OpenAI and Anthropic to respond with their own pricing adjustments. For SMBs, the most actionable near-term outcome is that Google Workspace is about to become meaningfully more capable without a subscription change. The risk: proactive agents managing email and calendars in the background require careful permission scoping — businesses that deploy without governance policies will face data-handling issues. The opportunity: the businesses that configure governance first and then deploy broadly will capture the productivity gains while competitors are still debating whether to adopt.

FAQ

What is Gemini 3.5 in plain English?

Gemini 3.5 is Google's newest generation of AI models, led by Gemini 3.5 Flash — a fast, cost-efficient model that Google says outperforms its prior premium model (Gemini 3.1 Pro) on coding and agentic tasks. It is designed to run inside automated workflows and background assistants, not just answer questions in a chat window.

What is Gemini Omni and how is it different from Gemini 3.5 Flash?

Gemini Omni is a separate model focused on multimodal tasks — taking any input type (text, audio, image, video) and producing any output type, starting with video generation and editing. Gemini 3.5 Flash is the primary text/code/agent model. They were announced together at Google I/O 2026 but serve different use cases.

Does Gemini 3.5 replace tools like ChatGPT or Claude for business workflows?

Not automatically. Gemini 3.5 Flash is a strong option for teams already embedded in Google Workspace and Vertex AI infrastructure. If your business is running workflows on OpenAI or Anthropic models today, a switch requires evaluating benchmark performance on your specific tasks, not just general claims. The right model choice depends on your actual use case, data infrastructure, and vendor relationships.

What is Antigravity and who should care about it?

Antigravity is Google's agent-first development platform for building multi-agent AI workflows. The I/O 2026 upgrades added agent-orchestration capabilities — tools for coordinating multiple specialized agents in a single pipeline. Businesses with software engineering resources or technology partners who are building custom AI features should evaluate Antigravity as part of their stack. Non-technical operators are better served by the Gemini app's proactive features first.

What is the fastest way for a small business to benefit from Gemini 3.5?

If you use Google Workspace, the Gemini app's proactive features are the zero-friction starting point — daily briefs and inbox management are available without any new infrastructure. The next step is identifying one repeatable internal workflow (meeting prep, email follow-up, document summary) and piloting it with Gemini's background agent capabilities before expanding. The agentic workflow platform at US Tech Automations covers the integration layer for connecting Gemini models to the rest of your business data.

Where This Lands for Your Business

Gemini 3.5 is not a single product — it is a generation of infrastructure that Google is distributing through four simultaneous channels: the Gemini 3.5 Flash model for developers, Gemini Omni for multimodal content work, the Gemini app's proactive assistant for knowledge workers, and Antigravity's orchestration upgrades for agent builders.

For businesses already operating in the Google ecosystem, the practical window is now: the agentic features are live, the performance claims are publicly documented, and the cost structure is Flash-tier, not Pro-tier. The gap between businesses that experiment with background agents today and those that wait for the technology to "mature" is going to widen over the next 12-18 months.

The implications for specific business segments are covered in depth in the spoke posts:

If you are ready to connect these capabilities to your actual workflow stack, the agentic workflows platform is the starting point for building model-agnostic automation pipelines that can take advantage of Gemini 3.5 and other frontier models as they evolve.

About the Author

Garrett Mullins

Workflow Specialist

Helping businesses leverage automation for operational efficiency.

Stateless MCP [What the New Spec Really Changes]