Google Gemini: What Changed at I/O 2026 and What Matters

Gemini 3.5 Flash, AI-generated video, Android as an intelligence system, and limits that annoyed subscribers: a recap of what came out of I/O 2026.

by Cleverson

Google Gemini: What Changed at I/O 2026 and What Matters

Google Gemini was one of the most searched terms of the week — and not by accident. Between May 19 and 20, 2026, Google I/O unleashed a sequence of announcements that change what the assistant does, where it runs, and how much it costs. This post organizes what really matters, separating technical news from marketing noise.

TL;DR

  • Gemini 3.5 Flash is the new main model: it surpasses Gemini 3.1 Pro in code, agentic tasks, and multimodality, at less than half the cost of frontier rivals.
  • Gemini Omni generates content from any input — starting with video, with embedded SynthID watermark.
  • Android has become an intelligence system: Gemini now operates natively on phones, watches, laptops, and cars.
  • The new usage limits based on computation, activated on May 20, 2026, have generated complaints from paying subscribers.
  • The Google Gemini app has surpassed 900 million monthly active users, more than double from a year ago.

Why Google Gemini Became the Topic of the Week

On May 19, 2026, in Mountain View, Google opened its I/O — the annual developer conference. The central theme was singular: turning every company product into an extension of artificial intelligence. Google Gemini was at the center of virtually all announcements, and that's what triggered the search volume you probably noticed.

I've been following AI launches closely for years, and the reading here is straightforward: Google has stopped treating Google Gemini as a standalone product. It is now the layer that stitches together Android, Search, development tools, and hardware. When a company makes this move, the term becomes a search trend because millions of people want to understand, at the same time, what changed in the app they already use.

To gauge this interest: the Gemini app has surpassed 900 million monthly active users, more than double from a year ago, with daily requests growing sevenfold in the period. Search's AI Mode surpassed 1 billion monthly users. When a base this size receives several new models at once, curiosity turns into a search spike.

There are three fronts worth your attention, and I'll go through each separating fact from promise: the new models, the new relationship between Google Gemini and Android, and a controversial change in usage limits. All data in this post comes from the official Google I/O 2026 announcements.

Gemini 3.5 Flash: The Model That Opens the New Generation

The main announcement of the event was Gemini 3.5 Flash, the first of a new line of models that Google describes as the union of frontier intelligence with action capability. In plain English: it's a fast and cheap model that also performs autonomous tasks — browsing, writing code, chaining tools — without relying on the house's heaviest model.

What the Benchmarks Say

According to Google, Gemini 3.5 Flash surpasses Gemini 3.1 Pro in code tests, multimodal reasoning, and agentic tasks. The numbers released were 76.2% on Terminal-Bench 2.1, 1656 Elo on GDPval-AA, and 83.6% on MCP Atlas. The model delivers output tokens about four times faster than other frontier models, at less than half the cost.

Benchmark is not user experience, and the usual skepticism applies — test numbers don't always translate into real gains in your task. But the direction is coherent: with this launch, Google Gemini pushes the fast model upward, instead of reserving high intelligence only for the Pro line. This changes the cost equation for those who use AI in volume.

It's worth translating the term "agentic," which repeats in the announcements. An agentic model doesn't just respond: it executes steps in sequence to complete a goal — open a page, read the content, fill out a form, check the result. It's the difference between asking for an answer and delegating a task. Gemini 3.5 Flash was positioned exactly in this category, and that's why Google insists on tests like Terminal-Bench, which measure execution, not just knowledge.

Where Gemini 3.5 Flash Is Already Available

The model started rolling out on the day of the announcement: in the Gemini app, Search, the Antigravity 2.0 development environment, and the Gemini API. For developers, it also appears in Google AI Studio and Android Studio. In other words, you can test it now, without a waitlist — and this immediate availability is part of the reason for the search spike.

Gemini Omni: When AI Starts Generating Video

The second model novelty is Gemini Omni, a separate line that combines Gemini reasoning with content generation. The first version, Gemini Omni Flash, accepts image, audio, video, and text as input — and returns video as output.

The detail that sets it apart from previous generators is the attempt to anchor the result in the real world. Google claims the model combines factual knowledge with a sense of physics, so that the generated video respects basic rules of movement and gravity. All content comes out with the SynthID watermark — invisible to the eye, but detectable by verification. It's a provenance signal that Google says has already been checked 50 million times worldwide.

This watermark point is not a technical detail. With AI-generated video becoming increasingly realistic, knowing whether a piece is synthetic becomes a matter of trust — for the press, for courts, for any company that publishes content. SynthID doesn't prevent misuse, but it creates a verifiable trail. It's a step in the right direction, though far from solving the problem alone.

In practice, Gemini Omni Flash appears in the Google Gemini app, as well as in Google Flow and the YouTube Shorts remix feature. For content producers, it's the most visible novelty of I/O. For businesses, it's still early: AI-generated video has real use in drafts and prototyping, but requires human review before becoming published content. Verifiable provenance is not the same as approved quality.

Android Becomes an "Intelligence System"

Google repeated a phrase on stage until it stuck: Android is no longer an app platform; it has become an intelligence system. Translating, Google Gemini now operates natively in the system, not as an app you open and close. It spans phones, watches, laptops, and cars.

Googlebooks: Laptops with Built-in Gemini

The most concrete hardware bet is Googlebooks — a new category of premium laptops built on Android and powered by Gemini. The announced manufacturers are Acer, ASUS, Dell, HP, and Lenovo, with arrival expected in the second half of 2026. It's Google's attempt to put the assistant at the center of the work experience, not just the phone.

Android XR Glasses and Gemini Spark

Google also confirmed that Android XR glasses are arriving this year, with partners like Samsung, Xreal, Warby Parker, and Gentle Monster. Alongside comes Gemini Spark, entering beta for Ultra plan subscribers the week after the event. The promise is an assistant that sees what you see. It's the most futuristic front of the announcement — and the one I would treat with more caution, until hardware is in real people's hands.

This movement has a clear parallel with the education sector: when intelligence becomes part of the container, not a loose resource, the user experience changes levels. It was the argument we detailed when comparing the customized Moodle app with the generic app — brand and technology at the center, not the periphery.

The New Usage Limits That Irritated Subscribers

Not every Google Gemini novelty was well received. On May 20, 2026, Google activated a new usage limit system that replaced fixed daily prompt quotas. The reaction from some paying subscribers was negative — and it's worth understanding why.

How the New System Works

The limit stopped counting prompts and started counting computation. The system weighs the complexity of each request, the activated resources — image generation, extended reasoning — and the accumulated session history. The quota renews every five hours, with a weekly cap on top.

There is a positive point: those who exceed the limit are not blocked. The system automatically downgrades the request to a smaller model and continues responding. Access doesn't stop — it degrades in quality.

The Central Complaint: Opacity

The problem that concentrates criticism is the lack of predictability. In the old model, you knew how many prompts you had left. Now, "computation used" offers no intuitive reference before you send the request. An AI Pro plan subscriber reported that a single prompt consumed 13% of the available quota. Others recounted that fewer than five consecutive requests — summarizing a document, debugging a code snippet, generating an image — exhausted half of the five-hour quota.

For those paying between $8 and $250 per month, the complaint is legitimate: the limit may make sense, but it needs to be visible before sending, not after. It's the kind of change that directly affects those who use Google Gemini as a daily work tool. The updated rules are in the official Gemini help center.

For those who depend on Google Gemini at work, there is a way to reduce the impact while the system doesn't become more transparent: concentrate heavy requests — image generation, long document analysis — in one block, and leave simple questions for the smaller model. Distributing the load throughout the day helps avoid exhausting the five-hour quota all at once.

Comparison: Gemini 3.1 Pro, 3.5 Flash, and 3.5 Pro

The Google Gemini line now has three versions on the radar, and it's worth placing them side by side to situate where Gemini 3.5 Flash fits. The table summarizes what Google released — remembering that 3.5 Pro has not yet been released to the public.

Model Position Status as of 05/20/2026 Highlight
Gemini 3.1 Pro Previous generation, focus on reasoning Available Baseline for new benchmarks
Gemini 3.5 Flash Fast, cheap, and agentic In app, Search, and API Surpasses 3.1 Pro in code and autonomous tasks
Gemini 3.5 Pro Top of the new generation In testing, expected June 2026 Already used internally by Google

The practical reading is simple. For most daily uses, Gemini 3.5 Flash already suffices — and it's what you'll find by default in the app. Gemini 3.5 Pro should interest those who need the maximum reasoning limit in long and complex tasks; it can only be truly evaluated when it launches, expected in June 2026.

What These Changes Mean in Practice for Businesses

Taking away the natural excitement of a launch event, what should a Brazilian manager do with all this? I separate the answer into two lists: when to take advantage now and when to hold back anxiety.

When It's Worth Adopting Now

  • Code and automation tasks: the agentic Gemini 3.5 Flash is mature enough to accelerate development and prototyping. It's worth testing via the Gemini API before committing an entire workflow.
  • Team productivity: if your team already uses the Google Gemini app daily, the speed gain from the new model is immediate and requires no extra configuration.
  • Draft content: Gemini Omni helps generate initial versions of video and image to quickly validate an idea before investing in production.

When to Hold Back Anxiety

  • New hardware: Googlebooks and Android XR glasses are not in anyone's hands yet. Corporate purchase decisions wait for independent reviews.
  • Critical workflows with quotas: if your team relies on intense and continuous usage, test the new limit system before migrating sensitive processes to the paid plan.
  • Generated video in production: without human review, do not publish. The SynthID watermark solves provenance, not quality.

It's worth a principle I apply to any technology decision: the right tool depends on the size and risk of the operation, not on what's newest. It was the same reasoning we used when comparing WhatsApp Business App and the Official API — adopting the most advanced is not always adopting the most suitable. And when automation comes into play, the cost model weighs as much as the capability, as we discussed in the case of unlimited agents on WhatsApp.

Conclusion: What to Do Now with Google Gemini

I/O 2026 made it clear that Google Gemini is no longer a chatbot — it's the intelligence layer that Google wants everywhere: on the phone, laptop, search, and code. Gemini 3.5 Flash is the novelty you can already use today and feel the difference. Gemini Omni and the new hardware are promises that deserve monitoring, not immediate decisions. And the new usage limits are a reminder that cheap or free products always have a cost — sometimes paid in predictability.

My practical advice is straightforward: open the Google Gemini app and test Gemini 3.5 Flash on a real task from your routine, comparing it with what you used before. The best way to separate useful novelty from marketing noise is to measure it on your own use case.

If your company is evaluating how to fit AI into customer service, content, or educational platforms, it's worth talking to those who implement this type of integration daily. At Agathas Web, that's exactly the work — translating frontier launches into business decisions that pay off.