WritingGoogle (DeepMind / Gemini)Google (DeepMind / Gemini)published May 20, 2025seen 6d

Our vision for building a universal AI assistant

Open original ↗

Captured source

source ↗

Google I/O 2025: Gemini as a universal AI assistant

Our vision for building a universal AI assistant

May 20, 2025

·

Share

x.com

Facebook

LinkedIn

Mail

Copy link

We’re extending Gemini to become a world model that can make plans and imagine new experiences by simulating aspects of the world.

Demis Hassabis

CEO of Google DeepMind

Share

x.com

Facebook

LinkedIn

Mail

Copy link

Over the last decade, we’ve laid a lot of the foundations for the modern AI era, from pioneering the Transformer architecture on which all large language models are based, to developing agent systems that can learn and plan like AlphaGo and AlphaZero . We’ve applied these techniques to make breakthroughs in quantum computing , mathematics , life sciences and algorithmic discovery . And we continue to double down on the breadth and depth of our fundamental research, working to invent the next big breakthroughs necessary for artificial general intelligence (AGI). This is why we’re working to extend our best multimodal foundation model, Gemini 2.5 Pro, to become a “world model” that can make plans and imagine new experiences by understanding and simulating aspects of the world, just as the brain does. We’ve been taking strides in this direction for a while, from our pioneering work training agents to master complex games like Go and StarCraft , to building Genie 2 , which is capable of generating 3D simulated environments that you can interact with, from a single image prompt. Already, we can see evidence of these capabilities emerging in Gemini’s ability to use world knowledge and reasoning to represent and simulate natural environments , Veo ’s deep understanding of intuitive physics, and the way Gemini Robotics teaches robots to grasp, follow instructions and adjust on the fly. Making Gemini a world model is a critical step in developing a new, more general and more useful kind of AI — a universal AI assistant. This is an AI that’s intelligent, understands the context you are in, and that can plan and take action on your behalf, across any device.

Bringing Project Astra’s live capabilities into our products Our ultimate vision is to transform the Gemini app into a universal AI assistant that will perform everyday tasks for us, take care of our mundane admin and surface delightful new recommendations — making us more productive and enriching our lives. This starts with the capabilities we first explored last year in our research prototype Project Astra , such as video understanding, screen sharing and memory. Over the past year, we’ve been integrating capabilities like these into Gemini Live for more people to experience today. We continue to relentlessly improve and explore new innovations at the frontier. For example, we upgraded voice output to be more natural with native audio, we’ve improved memory and added computer control.

We’re now gathering feedback about these capabilities from trusted testers and are working to bring them to Gemini Live , to new experiences in Search , the Live API for developers and new form factors, like glasses. Through every step of this process, safety and responsibility are central to our work. We recently conducted a large research project, exploring the ethical issues surrounding advanced AI assistants , and this work continues to inform our research, development and deployment.

Building AI that can multitask for you We’ve also been exploring how agentic capabilities can help people multitask, with Project Mariner . This is a research prototype that explores the future of human-agent interaction, starting with browsers. Since launching Project Mariner last December , we’ve been working closely with a group of trusted testers to gather feedback and improve its experimental capabilities. Project Mariner now includes a system of agents that can complete up to ten different tasks at a time. These agents can help you look up information, make bookings, buy things, do research and more — all at the same time.

The updated Project Mariner is available to Google AI Ultra subscribers in the U.S. We're bringing its computer use capabilities into the Gemini API , and we’re planning to bring more of its capabilities to Google products throughout the year. Read more about our agentic capabilities in Search and the Gemini app . With this, and all our groundbreaking work, we’re building AI that’s more personal, proactive and powerful, enriching our lives, advancing the pace of scientific progress and ushering in a new golden age of discovery and wonder.

Get more stories from Google in your inbox. Get more stories from Google in your inbox.

Email address

Your information will be used in accordance with Google's privacy policy.

Subscribe

Done. Just one step more.

Check your inbox to confirm your subscription.

You are already subscribed to our newsletter.

You can also subscribe with a different email address

.

POSTED IN:

Related stories

AI The latest AI news we announced in May 2026

By

The Keyword Team

Jun 05, 2026

AI How we used Gemini to build Google I/O 2026

By

Marvin Chow

Jun 01, 2026

Gemini models 9 demos of Gemini Omni and Gemini 3.5 in action

By

Zahra Thompson

May 29, 2026

AI Catch up on 12 major I/O 2026 moments

By

Zahra Thompson

May 28, 2026

NotebookLM Dive deeper into I/O 2026 with NotebookLM.

May 27, 2026

Google DeepMind Running Guide agent: A step towards running unbounded

By

Robin Dua

&

Miguel de Andrés-Clavera

May 20, 2026

.

Jump to position 1

Jump to position 2

Jump to position 3

Jump to position 4

Jump to position 5

Jump to position 6

Let’s stay in touch. Get the latest news from Google in your inbox.

Subscribe No thanks

Notability

notability 6.0/10

Visionary post from major lab