Google deep mind

From Transcription to Live Music: Gemini's Audio Stack — Thor Schaeff, Google DeepMind

From Transcription to Live Music: Gemini's Audio Stack — Thor Schaeff, Google DeepMind

Thor Schaeff from Google DeepMind demos the advanced audio AI stack, starting with a single API call to Gemini for rich transcription (speaker names, emotions, translation). He showcases speech generation directed by "director's notes" instead of a voice catalog, the real-time, sound-to-sound Gemini 1.5 Flash Live model, and a live demo of Gemini Live using the Lyria 2 model as a tool to generate a full song on stage.

Any-to-Any: Building Native Multimodal Agents - Patrick Löber, Google DeepMind

Any-to-Any: Building Native Multimodal Agents - Patrick Löber, Google DeepMind

Patrick Löber from Google DeepMind provides a technical walkthrough of the Gemini API's "any-to-any" capabilities. The session covers multimodal understanding of complex documents, video, and audio; an agentic loop using function calling to trigger native image and speech generation; and the real-time, audio-to-audio Live API.

How to Build the Future: Demis Hassabis

How to Build the Future: Demis Hassabis

Demis Hassabis, CEO of Google DeepMind, outlines the remaining challenges on the path to AGI, including memory, continual learning, and true reasoning. He discusses how learnings from AlphaGo are shaping agent development, the strategic importance of powerful small models like Gemma, and his vision for AI as the ultimate tool for scientific discovery, offering a framework for identifying breakthrough opportunities and advice for founders building in the age of AI.

The arrival of AGI | Shane Legg (co-founder of DeepMind)

The arrival of AGI | Shane Legg (co-founder of DeepMind)

Shane Legg, Chief AGI Scientist at Google DeepMind, outlines his framework for AGI, predicting 'minimal AGI' within years and 'full AGI' within a decade. He details a path to more reliable systems and introduces 'System 2 Safety' for building ethical AI. Legg issues an urgent call for society to prepare for the massive economic and structural transformations that advanced AI will inevitably bring.

The arrival of AGI | Shane Legg (co-founder of DeepMind)

The arrival of AGI | Shane Legg (co-founder of DeepMind)

Shane Legg, Chief AGI Scientist at Google DeepMind, outlines his framework for AGI levels, predicts a 50% chance of minimal AGI by 2028, and discusses the profound societal and economic transformations that will follow.

Google DeepMind Lead Researchers on Genie 3 & the Future of World-Building

Google DeepMind Lead Researchers on Genie 3 & the Future of World-Building

Google DeepMind researchers Jack Parker-Holder and Shlomi Fruchter detail the creation of Genie 3, a model that generates interactive, persistent worlds from text in real time. They cover its breakthrough spatial memory, emergent physical intuition, and its potential to revolutionize gaming, robotics, and AI agent training.