Nvidia

The Future of AI – Key Trends Shaping What’s Next • Ekaterina Sirazitdinova • YOW! 2025

The Future of AI – Key Trends Shaping What’s Next • Ekaterina Sirazitdinova • YOW! 2025

Ekaterina Sirazitdinova from NVIDIA provides a high-level overview of the latest trends shaping the future of AI, covering the evolution from early deep learning to the rise of agentic and physical AI, and diving deep into the critical optimization techniques required to deploy these powerful models efficiently.

Tokenmaxxing vs AI Hardware Bottlenecks — with Jon Krohn (@JonKrohnLearns)

Tokenmaxxing vs AI Hardware Bottlenecks — with Jon Krohn (@JonKrohnLearns)

While the 'tokenmaxxing' trend grows, the AI industry faces severe physical infrastructure bottlenecks. This summary explores the four key constraints choking AI compute: GPU packaging (CoWoS), high-bandwidth memory (HBM), the surprising surge in CPU demand from agentic AI, and critical electricity shortages, revealing how these challenges are shaping the future of AI development.

Robotics' End Game: Nvidia's Jim Fan

Robotics' End Game: Nvidia's Jim Fan

Jim Fan of Nvidia outlines the endgame for robotics, arguing it will mirror the successful playbook of Large Language Models. He introduces "The Great Parallel," a roadmap where World Models replace Language Models, and data collection shifts from limited teleoperation to scalable egocentric video, culminating in a future of physical APIs and automated research.

My Bets on Where Open LLMs Go Next

My Bets on Where Open LLMs Go Next

An analysis of the current unstable equilibrium between open and closed AI models, arguing that closed models will likely pull ahead due to economic and data feedback advantages. The long-term, stable future for open models lies in a specialized ecosystem of cheaper, faster models, potentially funded by new structures like consortiums.

Jensen Huang – Will Nvidia’s moat persist?

Jensen Huang – Will Nvidia’s moat persist?

Nvidia CEO Jensen Huang discusses the company's core strategy, which he defines as transforming electrons into tokens by orchestrating a vast supply chain. He details how Nvidia's true moat lies in its ecosystem and its ability to manage supply bottlenecks. Huang contrasts Nvidia's versatile 'accelerated computing' platform with competitors like TPUs, arguing programmability via CUDA is key to AI innovation. He also presents a strong case against broad AI chip export controls on China, warning it could backfire by forcing the creation of a competing tech stack. Finally, he explains why Nvidia invests in the ecosystem rather than becoming a hyperscaler itself.

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Chris Fregly discusses his new book, "AI Systems Performance Engineering", covering the co-design and optimization of hardware, software, and algorithms across PyTorch, CUDA, and NVIDIA GPUs. The talk explores GPU architecture, system-level reliability challenges, and the use of modern coding agents for low-level kernel optimization.