Reinforcement Learning

Reinforcement learning

Aug 19, 2025

How Reinforcement Learning can Improve your Agent

This talk addresses the unreliability of current AI agents, arguing that prompting is insufficient. It posits that Reinforcement Learning (RL) is the most promising solution, delving into the mechanisms of RLHF and RLVR. The core challenge identified is 'reward hacking', and the discussion explores future directions to overcome it, such as RLAIF, data augmentation, and the development of interactive, online models that can learn in real-time.

Aug 16, 2025

Google DeepMind Lead Researchers on Genie 3 & the Future of World-Building

Google DeepMind researchers Jack Parker-Holder and Shlomi Fruchter detail the creation of Genie 3, a model that generates interactive, persistent worlds from text in real time. They cover its breakthrough spatial memory, emergent physical intuition, and its potential to revolutionize gaming, robotics, and AI agent training.

Aug 12, 2025

913: LLM Pre-Training and Post-Training 101 — with Julien Launay

Julien Launay, CEO of Adaptive ML, discusses the evolution of Large Language Model (LLM) training, detailing the critical shift from pre-training to post-training with Reinforcement Learning (RL). He explains the nuances of RL feedback mechanisms (RLHF, RLEF, RLAIF), the role of synthetic data, and how his company provides the "RLOps" tooling to make these powerful techniques accessible to enterprises. The conversation also explores the future of AI, including scaling beyond data limitations and the path to a "spiky" AGI.

Aug 08, 2025

Open AI Researchers Breakdown GPT-5

OpenAI researchers discuss the step-change in capabilities in ChatGPT-5, from coding and reasoning to creative writing. They detail the data-centric training processes, the shift toward asynchronous agentic workflows, and the future of AI development and its impact on the startup ecosystem.

Aug 05, 2025

DeepMind's Secret AI Project That Will Change Everything [EXCLUSIVE]

Google DeepMind's Genie 3 is a new generative interactive environment that creates photorealistic, controllable 3D worlds from text prompts in real-time. This summary explores its architecture, the concept of emergent consistency, and its primary application as a powerful simulator for training embodied AI agents.

Jul 31, 2025

Computational models for brain science

Dr. Laschowski discusses his lab's research in computational neuroscience, focusing on three core areas: reverse-engineering human motor control using reinforcement and optimal control models, developing high-accuracy neural decoding algorithms for brain-machine interfaces (BMIs), and creating brain-inspired deep learning models for computer vision. The talk highlights a long-term vision of discovering the fundamental principles of intelligence to build more efficient and robust AI.

← Previous Next →