Reinforcement Learning

Reinforcement learning

Jul 22, 2025

OpenAI Just Released ChatGPT Agent, Its Most Powerful Agent Yet

The OpenAI team details the creation of a new, powerful AI agent in ChatGPT, achieved by unifying the Deep Research and Operator models. They cover its unified architecture with shared state across tools, the reinforcement learning techniques used for training, and the critical safety measures required for an agent that can take real-world actions.

Jul 17, 2025

No Priors Ep. 123 | With ReflectionAI Co-Founder and CEO Misha Laskin

Misha Laskin, co-founder of Reflection AI and former researcher at Google DeepMind, discusses the company's mission to build superhuman autonomous systems. He introduces Asimov, a code comprehension agent designed to solve the 80% of an engineer's time spent on understanding complex systems, rather than just code generation. Laskin delves into the intricacies of co-designing product and research, the critical role of customer-driven evaluations, the bottlenecks in scaling reinforcement learning (RL) — particularly the "reward problem" — and why he believes the future is one of "jagged superintelligence" emerging in specific, high-value domains like coding.

← Previous