Reinforcement Learning

Reinforcement learning

Sep 25, 2025

From Vibe Coding to Vibe Researching: OpenAI’s Mark Chen and Jakub Pachocki

OpenAI’s Chief Scientist, Jakub Pachocki, and Chief Research Officer, Mark Chen, discuss the research behind GPT-5, the push toward long-horizon reasoning, and the grand vision of an automated researcher. They cover how OpenAI evaluates progress beyond saturated benchmarks, the surprising durability of reinforcement learning, and the culture required to protect fundamental research while shipping world-class products.

Sep 18, 2025

Why experts writing AI evals is creating the fastest-growing companies in history | Brendan Foody

Brendan Foody, CEO of Mercor, discusses the critical role of AI evaluations (evals) in model improvement, detailing how his company achieved unprecedented growth by supplying high-skilled experts to top AI labs. He explores the shift to Reinforcement Learning from AI Feedback (RLAIF), the future of work in an AI-driven economy, and why he believes the path to AGI is paved with better evals, not just more data.

Sep 18, 2025

Upwork's Radical Bet on Reinforcement Learning: Building RLEF from Scratch | Andrew Rabinovich (CTO)

Andrew Rabinovich, CTO and Head of AI at Upwork, details their strategy for building AI agents for digital work. He introduces a custom reinforcement learning approach called RLEF (Reinforcement Learning from Experience), explains why digital work marketplaces are ideal training grounds, and shares his vision for a future where AI delivers finished projects, orchestrated by a meta-agent named Uma.

Sep 12, 2025

Fully autonomous robots are much closer than you think – Sergey Levine

Sergey Levine, co-founder of Physical Intelligence, outlines the path to general-purpose robots, predicting a 'self-improvement flywheel' could lead to fully autonomous household robots by 2030. He discusses the architecture of vision-language-action models, the critical role of embodiment in solving the data problem, and how robotics will scale faster than self-driving cars.

Aug 29, 2025

GPT-OSS vs. Qwen vs. Deepseek: Comparing Open Source LLM Architectures

A technical breakdown and comparison of the architectures, training methodologies, and post-training techniques of three leading open-source models: OpenAI's GPT-OSS, Alibaba's Qwen-3, and DeepSeek V3. The summary explores their different approaches to Mixture-of-Experts, long-context, and attention mechanisms.

Aug 28, 2025

The $10 Trillion AI Revolution: Why It’s Bigger Than the Industrial Revolution

Sequoia Capital's Konstantine Buhler presents an investment thesis on the AI-driven "Cognitive Revolution," framing it as a transformation larger and faster than the Industrial Revolution. The core of the thesis is the $10 trillion opportunity in automating the US services market and the shift in work from certainty to high leverage. Buhler outlines five current investment trends, including real-world validation over academic benchmarks and compute as the new production function, and five future themes Sequoia is betting on, such as persistent memory, AI-to-AI communication, and AI security.

← Previous Next →