Reinforcement Learning

Reinforcement learning

Oct 24, 2025

Introducing serverless reinforcement learning: Train reliable AI agents without worrying about GPUs

Kyle Corbett and Daniel from CoreWeave (formerly Openpipe) discuss the practical advantages of Reinforcement Learning (RL) over Supervised Fine-Tuning (SFT) for building reliable and efficient AI agents. They introduce Serverless RL, a new platform designed to eliminate the infrastructure complexities of RL training, and share a playbook for teams looking to get started.

Oct 24, 2025

ChatGPT Atlas, OpenAI’s new web browser

A discussion on OpenAI's new browser ChatGPT Atlas, Andrej Karpathy's pessimistic timeline for AI agents, the DeepSeek-OCR paper on visual context compression, and a study suggesting large language models can suffer from "brain rot" when trained on low-quality social media data.

Oct 23, 2025

Marc Andreessen & Amjad Masad on “Good Enough” AI, AGI, and the End of Coding

Amjad Masad, founder of Replit, joins a16z to discuss the rise of AI agents that can now plan, reason, and code for hours. He explains how reinforcement learning and verification loops unlocked long-horizon reasoning, why AI is advancing fastest in verifiable domains like code, and debates whether "good enough" AI might be a local maximum that blocks the path to AGI.

Oct 20, 2025

Machine Learning Explained: A Guide to ML, AI, & Deep Learning

A breakdown of Machine Learning (ML), its relationship with AI and Deep Learning, and its core paradigms: supervised, unsupervised, and reinforcement learning. The summary explores classic models and connects them to modern applications like Large Language Models (LLMs) and Reinforcement Learning with Human Feedback (RLHF).

Oct 09, 2025

Scale AI CEO on Meta’s $14B deal, scaling Uber Eats to $80B, & what frontier labs are building next

Jason Droege, CEO of Scale AI, discusses the evolution of AI training from simple labeling to complex, expert-driven tasks. He shares insights on the future of AI agents, the reality of enterprise AI adoption, and crucial business lessons learned from building Uber Eats from zero to a multi-billion dollar business.

Oct 04, 2025

Some thoughts on the Sutton interview

A reflection on Richard Sutton's "Bitter Lesson," arguing that while his critique of LLMs' inefficiency and lack of continual learning is valid, imitation learning is a complementary and necessary precursor to true reinforcement learning, much like fossil fuels were to renewable energy.

← Previous Next →