Tokenless

Machine Learning

View All
Post-training best-in-class models in 2025

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models

Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models

This talk introduces Anaximander, a system designed to bridge the gap between traditional, GUI-driven Geographic Information System (GIS) workflows and modern, code-heavy machine learning practices. Anaximander integrates geospatial foundation models directly into QGIS, allowing experts to interactively orchestrate, run, and evaluate models for tasks like semantic segmentation and object detection on satellite imagery.

Efficient Reinforcement Learning – Rhythm Garg & Linden Li, Applied Compute

Efficient Reinforcement Learning – Rhythm Garg & Linden Li, Applied Compute

At Applied Compute, efficient Reinforcement Learning is critical for delivering business value. This talk explores the transition from inefficient synchronous RL to a high-throughput asynchronous 'Pipeline RL' system. The core challenge is managing 'staleness'—a side effect of in-flight weight updates that can destabilize training. The speakers detail their first-principles systems model, based on the Roofline model, used to simulate and find the optimal allocation of GPU resources between sampling and training, balancing throughput with algorithmic stability and achieving significant speedups.

Artificial Intelligence

View All
Graph Neural Networks Just Solved Enterprise AI?

Graph Neural Networks Just Solved Enterprise AI?

Jure Leskovec introduces Relational Foundation Models (RFMs), a new class of models based on graph neural networks that learn directly from raw, multi-table enterprise data. This approach bypasses manual feature engineering, leading to more accurate, faster-to-deploy, and easier-to-maintain predictive models for tasks like churn prediction, fraud detection, and recommendation systems.

Ben & Marc: Why Everything Is About to Get 10x Bigger

Ben & Marc: Why Everything Is About to Get 10x Bigger

a16z co-founders Marc Andreessen and Ben Horowitz discuss the shift to a decentralized media ecosystem, their investment thesis on supply-driven markets, and the transformative impact of AI. They detail the a16z model of leveraging reputation as a core asset to turn inventors into CEOs and explain why AI represents a fundamental reinvention of computing that will unlock unprecedented growth.

How to Make AI Forget

How to Make AI Forget

Ben Luria, CEO of Hirundo, discusses the critical need for machine unlearning, framing it as a form of "AI neuro-surgery" for enterprise AI. He explains how this technique directly modifies model weights to remove unwanted data and behaviors, addressing core risks that superficial solutions like guardrails cannot solve.

Technology

View All
Palo Alto Networks CEO Nikesh Arora on the Virtues of Being an Outsider

Palo Alto Networks CEO Nikesh Arora on the Virtues of Being an Outsider

Nikesh Arora, CEO of Palo Alto Networks, shares his unconventional journey and leadership philosophy. He provides a masterclass in building a multi-platform company through strategic M&A, explains why founders should sometimes ignore customers, and reveals how to lead with conviction while managing imposter syndrome.

Mental models for building products people love ft. Stewart Butterfield

Mental models for building products people love ft. Stewart Butterfield

Stewart Butterfield, co-founder of Slack and Flickr, shares the product frameworks and leadership principles that guided his success. He delves into concepts like "utility curves" for feature investment, the "owner's delusion" in product design, and why focusing on "comprehension" is often more important than reducing friction. He also introduces powerful mental models for organizational effectiveness, such as combating "hyper-realistic work-like activities" and applying Parkinson's Law to team growth.

Intuit CEO Sasan Goodarzi’s Grown-Up CEO Playbook

Intuit CEO Sasan Goodarzi’s Grown-Up CEO Playbook

Intuit CEO Sasan Goodarzi discusses the operational playbook for reinventing a 40-year-old company, from its slow transition to SaaS to its early adoption of AI. He shares insights on winning the SMB market by treating small businesses like consumers, building effective channel partnerships, and developing a platform strategy. Goodarzi also details his leadership philosophy, emphasizing that grit and curiosity are more critical than raw talent.


Recent Post

Build Hour: Reinforcement Fine-Tuning

Build Hour: Reinforcement Fine-Tuning

A deep dive into Reinforcement Fine-Tuning (RFT), covering how to set up tasks, design effective graders, and run efficient training loops to improve model reasoning, based on a live demonstration from OpenAI's Build Hours.

Build Hour: Agentic Tool Calling

Build Hour: Agentic Tool Calling

A deep dive into building agentic systems using OpenAI's latest APIs. The session covers the core concept of 'agentic tool calling' (reasoning + tools), outlines a four-part framework (Agent, Infrastructure, Product, Evaluation) for designing long-horizon tasks, and provides a hands-on demonstration of building a non-blocking task processing system with a real-time progress UI.

Substack Cofounder on the Internet's Content Problem

Substack Cofounder on the Internet's Content Problem

Chris Best, cofounder and CEO of Substack, joins a16z to discuss the platform's origin, its cultural impact on free speech and media business models, and its evolution from a newsletter tool to a full-fledged creator network. The conversation explores the future of media in an attention-scarce world, the role of AI in content creation, and how rethinking algorithms and incentives can build a healthier cultural ecosystem.

The future of agentic coding with Claude Code

The future of agentic coding with Claude Code

Anthropic's Boris Cherny, creator of Claude Code, discusses the paradigm shift from traditional coding to agentic workflows. He covers the co-evolution of models and their "harnesses," the importance of "hackability" via features like slash commands, and provides practical tips for leveraging coding agents for tasks of varying complexity.

Beyond the Cloud: The Local-First Software Revolution • Brooklyn Zelenka & Julian Wood

Beyond the Cloud: The Local-First Software Revolution • Brooklyn Zelenka & Julian Wood

Distributed systems researcher Brooklyn Zelenka introduces local-first computing, a new paradigm where apps run on user devices, synchronizing via CRDTs like Automerge. This approach offers offline functionality, low latency, and enhanced privacy, ideal for collaborative "cozy web" applications and democratizing development by simplifying the tech stack.

919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron

919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron

Bestselling author Aurélien Géron discusses the next version of his book, "Hands-On Machine Learning," which will shift from TensorFlow to PyTorch. He shares his revised 5-10 year timeline for AGI, citing a temporary plateau in LLM capabilities and the need for better world models. Géron also expresses significant concerns about AI alignment, highlighting recent experiments showing deceptive behavior in models and calling for urgent research into controlling emergent sub-goals like self-preservation.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.