Tokenless

Machine Learning

View All
Post-training best-in-class models in 2025

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models

Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models

This talk introduces Anaximander, a system designed to bridge the gap between traditional, GUI-driven Geographic Information System (GIS) workflows and modern, code-heavy machine learning practices. Anaximander integrates geospatial foundation models directly into QGIS, allowing experts to interactively orchestrate, run, and evaluate models for tasks like semantic segmentation and object detection on satellite imagery.

Efficient Reinforcement Learning – Rhythm Garg & Linden Li, Applied Compute

Efficient Reinforcement Learning – Rhythm Garg & Linden Li, Applied Compute

At Applied Compute, efficient Reinforcement Learning is critical for delivering business value. This talk explores the transition from inefficient synchronous RL to a high-throughput asynchronous 'Pipeline RL' system. The core challenge is managing 'staleness'—a side effect of in-flight weight updates that can destabilize training. The speakers detail their first-principles systems model, based on the Roofline model, used to simulate and find the optimal allocation of GPU resources between sampling and training, balancing throughput with algorithmic stability and achieving significant speedups.

Artificial Intelligence

View All
Lessons from Building Open Source Libraries

Lessons from Building Open Source Libraries

Thomas Wolf, co-founder of Hugging Face, discusses his journey from physics to AI, the power of open-source models to accelerate innovation, the practical challenges of productionalizing AI demos, and why the biggest opportunities for founders now lie in the application layer on top of powerful foundation models.

Modernizing Manufacturing: AI + Robots + Humans | Daren Fields | Founder & CEO | Virtual Select | 4K

Modernizing Manufacturing: AI + Robots + Humans | Daren Fields | Founder & CEO | Virtual Select | 4K

Daren Fields, Co-Founder & CEO of Virtual Select, discusses the future of manufacturing, emphasizing the role of AI as a tool for human augmentation, not replacement. He explores how to modernize manufacturing by combining a carbon-based workforce with silicon-based systems to prevent defects, reduce costs, and de-risk execution.

Claude Cowork analysis & Apple picks Gemini

Claude Cowork analysis & Apple picks Gemini

The panel discusses Anthropic's Claude Cowork and the challenge of user trust in AI agents for everyday tasks. They then analyze the Apple-Google partnership to integrate Gemini into Siri, debating its implications for edge AI, privacy, and hardware limitations. Finally, they explore Linus Torvalds' use of AI for "vibe coding," considering its impact on hobbyist programming and entrepreneurship versus the current limitations in producing production-ready software.

Technology

View All
Ethical Hacking War Stories: Zero Trust, IAM & Advanced C2 Tactics

Ethical Hacking War Stories: Zero Trust, IAM & Advanced C2 Tactics

Jeff Crume and Patrick Fussell from IBM's X-Force team share a real-world ethical hacking war story, demonstrating an attack from an 'assume breach' perspective. They break down how vulnerabilities in Identity and Access Management (IAM) and legacy systems can lead to a full compromise, starting from an insider threat and escalating to domain administrator privileges through advanced C2 attacks and lateral movement.

Palo Alto Networks CEO Nikesh Arora on the Virtues of Being an Outsider

Palo Alto Networks CEO Nikesh Arora on the Virtues of Being an Outsider

Nikesh Arora, CEO of Palo Alto Networks, shares his unconventional journey and leadership philosophy. He provides a masterclass in building a multi-platform company through strategic M&A, explains why founders should sometimes ignore customers, and reveals how to lead with conviction while managing imposter syndrome.

Mental models for building products people love ft. Stewart Butterfield

Mental models for building products people love ft. Stewart Butterfield

Stewart Butterfield, co-founder of Slack and Flickr, shares the product frameworks and leadership principles that guided his success. He delves into concepts like "utility curves" for feature investment, the "owner's delusion" in product design, and why focusing on "comprehension" is often more important than reducing friction. He also introduces powerful mental models for organizational effectiveness, such as combating "hyper-realistic work-like activities" and applying Parkinson's Law to team growth.


Recent Post

Useful General Intelligence — Danielle Perszyk, Amazon AGI

Useful General Intelligence — Danielle Perszyk, Amazon AGI

Danielle from the Amazon AGI SF Lab introduces a new paradigm for agent development called "Useful General Intelligence" (UGI). Instead of focusing on making AI smarter than humans, UGI aims to build AI that augments human intelligence and agency. This talk explores the cognitive science principles behind this vision and introduces Nova Act, an agentic model and SDK designed to reliably interact with computer UIs, turning the browser into a programmable tool.

The 2025 AI Engineering Report — Barr Yaron, Amplify

The 2025 AI Engineering Report — Barr Yaron, Amplify

Barr Yaon of Amplify Partners presents early findings from the 2025 State of AI Engineering survey, covering LLM usage, customization techniques like RAG and fine-tuning, the state of AI agents, key challenges like evaluation, and community perspectives on the future of AI.

9 Commandments for Building AI Agents

9 Commandments for Building AI Agents

A deep dive into the design principles for building effective AI agents, covering the evolution of the ReAct loop, the critical role of memory and learning from experience, the 'build vs. buy' dilemma for tooling, and the importance of abstracting all capabilities—including systems and people—as tools.

Solving AI Video: How Fal.ai is making AI Video Generation Fatser & Easier

Solving AI Video: How Fal.ai is making AI Video Generation Fatser & Easier

Fal co-founder Burkay Gur and head of engineering Batuhan Taskaya discuss their journey building a high-performance generative media cloud. They cover their strategic pivot to media models, core optimization principles born from early GPU scarcity, and the development of a customer-obsessed culture to navigate the fast-paced AI model landscape.

Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic

Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic

Dr. Jasper Zhang argues that the relentless construction of new data centers is an inefficient, expensive, and unsustainable solution to the AI compute demand. He proposes a global GPU marketplace as a superior model, designed to aggregate fragmented, idle resources, drastically reduce costs through efficient allocation, and ultimately democratize access to AI infrastructure for developers and startups.

Flipping the Inference Stack — Robert Wachen, Etched

Flipping the Inference Stack — Robert Wachen, Etched

The current AI inference stack, reliant on general-purpose GPUs, is economically and technically unsustainable for real-time AI at scale. AI hardware expert Robert Wachen argues that the future is specialized hardware, like Transformer-specific ASICs, which can unlock currently bottlenecked applications such as real-time video, code generation, and large-scale enterprise deployments by solving critical latency and cost-per-user challenges.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.