Tokenless

Machine Learning

View All
Post-training best-in-class models in 2025

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models

Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models

This talk introduces Anaximander, a system designed to bridge the gap between traditional, GUI-driven Geographic Information System (GIS) workflows and modern, code-heavy machine learning practices. Anaximander integrates geospatial foundation models directly into QGIS, allowing experts to interactively orchestrate, run, and evaluate models for tasks like semantic segmentation and object detection on satellite imagery.

Efficient Reinforcement Learning – Rhythm Garg & Linden Li, Applied Compute

Efficient Reinforcement Learning – Rhythm Garg & Linden Li, Applied Compute

At Applied Compute, efficient Reinforcement Learning is critical for delivering business value. This talk explores the transition from inefficient synchronous RL to a high-throughput asynchronous 'Pipeline RL' system. The core challenge is managing 'staleness'—a side effect of in-flight weight updates that can destabilize training. The speakers detail their first-principles systems model, based on the Roofline model, used to simulate and find the optimal allocation of GPU resources between sampling and training, balancing throughput with algorithmic stability and achieving significant speedups.

Artificial Intelligence

View All
Ransomware whack-a-mole, AI agents as insider threats and how to hack a humanoid robot

Ransomware whack-a-mole, AI agents as insider threats and how to hack a humanoid robot

A discussion on the evolving cybersecurity landscape, covering the persistent threat of ransomware gangs adapting with AI, the critical failures in identity security highlighted by the Zestix case, the emergence of AI agents as a new class of insider threats, and the physical-world risks demonstrated by hacking humanoid robots.

Moonshot Podcast Deep Dive: Emily Ma on Solving Food Waste

Moonshot Podcast Deep Dive: Emily Ma on Solving Food Waste

In a discussion with Astro Teller, Emily Ma from X (formerly Google X) delves into the multifaceted problem of food waste and the moonshot projects developed to tackle it. They explore Project Delta, an initiative that created an "air traffic control" system for surplus food, and Project Chorus, a broader supply chain moonshot designed to give every object a voice through advanced sensors and software.

Ben Horowitz on Investing in AI: AI Bubbles, Economic Impact, and VC Acceleration

Ben Horowitz on Investing in AI: AI Bubbles, Economic Impact, and VC Acceleration

Ben Horowitz of Andreessen Horowitz discusses how AI is reshaping venture capital, detailing the firm's strategy for managing partners, verticalizing teams, and evaluating investments in real-time. He breaks down the current AI cycle, arguing that application design and model orchestration are key, and explains why the current market's growth is driven by real demand, not just hype.

Technology

View All
Palo Alto Networks CEO Nikesh Arora on the Virtues of Being an Outsider

Palo Alto Networks CEO Nikesh Arora on the Virtues of Being an Outsider

Nikesh Arora, CEO of Palo Alto Networks, shares his unconventional journey and leadership philosophy. He provides a masterclass in building a multi-platform company through strategic M&A, explains why founders should sometimes ignore customers, and reveals how to lead with conviction while managing imposter syndrome.

Mental models for building products people love ft. Stewart Butterfield

Mental models for building products people love ft. Stewart Butterfield

Stewart Butterfield, co-founder of Slack and Flickr, shares the product frameworks and leadership principles that guided his success. He delves into concepts like "utility curves" for feature investment, the "owner's delusion" in product design, and why focusing on "comprehension" is often more important than reducing friction. He also introduces powerful mental models for organizational effectiveness, such as combating "hyper-realistic work-like activities" and applying Parkinson's Law to team growth.

Intuit CEO Sasan Goodarzi’s Grown-Up CEO Playbook

Intuit CEO Sasan Goodarzi’s Grown-Up CEO Playbook

Intuit CEO Sasan Goodarzi discusses the operational playbook for reinventing a 40-year-old company, from its slow transition to SaaS to its early adoption of AI. He shares insights on winning the SMB market by treating small businesses like consumers, building effective channel partnerships, and developing a platform strategy. Goodarzi also details his leadership philosophy, emphasizing that grit and curiosity are more critical than raw talent.


Recent Post

Evals in Action: From Frontier Research to Production Applications

Evals in Action: From Frontier Research to Production Applications

An overview of OpenAI's approach to AI evaluation, covering the GDP-val benchmark for frontier models and the practical tools available for developers to evaluate their own custom agents and applications.

Sora, ImageGen, and Codex: The Next Wave of Creative Production

Sora, ImageGen, and Codex: The Next Wave of Creative Production

Developers from OpenAI showcase "Storyboard", a custom tool built in 48 hours that transforms creative workflows. They detail how, in partnership with a film production, they used AI-assisted coding to rapidly prototype a tool that takes artists from initial sketch to high-fidelity images and even full-motion video, drastically reducing production timelines.

Build a Local LLM App in Python with Just 2 Lines of Code

Build a Local LLM App in Python with Just 2 Lines of Code

Distinguished Engineer Chris Hay demonstrates how to run and program Large Language Models (LLMs) locally in just two lines of Python code. The tutorial covers setting up a local environment with Ollama and UV, using a custom library for simplified interaction, and explores advanced topics like asynchronous streaming, persona customization with system prompts, and managing multi-turn conversations.

Sam Altman on Sora, Energy, and Building an AI Empire

Sam Altman on Sora, Energy, and Building an AI Empire

Sam Altman discusses OpenAI's strategy, the path to AGI through world models like Sora, the importance of societal co-evolution with AI, and the massive infrastructure and energy requirements for future models. He covers topics from AI safety and regulation to monetization and the future of scientific discovery driven by AI.

Evaluating the Cultural Relevance of AI Models and Products: Insights from the YUX Team

Evaluating the Cultural Relevance of AI Models and Products: Insights from the YUX Team

Drawing from their work fine-tuning an ASR model in Wolof and building a stereotype detection dataset, researchers from YUX share a practical toolbox for evaluating the cultural relevance of AI models and products. The session covers methods for data collection, model benchmarking, user testing, and introduces LOOKA, a platform for scalable human evaluation in the African context.

Evaluating Privacy Policies under Modern Privacy Laws At Scale: An LLM-Based Automated Approach

Evaluating Privacy Policies under Modern Privacy Laws At Scale: An LLM-Based Automated Approach

Qinge Xie from Georgia Tech presents a large-scale evaluation of modern website privacy policies using a novel LLM-based framework. The research systematizes privacy practices from 10 major US and EU regulations into 34 clauses and analyzes over 100,000 websites to reveal current trends in data collection, sharing, and consumer rights disclosure.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.