Tokenless

Interactive discovery

Explore the topic map

Follow the connections between themes, people, and ideas across the Tokenless archive in an interactive topic modeling map.

Machine Learning

View All
Frontier results, on device - RL Nabors, Arize

Frontier results, on device - RL Nabors, Arize

RL Nabors discusses the significant costs associated with using frontier AI models, covering security, latency, and financial implications. She introduces a framework for right-sizing AI solutions by leveraging smaller, task-specific models and Small Language Models (SLMs). The framework details how to prove task feasibility, establish success criteria with golden datasets, conduct capability evaluations (using tools like Phoenix), and select the most appropriate "Small And Good Enough" (SAGE) model. Nabors further demonstrates how prompt engineering, particularly few-shot prompting, and post-processing can close performance gaps with larger models, while advocating for continuous regression evaluations to maintain performance integrity. The overarching message is to "prototype big, deploy small" to optimize AI deployments.

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Vaidas Razgaitis, Senior Research Engineer at Higharc, shares three tactical tips to accelerate the transition of novel AI/ML research into production-ready features. He emphasizes addressing the critical handoff challenge between ML researchers and software engineers through structured documentation (Research Prototype Taxonomy Document), a well-organized monorepo utilizing decoupled microservices, and a systematic approach to code decomposition and PR review. These strategies aim to improve legibility, maintainability, and delivery speed for ML-driven products.

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

This session details a data-efficient method for training engineering surrogate models by using uncertainty quantification (UQ) to guide geometric data augmentation. Instead of random deformations, the approach lets the deep ensemble model identify its own knowledge gaps (epistemic uncertainty), then uses Free-Form Deformation (FFD) to generate new shapes specifically in those uncertain regions. This ensures every expensive simulation run yields maximally informative data, significantly improving model accuracy for a fixed computational budget across domains like structural mechanics and aerodynamics.

Artificial Intelligence

View All
The Benchmark With No Instructions — Tufa Labs (ARC-AGI-3)

The Benchmark With No Instructions — Tufa Labs (ARC-AGI-3)

Tim Scarfe visits Tufa Labs to explore their top-ranking ARC-AGI-3 system, a benchmark for agentic intelligence that challenges LLMs in goal discovery and action efficiency. The team delves into the complexities of fractured representations, the role of human priors, and whether LLMs truly plan or merely simulate it effectively, all while balancing the bitter lesson with AI safety concerns.

Session on Reasoning

Session on Reasoning

This session features two talks on optimizing and verifying AI reasoning. Hongxiang Fan discusses cross-stack co-design for efficient AI, focusing on Test-Time Scaling (TTS) challenges, optimal verification granularity, and system-level optimizations for edge deployments. Nagarajan Natarajan introduces 'Advancing Verified Reasoning' with the InterVent platform, aiming to ensure AI agents comply with complex policies through formal verification, dynamic steering, and leveraging verification signals for training. Both emphasize addressing the computational and reliability costs of advanced AI.

Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities

Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities

This session explored Multimodal and Embodied Intelligence, featuring talks on hybrid AI in robotics (classical vs. end-to-end), AI's role in healthcare (focusing on NCDs, deployment, and uncertainty modeling), and fundamental perception challenges in multimodal reasoning (using educational video QA and visual puzzles). A panel discussed the impact of foundation models, the blurred lines between AGI and human-like AI, critical deployment pitfalls (human factors, efficiency, architectural limits), and future directions, emphasizing task-specific models and the redefinition of 'foundation models.'

Technology

View All
Are Your Tests Slowing You Down? • Trisha Gee • GOTO 2025

Are Your Tests Slowing You Down? • Trisha Gee • GOTO 2025

Trisha Gee delivers a compelling talk on Developer Productivity Engineering (DPE) for testing, dissecting common pain points in writing, troubleshooting, and running tests. She advocates for strategic use of IDEs, advanced tooling like build caches and predictive test selection (leveraging ML), and a disciplined approach to test design to overcome these challenges, emphasizing that good tests serve as crucial living documentation.

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

This episode delves into Q-Day, the anticipated future when quantum computers can break public key cryptography, and the U.S. Executive Order accelerating the transition to post-quantum cryptography. Experts discuss why Q-Day is a gradual process rather than a sudden event, the critical importance of "crypto-agility" as a long-term strategy, and the necessity for organizations to begin immediate discovery and planning to secure data against "collect now, decrypt later" threats. The discussion also touches upon the broader, transformative benefits of quantum computing beyond just security.

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Jim Kleewein's talk outlines the immense challenges and critical research opportunities in building and operating global hyperscale services like Microsoft 365 and Azure. He emphasizes that at this scale, traditional approaches fail, necessitating a "new golden age of applied research" across areas like continuous availability, data management, security, and sustainability. Kleewein also discusses AI's powerful but limited role, stressing the ongoing need for human expertise, and highlights the ethical imperative to prevent failures that can have life-or-death consequences.


Recent Post

Build Hour: API & Codex

Build Hour: API & Codex

A deep dive into building agent-powered engineering workflows with Codex and OpenAI APIs, covering the shift from pair programming to agentic delegation, the principles of 'Harness Engineering' to make agents reliable, and practical examples from OpenAI and Basis.

10 years of AlphaGo: The turning point for AI | Thore Graepel & Pushmeet Kohli

10 years of AlphaGo: The turning point for AI | Thore Graepel & Pushmeet Kohli

Ten years after the historic match between AlphaGo and Lee Sedol, Google DeepMind's Thore Graepel and Pushmeet Kohli reflect on its legacy. They discuss how AlphaGo's blend of deep learning and tree search conquered the game of Go, the significance of creative breakthroughs like 'Move 37', and how these foundational concepts evolved into systems like AlphaZero, which learns without human data. The conversation bridges the gap from game-playing to solving scientific grand challenges, detailing how the same principles are now used in tools like AlphaTensor to discover novel, more efficient algorithms for fundamental problems like matrix multiplication.

The Top 100 Consumer AI Apps | The a16z Show

The Top 100 Consumer AI Apps | The a16z Show

Anish Acharya and Olivia Moore from a16z discuss the 6th edition of their "Top 100 Gen AI Consumer Apps" report. They analyze the diverging strategies of ChatGPT, Claude, and Gemini; explore global adoption trends and cultural attitudes towards AI; and delve into the rise of agents, the importance of memory, and the future of creative tools and voice interfaces.

Physics Gets a Vote: Nominal Cofounders on Hardware Development in an AI World

Physics Gets a Vote: Nominal Cofounders on Hardware Development in an AI World

Nominal's co-founders discuss the new age of reindustrialization and the critical need for a modern data infrastructure in hardware engineering. They explain how their platform acts as a 'GitHub for hardware data,' providing a system of record for testing that bridges the gap between simulation and reality, and serves as the essential verification layer for the future of 'Physical AI'.

How To Build The Future: Max Hodak

How To Build The Future: Max Hodak

Max Hodak, co-founder of Neuralink and founder of Science, discusses the company's retinal implant that restores sight, the concept of the brain as a computer with a definable API, the surprising parallels between neural representations and latent spaces in AI, and the future of bio-hybrid interfaces.

Every Software Org is Dysfunctional • R. Parsons, G. Hohpe, B. O'Reilly & A. Harmel-Law • GOTO 2025

Every Software Org is Dysfunctional • R. Parsons, G. Hohpe, B. O'Reilly & A. Harmel-Law • GOTO 2025

A panel of distinguished architects—Rebecca Parsons, Gregor Hohpe, Barry O'Reilly, and Andrew Harmel-Law—discuss the multifaceted world of software architecture. They explore the inevitability of organizational dysfunction, the critical role of architects in decision-making, and the impact of non-deterministic technologies like Generative AI and quantum computing on system design and organizational change.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.