Tokenless

Interactive discovery

Explore the topic map

Follow the connections between themes, people, and ideas across the Tokenless archive in an interactive topic modeling map.

Machine Learning

View All
Frontier results, on device - RL Nabors, Arize

Frontier results, on device - RL Nabors, Arize

RL Nabors discusses the significant costs associated with using frontier AI models, covering security, latency, and financial implications. She introduces a framework for right-sizing AI solutions by leveraging smaller, task-specific models and Small Language Models (SLMs). The framework details how to prove task feasibility, establish success criteria with golden datasets, conduct capability evaluations (using tools like Phoenix), and select the most appropriate "Small And Good Enough" (SAGE) model. Nabors further demonstrates how prompt engineering, particularly few-shot prompting, and post-processing can close performance gaps with larger models, while advocating for continuous regression evaluations to maintain performance integrity. The overarching message is to "prototype big, deploy small" to optimize AI deployments.

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Vaidas Razgaitis, Senior Research Engineer at Higharc, shares three tactical tips to accelerate the transition of novel AI/ML research into production-ready features. He emphasizes addressing the critical handoff challenge between ML researchers and software engineers through structured documentation (Research Prototype Taxonomy Document), a well-organized monorepo utilizing decoupled microservices, and a systematic approach to code decomposition and PR review. These strategies aim to improve legibility, maintainability, and delivery speed for ML-driven products.

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

This session details a data-efficient method for training engineering surrogate models by using uncertainty quantification (UQ) to guide geometric data augmentation. Instead of random deformations, the approach lets the deep ensemble model identify its own knowledge gaps (epistemic uncertainty), then uses Free-Form Deformation (FFD) to generate new shapes specifically in those uncertain regions. This ensures every expensive simulation run yields maximally informative data, significantly improving model accuracy for a fixed computational budget across domains like structural mechanics and aerodynamics.

Artificial Intelligence

View All
a16z Goes Global: Why American Tech Must Lead the World

a16z Goes Global: Why American Tech Must Lead the World

This discussion explores a16z's expanding international strategy, emphasizing technology's pivotal role in economic growth and national security. The panel delves into why America's tech leadership is crucial globally, how AI is redefining government-private sector relationships, and the drive for countries to adopt frontier technologies while building local innovation ecosystems. Key topics include AI infrastructure, cybersecurity, defense tech, global startup expansion, and the elements of enduring tech ecosystems, highlighting trusted partnerships and the importance of Western technology.

GPT-5.6 Sol, FIFA AI & Wall Street’s AI nerves

GPT-5.6 Sol, FIFA AI & Wall Street’s AI nerves

OpenAI's new GPT-5.6 Sol model sparks debate on AI safety and release strategies, while Wall Street expresses growing skepticism over the long-term economics of frontier AI models. The discussion also touches on AI's impact on the FIFA World Cup and a thought-provoking paper comparing LLM anthropomorphism to Age of Empires II "goats."

The Prompt Is Still a Punch Card - Ted Johnson, JoinIn AI

The Prompt Is Still a Punch Card - Ted Johnson, JoinIn AI

Ted Johnson argues that current AI interfaces, particularly prompting, operate on an outdated "batch processing" protocol akin to punch cards. Despite advanced LLM capabilities, this interface design forces humans to adapt to machines, hindering natural interaction. He advocates for a shift towards human-compatible interfaces where AI actively participates in real-time conversation, leveraging its intelligence to remove user burdens and amplify human potential.

Technology

View All
Are Your Tests Slowing You Down? • Trisha Gee • GOTO 2025

Are Your Tests Slowing You Down? • Trisha Gee • GOTO 2025

Trisha Gee delivers a compelling talk on Developer Productivity Engineering (DPE) for testing, dissecting common pain points in writing, troubleshooting, and running tests. She advocates for strategic use of IDEs, advanced tooling like build caches and predictive test selection (leveraging ML), and a disciplined approach to test design to overcome these challenges, emphasizing that good tests serve as crucial living documentation.

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

This episode delves into Q-Day, the anticipated future when quantum computers can break public key cryptography, and the U.S. Executive Order accelerating the transition to post-quantum cryptography. Experts discuss why Q-Day is a gradual process rather than a sudden event, the critical importance of "crypto-agility" as a long-term strategy, and the necessity for organizations to begin immediate discovery and planning to secure data against "collect now, decrypt later" threats. The discussion also touches upon the broader, transformative benefits of quantum computing beyond just security.

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Jim Kleewein's talk outlines the immense challenges and critical research opportunities in building and operating global hyperscale services like Microsoft 365 and Azure. He emphasizes that at this scale, traditional approaches fail, necessitating a "new golden age of applied research" across areas like continuous availability, data management, security, and sustainability. Kleewein also discusses AI's powerful but limited role, stressing the ongoing need for human expertise, and highlights the ethical imperative to prevent failures that can have life-or-death consequences.


Recent Post

Grant Lee: Building Gamma’s AI Presentation Company to 100 Million Users

Grant Lee: Building Gamma’s AI Presentation Company to 100 Million Users

Grant Lee, co-founder of Gamma, shares the journey of building a presentation tool to 100 million users and $100M ARR. He discusses the strategy of being 'different, not just better' than incumbents like Microsoft and Google, the importance of starting with a strong product thesis before integrating AI, and the disciplined 'hire painfully slowly' approach that allowed a small team to achieve massive organic growth.

How Google’s Nano Banana Achieved Breakthrough Character Consistency

How Google’s Nano Banana Achieved Breakthrough Character Consistency

Nicole Brichtova and Hansa Srinivasan, the leads behind Google's Nano Banana image model, detail the technical breakthroughs in character consistency. They discuss how a focus on high-quality data, Gemini's multimodal architecture, and rigorous human evaluation enabled the model to realistically represent individuals from a single photo. The conversation covers the future of visual AI, moving beyond text prompts to specialized UIs, and the ultimate goal of a single, powerful model that can transform any modality into another, unlocking new applications in personalized education, professional design, and creative storytelling.

Michael Truell: How Cursor Builds at the Speed of AI

Michael Truell: How Cursor Builds at the Speed of AI

Michael Truell, CEO of Cursor, discusses the company's rapid growth, emphasizing a strategy of deliberate focus on power users over broad democratization. He shares insights into their unique two-day work trials that test for agency, the technical and relational challenges of out-scaling cloud and API providers, and a talent-first approach to M&A, all while navigating a market poised for its next 'iPhone moment'.

Build Hour: Agent RFT

Build Hour: Agent RFT

Will Hang and Theophile Sautory from OpenAI provide a deep dive into Agent RFT, a powerful method for fine-tuning large language models to become more effective, tool-using agents. They explain how Agent RFT enables models to learn directly from their interactions with custom tools and reward signals, leading to significant improvements in performance, latency, and efficiency on specialized tasks. The session includes a detailed code demo, best practices, and success stories from companies like Cognition, Ambience, and Rogo.

Prompt Engineering for LLMs, PDL, & LangChain in Action

Prompt Engineering for LLMs, PDL, & LangChain in Action

Martin Keen explains the evolution of prompt engineering from an art to a software engineering discipline. He introduces LangChain and Prompt Declaration Language (PDL) as tools to manage the probabilistic nature of LLMs, ensuring reliable, structured JSON output through concepts like contracts, control loops, and observability.

The Debugging Book • Andreas Zeller & Clare Sudbery

The Debugging Book • Andreas Zeller & Clare Sudbery

Professor Andreas Zeller discusses his interactive 'Debugging Book,' arguing that systematic, automated debugging is a critical but neglected skill. He explores powerful techniques like delta debugging and automated repair, explaining how developers can build their own tools to make debugging a more plannable and efficient process.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.