Large language models

Baseten CEO Tuhin Srivastava on Custom Models, and Building the Inference Cloud

Baseten CEO Tuhin Srivastava on Custom Models, and Building the Inference Cloud

Baseten CEO Tuhin Srivastava discusses the explosive growth in AI inference, driven by the adoption of specialized and post-trained open-source models. He covers the strategic importance of owning the software layer on top of compute, navigating the severe GPU supply crunch with a multi-cloud fabric, the evolving landscape of AI workloads, and the operational lessons learned from scaling 30x in one year.

I Read 9,000 AI Papers So You Don't Have To

I Read 9,000 AI Papers So You Don't Have To

Nick Vasiloglou, VP of Research at Relational AI, analyzes the key trends from NeurIPS 2025, highlighting the most impactful and under-the-radar developments for industry professionals. The discussion covers the rise of data markets through real-time attribution, the sophisticated engineering behind capable small language models (SLMs), the explosion of AI for science, and the shift towards post-training models with real-world tools.

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI

Adria Grondin, developer of the Locally AI app, provides a technical walkthrough on running large language models like Google's Gemma on an iPhone using Apple's MLX framework. The talk covers the necessary tools, performance expectations, the importance of quantization, and the growing MLX ecosystem.

Cognitive Exhaust Fumes, or: Read-Only AI Is Underrated — Šimon Podhajský, Head of AI, Waypoint

Cognitive Exhaust Fumes, or: Read-Only AI Is Underrated — Šimon Podhajský, Head of AI, Waypoint

A deep dive into a "read-only" personal AI system that analyzes your digital footprint—or "cognitive exhaust fumes"—from sources like email, notes, and browsing history. The author argues that this observer approach provides more profound insights and is inherently safer than action-oriented AI agents, by preventing data contamination and mitigating the high-stakes risks of write-access errors.

Agentic Engineering & PINNs: AI for Simulation Engineers - James Shaw | Podcast #172

Agentic Engineering & PINNs: AI for Simulation Engineers - James Shaw | Podcast #172

James Shaw, a mechanical engineer and Ansys channel partner, delves into the current and future impact of agentic AI and physics-informed neural networks (PINs) on simulation workflows. He explores how AI is revolutionizing aspects from tech support and model setup to the solver itself, particularly in CFD. The discussion also covers the implications for the engineering job market, the 'senior-junior inversion crisis', and the continued irreplaceability of skilled engineers due to the inherent physicality of the world, emphasizing the need for robust, trustworthy data to train AI.

This AI Company Catches Fraud Across the Internet

This AI Company Catches Fraud Across the Internet

Variance, emerging from three years in stealth with a $21 million Series A, is transforming enterprise risk and compliance through purpose-built AI agents. Founded by ex-Apple engineers, the company automates complex tasks like fraud detection, content review, and identity verification for Fortune 500s and platforms such as GoFundMe. They discuss the strategic reasons for stealth, technical challenges of integrating disparate data sources (including UI scraping), the shift from legacy systems to self-healing AI agent architectures, and how their lean, AI-maximalist team detects sophisticated threats like state-sponsored fraud rings.