Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI
Adria Grondin, developer of the Locally AI app, provides a technical walkthrough on running large language models like Google's Gemma on an iPhone using Apple's MLX framework. The talk covers the necessary tools, performance expectations, the importance of quantization, and the growing MLX ecosystem.
Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi
This hands-on workshop details the construction of a sophisticated, dual-part AI system for producing high-quality technical content. It begins with an MCP-powered deep research agent that autonomously plans, searches the web, and analyzes sources like YouTube to synthesize a grounded research artifact. The second part is a constrained, deterministic writing workflow that transforms this research into polished, non-sloppy content using an innovative "Evaluator-Optimizer" pattern for iterative refinement. The session emphasizes crucial AI engineering principles, such as choosing between agentic and workflow-based architectures, and concludes with a deep dive into implementing practical observability and evaluation pipelines to ensure the system is both measurable and improvable.
Can we AI our way to a more sustainable world?
Microsoft experts Doug Burger, Amy Luers, and Ishai Menache discuss the dual role of AI in sustainability. They analyze the environmental footprint of datacenters and explore how AI-driven optimization and materials discovery can be pivotal in decarbonizing global systems like energy, industry, and food production.
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
A deep dive into Google DeepMind's Gemma 4, the latest family of open models. This summary covers the new model architectures like per-layer embeddings, on-device agentic capabilities, multimodal features, and the growing ecosystem of fine-tuned applications from medicine to sovereign AI.
The New Application Layer - Malte Ubl, CTO Vercel
Malte Ubl, CTO of Vercel, posits that AI engineering is the successor to web development, arguing that AI agents will expand, not shrink, the software market. He explores practical agent archetypes being built today and discusses the profound shift required in infrastructure and security as agents become both the builders and primary users of software, concluding that the true innovation and value will lie in the application layer built by AI engineers, independent of the commoditizing foundational models.
A New Kind of Marketplace
Donné Stevenson (Prosus) and Pedro Chaves (OLX Group) explore the future of e-commerce, where AI agents will automate buying and selling. They discuss the transition from traditional search to agent-driven experiences, the critical challenges of user trust and UI/UX design, and a future vision for an agent-to-agent marketplace that handles everything from negotiation to logistics.