Llm

How a Meta PM ships products without ever writing code | Zevi Arnovitz

How a Meta PM ships products without ever writing code | Zevi Arnovitz

Zevi Arnovitz, a non-technical Product Manager at Meta, shares his complete workflow for building and shipping sophisticated applications using AI tools like Cursor. He details a structured, multi-step process that leverages different AI models for specific tasks, including a novel "peer review" technique where models critique each other's code.

Introducing Our Approach to Design Document Review Using Business-Specific Large Language Models

Introducing Our Approach to Design Document Review Using Business-Specific Large Language Models

Hitachi's Financial Business Unit developed a specialized LLM to automate the review of system design documents, addressing the inadequacy of general-purpose AI for mission-critical systems. This presentation details the model's development using Continued Pre-training and LoRA on proprietary data, its integration into a multi-agent architecture, and the use of Weights & Biases for MLOps, which led to a 70% reduction in manual review workload.

Multi-Agent Systems for the Misinformation Lifecycle

Multi-Agent Systems for the Misinformation Lifecycle

A detailed overview of a modular, five-agent system designed to combat the entire lifecycle of digital misinformation. Based on an ICWSM research paper, this practitioner's guide details the roles of the Classifier, Indexer, Extractor, Corrector, and Verifier agents. The system emphasizes scalability, explainability, and high precision, moving beyond the limitations of single-LLM solutions. The talk covers the complete blueprint, from agent coordination and MLOps to holistic evaluation and optimization strategies for production environments.

DSPy: The End of Prompt Engineering - Kevin Madura, AlixPartners

DSPy: The End of Prompt Engineering - Kevin Madura, AlixPartners

An in-depth guide to DSPy, a framework for programming with language models, not just prompting them. Learn its core concepts—Signatures, Modules, Adapters, and Optimizers—and see real-world examples of building robust, testable, and transferable AI applications for the enterprise.

Post-training best-in-class models in 2025

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Memory in LLMs: Weights and Activations - Jack Morris, Cornell

Memory in LLMs: Weights and Activations - Jack Morris, Cornell

This talk explores the limitations of current methods for providing knowledge to LLMs, such as large context windows and Retrieval-Augmented Generation (RAG). The speaker argues that the future lies in training knowledge directly into the model's weights. This is achieved through a combination of generating large synthetic datasets from small amounts of source material and using parameter-efficient fine-tuning (PEFT) techniques like LoRA to avoid catastrophic forgetting. The goal is to create more capable, personalized, and efficient models by fundamentally altering how they store and access information.