Retrieval augmented generation

Context Engineering & Agentic Search with the CEO of Chroma

Context Engineering & Agentic Search with the CEO of Chroma

Jeff Huber, CEO of Chroma, discusses "context rot," the degradation of AI performance in large context windows, and outlines a new vision for retrieval infrastructure. He covers the evolution of search, the importance of a two-stage recall-then-precision pipeline, and the challenges of agentic memory, advocating for a shift from AI "alchemy" to reliable engineering.

Overcoming Agentic Memory Management Challenges

Overcoming Agentic Memory Management Challenges

Biswaroop Bhattacharjee from Prem AI discusses Cortex, a novel AI memory system inspired by human cognition. The conversation explores moving beyond traditional flat memory structures to hierarchical, context-aware systems that enable more sophisticated and less noisy retrieval for AI agents.

Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI

Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI

A deep dive into building expert AI systems using a Wisdom-Driven Knowledge Graph. This approach enhances Knowledge-Augmented Generation (KAG) to surpass traditional Retrieval-Augmented Generation (RAG) by enabling systems to understand, reason, and provide expert-level quantitative analysis and advice.

Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)

Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)

David Karam, formerly of Google Search, presents a pragmatic framework for enhancing RAG systems, advocating a "quality engineering" approach. The talk progresses through a ladder of techniques, from in-memory retrieval and BM25 to custom embeddings, re-ranking, and advanced orchestration, emphasizing that the choice of technique should be driven by empirical analysis of system failures ("loss analysis") and balanced by a "complexity-adjusted impact" mindset.

Building Production-Grade RAG at Scale

Building Production-Grade RAG at Scale

Douwe Kiela, CEO of Contextual AI, explains the evolution from basic RAG to "RAG 2.0", an end-to-end, trainable system. He argues that this system-level approach, which integrates optimized document parsing, retrieval, reranking, and grounded models, is superior to relying on massive context windows alone and is a fundamental tool for next-generation AI agents.