Vector database

Is RAG Still Needed? Choosing the Best Approach for LLMs

Is RAG Still Needed? Choosing the Best Approach for LLMs

Martin Keen compares Retrieval Augmented Generation (RAG) with the emerging long context window approach in LLMs. He analyzes the pros and cons of each, from infrastructure simplicity and retrieval accuracy to computational costs and the 'needle in the haystack' problem, providing guidance on when to use each solution.

Context Engineering 2.0

Context Engineering 2.0

Simba Khadder explains the evolution of feature stores and MLOps, detailing why they remain crucial in the age of LLMs for high-scale use cases. He discusses the acquisition of his company, Featureform, by Redis and outlines their new vision: building a "Context Engine" for AI. This engine aims to unify structured data, unstructured data, and memory into a single pane of glass, moving beyond simple RAG to a more sophisticated "Context Engineering 2.0" that empowers agents with rich, queryable context.

Context Engineering & Agentic Search with the CEO of Chroma

Context Engineering & Agentic Search with the CEO of Chroma

Jeff Huber, CEO of Chroma, discusses "context rot," the degradation of AI performance in large context windows, and outlines a new vision for retrieval infrastructure. He covers the evolution of search, the importance of a two-stage recall-then-precision pipeline, and the challenges of agentic memory, advocating for a shift from AI "alchemy" to reliable engineering.

Serverless Apps on Cloudflare • Ashley Peacock & Ricky Robinett

Serverless Apps on Cloudflare • Ashley Peacock & Ricky Robinett

Ashley Peacock, author of 'Serverless Apps on Cloudflare,' joins Ricky Robinett to provide a deep dive into Cloudflare's evolution from a security provider to a comprehensive developer platform. They explore its unique global architecture, serverless offerings like Workers and Durable Objects, and its growing suite of AI and data tools, contrasting its developer experience with traditional cloud providers.

Building Advanced Agents Over Complex Data // Jerry Liu

Building Advanced Agents Over Complex Data // Jerry Liu

Jerry from LlamaIndex explains why naive Retrieval-Augmented Generation (RAG) fails in production and dives deep into advanced data quality techniques—from parsing complex documents and hierarchical indexing to chunking best practices—required to build robust, high-quality LLM applications.

Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)

Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)

A summary of the talk by Harvey and LanceDB on building a highly optimized retrieval architecture for the legal profession. It covers challenges like query complexity and data scale, the importance of evaluation, and how LanceDB's multimodal lakehouse architecture provides the necessary foundation.