Context Window

May 26, 2026

The Four Types of Memory Every AI Agent Needs

AI agents utilize four distinct types of memory, analogous to human cognition, to move beyond simple chatbot responses. This summary explores the CoALA framework, detailing working, semantic, procedural, and episodic memory and how they enable agents to learn, recall skills, and leverage past experiences.

May 10, 2026

Hierarchical Memory: Context Management in Agents — Sally-Ann Delucia

The Arize team shares lessons from building their AI agent, Alyx, which analyzes its own trace data. They detail their journey from failed attempts like naive truncation and summarization to a successful strategy combining head/tail preservation with a retrievable memory store and using sub-agents to manage context complexity.

Apr 25, 2026

Every API Is a Tool for Agents - Matt Carey, Cloudflare

This talk explores how to overcome the context window limitations that prevent AI agents from accessing large APIs. It introduces "Codemode," a technique where agents write code against a typed SDK in a secure, sandboxed environment, moving beyond static tool definitions and enabling full API accessibility.

Feb 06, 2026

We're All Addicted To Claude Code

Calvin French-Owen, co-founder of Segment and former OpenAI Codex team member, discusses the rise of powerful coding agents. He contrasts the architectures of Codex and Claude Code, explores the future of work where engineers become managers of AI, and shares tips for becoming a top 1% power user.

Jan 23, 2026

Why AI Agents Forget Everything (And How To Fix That)

Mem0 is building a model-neutral, persistent memory layer for AI agents to solve the fundamental statelessness of LLMs. Co-founders Taranjeet Singh and Deshraj Yadav discuss their hybrid memory architecture, which reduces cost and latency compared to context stuffing, and their vision for a future where user memory is portable across all AI applications.

Dec 29, 2025

Memory in LLMs: Weights and Activations - Jack Morris, Cornell

This talk explores the limitations of current methods for providing knowledge to LLMs, such as large context windows and Retrieval-Augmented Generation (RAG). The speaker argues that the future lies in training knowledge directly into the model's weights. This is achieved through a combination of generating large synthetic datasets from small amounts of source material and using parameter-efficient fine-tuning (PEFT) techniques like LoRA to avoid catastrophic forgetting. The goal is to create more capable, personalized, and efficient models by fundamentally altering how they store and access information.