The Agent Stack: Complete Reference Guide

This is the canonical reference for building AI agents that work in production. Organized by layer, it provides a curated reading path through our research library.

How to use this guide:

Start at Foundation if you're new to agent development
Jump directly to a specific layer if you have targeted questions
Each layer has a featured deep dive plus supporting articles

Foundation5 articles

Before you orchestrate agents or worry about operations, you need to understand the foundation layer: how models work with external context, the tradeoffs of RAG, and the protocols connecting agents to tools.

Technical Deep Dive

Start here

RAG Is Oversold: The Gap Between Tutorial and Production

95% of RAG projects fail to reach production. The gap isn't infrastructure—it's retrieval accuracy, data processing, and reasoning. Naive RAG is obsolete; production requires rigorous engineering.

13 min readRead article

Technical Deep Dive5 min

The Prompt DNA Hypothesis: Evolving Agent Instructions

What if we treated prompts like genetic code—subject to mutation, selection, and evolution? The best agent prompts aren't written. They're bred.

RAG Is Oversold: The Gap Between Tutorial and Production

The Prompt DNA Hypothesis: Evolving Agent Instructions

MCP: The Protocol That Won (For Now)

The MCP Tax: When Standards Cost You 99% of Your Token Budget

The Probabilistic Stack: Engineering for Non-Determinism

The Orchestration Decision: LangGraph vs AutoGen

Agent Memory: From Stateless to Stateful AI

Swarm Patterns: When Agents Learn to Collaborate

The Graph Mandate: Why Chat-Based Agents Fail in Production

The Durable Agent: Why Infrastructure Beats Prompts

Why 90% of AI Pilots Still Fail (And How to Beat the Odds)

The 5 Agent Failure Modes (And How to Prevent Them)

You're Monitoring Agents Like APIs. That's Why They Fail Silently.

The Self-Healing Agent: How AI Systems Learn to Fix Themselves

The Agent Operations Playbook: SRE for AI Systems

Agent Economics: The Unit Economics of Autonomous Work

The CPCT Standard: Why Cost-Per-Token is a Vanity Metric

The Hallucination Tax: Calculating the True Cost of AI Errors

The Agent Scorecard: Translating Technical Metrics to Business ROI

The Agent Attack Surface: Security Beyond Safety

The Agent Safety Stack: Defense-in-Depth for Autonomous AI

The HITL Firewall: How Human Oversight Doubles Your AI ROI

The Input Assurance Boundary: Treating Prompts Like SQL Injection

How to Know If Your AI Agent Actually Works

The Agent Scorecard: Translating Technical Metrics to Business ROI

Vertical Agents Are Eating Horizontal Agents

The Autonomous Revolution: AI Agents Rewriting Work

Solve Intelligence: The AI Operating System for Patent Law

Why Legal AI Breaks Every Rule About Agent Adoption

The State of Legal AI: When Research Takes Minutes and Arguments Write Themselves

The Agent Ecosystem Map: A Buyer's Guide to Vendor Selection

The Top 100 AI Agent Companies: A Strategic Directory

The 500ms Threshold: Why Latency Kills Voice AI

Voice: The Universal API for Human-Computer Interaction

ElevenLabs: The Voice Infrastructure Play

Related

Ask a follow-up