Foundations

Before picking a pattern, build the mental models. This section covers the vocabulary, anatomy, and decision frameworks you'll use throughout the repository.

Terminology

Precise definitions of core concepts: LLM, Prompt, Tool, State, Memory, Context Window, and more.

Anatomy of an Agent

The components that make up an agent system: Agent Core, Goal, Tools, State, Memory, Constraints.

Choosing a Pattern

A decision flowchart to help you pick the right workflow or agent pattern for your problem.

Frameworks & Integrations

How patterns map to LangGraph, Claude Agent SDK, CrewAI, AutoGen, LlamaIndex, and MCP. A map, not a tutorial.

Security & Safety

Prompt injection, tool-use safety, secrets, output filtering, MCP supply chain — the security primer for LLM agents.

Hallucination & Grounding

Why agents hallucinate, grounding strategies, abstention patterns, eval-gated deployment.

Evals & Quality

Evals as tests (not benchmarks), golden datasets, metric selection, online vs offline, regression suites.

Cost & Model Selection

Model tier selection (Haiku/Sonnet/Opus), token budgets, per-pattern cost shape, cost guardrails.

Foundations

This section establishes the mental models and vocabulary you need before diving into any specific pattern. Read this first.

What Are LLM Applications?

An LLM application is any system that uses a large language model to process inputs and produce outputs. These range from a single API call with a well-crafted prompt to complex multi-agent systems that plan, reason, and act autonomously.

Most production LLM systems fall somewhere on this spectrum:

Single LLM Call — One prompt in, one response out. No loops, no tools. Useful for classification, summarization, and extraction.

Prompt Chain — Multiple LLM calls in sequence where each output feeds the next input. Adds structure and validation gates between steps.

LLM Workflow — Orchestrated patterns of LLM calls with branching, parallelism, or evaluation loops. The system follows a predetermined control flow — the code decides what happens next, not the LLM.

Agent — An LLM-driven system where the model itself decides what to do next. The LLM chooses which tools to call, when to stop, and how to respond to observations. Control flow is dynamic.

Multi-Agent System — Multiple agents collaborating, each with specialized capabilities. A supervisor or protocol coordinates their interactions.

The Key Insight

The difference between a workflow and an agent is who controls the loop:

In a workflow, the developer writes the control flow. The LLM fills in the blanks.
In an agent, the LLM is the control flow. The developer provides tools and constraints.

This distinction matters because it determines your system's predictability, debuggability, and failure modes. Workflows are easier to test and reason about. Agents are more flexible but harder to control.

How This Repository Is Organized

Section	What It Covers	Start Here If...
Foundations	Core concepts, terminology, pattern selection	You're new to agent design
Workflows	Pre-agent LLM patterns (chaining, parallelism, orchestration, evaluation)	You need structured LLM pipelines
Agent Patterns	Agent architectures (ReAct, planning, memory, RAG, multi-agent)	You need autonomous LLM behavior
Composition	How to combine patterns into complete systems	You're designing a production system

Reading Order

If you're learning from scratch, read in this order:

Terminology — Get the vocabulary right
Anatomy of an Agent — Understand what makes agents tick
Choosing a Pattern — Pick the right tool for your problem
Workflows — Learn the foundational patterns
Agent Patterns — See how workflows evolve into agents

Before shipping to production, also read:

Anti-Patterns — The 12 most common design mistakes and how to avoid them

For background and positioning:

System Design Heritage — How blueprints map to classical distributed-systems patterns, and which patterns are scoped to agent-deployments

In This Section

The foundations split into three groups. Read them in the order that matches your goal.

Core concepts — read first

The vocabulary, mental models, and decision frameworks you'll use everywhere else.

Terminology — Precise definitions of agent, workflow, tool, and other overloaded terms
Anatomy of an Agent — The components every agent has and what distinguishes agents from workflows
Choosing a Pattern — Decision flowchart and guidance for selecting the right pattern
Anti-Patterns — What not to build, why people build it anyway, and the correct alternative

Production concerns — read before shipping

Cross-cutting concerns every production agent inherits. Not pattern-specific; safe to read at any depth tier.

Security & Safety — Prompt injection, tool-use safety, secrets, output filtering, MCP supply chain
Hallucination & Grounding — Why agents hallucinate, grounding strategies, abstention, eval-gated deployment
Evals & Quality — Evals as tests (not benchmarks), golden datasets, metric selection, online vs offline, regression suites
Cost & Model Selection — Model tier selection, token budgets, per-pattern cost shape, the latency/cost/quality triangle, guardrails
Context Engineering — Context as a finite resource, the four levers (select / compress / prune / persist), per-pattern context shape, context-window awareness, compaction
Testing Strategies — Unit tests, mock LLMs, integration tests, evaluation, and regression testing for LLM systems

Reference and positioning

The repo's lineage and its place in the broader ecosystem.

System Design Heritage — Lineage map from classical system-design patterns to the blueprints here, and the reliability gap scoped to agent-deployments
Frameworks & Integrations — A map from patterns to LangGraph, Claude Agent SDK, CrewAI, AutoGen, LlamaIndex, and MCP