Agent Pattern

ReAct

Reason-act loop: the LLM reasons, calls a tool, observes, and repeats until done.

Intermediate Evolves from: Prompt Chaining →

ReAct (Reason + Act) — Overview

ReAct is the foundational agent pattern: a loop where the LLM reasons about what to do, acts by calling a tool, observes the result, and repeats until the task is complete. The LLM controls when to act and when to stop.

Evolves from: Prompt Chaining — adds dynamic tool selection and LLM-controlled looping.

Architecture

graph TD Input([User Task]) -->|"goal"| Loop[Agent Loop] Loop --> Think[Think:<br/>Reason about state + goal] Think --> Decide{Done?} Decide -->|"No"| ToolCall[Select & call tool] ToolCall -->|"tool request"| Execute[Execute tool] Execute -->|"observation"| Loop Decide -->|"Yes"| Output([Final Answer]) Guard[/"Max Iterations"/] -.->|"force stop"| Output style Input fill:#e3f2fd style Loop fill:#f3e5f5 style Think fill:#fff3e0 style Decide fill:#fce4ec style ToolCall fill:#e8f5e9 style Execute fill:#e8f5e9 style Output fill:#e3f2fd style Guard fill:#fff8e1

Figure: The ReAct loop. The LLM thinks, decides whether to act or respond, executes a tool if needed, and observes the result. A max iteration guard prevents infinite loops.

How It Works

The LLM receives the task and the available tool schemas
It generates a reasoning step ("I need to search for X because...")
It selects a tool and provides arguments
Your code executes the tool and returns the observation
The LLM reasons about the observation and decides the next action
Repeat until the LLM produces a final answer or hits the iteration limit

The key insight: the LLM interleaves thinking with acting. It doesn't just plan all steps upfront — it adapts based on what it discovers.

Minimal Example

Answer a compound question using search and a calculator — the agent decides which tools to call and when to stop.

from patterns.react.code.python.react_agent import ReActAgent, Tool

agent = ReActAgent(
    llm=your_llm,
    tools=[
        Tool("search",     "Search the web for current information", lambda q: search_api(q)),
        Tool("calculator", "Evaluate a math expression",             lambda expr: str(eval(expr))),
    ],
    max_steps=8,
)

result = agent.run(
    "What is the compound interest on $5,000 at the current US federal funds rate for 10 years?"
)
# result.answer            → final answer once the agent calls "Final Answer:"
# result.steps             → full Thought / Action / Observation trace
# result.stopped_by_guard  → True if max_steps was hit before a final answer

Example trace:

Thought: I need the current federal funds rate first.
Action: search | Input: "current US federal funds rate 2024"
Observation: The federal funds rate is 5.25–5.50% as of late 2024.

Thought: Now I'll calculate compound interest.
Action: calculator | Input: 5000 * (1 + 0.0525) ** 10
Observation: 8292.87

Thought: I now know the final answer.
Final Answer: At 5.25%, $5,000 grows to approximately $8,293 over 10 years.

Full implementation: [`code/python/react_agent.py`](code/python/react_agent.py)

Input / Output

Input: A user task/question + a set of available tools (with schemas)
Output: A final answer after zero or more tool calls
State: Message history accumulating reasoning steps and observations

Key Tradeoffs

Strength	Limitation
Handles open-ended, exploratory tasks	Unpredictable number of steps and cost
Adapts strategy based on observations	Can get stuck in loops or repeat failed actions
Simple to implement — one loop, one LLM	No upfront planning — may take inefficient paths
General-purpose — works for many task types	Reasoning quality degrades with long histories
Easy to add new tools without structural changes	Hard to test deterministically

When to Use

Open-ended tasks where the steps aren't known in advance
Tasks requiring tool use with adaptive behavior
Question-answering that may need multiple information sources
When you want the simplest possible agent architecture
As the starting point before deciding you need a more complex pattern

When NOT to Use

When steps are known in advance — use Prompt Chaining
When the task needs upfront strategic planning — use Plan & Execute
When quality needs iterative self-improvement — use Reflection
When multiple specialized capabilities are needed — use Multi-Agent

Evolves from: Prompt Chaining — see evolution.md
Builds on: Tool Use — ReAct requires tool use as a component
Extends into: Plan & Execute (add planning), Reflection (add self-critique), RAG (add retrieval), Memory (add persistence)

Deeper Dive

Design — Loop mechanics, message history management, tool dispatch, termination strategies
Implementation — Pseudocode, interfaces, prompt templates, testing approach
Evolution — How ReAct emerges from prompt chaining