Workflow

Prompt Chaining

Sequential LLM calls with validation gates between steps.

Beginner · Evolves into: ReAct , Tool Use , Memory

Prompt Chaining — Overview

Prompt chaining is the simplest workflow pattern: a sequence of LLM calls where each call's output becomes the next call's input, with optional validation gates between steps.

Architecture

graph TD Input([Input]) -->|"raw input"| Step1[LLM Call 1:<br/>Extract / Transform] Step1 -->|"output_1"| Gate1{Gate: Valid?} Gate1 -->|"yes"| Step2[LLM Call 2:<br/>Process / Enrich] Gate1 -->|"no"| Fail1([Fail / Retry]) Step2 -->|"output_2"| Gate2{Gate: Valid?} Gate2 -->|"yes"| Step3[LLM Call 3:<br/>Synthesize / Format] Gate2 -->|"no"| Fail2([Fail / Retry]) Step3 -->|"final output"| Output([Output]) style Input fill:#e3f2fd style Step1 fill:#fff3e0 style Step2 fill:#fff3e0 style Step3 fill:#fff3e0 style Gate1 fill:#fce4ec style Gate2 fill:#fce4ec style Output fill:#e3f2fd style Fail1 fill:#ffcdd2 style Fail2 fill:#ffcdd2

Figure: A 3-step prompt chain with validation gates between steps. Each LLM call has a focused task. Gates check output quality before proceeding.

How It Works

Decompose the task into discrete, sequential steps
Each step gets its own prompt, optimized for that specific subtask
Gates between steps validate output format, quality, or content before passing it forward
The chain runs to completion or fails at a gate

Each LLM call has a narrow, well-defined job. This makes prompts simpler, outputs more reliable, and debugging straightforward — you know exactly which step produced a given output.

Minimal Example

Extract key requirements from a spec, prioritize by complexity, then format as an engineer checklist — three focused LLM calls in sequence.

from workflows.prompt_chaining.code.python.prompt_chaining import PromptChain, ChainStep

chain = PromptChain(
    llm=your_llm,
    steps=[
        ChainStep(
            name="extract",
            prompt_template="Extract the key technical requirements from:\n\n{input}",
            validate=lambda out: len(out) > 20,   # Gate: reject empty/trivial output
        ),
        ChainStep(
            name="prioritize",
            prompt_template="Rank these requirements by implementation complexity:\n\n{input}",
        ),
        ChainStep(
            name="format",
            prompt_template="Format this as a numbered checklist for engineers:\n\n{input}",
        ),
    ],
)

result = chain.run(raw_spec_document)
# result.success    → True/False
# result.failed_at  → name of the step that failed its gate, or None
# result.output     → final formatted checklist

Full implementation: [`code/python/prompt_chaining.py`](code/python/prompt_chaining.py)

Input / Output

Input: Any data that needs multi-step LLM processing
Output: Transformed result after passing through all steps
Intermediate: Each step produces an output consumed by the next step

Key Tradeoffs

Strength	Limitation
Simple to understand and debug	Rigid — steps are fixed at design time
Each step has a focused prompt	Latency scales linearly with step count
Gates catch errors early	No ability to adapt based on intermediate results
Easy to test step-by-step	Information can be lost between steps
Predictable cost (fixed call count)	Adding new steps requires code changes

When to Use

Tasks with a clear, fixed sequence of transformations
When each step's output can be validated before proceeding
When you need deterministic behavior and easy debugging
Multi-step content generation (draft → edit → format)
ETL-style processing (extract → transform → load)

When NOT to Use

When the number of steps depends on the input or intermediate results — use ReAct instead
When steps are independent and can run concurrently — use Parallel Calls instead
When output needs iterative quality improvement — use Evaluator-Optimizer instead
When the LLM needs to decide which operations to perform — use an agent pattern

Evolves into: ReAct (add dynamic tool selection and LLM-controlled looping), Tool Use (add structured function calling), Memory (add persistent state between runs)
Combines with: Evaluator-Optimizer (add quality gates that loop), Parallel Calls (parallelize independent steps)
Simpler alternative to: Orchestrator-Worker (when you don't need dynamic task decomposition)

Deeper Dive

Design — Component breakdown, data flow, gate strategies, error handling
Implementation — Pseudocode, interfaces, testing strategy, common pitfalls