Map-Reduce

A specialisation of manager-worker for the classic map-reduce shape: apply the same operation to N inputs in parallel, then combine the N outputs.

When to use

Homogeneous tasks. Every sub-task is structurally the same — same prompt, same tools, different input.
Results combine cleanly. There's a well-defined reduce step: sum, union, merge, concatenate, pick-best.
Scale matters. N is large enough that running sequentially is painful.

Examples: summarise each file in a directory, generate tests for each function in a module, extract a field from each of N JSON documents, translate each paragraph.

Shape

          input list [i1, i2, i3, ..., iN]
                       │
                       ▼
                  ┌──── map ────┐
                  ▼     ▼   ... ▼
                 A1    A2       AN      ← worker agents, same prompt
                  │     │        │
                  ▼     ▼        ▼
                 r1    r2       rN      ← partial results
                  │     │        │
                  └──── reduce ──┘
                       │
                       ▼
                    result

Map vs manager-worker

Map-reduce is manager-worker with two constraints:

All workers run the same agent configuration.
The combine step is a pure function of the partial results, not a new round of decision-making.

If either constraint is false, it's just manager-worker.

Implementation

// pseudocode for the orchestrator agent
const items = loadInputs();
const results = await Promise.all(
  items.map(item =>
    toolService.call({
      tool: "codebolt_agent.start",
      args: {
        agent: "per-item-worker",
        input: item,
        wait: true,
      },
    })
  )
);
return reduce(results);

In Codebolt terms:

codebolt_agent.start with wait: true creates a child run via agentService.startRun and blocks until it finishes.
The orchestrator uses Promise.all (or the equivalent in the agent framework) to run children concurrently.
Concurrency is bounded by AgentProcessManager's capacity — set a project-level cap so a map of 1000 items doesn't try to spawn 1000 processes at once.

The reduce step

Two flavours:

Pure reduce (deterministic)

The orchestrator collects results and applies a deterministic function: sum the numbers, merge the sets, concatenate the strings. No LLM needed. Cheapest and most predictable.

LLM reduce

The orchestrator asks an LLM to synthesise the partials into a unified result. Use when the partials need judgment to combine — e.g. "here are 10 draft summaries of different files, write a unified summary of the module".

Prefer pure reduce when possible. LLM reduce re-introduces non-determinism and cost.

Variations

Bounded parallelism

Cap concurrency to K workers at a time. Use coordinationService or a simple semaphore. Essential at large N.

Map-reduce with filtering

The map step can drop items ("this file isn't relevant"). The reduce step only sees items that survived. Cheap way to combine classification + processing.

Hierarchical reduce

For very large N, combine in a tree: reduce groups of 10, then reduce the reductions. Keeps any single reduce step's context manageable.

Pitfalls

Unbounded parallelism. N=1000 with no cap will open 1000 processes, spend your LLM budget in seconds, and probably hit provider rate limits. Always cap.
Heterogeneous items pretending to be homogeneous. If some items need different prompts, you don't have a map-reduce — you have manager-worker. Don't force it.
Losing the per-item context in the reduce. If the reduce needs to know which item each result came from, carry the item ID in the partial result. Easy to forget.
Rate limits. Bursty concurrent LLM calls will hit provider rate limits before they hit Codebolt's own caps. Set worker concurrency below your provider's sustained rate.

Example: "summarise each file in src/"

# orchestrator
task: "produce a one-paragraph summary of each file in src/"
steps:
  - list_files: { path: "src/", pattern: "**/*.ts" }
  - map:
      over: files
      agent: file-summariser
      concurrency: 8
  - reduce:
      kind: concat
      sep: "\n\n"
  - write: "docs/file-summaries.md"

When to use​

Shape​

Map vs manager-worker​

Implementation​

The reduce step​

Pure reduce (deterministic)​

LLM reduce​

Variations​

Bounded parallelism​

Map-reduce with filtering​

Hierarchical reduce​

Pitfalls​

Example: "summarise each file in src/"​

See also​