Drift Detection

Keeping long-running swarms on-task. Drift is what happens when an agent (or a swarm) gradually moves away from the original goal, one reasonable-looking step at a time, until you look up an hour later and it's working on something else entirely.

What drift looks like

Real examples from real runs:

A refactoring agent starts cleaning up unrelated files because they "look similar".
A planning agent expands the scope of a feature to add "obviously useful" extras.
A debugging agent spirals into a library it shouldn't have entered to chase a phantom bug.
A swarm picks up stigmergic work items that were supposed to be scoped to one subdirectory.

Each step looks locally reasonable. The problem is only visible when you compare the current state to the original goal.

The detector

Codebolt's drift detector runs as a background check on long runs. It periodically:

Loads the original task and the initial plan.
Loads the last N phases from the event log.
Asks an LLM (a cheap one, not the agent's main model): "does what's happening in these phases still align with the original task?"
Produces a verdict: on_track, drifting, or off_track, plus a reason.

When it fires drifting or off_track, it doesn't kill the agent — it emits a warning event the orchestrator can react to. Typical reactions:

drifting — the orchestrator injects a reminder into the next step's context ("your original task was X, you're currently working on Y — confirm this is still on the path").
off_track — the orchestrator triggers a replan at the appropriate level of the planning hierarchy, or pauses the run for human input.

Why it runs as a sidecar

The drift detector is deliberately not part of the agent's own loop. An agent can't reliably detect its own drift — it's already decided the current path is reasonable. An external observer with a fresh context window is the only way to catch gradual movement.

This is the same reason code review is a separate agent in plan-execute-review: independence of judgment matters.

Configuration

Drift detection is per-flow or per-agent:

drift_detection:
  enabled: true
  interval_phases: 5         # check every 5 phases
  model: "small-judge-model"
  on_drifting: reminder
  on_off_track: replan       # or: pause_for_review, kill

Default is enabled: false for short runs and enabled: true for flows with more than a few iterations. You can override per-agent if you know a particular agent is prone to wandering.

Cost

Drift detection is a small recurring cost — one cheap LLM call every few phases. For long runs the savings from catching drift early massively outweighs the detection cost. For short runs the detector rarely fires before the run ends anyway, so disabling it saves noise.

Complementary mechanisms

Drift detection works best alongside:

Guardrails — deterministic rules catch hard violations; drift catches soft ones.
Reputation — agents with chronic drift lose reputation over time and get assigned less work.
Explicit scope in the task. A task written as "only touch files under src/auth/" gives the detector something concrete to check against. Vague tasks ("improve the auth code") are harder to drift-check.
Checkpoints — when drift is detected and the run is rolled back, you already have the state from before the drift started.

Pitfalls

Over-eager replans. If the detector is too sensitive, it replans constantly and nothing gets done. Tune the threshold per project.
Drift detector drifting. Yes, really — if the detector's model or prompt changes, its sense of "on track" changes. Treat the detector as infrastructure and version it.
False on-track. A drifting agent can sometimes talk the detector into thinking everything is fine. Reduce this by feeding the detector the original plan, not just the task description.
Using drift detection instead of good task definition. If your tasks are vague, drift detection is a patch. Better to write clearer tasks upfront.

What drift looks like​

The detector​

Why it runs as a sidecar​

Configuration​

Cost​

Complementary mechanisms​

Pitfalls​

See also​