Scaling Managed Agents: Decoupling the Brain from the Hands

Source: Scaling Managed Agents — Decoupling the Brain from the Hands

Original: raw/docs/anthropic-managed-agents.md
URL: https://www.anthropic.com/engineering/managed-agents
Author: Anthropic Engineering

Summary

Anthropic's Managed Agents service redesigns agentic infrastructure around a single insight: harnesses encode assumptions that go stale as models improve. The fix is to decouple the three logical layers of an agent system so each can evolve or fail independently.

The three-layer model

Layer What it is
Brain Claude + its harness (reasoning, tool-calling logic)
Hands Sandboxes + tools that actually execute actions
Session Append-only event log (persists outside both Brain and Hands)

Key design decisions

Harness outside containers

Previously: harness ran inside the container alongside execution.
Now: harness lives outside, talks to containers via execute(name, input) → string.

Benefits:

Durable session storage

The session log is stored outside the harness. On crash/restart:

Security through separation

Performance results

Metric Improvement
p50 time-to-first-token ~60% reduction
p95 time-to-first-token >90% reduction

Gains come primarily from on-demand container provisioning.

Scalability model

"Many brains, many hands": multiple Claude instances access multiple execution environments independently. No shared state between Brain instances → no single point of failure.