claude-engineering-plugin/docs/solutions/skill-design/pass-paths-not-content-to-subagents-2026-03-26.md at c1f68d4d55ebf6085eaa7c177bf5c2e7a2cfb62c

john/claude-engineering-plugin

Fork 0

Files

Trevin Chow c1f68d4d55

CI / pr-title (push) Has been cancelled

Details

CI / test (push) Has been cancelled

Details

Release PR / release-pr (push) Has been cancelled

Details

Release PR / publish-cli (push) Has been cancelled

Details

feat(doc-review, learnings-researcher): tiers, chain grouping, rewrite (#601 )

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-19 20:25:47 -07:00

6.1 KiB

Raw Blame History

title, problem_type, component, root_cause, resolution_type, severity, tags, date

title

problem_type

component

root_cause

resolution_type

severity

Problem

When orchestrating sub-agents that need codebase reference material (config files, standards docs, etc.), passing full file contents in the sub-agent prompt bloats context and makes the orchestrator do expensive upfront work that may go unused.

Symptoms

Orchestrator skill reads multiple files, concatenates their contents into a block (e.g., <standards> with full CLAUDE.md/AGENTS.md content), and injects it into the sub-agent prompt
Sub-agent receives all content regardless of how much is relevant to its specific task
In repos with directory-scoped config files, the orchestrator must discover and read every file before invoking a single sub-agent
Sub-agent prompts grow linearly with the number of reference files, even when the agent needs only specific sections

What Didn't Work

Having the orchestrator read all relevant file contents and pass them in a content block. This was the initial approach for the project-standards-reviewer agent in ce:review: Stage 3b collected all CLAUDE.md/AGENTS.md content into a <standards> block passed in the sub-agent prompt.

Problems:

Orchestrator did expensive read work that may be partially wasted
Sub-agent prompt inflated with content it may not fully use
Scales poorly as the number of directory-scoped config files grows
Sub-agent loses agency to decide what's relevant

Solution

Separate discovery (cheap) from reading (expensive). The orchestrator discovers file paths via glob or search, passes a path list, and the sub-agent reads only the files and sections it needs.

Pattern from Anthropic's code-review command:

"Use another Haiku agent to give you a list of file paths to (but not the contents of) any relevant CLAUDE.md files from the codebase: the root CLAUDE.md file (if one exists), as well as any CLAUDE.md files in the directories whose files the pull request modified"

The reviewing agents then receive those paths and read the files themselves.

How we applied it in ce:review:

Stage 3b: orchestrator globs for CLAUDE.md/AGENTS.md paths in changed directories, emits a <standards-paths> block
Sub-agent prompt: project-standards-reviewer reads the listed files itself, targeting sections relevant to the changed file types
Standalone fallback: if no <standards-paths> block is present, the agent discovers paths independently

General template:

Orchestrator:
1. Discover paths (glob/search) -> emit <reference-paths> block
2. Pass path list to sub-agent

Sub-agent:
1. If <reference-paths> present, read listed files
2. If absent, discover paths independently (standalone fallback)
3. Read only sections relevant to the specific task

Why This Works

Discovery is cheap; reading and processing file contents is expensive. The sub-agent is closer to the task (it knows what it's reviewing) and is better positioned to decide which sections of which files are relevant. This is lazy evaluation applied to agent orchestration: don't pay the cost of reading until you know you need the content.

Prevention

When designing orchestrator skills that invoke sub-agents needing repo reference material:

Default to path-passing. Orchestrator discovers paths, sub-agent reads content.
Include a standalone fallback. If the paths block is absent, the sub-agent discovers paths on its own. This enables both orchestrated and standalone invocation.
Content-passing is acceptable when: the reference material is small, static, and guaranteed to be fully consumed by every invocation (e.g., a JSON schema under 50 lines that the sub-agent always needs in full).
Signal to refactor: if you catch an orchestrator reading file contents before invoking sub-agents, treat it as a candidate for the path-passing pattern.

Instruction phrasing matters more than meta-rules

Empirical testing showed that how the skill phrases a search instruction has a dramatic effect on tool call count. For the same task (find ancestor CLAUDE.md/AGENTS.md files for changed paths):

Instruction phrasing	Claude Code tool calls	Codex shell commands
"for each changed file, walk its ancestor directories and check for X at each level"	14	2
"find all X in the repo, then filter to ancestors of changed files"	2	2

The "per-item walk" phrasing caused Claude Code to glob each directory level individually. The "bulk find, then filter" phrasing produced two globs total. Codex was resilient to both phrasings (it wrote a Python script to batch the work either way).

When in doubt about whether an instruction phrasing is efficient, test it empirically before committing. Both claude -p and codex exec support JSON output that reveals tool call counts:

# Claude Code: stream-json + verbose shows each tool call
claude -p "instruction here" --output-format stream-json --verbose 2>/dev/null > out.jsonl

# Codex: --json shows command_execution events
codex exec --json --full-auto "instruction here" > out.jsonl

This is worth doing for orchestration-heavy skills where instructions drive search or file discovery — a small phrasing change can produce a large difference in tool calls, latency, and token cost. Not every instruction needs benchmarking, but when the skill will run on every review or every plan, the cost compounds.

docs/solutions/skill-design/compound-refresh-skill-improvements.md — establishes "no shell commands for file operations in subagents"; complementary pattern about letting sub-agents use appropriate tools rather than orchestrating reads on their behalf
docs/solutions/skill-design/script-first-skill-architecture.md — complementary pattern: scripts pre-process large datasets so orchestrators don't load raw data
docs/solutions/agent-friendly-cli-principles.md — Principle #7 (Bounded, High-Signal Responses) reinforces that agents pay real cost for extra output; paths are bounded, content is not

6.1 KiB Raw Blame History

Problem

Symptoms

What Didn't Work

Solution

Why This Works

Prevention

Instruction phrasing matters more than meta-rules

Related

6.1 KiB

Raw Blame History