Agentic Context Pruning (ACP)

![npm version](https://www.npmjs.com/package/@tuanhung303/opencode-acp)
![CI](https://github.com/tuanhung303/opencode-agent-context-pruning/actions)
![License: MIT](https://opensource.org/licenses/MIT)

Agent Context Pruning - Making context disappear

Reduce token usage by up to 50% through intelligent context management.

ACP optimizes LLM context windows by automatically pruning obsolete content—tool outputs, messages, and reasoning blocks—while preserving critical operational state.

---

📊 Context Flow Architecture

``mermaid flowchart TB subgraph Input["📥 Input Layer"] U[User Message] T[Tool Outputs] M[Assistant Messages] R[Thinking Blocks] end

subgraph Processing["⚙️ ACP Processing"] direction TB Auto["Auto-Supersede"] Manual["Manual Pruning"]

subgraph AutoStrategies["Auto-Supersede Strategies"] H[Hash-Based Duplicates] F[File-Based Operations] Todo[Todo-Based Updates] URL[Source-URL Fetches] SQ[State Query Dedup] end

subgraph ManualTools["Manual Tools"] D[Discard] Dist[Distill] end end

subgraph Output["📤 Optimized Context"] Clean[Clean Context ~50% smaller] L[LLM Provider] end

U --> Processing T --> Auto M --> Manual R --> Manual

Auto --> H Auto --> F Auto --> Todo Auto --> URL Auto --> SQ

Manual --> D Manual --> Dist

H --> Clean F --> Clean Todo --> Clean URL --> Clean SQ --> Clean D --> Clean Dist --> Clean

Clean --> L

style Input fill:#e1f5fe,stroke:#01579b,stroke-width:2px style Processing fill:#f3e5f5,stroke:#4a148c,stroke-width:2px style AutoStrategies fill:#fff3e0,stroke:#e65100,stroke-width:1px style ManualTools fill:#e8f5e9,stroke:#1b5e20,stroke-width:1px style Output fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px`

---

`🚀 Quick Start`

`$3`

`bash npm install @tuanhung303/opencode-acp`

Add to your OpenCode config:

`jsonc // opencode.jsonc { "plugin": ["@tuanhung303/opencode-acp@latest"], }`

`$3`

ACP handles most pruning automatically. The following tools give agents granular control over context:

`typescript // Discard completed work context_prune({ action: "discard", targets: [["a1b2c3"]] })

// Distill large outputs context_prune({ action: "distill", targets: [["d4e5f6", "Found 15 TypeScript files"]], })

// Batch operations context_prune({ action: "discard", targets: [["hash1"], ["hash2"], ["hash3"]], })`

---

`📚 Documentation`

| Document | Purpose | | --------------------------------------------------------------- | ------------------------------------------ | | Validation Guide | 43 comprehensive test cases | | Test Harness | Ready-to-run test scripts | | Todo Write Testing Guide | Testingtodowrite& stuck task detection | | Context Architecture | Memory management strategies | | Decision Tree | Visual pruning flowcharts | | Limitations Report | What cannot be pruned | | Changelog | Version history and migration guides |

---

`🤖 Agent Auto Mode`

ACP provides the context_prune tool for intelligent context management:

`$3`

`typescript context_prune({ action: "discard" | "distill" | "replace", targets: [string, string?, string?][] // Format depends on action })`

`$3`

| Type | Format | Example | | ------------------- | ------------------------- | ---------------------------------------------- | | Tool outputs | 6 hex chars |44136f, 01cb91| | Thinking blocks | 6 hex chars |abc123| | Messages | 6 hex chars |def456| | Pattern replace | [start, end, replacement] |["Start marker:", "End marker.", "[pruned]"] |

`$3`

`typescript // Prune multiple items at once context_prune({ action: "discard", targets: [ ["44136f"], // Tool output ["abc123"], // Thinking block ["def456"], // Message ], })

// Distill with shared summary context_prune({ action: "distill", targets: [ ["44136f", "Research phase complete"], ["01cb91", "Research phase complete"], ], })

// Pattern replace - replace content between markers context_prune({ action: "replace", targets: [ ["Detailed findings from analysis:", "End of detailed findings.", "[analysis complete]"], ["Debug output started:", "Debug output ended.", "[debug pruned]"], ], })`

`$3`

- Match content must be ≥30 characters - Start OR end pattern must be >15 characters - Literal matching only (no regex) - Exactly one match per pattern - No overlapping patterns

---

`🔄 Auto-Supersede Mechanisms`

ACP automatically removes redundant content through multiple strategies:

`$3`

Duplicate tool calls with identical arguments are automatically deduplicated.

`┌─────────────────────────────────────┐ ┌─────────────────────────────────────┐ │ BEFORE: │ │ AFTER: │ │ │ │ │ │ 1. read(package.json) #a1b2c3 │ ───► │ ...other work... │ │ 2. ...other work... │ │ 3. read(package.json) #d4e5f6◄──┐ │ │ 3. read(package.json) #d4e5f6 │ │ │ │ │ │ First call superseded (hash match) │ │ Tokens: ~15,000 │ │ Tokens: ~10,000 (-33%) │ └─────────────────────────────────────┘ └─────────────────────────────────────┘`

`$3`

File operations automatically supersede previous operations on the same file.

`┌─────────────────────────────────────┐ ┌─────────────────────────────────────┐ │ BEFORE: │ │ AFTER: │ │ │ │ │ │ 1. read(config.ts) │ ───► │ │ │ 2. write(config.ts) │ │ 3. edit(config.ts)◄────────────┐ │ │ 3. edit(config.ts) │ │ │ │ │ │ Previous operations pruned │ │ Tokens: ~18,000 │ │ Tokens: ~6,000 (-67%) │ └─────────────────────────────────────┘ └─────────────────────────────────────┘`

`$3`

Todo operations automatically supersede previous todo states.

`┌─────────────────────────────────────┐ ┌─────────────────────────────────────┐ │ BEFORE: │ │ AFTER: │ │ │ │ │ │ 1. todowrite: pending │ ───► │ │ │ 2. todowrite: in_progress │ │ 3. todowrite: completed◄────────┐ │ │ 3. todowrite: completed │ │ │ │ │ │ Previous states auto-pruned │ │ Tokens: ~4,500 │ │ Tokens: ~1,500 (-67%) │ └─────────────────────────────────────┘ └─────────────────────────────────────┘`

`$3`

Identical URL fetches are deduplicated—only the latest response is retained.

`$3`

State queries (ls, find, pwd, git status) are deduplicated—only the latest results matter.

`$3`

New context_prune tool calls supersede previous context operations, preventing context management overhead from accumulating.

`$3`

Only the latest snapshot per file is retained. Previous snapshots are automatically pruned.

`$3`

Failed tool attempts are automatically removed when the operation succeeds on retry.

---

`🛡️ Protected Tools`

These tools are exempt from pruning to ensure operational continuity:

`context_info, task, todowrite, todoread, context_prune, batch, write, edit, plan_enter, plan_exit`

Additional tools can be protected via configuration:

`jsonc { "commands": { "protectedTools": ["my_custom_tool"], }, }`

---

`⚙️ Configuration`

ACP uses its own config file with multiple levels:

`Priority: Defaults → Global → Config Dir → Project`

- Global: ~/.config/opencode/acp.jsonc- Config Dir:$OPENCODE_CONFIG_DIR/acp.jsonc- Project:.opencode/acp.jsonc

`$3`

`jsonc { "$schema": "https://raw.githubusercontent.com/tuanhung303/opencode-agent-context-pruning/master/acp.schema.json", "enabled": true, "debug": false, "pruneNotification": "minimal",

"commands": { "enabled": true, "protectedTools": [], // Additional tools to protect (merged with defaults) },

"protectedFilePatterns": [ "**/.env", "*/.env.", "**/credentials.json", "**/secrets.json", "*/.pem", "*/.key", "**/package.json", "**/tsconfig.json", "**/pyproject.toml", "**/Cargo.toml", ],

"tools": { "settings": { "protectedTools": [], // Merged with built-in protected tools "enableAssistantMessagePruning": true, "enableReasoningPruning": true, "enableVisibleAssistantHashes": true, }, "discard": { "enabled": true }, "distill": { "enabled": true, "showDistillation": false }, "todoReminder": { "enabled": true, "initialTurns": 5, "repeatTurns": 4, "stuckTaskTurns": 12, }, "automataMode": { "enabled": true, "initialTurns": 8 }, },

"strategies": { "purgeErrors": { "enabled": false, "turns": 4 }, "aggressivePruning": { // All enabled by default - see Aggressive Pruning section }, }, }`

`$3`

All aggressive pruning options are enabled by default for up to 50% token savings.

#### Pruning Presets

Use presets for quick configuration:

`jsonc { "strategies": { "aggressivePruning": { "preset": "balanced", // Options: "compact", "balanced", "verbose" }, }, }`

| Preset | Description | Use Case | | ------------ | ----------------------------------------- | -------------------------------- | | compact | Maximum cleanup, all options enabled | Long sessions, token-constrained | | balanced | Good defaults, preserves user code blocks | Most use cases (default) | | verbose | Minimal cleanup, preserves everything | Debugging, audit trails |

#### Individual Options

Override preset values with individual flags:

`jsonc { "strategies": { "aggressivePruning": { "preset": "balanced", "pruneToolInputs": true, // Strip verbose inputs on supersede "pruneStepMarkers": true, // Remove step markers entirely "pruneSourceUrls": true, // Dedup URL fetches "pruneFiles": true, // Mask file attachments "pruneSnapshots": true, // Keep only latest snapshot "pruneRetryParts": true, // Prune failed retries on success "pruneUserCodeBlocks": false, // Keep user code blocks (balanced default) "truncateOldErrors": false, // Keep full errors (balanced default) "aggressiveFilePrune": true, // One-file-one-view "stateQuerySupersede": true, // Dedup state queries (ls, git status) }, }, }`

---

`📊 Token Savings`

| Metric | Without ACP | With ACP | Savings | | ------------------- | ------------ | ----------- | ------- | | Typical Session | ~80k tokens | ~40k tokens | 50% | | Long Session | ~150k tokens | ~75k tokens | 50% | | File-Heavy Work | ~100k tokens | ~35k tokens | 65% |

Cache Impact: ~65% cache hit rate with ACP vs ~85% without. The token savings typically outweigh the cache miss cost, especially in long sessions.

---

`🧪 Testing`

Run the comprehensive test suite:

`bash

`Load test todos`


todowrite({ / copy from docs/VALIDATION_GUIDE.md / })
Run preparation

prep-0 through prep-7
Execute tests

t1 through t43
Generate report

report-1 through report-4


See Validation Guide for detailed test procedures.
---
📋 Pruning Workflow
Complete example: execute tool → find hash → prune.
Step 1: Run a tool

`typescript read({ filePath: "src/config.ts" }) // Output includes: a1b2c3`

Step 2: Find the hash in output

`... file contents ... a1b2c3`

Step 3: Prune when no longer needed

`typescript context_prune({ action: "discard", targets: [["a1b2c3"]] }) // Response: 「 🗑️ discard ✓ 」- ⚙️ read // Available: Tools(5), Messages(2), Reasoning(1)`

Batch multiple targets:

`typescript context_prune({ action: "discard", targets: [["a1b2c3"], ["d4e5f6"], ["g7h8i9"]] })`

Distill with summary:

`typescript context_prune({ action: "distill", targets: [["abc123", "Auth: chose JWT over sessions"]], })`

---

`🏗️ Architecture Overview`

`mermaid flowchart TD subgraph OpenCode["OpenCode Core"] direction TB A[User Message] --> B[Session] B --> C[Transform Hook] C --> D[toModelMessages] D --> E[LLM Provider] end

subgraph ACP["ACP Plugin"] direction TB C --> F[syncToolCache] F --> G[injectHashes] G --> H[Apply Strategies] H --> I[prune] I --> C end

style OpenCode fill:#F4F7F9,stroke:#5A6B8A,stroke-width:1.5px style ACP fill:#E8F5F2,stroke:#9AC4C0,stroke-width:1.5px`

ACP hooks into OpenCode's message flow to reduce context size before sending to the LLM:

1. Sync Tool Cache - Updates internal tool state tracking 2. Inject Hashes - Makes content addressable for pruning 3. Apply Strategies - Runs auto-supersede mechanisms 4. Prune - Applies manual and automatic pruning rules

---

`📝 Commands`

| Command | Description | | ------------ | ------------------------------- | |/acp| Show ACP statistics and version | |/acp stats | Show ACP statistics and version |

---

`🔧 Advanced Features`

`$3`

Monitors todowrite usage and prompts when tasks are neglected:

`jsonc { "tools": { "todoReminder": { "enabled": true, "initialTurns": 8, // First reminder after 8 turns without todo update "repeatTurns": 4, // Subsequent reminders every 4 turns "stuckTaskTurns": 12, // Threshold for stuck task detection }, }, }`

Reminder Behavior:

- First reminder: Fires after initialTurns (8) turns without todowrite- Repeat reminders: Fire everyrepeatTurns(4) turns thereafter - Auto-reset: Eachtodowritecall resets the counter to 0 - Deduplication: Only ONE reminder exists in context at a time; new reminders replace old ones - Stuck task detection: Tasks inin_progress for stuckTaskTurns(12) are flagged with guidance - Prunable outputs: Reminder displays a list of prunable tool outputs to help with cleanup

Reminder Sequence:

`Turn 0: todowrite() called (resets counter) Turn 8: 🔖 First reminder (if no todowrite since turn 0) Turn 12: 🔖 Repeat reminder Turn 16: 🔖 Repeat reminder ...`

`$3`

Autonomous reflection triggered by "automata" keyword:

`jsonc { "tools": { "automataMode": { "enabled": true, "initialTurns": 8, // Turns before first reflection }, }, }`

`$3`

Identifies tasks stuck in in_progress for too long:

`jsonc { "tools": { "todoReminder": { "stuckTaskTurns": 12, // Threshold for stuck detection }, }, }`

---

`🚧 Limitations`

- Subagents: ACP is disabled for subagent sessions - Cache Invalidation: Pruning mid-conversation invalidates prompt caches - Protected Tools: Some tools cannot be pruned by design

---

`🛠️ Troubleshooting`

`$3`

Cause: Using Anthropic/DeepSeek/Kimi thinking mode with an outdated ACP version or missing reasoning sync.

Fix:

1. Update to ACP v3.0.0+: npm install @tuanhung303/opencode-acp@latest2. Ensure your config has thinking-compatible settings 3. See Thinking Mode Compatibility for details

`$3`

Symptoms: Commands like /acp return "Unknown command"

Fix:

1. Verify plugin is in opencode.jsonc: "plugin": ["@tuanhung303/opencode-acp@latest"]2. Runnpm run build && npm linkin the plugin directory 3. Restart OpenCode

`$3`

Check:

- Is aggressive pruning enabled in config? See Configuration - Are you using protected tools excessively? (task, write, editcan't be pruned) - Is your session >100 turns? Consider starting a fresh session

---

`🔬 Provider Compatibility`

`$3`

ACP is fully compatible with extended thinking mode APIs that require the reasoning_content field. The context_prune tool automatically syncs reasoning content to prevent 400 Bad Request errors.

Supported providers: Anthropic, DeepSeek, Kimi Not required: OpenAI, Google

See the detailed technical documentation for implementation details and the root cause of the original compatibility issue.

---

`📦 npm Package`

Package: @tuanhung303/opencode-acpLicense: MIT Repository: https://github.com/tuanhung303/opencode-agent-context-pruning

`$3`

`bash

`Via npm`


npm install @tuanhung303/opencode-acp
Via OpenCode config

Add to opencode.jsonc: "plugin": ["@tuanhung303/opencode-acp@latest"]
Via URL (for agents)

curl -s https://raw.githubusercontent.com/tuanhung303/opencode-acp/master/README.md

$3

- CI: Every PR triggers linting, type checking, and unit tests - CD: Merges tomain auto-publish to npm

---

`🤝 Contributing`

1. Fork the repository 2. Create a feature branch 3. Run tests:npm test4. Submit a pull request

---

`📄 License`

MIT © tuanhung303

---


⚠️ Known Pitfalls for Agents — Critical rules when modifying ACP code
> Read this section before modifying ACP code. These are hard-won lessons from debugging production issues.
$3
❌ WRONG:

`typescript async function executeContextToolDiscard(ctx, toolCtx, hashes) { const { state, logger } = ctx

// Validate hashes...

if (validHashes.length === 0) { // Early return without fetching messages const currentParams = getCurrentParams(state, [], logger) // ← BUG: Empty array return "No valid hashes" }

// Only fetch messages in success path const messages = await client.session.messages(...) }`

✅ CORRECT:

`typescript async function executeContextToolDiscard(ctx, toolCtx, hashes) { const { client, state, logger } = ctx

// ALWAYS fetch messages first - required for thinking mode API compatibility const messagesResponse = await client.session.messages({ path: { id: toolCtx.sessionID }, }) const messages = messagesResponse.data || messagesResponse

// ALWAYS initialize session - syncs reasoning_content await ensureSessionInitialized(client, state, toolCtx.sessionID, logger, messages)

// Now validate hashes...

if (validHashes.length === 0) { const currentParams = getCurrentParams(state, messages, logger) // ← Use actual messages return "No valid hashes" } }`

Why? Anthropic's thinking mode API requires reasoning_content on all assistant messages with tool calls. Skipping ensureSessionInitialized causes 400 errors.

---

`$3`

This function syncs reasoning_content from message parts to msg.info. Without it:

`error, status code: 400, message: thinking is enabled but reasoning_content is missing in assistant tool call message at index 2`

Rule: Call ensureSessionInitialized at the START of every context_prune tool function, before any early returns.

---

`$3`

❌ WRONG:

`typescript // Completely removing reasoning_content breaks API state.prune.reasoningPartIds.push(partId) // No replacement content → field removed → API error`

✅ CORRECT:

`typescript // Convert discard to distill with minimal placeholder if (reasoningHashes.length > 0) { const minimalSummaries = reasoningHashes.map(() => "—") await executeContextReasoningDistill( ctx, toolCtx, reasoningHashes.map((h, i) => [h, minimalSummaries[i]]), ) }`

Why? The reasoning_content field must exist (even if minimal) for thinking mode providers.

---

`$3`

Always test context_prune tool changes with:

1. Valid existing hashes (success path) 2. Non-existing hashes like"zzzzzz"(no-op path) 3. Mix of valid and invalid hashes

The no-op path is where most bugs hide because it's less tested.

---

`$3`

| Provider | Thinking Mode | reasoning_contentRequired | Notes | | --------- | ----------------- | ---------------------------- | -------------------- | | Anthropic | Extended thinking | ✅ Yes | Strict validation | | DeepSeek | DeepThink | ✅ Yes | Similar to Anthropic | | Kimi | K1 thinking | ✅ Yes | Similar to Anthropic | | OpenAI | N/A | ❌ No | No thinking mode | | Google | N/A | ❌ No | No thinking mode |

Always test with Anthropic when modifying context/pruning logic—it has the strictest API validation.

---

`$3`

`User Message → Transform Hook → ensureReasoningContentSync → API Call ↑ Context tool runs HERE`

If context_prune tool returns early (no-op), the transform hook may not have synced reasoning_content yet. That's why each tool function must call ensureSessionInitialized itself.

---

`$3`

When you see reasoning_content is missing errors:

1. Check the error index - "at index 2" means the 3rd message in the array 2. Verify message fetch - Is the function fetching messages before early returns? 3. Verify session init - IsensureSessionInitializedcalled in ALL code paths? 4. Check the no-op path - Does the early return path have proper initialization? 5. Test with invalid hash -context_prune({ action: "discard", targets: [["zzzzzz"]] })

---


🗨️ For LLM Agents — Quick pruning reference and safety guidelines
If you're an AI assistant reading this:

> Quick Pruning Reference: > > - Light pressure (<50%): No action > - Moderate (50-75%): Prune old tool outputs usingcontext_prune({ action: "discard", targets: [[hash]] })`
> - High (75-90%): Anchor critical info, then batch prune non-essential items
> - Critical (>90%): Aggressively discard all disposable context
>
> Always Protect: Active todos, current file edits, user requirements

Ask the user: "Would you like me to run a context optimization check?"

Agentic Context Pruning (ACP)

Agent Context Pruning - Making context disappear

Reduce token usage by up to 50% through intelligent context management.

ACP optimizes LLM context windows by automatically pruning obsolete content—tool outputs, messages, and reasoning blocks—while preserving critical operational state.

---

📊 Context Flow Architecture

``mermaid flowchart TB subgraph Input["📥 Input Layer"] U[User Message] T[Tool Outputs] M[Assistant Messages] R[Thinking Blocks] end

subgraph Processing["⚙️ ACP Processing"] direction TB Auto["Auto-Supersede"] Manual["Manual Pruning"]

subgraph AutoStrategies["Auto-Supersede Strategies"] H[Hash-Based Duplicates] F[File-Based Operations] Todo[Todo-Based Updates] URL[Source-URL Fetches] SQ[State Query Dedup] end

subgraph ManualTools["Manual Tools"] D[Discard] Dist[Distill] end end

subgraph Output["📤 Optimized Context"] Clean[Clean Context ~50% smaller] L[LLM Provider] end

U --> Processing T --> Auto M --> Manual R --> Manual

Auto --> H Auto --> F Auto --> Todo Auto --> URL Auto --> SQ

Manual --> D Manual --> Dist

H --> Clean F --> Clean Todo --> Clean URL --> Clean SQ --> Clean D --> Clean Dist --> Clean

Clean --> L

---

`🚀 Quick Start`

`$3`

`bash npm install @tuanhung303/opencode-acp`

Add to your OpenCode config:

`jsonc // opencode.jsonc { "plugin": ["@tuanhung303/opencode-acp@latest"], }`

`$3`

ACP handles most pruning automatically. The following tools give agents granular control over context:

`typescript // Discard completed work context_prune({ action: "discard", targets: [["a1b2c3"]] })

// Distill large outputs context_prune({ action: "distill", targets: [["d4e5f6", "Found 15 TypeScript files"]], })

// Batch operations context_prune({ action: "discard", targets: [["hash1"], ["hash2"], ["hash3"]], })`

---

`📚 Documentation`

---

`🤖 Agent Auto Mode`

ACP provides the context_prune tool for intelligent context management:

`$3`

`typescript context_prune({ action: "discard" | "distill" | "replace", targets: [string, string?, string?][] // Format depends on action })`

`$3`

`typescript // Prune multiple items at once context_prune({ action: "discard", targets: [ ["44136f"], // Tool output ["abc123"], // Thinking block ["def456"], // Message ], })

// Distill with shared summary context_prune({ action: "distill", targets: [ ["44136f", "Research phase complete"], ["01cb91", "Research phase complete"], ], })

`$3`

- Match content must be ≥30 characters - Start OR end pattern must be >15 characters - Literal matching only (no regex) - Exactly one match per pattern - No overlapping patterns

---

`🔄 Auto-Supersede Mechanisms`

ACP automatically removes redundant content through multiple strategies:

`$3`

Duplicate tool calls with identical arguments are automatically deduplicated.

`$3`

File operations automatically supersede previous operations on the same file.

`$3`

Todo operations automatically supersede previous todo states.

`$3`

Identical URL fetches are deduplicated—only the latest response is retained.

`$3`

State queries (ls, find, pwd, git status) are deduplicated—only the latest results matter.

`$3`

New context_prune tool calls supersede previous context operations, preventing context management overhead from accumulating.

`$3`

Only the latest snapshot per file is retained. Previous snapshots are automatically pruned.

`$3`

Failed tool attempts are automatically removed when the operation succeeds on retry.

---

`🛡️ Protected Tools`

These tools are exempt from pruning to ensure operational continuity:

`context_info, task, todowrite, todoread, context_prune, batch, write, edit, plan_enter, plan_exit`

Additional tools can be protected via configuration:

`jsonc { "commands": { "protectedTools": ["my_custom_tool"], }, }`

---

`⚙️ Configuration`

ACP uses its own config file with multiple levels:

`Priority: Defaults → Global → Config Dir → Project`

- Global: ~/.config/opencode/acp.jsonc- Config Dir:$OPENCODE_CONFIG_DIR/acp.jsonc- Project:.opencode/acp.jsonc

`$3`

`jsonc { "$schema": "https://raw.githubusercontent.com/tuanhung303/opencode-agent-context-pruning/master/acp.schema.json", "enabled": true, "debug": false, "pruneNotification": "minimal",

"commands": { "enabled": true, "protectedTools": [], // Additional tools to protect (merged with defaults) },

"protectedFilePatterns": [ "**/.env", "*/.env.", "**/credentials.json", "**/secrets.json", "*/.pem", "*/.key", "**/package.json", "**/tsconfig.json", "**/pyproject.toml", "**/Cargo.toml", ],

"strategies": { "purgeErrors": { "enabled": false, "turns": 4 }, "aggressivePruning": { // All enabled by default - see Aggressive Pruning section }, }, }`

`$3`

All aggressive pruning options are enabled by default for up to 50% token savings.

#### Pruning Presets

Use presets for quick configuration:

`jsonc { "strategies": { "aggressivePruning": { "preset": "balanced", // Options: "compact", "balanced", "verbose" }, }, }`

#### Individual Options

Override preset values with individual flags:

---

`📊 Token Savings`

Cache Impact: ~65% cache hit rate with ACP vs ~85% without. The token savings typically outweigh the cache miss cost, especially in long sessions.

---

`🧪 Testing`

Run the comprehensive test suite:

`bash

`Load test todos`


todowrite({ / copy from docs/VALIDATION_GUIDE.md / })
Run preparation

prep-0 through prep-7
Execute tests

t1 through t43
Generate report

report-1 through report-4


See Validation Guide for detailed test procedures.
---
📋 Pruning Workflow
Complete example: execute tool → find hash → prune.
Step 1: Run a tool

`typescript read({ filePath: "src/config.ts" }) // Output includes: a1b2c3`

Step 2: Find the hash in output

`... file contents ... a1b2c3`

Step 3: Prune when no longer needed

`typescript context_prune({ action: "discard", targets: [["a1b2c3"]] }) // Response: 「 🗑️ discard ✓ 」- ⚙️ read // Available: Tools(5), Messages(2), Reasoning(1)`

Batch multiple targets:

`typescript context_prune({ action: "discard", targets: [["a1b2c3"], ["d4e5f6"], ["g7h8i9"]] })`

Distill with summary:

`typescript context_prune({ action: "distill", targets: [["abc123", "Auth: chose JWT over sessions"]], })`

---

`🏗️ Architecture Overview`

`mermaid flowchart TD subgraph OpenCode["OpenCode Core"] direction TB A[User Message] --> B[Session] B --> C[Transform Hook] C --> D[toModelMessages] D --> E[LLM Provider] end

subgraph ACP["ACP Plugin"] direction TB C --> F[syncToolCache] F --> G[injectHashes] G --> H[Apply Strategies] H --> I[prune] I --> C end

style OpenCode fill:#F4F7F9,stroke:#5A6B8A,stroke-width:1.5px style ACP fill:#E8F5F2,stroke:#9AC4C0,stroke-width:1.5px`

ACP hooks into OpenCode's message flow to reduce context size before sending to the LLM:

---

`📝 Commands`

| Command | Description | | ------------ | ------------------------------- | |/acp| Show ACP statistics and version | |/acp stats | Show ACP statistics and version |

---

`🔧 Advanced Features`

`$3`

Monitors todowrite usage and prompts when tasks are neglected:

Reminder Behavior:

Reminder Sequence:

`Turn 0: todowrite() called (resets counter) Turn 8: 🔖 First reminder (if no todowrite since turn 0) Turn 12: 🔖 Repeat reminder Turn 16: 🔖 Repeat reminder ...`

`$3`

Autonomous reflection triggered by "automata" keyword:

`jsonc { "tools": { "automataMode": { "enabled": true, "initialTurns": 8, // Turns before first reflection }, }, }`

`$3`

Identifies tasks stuck in in_progress for too long:

`jsonc { "tools": { "todoReminder": { "stuckTaskTurns": 12, // Threshold for stuck detection }, }, }`

---

`🚧 Limitations`

- Subagents: ACP is disabled for subagent sessions - Cache Invalidation: Pruning mid-conversation invalidates prompt caches - Protected Tools: Some tools cannot be pruned by design

---

`🛠️ Troubleshooting`

`$3`

Cause: Using Anthropic/DeepSeek/Kimi thinking mode with an outdated ACP version or missing reasoning sync.

Fix:

1. Update to ACP v3.0.0+: npm install @tuanhung303/opencode-acp@latest2. Ensure your config has thinking-compatible settings 3. See Thinking Mode Compatibility for details

`$3`

Symptoms: Commands like /acp return "Unknown command"

Fix:

1. Verify plugin is in opencode.jsonc: "plugin": ["@tuanhung303/opencode-acp@latest"]2. Runnpm run build && npm linkin the plugin directory 3. Restart OpenCode

`$3`

Check:

---

`🔬 Provider Compatibility`

`$3`

Supported providers: Anthropic, DeepSeek, Kimi Not required: OpenAI, Google

See the detailed technical documentation for implementation details and the root cause of the original compatibility issue.

---

`📦 npm Package`

Package: @tuanhung303/opencode-acpLicense: MIT Repository: https://github.com/tuanhung303/opencode-agent-context-pruning

`$3`

`bash

`Via npm`


npm install @tuanhung303/opencode-acp
Via OpenCode config

Add to opencode.jsonc: "plugin": ["@tuanhung303/opencode-acp@latest"]
Via URL (for agents)

curl -s https://raw.githubusercontent.com/tuanhung303/opencode-acp/master/README.md

$3

- CI: Every PR triggers linting, type checking, and unit tests - CD: Merges tomain auto-publish to npm

---

`🤝 Contributing`

1. Fork the repository 2. Create a feature branch 3. Run tests:npm test4. Submit a pull request

---

`📄 License`

---


⚠️ Known Pitfalls for Agents — Critical rules when modifying ACP code
> Read this section before modifying ACP code. These are hard-won lessons from debugging production issues.
$3
❌ WRONG:

`typescript async function executeContextToolDiscard(ctx, toolCtx, hashes) { const { state, logger } = ctx

// Validate hashes...

if (validHashes.length === 0) { // Early return without fetching messages const currentParams = getCurrentParams(state, [], logger) // ← BUG: Empty array return "No valid hashes" }

// Only fetch messages in success path const messages = await client.session.messages(...) }`

✅ CORRECT:

`typescript async function executeContextToolDiscard(ctx, toolCtx, hashes) { const { client, state, logger } = ctx

// ALWAYS initialize session - syncs reasoning_content await ensureSessionInitialized(client, state, toolCtx.sessionID, logger, messages)

// Now validate hashes...

if (validHashes.length === 0) { const currentParams = getCurrentParams(state, messages, logger) // ← Use actual messages return "No valid hashes" } }`

Why? Anthropic's thinking mode API requires reasoning_content on all assistant messages with tool calls. Skipping ensureSessionInitialized causes 400 errors.

---

`$3`

This function syncs reasoning_content from message parts to msg.info. Without it:

`error, status code: 400, message: thinking is enabled but reasoning_content is missing in assistant tool call message at index 2`

Rule: Call ensureSessionInitialized at the START of every context_prune tool function, before any early returns.

---

`$3`

❌ WRONG:

`typescript // Completely removing reasoning_content breaks API state.prune.reasoningPartIds.push(partId) // No replacement content → field removed → API error`

✅ CORRECT:

Why? The reasoning_content field must exist (even if minimal) for thinking mode providers.

---

`$3`

Always test context_prune tool changes with:

1. Valid existing hashes (success path) 2. Non-existing hashes like"zzzzzz"(no-op path) 3. Mix of valid and invalid hashes

The no-op path is where most bugs hide because it's less tested.

---

`$3`

Always test with Anthropic when modifying context/pruning logic—it has the strictest API validation.

---

`$3`

`User Message → Transform Hook → ensureReasoningContentSync → API Call ↑ Context tool runs HERE`

If context_prune tool returns early (no-op), the transform hook may not have synced reasoning_content yet. That's why each tool function must call ensureSessionInitialized itself.

---

`$3`

When you see reasoning_content is missing errors:

---


🗨️ For LLM Agents — Quick pruning reference and safety guidelines
If you're an AI assistant reading this:

Ask the user: "Would you like me to run a context optimization check?"