An r/ClaudeCode thread captured the bind: Opus 4.7 makes the agent better but burns through the Plus plan. The reflex is upgrade to Max. The cheaper fix is attacking where the tokens actually go.
Prerequisites
- Claude Code CLI
- An honest 1-day session log
Walkthrough
Step 1: Audit where tokens are going (1-day log)
Open the trace, count.
// Categories: grep+read fanout, MCP descriptions, skill folder, attached docs, system prompt, model choice.Step 2: Replace grep+read with a local code search MCP on large repos
Biggest single win.
// Install Semble or sourcegraph-cody-bridge.Step 3: Trim attached MCPs to 4-6 named ones
Reduce description bloat.
// Drop unused MCPs. Replace 5-8 narrow web skills with one Scavio MCP.Step 4: Trim skill folder
Same logic, different surface.
// 70 skills → 20-30. See the trim-skills tutorial.Step 5: Use Sonnet 4.6 by default; switch to Opus 4.7 only for hard tasks
Cheapest move.
// Routine ops → Sonnet 4.6. Architecture decisions, novel logic, hard debugging → Opus 4.7.Step 6: Re-measure after one week
Honest before/after.
// Track total cost per week vs feature output. Expect 30-50% drop.Python Example
# Tuned Plus: ~$50/mo all-in. Blanket Max: $100-200/mo for similar output.JavaScript Example
// Config-discipline tutorial.Expected Output
Same agent quality on Opus 4.7 for hard tasks, lower model + tuned tools for everything else, and a per-month bill cut by 30-50%.