Claude Sonnet 4.6 Burns 4x More Tokens Than Predecessor on Reasoning Tasks

Anthropic's new Claude Sonnet 4.6 model consumes up to four times more tokens than its predecessor during extended reasoning tasks, offsetting its advertised 40% per-token cost cut compared to Opus 4.6. Sam Witteveen, an Anthropic expert, highlighted this trade-off as users weigh performance gains against higher overall expenses.

G Geeky Gadgets By World Headlines Newsroom

Updated Mar 26, 2026, 9:44 PM 2 min read Audio Published Feb 20, 2026, 2:23 PM

Claude Sonnet 4.6 Burns 4x More Tokens Than Predecessor on Reasoning Tasks

Key Points

Claude Sonnet 4.6 uses up to 4x more tokens than Sonnet 4.5 on long reasoning tasks
40% cheaper per token than Opus 4.6, but higher usage offsets savings
Strong in adaptive reasoning; weaker on complex, multi-step problems versus Opus

SAN FRANCISCO — Anthropic’s Claude Sonnet 4.6 promises sharper adaptive reasoning and better context retention for complex AI workloads. Those gains come at a price. The model chews through tokens at a much higher rate, especially in long-chain problem-solving, according to observations from Sam Witteveen.

Sonnet 4.6 costs 40% less per token than Opus 4.6, Anthropic states. Real-world use tells a different story. Tasks demanding sustained reasoning or multi-step processes can demand four times as many tokens as Sonnet 4.5 required. A large dataset analysis, for instance, might look cheaper at first glance. Total bills climb fast once token volume spikes.

Enhancements like context compaction and programmatic tool calling shine in automation workflows. Sonnet 4.6 narrows the gap with Opus 4.6 on benchmarks for adaptive thinking. It stumbles, though, on complex puzzles or prolonged logic chains. Performance dips as complexity rises. Straightforward queries? No issue. Elaborate scenarios expose the limits.

API quirks add another layer. Anthropic’s own platform handles advanced features smoothly. Third-party APIs lag, with spotty support for tool calling. Developers integrating across stacks report headaches. The result: uneven results that force workarounds.

Organizations eye Sonnet 4.6 for targeted jobs—quick adaptive tasks or moderate context needs. High-volume processing or deep reasoning? Opus 4.6 holds the edge. Some hold out for Opus 4.7 or even 5.0, betting on fixes to token hunger and consistency.

Anthropic positions Sonnet 4.6 as a step up from Sonnet 4.5 in computational chops. Benchmarks show progress. Token math remains the sticking point. Users running budget-sensitive operations crunch numbers twice before switching. One developer testing dataset workflows found costs doubling despite the per-token .

Platform choice matters too. Anthropic’s API outperforms rivals in speed and feature access. Third-party options falter, limiting Sonnet 4.6’s reach for diverse setups. Companies with mixed tech environments face integration snags that erode efficiency gains.

Sonnet 4.6 fits niches where its strengths—context hold and tool use—dominate. Broad applications demand caution. Witteveen urges matching model traits to exact workloads. Mismatched picks waste money and time.

The AI field shifts quickly. Sonnet 4.6 advances Anthropic’s lineup. Token efficiency lags behind. Teams tracking costs for reasoning-heavy apps stick with proven options or wait for refinements.

Asia Pacific

Europe

Claude Sonnet 4.6 Burns 4x More Tokens Than Predecessor on Reasoning Tasks

Key Points

Read Next

Ajay Mitchell Shines as Thunder Dominate Playoffs

Ebola Returns to Eastern DRC: Panic, 88 Deaths

Pope Issues Text on Human Dignity and AI

Trending

Greece to Pay Off €6.9 Billion, Reducing Debt Burden

Xi Jinping hosts Putin as China emerges as diplomatic hub

Ebola Spreading Faster Than First Thought, WHO Warns

Comments