Claude Opus 4.8 cuts fast-mode pricing by two-thirds, adds reasoning sliders
Anthropic released Claude Opus 4.8 on May 28, keeping base pricing flat while slashing fast-inference costs and adding granular reasoning-length controls.

Anthropic released Claude Opus 4.8 on May 28 with three headline changes: granular reasoning controls that let users dial inference depth up or down, a fast mode now priced at one-third the cost of the previous fast tier, and what the company calls "improved honesty" — a claim that the model will hallucinate less often. The base pricing for Opus 4.8 remains unchanged from the prior version.
The fast mode shift is the most tangible cost win. Anthropic's fast inference option already delivered 2.5× faster generation; the new pricing drops the premium from 6× the base rate to 2× — a three-fold reduction that makes the fast tier viable for higher-volume workloads. The reasoning controls mirror the slider OpenAI added to ChatGPT, giving practitioners fine-grained control over how much compute the model spends on chain-of-thought steps before answering.

