Claude Opus 4.8: Release Date, Model ID and Availability

Verified status

Last updated: July 18, 2026 · Release status, model ID, availability, pricing, context window and migration guidance checked against official Anthropic sources.

Short answer: Claude Opus 4.8 was officially released on May 28, 2026 and remains listed by Anthropic as the current Opus-tier model. Its Claude API model ID is claude-opus-4-8. Standard API pricing is $5 per million input tokens and $25 per million output tokens; the Claude API lists a 1M-token context window and up to 128K output tokens. Fast mode remains an API research preview at $10/$50, while Claude Code dynamic workflows are now generally available.

Released May 28, 2026 Model ID: claude-opus-4-8 1M context Last verified July 18

Table of Contents

Is Claude Opus 4.8 released? Current status

Yes. Anthropic launched Claude Opus 4.8 on May 28, 2026. Its current models overview still lists claude-opus-4-8, so this is a released model rather than a roadmap name, leak or unconfirmed future version.

Status field	Verified answer
Released?	Yes — May 28, 2026
Current Claude API model ID	`claude-opus-4-8`
Standard API price	$5 input / $25 output per million tokens
Claude API context window	1M tokens by default
Maximum output	128K tokens on the synchronous Messages API
Fast mode	Research preview on the Claude API; access-controlled
Dynamic workflows	Generally available in Claude Code and supported platform surfaces
Last verified	July 18, 2026

Do not confuse “current Opus” with “newest Claude model”

Opus 4.8 remains the current Opus-tier model, but Anthropic’s model lineup has continued to change. For the current family-wide comparison, use our Claude models 2026 guide. Historical Sonnet 4.8, Capybara and leak queries belong to the archived Claude roadmap, not this status page.

Claude Opus 4.8 release date, availability and API model ID

The official Claude Opus 4.8 release date is May 28, 2026. Anthropic’s launch announcement said the model was available immediately and gave developers the API identifier claude-opus-4-8. The current platform documentation lists the same identifier for the Claude API, with provider-specific identifiers documented separately for Amazon Bedrock and Google Cloud.

Anthropic describes the dateless claude-opus-4-8 name as a pinned model snapshot, not an evergreen alias that silently moves to a future Opus release. That matters for production evaluation: a test against this model ID remains tied to Opus 4.8 rather than automatically becoming a later model.

Implementation note

Use the exact documented ID for your platform and query the Models API for current token limits. Cloud availability and endpoint identifiers can differ even when the product name is the same.

What changed in Claude Opus 4.8?

Opus 4.8 builds on Opus 4.7 rather than introducing a new API family. Anthropic’s current documentation highlights better long-horizon agentic coding, tool triggering, compaction recovery and effort calibration, plus several concrete platform changes.

Dynamic workflows are now generally available

Dynamic workflows let Claude Code plan a large task, dispatch tens to hundreds of parallel subagents, verify their work and return one coordinated result. The feature launched as a research preview with Opus 4.8, but Anthropic’s current product page now says it is generally available in Claude Code CLI, Desktop and the VS Code extension for Pro, Max, Team and Enterprise plans, as well as on supported API and cloud surfaces.

This is useful for decomposable work such as codebase-wide audits, migrations and independent verification. It is not a free performance multiplier: Anthropic warns that workflows can consume substantially more tokens than a typical Claude Code session. Start with a scoped task and measure usage before making workflows a default.

For a broader explanation of the orchestration pattern, see our agentic workflows guide.

Mid-conversation system messages

Opus 4.8 accepts a role: "system" message immediately after a user turn in the Messages API, subject to Anthropic’s placement rules. This lets an application update instructions during a long-running conversation without rebuilding the full history, which can preserve prompt-cache hits. The top-level system field remains the correct place for instructions that apply from the beginning.

Effort defaults and adaptive thinking

The effort parameter defaults to high across Opus 4.8 surfaces. Adaptive thinking is supported, but API requests without a thinking field run without thinking. Enable it with {"type":"adaptive"} and set depth through output_config.effort. Manual extended-thinking budgets are not supported on Opus 4.8.

Fast mode remains a research preview

Fast mode uses the same Opus 4.8 model with up to 2.5× higher output tokens per second. It is not a different or more capable model, and the benefit targets output speed rather than time to first token. As of the verification date it is an access-controlled research preview on the Claude API, not a generally available cloud-platform feature.

Lower cache minimum and documented refusal details

The minimum cacheable prompt length is 1,024 tokens, down from 2,048 on Opus 4.7. Anthropic also documents the stop_details object on refusals, which helps applications route different refusal categories without treating them as generic failures.

Claude Opus 4.8 pricing, context window and Fast mode

Mode	Input / MTok	Output / MTok	Status and use
Standard Opus 4.8	$5	$25	Current standard Claude API pricing
Fast mode	$10	$50	Research preview; latency-sensitive output streaming
Batch API	$2.50	$12.50	Asynchronous workloads; Fast mode is not available with Batch

The official model overview lists a 1M-token context window and 128K maximum output for Opus 4.8 on the synchronous Messages API. The 1M window is included at standard per-token pricing. Provider-specific endpoint limits and identifiers can differ, so check the current platform documentation before assuming identical behavior across every cloud.

Fast mode is exactly 2× the standard input and output price. Because it accelerates output token generation rather than model intelligence, use it only where response speed has measurable product value. Requests at fast and standard speeds do not share prompt-cache prefixes, and a fast-mode rate or capacity error does not silently fall back to standard speed.

Claude Opus 4.8 benchmarks: what the results do and do not show

Anthropic’s launch materials report improvements across coding, agentic work, reasoning and practical knowledge tasks. These results are useful release signals, but most are vendor-run or partner-reported. They should guide what to test in your own harness, not replace workload-specific evaluation.

Reported result	What it indicates	Important limit
84% on Online-Mind2Web	Strong browser-agent and computer-use signal	Partner-reported in Anthropic’s announcement
About 4× less likely to let self-written code flaws pass unremarked	Better uncertainty and flaw flagging	Not a claim that the model writes 4× fewer bugs
First reported model above 10% all-pass on Legal Agent Benchmark	Progress on long multi-step professional workflows	All-pass is strict and the result is partner-reported
Improved long-horizon coding, tool triggering and compaction recovery	More reliable agentic work than Opus 4.7	Re-test against your own prompts, tools and cost limits

Read the “4×” claim precisely

Anthropic says Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it wrote to pass unremarked. That is an honesty/self-review measure, not a blanket fourfold improvement in code correctness.

How to migrate from Opus 4.7 to Opus 4.8

Anthropic documents no breaking API changes for code already running on Opus 4.7. The required change is the model name; the main operational work is re-baselining behavior, latency and cost at your chosen effort setting.

Python · Anthropic SDK

from anthropic import Anthropic

client = Anthropic()

response = client.messages.create(
    model="claude-opus-4-8",       # was claude-opus-4-7
    max_tokens=16000,
    thinking={"type": "adaptive"},
    output_config={"effort": "high"},
    messages=[
        {"role": "user", "content": "Review this migration plan."}
    ],
)

Before moving production traffic:

Swap the model ID from claude-opus-4-7 to claude-opus-4-8.
Re-run your evals. Anthropic expects strong out-of-the-box performance on Opus 4.7 prompts, but behavior and effort calibration changed.
Re-baseline latency and cost. The default effort level is high; set it explicitly if your product depends on a stable trade-off.
Remove obsolete thinking configuration. Use adaptive thinking and output_config.effort, not budget_tokens.
Remove an old long-context beta header if your Opus 4.7 client still sends one; the 1M window is now the default on supported endpoints.
Handle refusal details and verify that downstream parsers tolerate the model’s updated response behavior.

For a practical coding-tool next step, compare Claude Code with Cursor.

Where Opus 4.8 fits in the current Claude model lineup

Opus 4.8 is the current Opus-tier model for complex agentic coding and enterprise work. It is a stable production choice when you specifically need the Opus balance of long-horizon execution, tool use and 1M context. It should not be described as Anthropic’s newest model across every tier, because the wider Claude lineup evolves independently.

Choose by workload rather than model name alone:

Use Opus 4.8 when complex agentic coding, long tool chains and high-stakes knowledge work justify its cost.
Use a cheaper current Claude tier when latency and unit economics matter more than maximum Opus capability.
Use the current Claude model lineup for family-wide availability and price comparisons.
Use the dedicated Claude Mythos status guide for restricted-model context rather than treating a May launch forecast as current Opus information.

FAQ

When was Claude Opus 4.8 released?

Anthropic released Claude Opus 4.8 on May 28, 2026. It is a released model, not an unconfirmed roadmap name.

What is the Claude Opus 4.8 API model ID?

The Claude API model ID is claude-opus-4-8. Anthropic describes this dateless ID as a pinned model snapshot rather than an evergreen alias.

Is Claude Opus 4.8 still available?

Yes. Anthropic’s current models overview still lists Claude Opus 4.8. Platform identifiers and endpoint availability can differ, so verify the documentation for the Claude API, Amazon Bedrock or Google Cloud before deployment.

How much does Claude Opus 4.8 cost?

Standard Claude API pricing is $5 per million input tokens and $25 per million output tokens. Fast mode is $10 input and $50 output per million tokens, while Batch API pricing is $2.50 input and $12.50 output.

What context window does Claude Opus 4.8 have?

Anthropic’s Claude API model overview lists a 1M-token context window and up to 128K output tokens for synchronous Messages API requests. Check provider-specific documentation for cloud endpoint differences.

Is Fast mode generally available for Opus 4.8?

No. Fast mode remains an access-controlled research preview on the Claude API. It can deliver up to 2.5× higher output tokens per second from the same model, but it does not improve model intelligence.

Will existing Opus 4.7 code work with Opus 4.8?

Anthropic documents no breaking API changes for code already running on Opus 4.7. Change the model ID to claude-opus-4-8, rerun your evals, and re-baseline effort, latency and cost before moving production traffic.

Update log

July 18, 2026: Rebuilt the page around release status, model ID and availability. Reverified pricing, context, maximum output and migration guidance; corrected adaptive-thinking code; updated dynamic workflows from research preview to generally available; removed stale Mythos timing and dated competitor rankings.
June 2, 2026: Original release-week analysis published after the May 28 launch.

Sources and further reading (12)

Time-sensitive model, pricing and product-status claims were checked against official Anthropic or Claude documentation on July 18, 2026. Benchmark claims are identified as Anthropic- or partner-reported where applicable.

Anthropic — Introducing Claude Opus 4.8 (May 28, 2026 release date, launch availability, pricing and reported evaluations).
Claude Platform Docs — What’s new in Claude Opus 4.8 (model ID, features, behavior and API constraints).
Claude Platform Docs — Models overview (current listing, identifiers, pricing, context and maximum output).
Claude Platform Docs — Pricing (standard, Fast mode and Batch API rates).
Claude Platform Docs — Migration guide (Opus 4.7 to 4.8 changes and checklist).
Claude Platform Docs — Fast mode (research-preview status, access, speed behavior and pricing).
Claude Platform Docs — Prompting Claude Opus 4.8 (effort, thinking and prompting behavior).
Claude Platform Docs — Context windows (current context behavior and limits).
Claude — Introducing dynamic workflows in Claude Code (current GA status, plan/platform availability and usage warning).
Anthropic — Claude Opus 4.8 System Card (evaluation and safety methodology).
Anthropic — Claude Opus model page (current product positioning).
Claude Platform Docs — Prompt caching (cache behavior and model-specific minimums).

2 COMMENTS

Claude Opus 4.8 + Dynamic Workflows 2026: Full Breakdown 2026-06-06 At 14:12

[…] Decode The Future. (2026, June 2). Claude Opus 4.8: 7 Changes + Dynamic Workflows. https://decodethefuture.org/en/claude-opus-4-8-explained/ […]

Claude Opus 4 Review 2026: Still the Best AI Model? 2026-06-13 At 05:17

[…] subagents. This positions Opus 4.8 directly in the enterprise agentic pipeline market. According to Decode the Future, this is the biggest architectural shift since the base Opus 4 […]

Claude Opus 4.8: Release Date, Model ID and Availability

Is Claude Opus 4.8 released? Current status

Claude Opus 4.8 release date, availability and API model ID

What changed in Claude Opus 4.8?

Dynamic workflows are now generally available

Mid-conversation system messages

Effort defaults and adaptive thinking

Fast mode remains a research preview

Lower cache minimum and documented refusal details

Claude Opus 4.8 pricing, context window and Fast mode

Claude Opus 4.8 benchmarks: what the results do and do not show

How to migrate from Opus 4.7 to Opus 4.8

Where Opus 4.8 fits in the current Claude model lineup

FAQ

Update log

Sources and further reading (12)

AI Agent Observability: Traces, Evals and Costs

Best LLM Observability Tools 2026: 9 Compared

AI Agent Security 2026: Injection, Sandboxing & Permissions

2 COMMENTS

LEAVE A REPLY Cancel reply

Most Popular

AI Agent ROI Calculator: Cost, Payback and Break-Even

AI Agent Observability: Traces, Evals and Costs

Best LLM Observability Tools 2026: 9 Compared

Day Trading vs Swing Trading: Risks & Costs

Recent Comments

Inwestowanie

AI Agent ROI Calculator: Cost, Payback and Break-Even

AI Agent Observability: Traces, Evals and Costs

Best LLM Observability Tools 2026: 9 Compared

POPULAR POSTS

AI Agent ROI Calculator: Cost, Payback and Break-Even

AI Agent Observability: Traces, Evals and Costs

Best LLM Observability Tools 2026: 9 Compared

POPULAR CATEGORY

ABOUT US

FOLLOW US