HomeArtificial IntelligenceClaude Sonnet 4.8 & Opus 4.8: What Actually Shipped (2026)

Claude Sonnet 4.8 & Opus 4.8: What Actually Shipped (2026)

Last updated: June 2026

The Claude Code source leak of March 31, 2026 exposed Anthropic’s internal model codenames — version references Opus 4.7 and Sonnet 4.8 plus animal names Fennec (Opus 4.6), Capybara (the Mythos-class tier above Opus), and Numbat (in testing). Here is what has actually shipped since: Claude Opus 4.8 went GA on 28 May 2026, and the Capybara/Mythos line launched as Claude Fable 5 and Mythos 5 on 9 June 2026 (Mythos 5 access was then restricted). Notably, no “Sonnet 4.8” has shipped — the current Sonnet is 4.6. This article decodes the leaked codename system and tracks each prediction against what Anthropic actually released.

Claude Models Opus 4.8 Sonnet 4.8 Capybara Fable 5 / Mythos
✅ Update — June 2026: what the leak got right (and wrong)

Months on, the codenames can be scored against reality. Opus 4.8 shipped GA on 28 May 2026 — read our Claude Opus 4.8 breakdown. The Capybara / Mythos-class prediction landed: Anthropic launched Claude Fable 5 and Mythos 5 on 9 June 2026 (see what Claude Mythos is). The miss: there is no “Sonnet 4.8” — Anthropic’s current Sonnet is 4.6, and the version jumped straight to the Mythos-class tier. The earlier Opus 4.7 also shipped before being superseded by 4.8.

When the Claude Code source code leaked on March 31, 2026, the community immediately found what Anthropic tried hardest to hide. Not just unreleased features — but explicit references to future model versions and a complete internal codename system that maps Anthropic’s past, present, and planned AI lineup.

The irony is almost poetic: the leaked codebase contains an „Undercover Mode” that explicitly instructs Claude to never reveal internal model codenames in public git commits. The forbidden list includes „animal names like Capybara, Tengu” and „unreleased model version numbers (e.g., opus-4-7, sonnet-4-8).” Those exact strings — intended to be hidden — are now the most searched terms from the leak.

This article decodes the full codename system, maps what each animal name corresponds to, and analyzes the internal quality metrics that Anthropic never intended to publish.

What model codenames did the leak reveal?

Anthropic uses an animal-based codename system for internal model development. The leaked source code, analyzed by independent researchers and documented on GitHub, confirms the following mappings:

Codename Maps to Status (March 2026) Key detail
Tengu Claude Code (project) ✅ Live Appears hundreds of times as prefix for feature flags and analytics events. The project codename for Claude Code itself.
Fennec Opus 4.6 (originally), then also Sonnet 4.6 ✅ Released Migration function migrateFennecToOpus confirms Fennec was an Opus codename. Sonnet 4.6 launched Feb 17, 2026 under this codename.
Capybara Claude 4.6 variant / Mythos tier 🧪 In testing Currently on v8 iteration. Internal benchmarks show 29–30% false claims rate. First model in the new tier above Opus.
Numbat Unknown model 🔬 Pre-launch Referenced as still undergoing testing. No public details beyond the name.

The naming convention is consistent: small-to-medium mammals and mythological creatures. Fennec (the fox), Capybara, Numbat — each one a development codename that gets replaced by the marketing name before public release. What makes this leak unique is that we can now trace the evolution path of models through the migration functions embedded in the code.

What do the migration functions tell us about Anthropic’s release history?

The leaked codebase contains model migration functions — code that handles transitioning users from one model version to the next. These functions act as a fossil record of Anthropic’s internal release timeline:

TypeScript (from leaked source)
// Model migration history — extracted from Claude Code v2.1.88
migrateFennecToOpus        // Fennec → Opus (codename to product name)
migrateSonnet1mToSonnet45  // Sonnet 1M context → rebranded as Sonnet 4.5
migrateSonnet45ToSonnet46  // Sonnet 4.5 → Sonnet 4.6
resetProToOpusDefault      // Pro users reset to Opus as default model

Reading these in sequence tells a clear story. Fennec started as an internal Opus development build. The 1-million-token context window Sonnet was initially a separate variant that got folded into the Sonnet 4.5 release. When Sonnet 4.6 shipped on February 17, 2026, another migration function handled the transition for existing users. And at some point, Pro tier subscribers had their default switched to Opus — likely during the Opus 4.6 launch alongside Agent Teams in early February.

The most revealing detail? The Undercover Mode instructions list opus-4-7 and sonnet-4-8 as examples of „unreleased model version numbers” that must never appear in public commits. This isn’t hypothetical — it confirms these version numbers exist internally at Anthropic, even if the models behind them aren’t finished yet.

💡 Why „4.7” and „4.8” — not „5.0”?

Anthropic has avoided integer version jumps since Claude 4. The progression from 4.5 → 4.6 → (likely) 4.7 and 4.8 suggests incremental improvements within the Claude 4 architecture rather than a complete generational leap. The true „next generation” appears to be Mythos/Capybara — a new tier above Opus entirely, not just Opus 5.

What is Capybara and how does it connect to Claude Mythos?

This is where two separate leaks converge. Five days before the Claude Code source leak, on March 26, 2026, a different incident exposed Anthropic’s internal content management system. A misconfigured data store left nearly 3,000 unpublished assets publicly accessible — draft blog posts, PDFs, images, and internal memos describing an unreleased model called Claude Mythos.

Anthropic confirmed that leak was real. A spokesperson acknowledged the company had completed training on Mythos, describing it as a „step change” and „the most capable we’ve built to date.” The leaked blog drafts positioned Mythos not as Opus 5, but as the first model in a new tier above Opus.

The Claude Code source leak now confirms the internal codename: Capybara = Mythos. The code references „Capybara” as a Claude 4.6 variant, but the Mythos leak documents describe it as sitting above the existing Opus tier. The most likely interpretation is that Capybara started as an experimental 4.6 variant and evolved into something significantly more capable during development — powerful enough to warrant its own tier.

This matches the leaked cybersecurity assessment that moved markets. Mythos was described internally as being „currently far ahead of any other AI model in cyber capabilities,” a statement that triggered 4–7% drops in cybersecurity stocks including CrowdStrike, Palo Alto Networks, and Zscaler on March 27–28.

What do the internal quality metrics reveal?

The most technically significant finding from the source leak isn’t a codename — it’s internal benchmark data that Anthropic would never willingly publish.

Comments attached to Capybara’s development code reveal that its eighth iteration (v8) carries a false claims rate of 29–30%. This is not a typo. It’s actually a regression from v4, which had a false claims rate of 16.7%. The model got almost twice as unreliable between versions.

This data point is extraordinary for several reasons:

It shows that more capable ≠ more reliable. The AI industry’s public narrative focuses on benchmark scores going up — SWE-bench, AIME, ARC-AGI. Anthropic’s internal data shows that as Capybara gained capabilities, its tendency to make false claims nearly doubled. This is the classic capability-reliability tradeoff that researchers have warned about but companies rarely quantify publicly.

It reveals the „assertiveness counterweight.” Alongside the false claims metric, the code references a mechanism designed to prevent the model from becoming „too aggressive in its refactors.” This suggests that as Capybara became more powerful in code generation, it also became more confident in making changes — including wrong ones. Anthropic built a dedicated counterweight to throttle this tendency, which implies the unweighted model was aggressively rewriting code in ways that introduced errors.

It sets a benchmark for the entire industry. Before this leak, nobody outside Anthropic (or its competitors) knew what internal false claims rates look like for frontier large language models. A 29% false claims rate in a model described as a „step change” suggests that even the most advanced systems hallucinate at rates that would be unacceptable in high-stakes applications without human oversight.

Anthropic Claude Model Evolution — Codenames & Timeline Timeline diagram showing the evolution of Claude models from Opus 4.5 through the leaked Opus 4.7 and Sonnet 4.8, with animal codenames (Fennec, Capybara, Numbat) and the new Mythos tier above Opus. Internal false claims metrics shown for Capybara v4 (16.7%) and v8 (29-30%). Anthropic Claude Model Evolution Timeline DecodeTheFuture.org Claude Opus 4.7, Claude Sonnet 4.8, Capybara, Fennec, Numbat, model roadmap, Anthropic Visual timeline of Claude model releases and upcoming versions decoded from the March 2026 source code leak. Diagram image/svg+xml en © DecodeTheFuture.org Claude Model Evolution Decoded from source code leak • March 31, 2026 Opus 4.5 Nov 2025 • 77.2% SWE-bench • 200K context RELEASED Sonnet 4.5 Late 2025 • 1M context variant migrateSonnet1mToSonnet45 Opus 4.6 🦊 Fennec Feb 6, 2026 • Agent Teams • 80.8% SWE-bench migrateFennecToOpus RELEASED Sonnet 4.6 🦊 Fennec Feb 17, 2026 • 79.6% SWE-bench • 1M ctx • $3/$15 migrateSonnet45ToSonnet46 RELEASED AT LEAK: UNRELEASED ▼ Opus 4.7 → 4.8 Opus 4.8 shipped GA 28 May 2026 SHIPPED Sonnet 4.8 Never shipped — current Sonnet is 4.6 DID NOT SHIP Numbat (unknown tier) Pre-launch testing. No public mapping. UNRELEASED Mythos 🐹 Capybara New tier ABOVE Opus • „Step change” Shipped: Fable 5 + Mythos 5, 9 Jun 2026 SHIPPED Shipped Did not ship Unreleased 🦊 = codename

What does the „assertiveness counterweight” mean in practice?

One of the subtler findings from the leak is a mechanism called the assertiveness counterweight. This is a guardrail specifically designed to prevent Capybara from being too aggressive when modifying user code.

The problem it solves is straightforward: as deep learning models become more capable at code generation, they also become more confident in making changes — including changes that are wrong. A model that hesitates before rewriting a function is annoying but safe. A model that confidently rewrites an entire module and introduces subtle bugs is dangerous.

The assertiveness counterweight suggests Anthropic discovered this failure mode during internal testing of Capybara and had to engineer a dedicated mechanism to throttle it. This isn’t a prompt-level instruction like „be careful with code changes” — it’s a system-level architectural constraint built into the model’s operating logic.

For developers building with AI agents, this is a critical design pattern to understand. More capable models need more sophisticated restraint mechanisms. The pattern holds whether you’re building a coding agent, a trading bot, or any autonomous system: capability without calibrated confidence is a liability.

What does the „over-commenting” problem reveal?

A less dramatic but equally telling detail from the leak: developers noted that Capybara has an over-commenting tendency. The model adds excessive code comments — explaining obvious operations, annotating every function, padding code with verbose documentation that no experienced developer would write.

This matters because over-commenting is a classic signal of AI-generated code. In a world where Anthropic’s Undercover Mode explicitly instructs Claude to hide its AI identity in open-source contributions, a model that compulsively comments every line of code is undermining its own stealth. It’s the AI equivalent of a spy who can’t stop explaining what they’re doing.

More importantly, over-commenting increases the token count of every response, raising costs for API users. For high-volume users of machine learning pipelines that use Claude as a coding agent, even a 20% increase in output tokens translates directly to 20% higher bills. This isn’t a style preference — it’s a cost problem at scale.

How does Anthropic’s model versioning compare to competitors?

The leak provides an unusually clear picture of how Anthropic stages model releases internally. Comparing this to what we know about OpenAI and Google:

Aspect Anthropic (from leak) OpenAI (public) Google (public)
Codename system Animal names (Fennec, Capybara, Numbat) Alphanumeric (e.g. „gpt-5.3-codex”) Animal names (Snow Bunny for Gemini 3.5)
Version increments Sub-integer (4.5 → 4.6 → 4.7) Mixed (GPT-4 → GPT-5 → GPT-5.1 → 5.2 → 5.3) Integer + variant (Gemini 3 → 3.1 → 3.5)
Tier structure Haiku / Sonnet / Opus / Mythos (new) Mini / Standard / Reasoning (o-series) Flash / Pro / Ultra
Internal metrics leaked False claims rate (16.7% → 29%) Not leaked Not leaked
Release cadence ~2 weeks between major updates (2026) ~4–6 weeks between versions ~6–8 weeks between versions

The most significant structural difference is the new Mythos tier. Both OpenAI and Google have maintained three-tier hierarchies (cheap/balanced/powerful). Anthropic introduced a fourth tier — a „beyond Opus” class that sits above their previous best, which shipped as Fable 5 and Mythos 5 on 9 June 2026. It represents a genuine architectural leap rather than an incremental version bump.

What should developers and AI practitioners expect next?

The combination of the Mythos CMS leak (March 26) and the Claude Code source leak (March 31) provides an unusually complete picture of Anthropic’s roadmap. Three conclusions stand out:

The Opus strings were real; the “Sonnet 4.8” string was not (update). Opus 4.7 shipped and was quickly superseded by Opus 4.8 (GA 28 May 2026). But despite appearing in the same forbidden-strings list, no Sonnet 4.8 ever launched — Anthropic’s Sonnet line still sits at 4.6, and the next leap went to the Mythos-class tier instead. The lesson: a version string in operational code signals intent, not a guaranteed release.

Capybara/Mythos was the real next-generation leap — and it landed (update). On 9 June 2026 Anthropic launched the Mythos-class tier as Claude Fable 5 (generally available) and Mythos 5 (restricted, then suspended). Exactly as the codenames implied, this was not “Opus 5” but a qualitatively new tier sitting above Opus. The leak’s framing of Capybara as a generational jump rather than an increment proved correct.

The false claims regression is the most honest AI metric ever accidentally published. Anthropic’s internal acknowledgment that their most capable model hallucinates at nearly 30% — and that this is worse than a less capable version — is the kind of data the industry needs more of. It confirms what practitioners building with RAG systems and context engineering already suspect: scaling capabilities does not automatically scale reliability, and external verification layers remain essential.

⚠️ Important caveat

This analysis is built on code comments, feature flags, and migration functions from the leak — not official Anthropic announcements. Several predictions have since been confirmed by real releases (Opus 4.8 on 28 May 2026; Fable 5 and Mythos 5 on 9 June 2026), while others never shipped (a “Sonnet 4.8” — the current Sonnet is 4.6). The internal false-claims metrics may have been measured under conditions that don’t reflect general use. Treat any still-unreleased codename details as directional, not definitive.

Related Guides

Building with these Claude models? Start here:

FAQ

Are Claude Opus 4.7 and Sonnet 4.8 confirmed models?

Partly — and the outcome is now known. Opus did ship: Opus 4.7 launched and was then superseded by Opus 4.8 (GA 28 May 2026). But Sonnet 4.8 never shipped — despite appearing in the leaked forbidden-strings list, Anthropic’s Sonnet line is still at 4.6. The strings confirmed internal intent, but a version number in leaked code did not guarantee a public release under that name.

What is the Capybara model — is it Claude 5?

Capybara was the internal codename for the Mythos-class tier — a class above Opus, not simply „Opus 5.” It shipped on 9 June 2026 as Claude Fable 5 (generally available) and Claude Mythos 5 (restricted, later suspended). So the codename mapped to a real, qualitatively new performance class — exactly as the leak implied.

What does a 29% false claims rate mean?

In internal benchmarks, Capybara v8 generates false or inaccurate claims in approximately 29–30% of relevant test cases. This is nearly double the 16.7% rate in v4. The specific benchmark methodology hasn’t been disclosed, but the regression suggests that increasing the model’s capabilities also increased its tendency to confidently assert incorrect information.

What is the „assertiveness counterweight”?

It’s a mechanism built into Capybara to prevent the model from being too aggressive when rewriting user code. As the model became more capable, it also became more confident in making changes — including wrong ones. The counterweight throttles this tendency, ensuring the model is less likely to make sweeping, potentially destructive code refactors without user approval.

What is Numbat?

Numbat is an internal codename for an unreleased model that is still in pre-launch testing, according to the leaked source code. No additional details — tier, capabilities, or timeline — have been disclosed. It follows the animal naming convention used for other Claude models (Fennec, Capybara).

When will Opus 4.7 and Sonnet 4.8 be released?

This is now answered. Opus shipped — Opus 4.8 reached GA on 28 May 2026. And Anthropic did skip straight to the Capybara/Mythos tier (Claude Fable 5 and Mythos 5, 9 June 2026) rather than releasing a “Sonnet 4.8,” which never materialized — the current Sonnet remains 4.6. The 4.7/4.8 Sonnet strings appear to have been internal-only iterations that were never shipped publicly.

Sources & Further Reading

  1. Kuberwastaken. (2026, March 31). Claude Code’s Source Code & Breakdown from a leaked map file in their NPM registry. GitHub. https://github.com/Kuberwastaken/claude-code
  2. Wiggers, K. (2026, March 31). Claude Code’s source code appears to have leaked: here’s what we know. VentureBeat. https://venturebeat.com
  3. NerdLevelTech. (2026, March 27). Anthropic Claude Mythos leak 2026: What we know. https://nerdleveltech.com
  4. WaveSpeedAI. (2026, March 29). Claude Mythos (Opus 5) leaked: What we know so far. https://wavespeed.ai
  5. thehuman2ai. (2026, March 31). Claude Code source has been available for 13 months, and nothing happened — why? https://thehuman2ai.com
  6. Naprys, E. (2026, March 31). Full source code for Anthropic’s Claude Code leaks. Cybernews. https://cybernews.com
  7. Anthropic. (2026). Models overview — Claude API Docs. https://platform.claude.com
  8. NxCode. (2026, February 17). Claude Sonnet 5 ‘Fennec’ leak: What actually launched as Claude Sonnet 4.6. https://www.nxcode.io
RELATED ARTICLES

2 COMMENTS

  1. Model roadmaps are useful, but creative workflows still need identity control. For generated character assets, I check whether the same face and style remain consistent before using outputs across scenes.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -

Most Popular

Recent Comments