Claude Fable 5: What It Is and When to Use It
Fable 5 is Anthropic's most capable public model. What it excels at, how the safety fallback works, and when Opus 4.8 is still the better call.
On this page
Claude Fable 5 launched on June 9, 2026, as Anthropic's most capable publicly available model. It is the first model from Anthropic's Mythos class, a capability tier that sits above the Opus family, built for long-horizon tasks that previous models could not sustain.
A note on timing: within three days of launch, a US government export-control directive forced Anthropic to suspend access globally. That suspension was lifted on July 1, 2026. Fable 5 is fully available again across Claude.ai, the Claude API, AWS, Google Cloud, and Microsoft Foundry.
This post cuts through the launch noise to answer the questions that actually matter for developers: what tasks is Fable 5 genuinely better at, what does it cost, how does the safety fallback affect your integration, and when should you stick with Opus 4.8 instead.
What Makes Fable 5 Different from Opus 4.8
Fable 5 is not the next Opus update. The architecture is fundamentally different in how it approaches long tasks. Four changes matter most for developers.
- Adaptive thinking is always on. Every request runs extended reasoning by default, with depth controlled by the effort parameter. There is no mode to switch on or off, and an explicit request to disable thinking is rejected.
- The context and output limits are larger. Fable 5 supports a 1 million token context window and up to 128,000 output tokens per request. The 1M window is both the maximum and the default.
- The lead grows with task complexity. On quick, isolated questions the gap between Fable 5 and Opus 4.8 is small. On tasks that span thousands of files and require planning, execution, self-checking, and course correction across many steps, Fable 5's advantage compounds. Anthropic's own framing is explicit: the longer and more complex the task, the larger Fable 5's lead.
- It works autonomously for extended periods. In a Claude Code or agent harness, Fable 5 can plan across stages, delegate to sub-agents, write and run its own tests, and verify its own outputs across multi-hour sessions. This is qualitatively different from asking a model to write a single function.
Benchmark Snapshot
All numbers below come from Anthropic's official launch announcement on June 9, 2026, comparing Claude Fable 5 to Claude Opus 4.8.
| Benchmark | Fable 5 | Opus 4.8 | Notes |
|---|---|---|---|
| SWE-Bench Pro (agentic coding) | 80.3% | 69.2% | Largest coding benchmark lead |
| FrontierCode Diamond | 29.3% | 13.4% | Hardest long-horizon coding problems |
| Terminal-Bench 2.1 | 88.0%* | N/A | *Mythos 5 score; Fable 5 lower due to cybersecurity fallbacks |
| GDPval-AA (knowledge work) | 1932 | 1890 | General domain performance |
| Legal | 13.3% | 10.4% | Contract and statute reasoning |
Starred benchmarks represent Mythos 5 results, a sibling model with the same capabilities. On cybersecurity-related benchmarks, Fable 5 scores closer to Opus 4.8 because its safety classifiers route those queries elsewhere. For non-cybersecurity tasks, Fable 5 and Mythos 5 perform within 1 to 3 percentage points of each other.
Third-party evaluations at launch pointed the same direction. Hex reported Fable 5 was the first model to score 90% on its core analytics benchmark for complex, long-running analytical tasks. Genspark said Fable 5 beat every other model in its evaluations, with particularly strong UI design and game coding results. Wharton professor Ethan Mollick, who had early access, found it capable of working autonomously for up to 12 hours on multi-page specifications.
Where Fable 5 Shines: Task by Task
1. Long-horizon agentic coding
This is Fable 5's clearest advantage. Large-scale migrations, repository-wide refactors, complex implementations that need repeated test-and-fix loops, and multi-day autonomous sessions are exactly what the model was designed for.
The most concrete example from launch week: Stripe reported that Fable 5 compressed a codebase-wide migration across a 50-million-line Ruby infrastructure into a single day. The team estimated the same work would take a group of engineers more than two months by hand. The task required semantic understanding of thousands of files, correct application of transformations in unusual edge cases, and autonomous verification that each change did not break anything downstream.
If you run these jobs through an agent harness, the practical setup matters as much as the model. Our Claude Code cheatsheet covers running long-horizon coding sessions, and the multi-agent AI coding workflow guide covers delegating across parallel sub-agents, which is where Fable 5 pulls furthest ahead.
For shorter coding tasks, such as writing a unit test, explaining a snippet, or generating a simple component, the gap between Fable 5 and Opus 4.8 shrinks considerably. The premium is harder to justify there.
2. Complex knowledge work
Research synthesis across dozens of sources, multi-stage analysis feeding into a final deliverable, and projects that teams hand off and review as completed work rather than supervising step by step. Fable 5's larger context window and stronger sustained reasoning make it more reliable across this kind of work than Opus 4.8.
3. Document-heavy work in finance, legal, and analytics
Fable 5 uses vision to understand diagrams, charts, and tables nested in PDFs and documents, not just plain-text extraction. For contract review, financial report analysis, and architecture documentation, it can reason over the full document structure rather than working from text alone. Multiple law firms reported at launch that in blind review, Fable 5's redlines matched or beat their existing model on every comparison.
4. Multi-step autonomous workflows
Agent harnesses where the model must plan across stages, use tools, check intermediate results, change direction when an approach is not working, and complete the full job without human supervision in between. Fable 5 is more reliable at maintaining coherent state and goals across long chains of tool calls than earlier models. When you run several of these agents at once, git worktrees for parallel AI coding agents keep each session isolated on its own branch.
5. Code generation with self-validation
Fable 5 can write its own tests, run them, check the results, and use vision to compare its output against a design spec or goal image. This changes the economics of code review for high-value tasks: the model provides a validated result rather than one that still needs full human verification.
The Safety Fallback and Your Integration
This is the section most launch-week reviews handle poorly. The fallback behavior is genuinely novel and has real integration implications.
What triggers it. Fable 5 includes safety classifiers that watch for certain query types, primarily in three categories: offensive cybersecurity (exploit development, agentic hacking), biology with dual-use potential, and chemistry. When a query trips a classifier, Fable 5 does not return an error. It reroutes the request to Opus 4.8 and gives you the Opus 4.8 response instead.
What the API response looks like. When a fallback occurs, the API returns a standard HTTP 200 response with stop_reason: "refusal" in the message metadata, plus a stop_details object naming the classifier category that triggered it. This is different from a traditional refusal that returns no useful content: you still get a usable answer, just from Opus 4.8 rather than Fable 5.
response = client.messages.create(
model="claude-fable-5",
max_tokens=4096,
messages=[{"role": "user", "content": prompt}],
)
# Check stop_reason BEFORE reading content, or you will hit an
# index error on a rerouted request.
if response.stop_reason == "refusal":
category = response.stop_details.category # e.g. "cyber", "bio"
# This is a success case: the response came from the fallback model.
handle_fallback_response(response)
else:
print(response.content[0].text)Three practical changes for your integration. First, treat stop_reason: "refusal" as a success case, not an error. Your existing handling probably does not cover it if you have only used earlier Claude models. Second, decide whether fallback responses need to be flagged differently in your UX; for most enterprise applications a high-quality Opus 4.8 answer is still a good outcome. Third, on the raw API the fallback is opt-in through a fallbacks parameter, so make sure your request enables it rather than assuming a refusal is automatically rerouted.
How often does it trigger? Anthropic reports the classifiers fire in fewer than 5% of sessions on average. For data analysis, code review, content generation, and general knowledge work, the restriction will not appear at all. The covered categories are narrow and targeted.
Billing. You are not charged Fable 5 prices for rerouted requests. A fallback response is billed at Opus 4.8 rates, and a request declined before any output is not billed at all.
Pricing and When the Cost Premium Is Worth It
Fable 5 costs twice as much as Opus 4.8 at every tier. The right way to think about whether that premium pays off is cost per completed task, not cost per million tokens.
| Claude Fable 5 | Claude Opus 4.8 | |
|---|---|---|
| Input (per million tokens) | $10 | $5 |
| Output (per million tokens) | $50 | $25 |
| Context window | 1M tokens | 1M tokens |
| Max output | 128K tokens | 128K tokens |
| Prompt cache read discount | Up to 90% | Up to 90% |
If Fable 5 completes a complex migration in one attempt and Opus 4.8 fails twice before succeeding on the third, you have spent triple the Opus cost before getting a result. Even at double the per-token rate, Fable 5 can be the cheaper option when first-attempt success rates differ significantly.
Concretely: suppose a difficult task uses 500,000 input tokens and 100,000 output tokens. That is one Fable 5 run at about $10. If a cheaper model costs $4 per attempt and needs three attempts to produce a usable result, Fable 5 comes out ahead. Input tokens served from the prompt cache receive up to a 90% discount on the input price, which brings the effective cost down further for applications with large system prompts or repeated context.
Operational Considerations
Mandatory data retention. Fable 5 requires a 30-day data retention window on all API traffic, even for organizations that previously had zero-retention agreements. A request from an organization configured for zero retention returns a 400 error on every call. Anthropic states retained data is not used for training and is deleted after 30 days in most cases, with exceptions for safety investigations and legal obligations. For regulated industries, this needs to clear your compliance review before you migrate.
Model ID and availability. Use claude-fable-5 as the model string in API calls. Fable 5 is available on the Claude API, Claude.ai (Pro, Max, Team, and Enterprise), Amazon Bedrock, Google Cloud, and Microsoft Foundry.
US-only inference. If your workload must run on US infrastructure for compliance reasons, US-only inference is available at a 1.1x multiplier on standard pricing.
Decision Framework
The practical approach most teams are using: route high-complexity, high-stakes, long-horizon jobs to Fable 5, and default routine traffic to Opus 4.8 or a smaller model. This hybrid gets frontier-quality results where they matter without paying frontier prices everywhere.
| If your task looks like this | Use this model |
|---|---|
| Multi-day autonomous coding session | Fable 5 |
| Large codebase migration (100k+ lines) | Fable 5 |
| Long-horizon research with multi-step synthesis | Fable 5 |
| Contract review across many documents using vision | Fable 5 |
| Agentic workflow running for hours unattended | Fable 5 |
| A task that already failed several times with Opus 4.8 | Fable 5 |
| Writing a unit test for one function | Opus 4.8 or Sonnet |
| Quick code explanation or review | Opus 4.8 or Sonnet |
| High-volume chatbot responses | Sonnet or Haiku |
| Latency-sensitive or cost-sensitive pipeline | Opus 4.8 or Sonnet |
| Any task involving cybersecurity or biology topics | Opus 4.8 directly (avoids fallback overhead) |
Frequently Asked Questions
What is Claude Fable 5?
Claude Fable 5 is Anthropic's most capable publicly available model, released on June 9, 2026, and the first model in the Mythos capability tier that sits above the Opus family. It is built for long-horizon agentic work: multi-day coding sessions, large migrations, and autonomous workflows that plan, execute, and self-verify across many steps. Its lead over Opus 4.8 grows as tasks get longer and more complex.
Is Fable 5 worth double the price of Opus 4.8?
It depends on the task. Fable 5 costs $10 per million input tokens and $50 per million output tokens, versus $5 and $25 for Opus 4.8. Measure cost per completed task, not cost per token: if Fable 5 finishes a hard job on the first attempt where a cheaper model needs three, it can be cheaper overall despite the higher rate. For short, simple tasks, Opus 4.8 performs nearly as well for half the price.
What is the Fable 5 safety fallback?
When a request touches cybersecurity, dual-use biology, or chemistry, Fable 5's safety classifiers reroute it to Opus 4.8 instead of Fable 5. The API returns a normal HTTP 200 response with stop_reason: "refusal" and a category in stop_details, and the answer comes from Opus 4.8. Anthropic reports it triggers in fewer than 5% of sessions.
How does the fallback affect my API integration?
Add handling for stop_reason: "refusal" as a success case rather than an error, since a usable Opus 4.8 response is returned. Check stop_reason before reading response.content, and enable the fallbacks parameter on the request so a declined query is rerouted rather than simply stopping. Rerouted requests are billed at Opus 4.8 rates.
What is the model ID for Fable 5, and where can I use it?
Use claude-fable-5 as the model string. It is available on the Claude API, Claude.ai (Pro, Max, Team, and Enterprise), Amazon Bedrock, Google Cloud, and Microsoft Foundry. US-only inference is offered at a 1.1x pricing multiplier for workloads that must run on US infrastructure.
Does Fable 5 require data retention?
Yes. Fable 5 requires a 30-day data retention window on all API traffic, even for organizations that previously had zero-retention agreements. Requests from an organization configured for zero retention return a 400 error. Anthropic states retained data is not used for training and is deleted after 30 days in most cases, with exceptions for safety investigations and legal obligations. Regulated teams should clear this with compliance before migrating.
How large is the Fable 5 context window and output limit?
Fable 5 has a 1 million token context window, which is both the maximum and the default, and supports up to 128,000 output tokens per request. Adaptive thinking is always on, so a portion of the output budget is spent on reasoning; for very large outputs, use streaming to avoid request timeouts.
Related Articles
Claude Code Cheatsheet: Commands, Hooks & Subagents
The complete Claude Code reference: every slash command, keyboard shortcut, hook, subagent, and CLAUDE.md tip, with real examples for developers.
Multi-Agent AI Coding Workflow: Step-by-Step (2026)
Build a 3-agent AI coding workflow with CrewAI and Python. One agent writes, one reviews, one writes tests. Full code included.