Skip to content

Claude Opus 4.7

Claude Opus 4.7 extends the Opus line with task budgets, an xhigh effort level, and stronger visual verification. It targets long-running asynchronous agents, multi-step knowledge work, and workflows requiring pixel-level chart transcription and high-resolution image handling.

Tool UseReasoningVision (Image)File InputExplicit CachingWeb Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'anthropic/claude-opus-4.7',
prompt: 'Why is the sky blue?'
})

Playground

Try out Claude Opus 4.7 by Anthropic. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Anthropic
Legal:Terms
Privacy
1M
3.5s
69tps
$5.00/M$25.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
04/16/2026
Amazon Bedrock
Legal:Terms
Privacy
1M
3.0s
48tps
$5.00/M$25.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
04/16/2026
Google Vertex AI
Legal:Terms
Privacy
1M
1.2s
55tps
$5.00/M$25.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
04/16/2026
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by Anthropic

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
0.8s
53tps
$3.00/M$15.00/M
Read:$0.3/M
Write:
$3.75/M
$10/K
+ input costs
anthropic logo
bedrock logo
vertexAnthropic logo
02/17/2026
1M
0.7s
52tps
$5.00/M$25.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
anthropic logo
bedrock logo
vertexAnthropic logo
02/05/2026
200K
0.5s
115tps
$1.00/M$5.00/M
Read:$0.1/M
Write:
$1.25/M
$10.00/K
+ input costs
anthropic logo
bedrock logo
vertexAnthropic logo
10/15/2025
1M
0.8s
60tps
$3.00/M
$15.00/M
Read:
$0.3/M
Write:
$3.75/M
$10.00/K
+ input costs
anthropic logo
bedrock logo
vertexAnthropic logo
09/29/2025
1M
0.6s
68tps
$3.00/M
$15.00/M
Read:
$0.3/M
Write:
$3.75/M
$10.00/K
+ input costs
anthropic logo
bedrock logo
vertexAnthropic logo
05/22/2025
200K
0.6s
51tps
$5.00/M$25.00/M
Read:$0.5/M
Write:
$6.25/M
$10.00/K
+ input costs
anthropic logo
bedrock logo
vertexAnthropic logo
11/24/2024

About Claude Opus 4.7

Claude Opus 4.7 launched on AI Gateway on April 16, 2026. The model is optimized for long-running, asynchronous agents that execute complex, multi-step tasks. Over the 4.6 baseline, it strengthens knowledge-worker capabilities where visual verification of outputs matters and improves programmatic tool-calling with image-processing libraries.

Two API additions define the release. First, taskBudget lets you set token limits that cap individual agentic turns, bounding runaway cost on open-ended autonomous work. Second, adaptive thinking gains an xhigh effort level that sits between high and the max ceiling, giving finer-grained control over how deeply the model reasons. Thinking content is no longer returned by default; configure the display option when you want to surface it.

Claude Opus 4.7 handles pixel-level data transcription from charts and figures, high-resolution images for computer use and screenshot analysis, and document parsing where detail accuracy matters. Structured memory across conversation turns keeps state reliable over extended sessions.

Through AI Gateway, Claude Opus 4.7 is available with the standard unified API, observability, and provider routing. Set the model to anthropic/claude-opus-4.7 in the AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python.

What To Consider When Choosing a Provider

  • Configuration: Thinking content is omitted from responses by default in Claude Opus 4.7. Configure the thinking display option explicitly when you want reasoning traces surfaced in the response.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Claude Opus 4.7

Best For

  • Long-running asynchronous agents: Multi-step workflows where taskBudget caps cost on open-ended autonomous execution
  • Visual verification tasks: Knowledge work that requires pixel-level transcription from charts, figures, and technical diagrams
  • Computer use at high resolution: Screenshot analysis, GUI automation, and document processing where image detail affects accuracy
  • Programmatic tool-calling with images: Agents that invoke image-processing libraries as part of multi-step reasoning
  • Extended agent sessions: Structured memory across turns keeps state reliable across long conversations

Consider Alternatives When

  • Cost-sensitive production workloads: Claude Sonnet 4.6 provides Opus-approaching intelligence at Sonnet pricing for high-volume traffic
  • Interactive low-latency use cases: Claude Haiku 4.5 delivers faster responses for well-bounded high-throughput requests
  • Fixed thinking budget control: Claude Opus 4.6 accepts fixed thinking budgets when you need deterministic compute per request rather than adaptive scaling

Conclusion

Claude Opus 4.7 targets the hardest agentic workloads where tasks run for extended periods, span many steps, and involve visual verification. Task budgets, xhigh effort, and stronger image tool-calling make it a practical fit for knowledge-worker pipelines that previously pushed against Opus 4.6's limits.

Frequently Asked Questions

  • What does the taskBudget parameter do?

    It sets a token limit for an individual agentic turn, capping the maximum compute the model can use on open-ended autonomous work. Use it to bound runaway cost on long-running agents.

  • How is the xhigh effort level different from high?

    xhigh sits between high and max, offering deeper adaptive thinking than high without going to the max ceiling. Use it when the task warrants more depth than high provides but max would be overkill.

  • Why don't I see reasoning traces in my responses?

    Thinking content is omitted by default in Claude Opus 4.7. Configure the thinking display option (for example, 'summarized') when you want reasoning visible in the response.

  • What makes Claude Opus 4.7 better for image-heavy agents?

    High-resolution image support for computer use, screenshots, and documents, plus stronger programmatic tool-calling with image-processing libraries. The model also transcribes pixel-level data from charts and figures with higher fidelity.

  • Does Vercel AI Gateway support Zero Data Retention for Claude Opus 4.7?

    Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

  • What is the pricing for Claude Opus 4.7?

    Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.

  • How do I call Claude Opus 4.7 through AI Gateway?

    Set the model to anthropic/claude-opus-4.7 in the AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python. AI Gateway handles authentication, retries, and failover across anthropic, bedrock, vertexAnthropic.