What does the `taskBudget` parameter do?

It sets a token limit for an individual agentic turn, capping the maximum compute the model can use on open-ended autonomous work. Use it to bound runaway cost on long-running agents.

How is the `xhigh` effort level different from `high`?

`xhigh` sits between `high` and `max`, offering deeper adaptive thinking than `high` without going to the `max` ceiling. Use it when the task warrants more depth than `high` provides but `max` would be overkill.

Why don't I see reasoning traces in my responses?

Thinking content is omitted by default in Claude Opus 4.7. Configure the thinking display option (for example, `'summarized'`) when you want reasoning visible in the response.

Does Vercel AI Gateway support Zero Data Retention for Claude Opus 4.7?

Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

What is the pricing for Claude Opus 4.7?

Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.

Claude Opus 4.7

Claude Opus 4.7 extends the Opus line with task budgets, an xhigh effort level, and stronger visual verification. It targets long-running asynchronous agents, multi-step knowledge work, and workflows requiring pixel-level chart transcription and high-resolution image handling.

Tool UseReasoningVision (Image)File InputExplicit CachingWeb Search

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'anthropic/claude-opus-4.7',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Claude Opus 4.7 by Anthropic. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

3.5s

69tps

$5.00/M

$25.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

04/16/2026

Legal:Terms

•

Privacy

3.0s

48tps

$5.00/M

$25.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

04/16/2026

Legal:Terms

•

Privacy

1.2s

55tps

$5.00/M

$25.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

04/16/2026

More models by Anthropic

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

0.8s

53tps

$3.00/M

$15.00/M

Read:$0.3/M

Write:

$3.75/M

$10/K

+ input costs

—

02/17/2026

0.7s

52tps

$5.00/M

$25.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

02/05/2026

200K

0.5s

115tps

$1.00/M

$5.00/M

Read:$0.1/M

Write:

$1.25/M

$10.00/K

+ input costs

—

10/15/2025

0.8s

60tps

$3.00/M

$15.00/M

Read:

$0.3/M

Write:

$3.75/M

$10.00/K

+ input costs

—

09/29/2025

0.6s

68tps

$3.00/M

$15.00/M

Read:

$0.3/M

Write:

$3.75/M

$10.00/K

+ input costs

—

05/22/2025

200K

0.6s

51tps

$5.00/M

$25.00/M

Read:$0.5/M

Write:

$6.25/M

$10.00/K

+ input costs

—

11/24/2024

About Claude Opus 4.7

Claude Opus 4.7 launched on AI Gateway on April 16, 2026. The model is optimized for long-running, asynchronous agents that execute complex, multi-step tasks. Over the 4.6 baseline, it strengthens knowledge-worker capabilities where visual verification of outputs matters and improves programmatic tool-calling with image-processing libraries.

Two API additions define the release. First, taskBudget lets you set token limits that cap individual agentic turns, bounding runaway cost on open-ended autonomous work. Second, adaptive thinking gains an xhigh effort level that sits between high and the max ceiling, giving finer-grained control over how deeply the model reasons. Thinking content is no longer returned by default; configure the display option when you want to surface it.

Claude Opus 4.7 handles pixel-level data transcription from charts and figures, high-resolution images for computer use and screenshot analysis, and document parsing where detail accuracy matters. Structured memory across conversation turns keeps state reliable over extended sessions.

Through AI Gateway, Claude Opus 4.7 is available with the standard unified API, observability, and provider routing. Set the model to anthropic/claude-opus-4.7 in the AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python.

What To Consider When Choosing a Provider

Configuration: Thinking content is omitted from responses by default in Claude Opus 4.7. Configure the thinking display option explicitly when you want reasoning traces surfaced in the response.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Claude Opus 4.7

Best For

Long-running asynchronous agents: Multi-step workflows where taskBudget caps cost on open-ended autonomous execution
Visual verification tasks: Knowledge work that requires pixel-level transcription from charts, figures, and technical diagrams
Computer use at high resolution: Screenshot analysis, GUI automation, and document processing where image detail affects accuracy
Programmatic tool-calling with images: Agents that invoke image-processing libraries as part of multi-step reasoning
Extended agent sessions: Structured memory across turns keeps state reliable across long conversations

Consider Alternatives When

Cost-sensitive production workloads: Claude Sonnet 4.6 provides Opus-approaching intelligence at Sonnet pricing for high-volume traffic
Interactive low-latency use cases: Claude Haiku 4.5 delivers faster responses for well-bounded high-throughput requests
Fixed thinking budget control: Claude Opus 4.6 accepts fixed thinking budgets when you need deterministic compute per request rather than adaptive scaling

Conclusion

Claude Opus 4.7 targets the hardest agentic workloads where tasks run for extended periods, span many steps, and involve visual verification. Task budgets, xhigh effort, and stronger image tool-calling make it a practical fit for knowledge-worker pipelines that previously pushed against Opus 4.6's limits.

Frequently Asked Questions

What does the taskBudget parameter do?
It sets a token limit for an individual agentic turn, capping the maximum compute the model can use on open-ended autonomous work. Use it to bound runaway cost on long-running agents.
How is the xhigh effort level different from high?
xhigh sits between high and max, offering deeper adaptive thinking than high without going to the max ceiling. Use it when the task warrants more depth than high provides but max would be overkill.
Why don't I see reasoning traces in my responses?
Thinking content is omitted by default in Claude Opus 4.7. Configure the thinking display option (for example, 'summarized') when you want reasoning visible in the response.
What makes Claude Opus 4.7 better for image-heavy agents?
High-resolution image support for computer use, screenshots, and documents, plus stronger programmatic tool-calling with image-processing libraries. The model also transcribes pixel-level data from charts and figures with higher fidelity.
Does Vercel AI Gateway support Zero Data Retention for Claude Opus 4.7?
Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.
What is the pricing for Claude Opus 4.7?
Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.
How do I call Claude Opus 4.7 through AI Gateway?
Set the model to anthropic/claude-opus-4.7 in the AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python. AI Gateway handles authentication, retries, and failover across anthropic, bedrock, vertexAnthropic.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Claude Opus 4.7

Playground

Providers

More models by Anthropic

About Claude Opus 4.7

What To Consider When Choosing a Provider

When to Use Claude Opus 4.7

Best For

Consider Alternatives When

Conclusion

Frequently Asked Questions