Bytedance Seed 1.8
Bytedance Seed 1.8 is ByteDance's generalized agentic model. It combines a Search Agent, Code Agent, and GUI Agent in one multimodal system with token-efficient visual encoding and three adaptive thinking modes.
import { streamText } from 'ai'
const result = streamText({ model: 'bytedance/seed-1.8', prompt: 'Why is the sky blue?'})Playground
Try out Bytedance Seed 1.8 by ByteDance. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by ByteDance
| Model |
|---|
About Bytedance Seed 1.8
Bytedance Seed 1.8 integrates three agent functions into one system. The Search Agent handles information retrieval across web and document sources. The Code Agent writes, debugs, and runs code. The GUI Agent interacts with graphical interfaces on desktop, web, and mobile using native vision rather than scripted automation, and operates software the way a human would.
Visual token efficiency is a core engineering focus. Bytedance Seed 1.8 reduces image encoding token requirements without sacrificing reasoning quality. This matters for GUI-heavy workloads where dozens of screenshots may pass through a single session. Three adaptive thinking modes calibrate processing depth to task complexity. They skip unnecessary compute on straightforward steps and use deeper reflection on ambiguous decisions.
In ByteDance's published benchmarks, Bytedance Seed 1.8 reaches 67.6 on BrowseComp-en, 87.8 on VideoMME (long-form video understanding), and 11.0 on ZeroBench (multimodal reasoning). It scores 62.0 on VLMsAreBiased and 47.2 on WorldTravel, up from its predecessor Seed 1.5-VL in those tables. Evaluations cover simulated workflows including travel planning, financial analysis, and software engineering. See https://docs.byteplus.com/en/docs/ModelArk/2123228 for methodology, tables, and comparisons.
What To Consider When Choosing a Provider
- Configuration: For agentic pipelines with repeated GUI observation steps, confirm that your provider supports streaming for incremental processing of long action sequences. Compare token pricing ($0.25 in, $2 out per million tokens when listed).
- Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use Bytedance Seed 1.8
Best For
- Browser and desktop automation: The GUI Agent observes, clicks, types, and navigates interfaces without custom scripting
- Research workflows: Combine live information retrieval with synthesis and code execution in a single agent loop
- Agentic programming: The model writes, tests, and iterates on code autonomously
- Long-form video understanding: Supported by an 87.8 VideoMME score in ByteDance's published results
- Business process automation: Financial analysis, itinerary generation, and document-heavy enterprise workflows
Consider Alternatives When
- Single-turn generation: A lighter model would cost less when no agentic component is needed
- Strict JSON validation: Deterministic tool-call schemas require verification of multi-step function calling before production
- Pure text workloads: The model's image encoding overhead wastes compute when no visual inputs are involved
- Formal reasoning specialists: A model optimized for mathematics or formal logic may suit better than a generalized agent
Conclusion
Bytedance Seed 1.8 consolidates search, code, and GUI agency into one model. You don't need separate specialized systems for each capability. Token-efficient visual encoding and adaptive thinking depth make it practical for multi-step pipelines where input modality and task type vary across turns.
Frequently Asked Questions
What does "generalized agentic model" mean for Bytedance Seed 1.8?
It completes multi-step tasks autonomously across search, code, and graphical interfaces. It covers all three instead of specializing in one capability.
How does the GUI Agent in Bytedance Seed 1.8 work without traditional scripted automation?
It uses native vision to interpret screenshots and decide which actions to take: clicks, keystrokes, and form entries. It adapts to any UI layout without pre-defined selectors or automation scripts.
What is the BrowseComp-en benchmark and why is Bytedance Seed 1.8's score notable?
BrowseComp-en tests retrieval and synthesis through web browsing in English. Bytedance Seed 1.8 scores 67.6 in ByteDance's published table. See https://docs.byteplus.com/en/docs/ModelArk/2123228 for the full benchmark context.
How does token-efficient visual encoding benefit agentic applications?
Each GUI observation step sends one or more screenshots to the model. Fewer tokens per image means more steps fit within the context window at lower cost. This is especially important for long automation sessions with many intermediate observations.
Does Bytedance Seed 1.8 support video understanding in addition to image inputs?
Yes. Bytedance Seed 1.8 scores 87.8 on VideoMME (long-form video understanding) in ByteDance's published results. It processes temporal sequences of visual content alongside text instructions.
What kinds of real-world workflows was Bytedance Seed 1.8 evaluated on?
Evaluations cover simulated practical scenarios including travel planning, financial and business analysis, software engineering tasks, and multi-step information retrieval.
Is Bytedance Seed 1.8 accessible without setting up a Volcano Engine account?
Yes. Through AI Gateway, you authenticate with an API key or OIDC token and route requests to Bytedance Seed 1.8. You don't need a separate Volcano Engine account.