Bytedance

ByteDance: UI-TARS 7B

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement learning-based reasoning, enabling robust action planning and execution across virtual interfaces. This model achieves state-of-the-art results on a range of interactive and grounding benchmarks, including OSworld, WebVoyager, AndroidWorld, and ScreenSpot. It also demonstrates perfect task completion across diverse Poki games and outperforms prior models in Minecraft agent tasks. UI-TARS-1.5 supports thought decomposition during inference and shows strong scaling across variants, with the 1.5 version notably exceeding the performance of earlier 72B and 7B checkpoints.

Context Window
128K tokens
Parameters
72B
Input Price
$0.10/1M
Output Price
$0.20/1M
Price Tier
budget
Provider
Bytedance

How to Use ByteDance: UI-TARS 7B

With CoreAI, you can start chatting with ByteDance: UI-TARS 7B instantly — no separate subscription needed. CoreAI bundles access to ByteDance: UI-TARS 7B along with 300+ other AI models from Bytedance and other providers like OpenAI, Anthropic, Google, Meta, and more.

  1. Download the CoreAI app for iOS, Android, or use the Web App
  2. Select ByteDance: UI-TARS 7B from the model selector
  3. Start chatting, comparing, or creating with AI

Try ByteDance: UI-TARS 7B Now

Chat with ByteDance: UI-TARS 7B and 300+ other AI models — all in one app.

Download App → Try on Web App