UI-TARS-desktop

bytedance
12562
A GUI Agent application based on UI-TARS (Visual-Language Model) that allows you to control your computer using natural language.
#agent #vlm #electron #vision #vite #browser-use #computer-use #gui-agents #mcp #mcp-server

Overview

What is UI-TARS-desktop

UI-TARS-desktop is a GUI Agent application based on the UI-TARS (Vision-Language Model) that enables users to control their computers using natural language.

How to Use

To use UI-TARS-desktop, simply install the application, launch it, and interact with your computer by typing or speaking commands in natural language.

Key Features

Key features of UI-TARS-desktop include natural language processing, multimodal interaction, seamless integration with web browsers, command lines, and file systems, as well as the ability to visually interpret web pages.

Where to Use

UI-TARS-desktop can be used in various fields such as personal computing, software development, web automation, and accessibility tools for users with disabilities.

Use Cases

Use cases for UI-TARS-desktop include retrieving weather information, sending tweets, automating repetitive tasks, and controlling applications through voice commands.

Usage

Installation

Generate your exclusive SSE URL:

MCP Server Configuration (for all clients):

Installation in Cursor

Click on the Cursor settings icon in the top right, find the MCP option, then click the + button in the top right of that page and add the configuration in the config panel.

Note: Do not delete previously added configuration items

After completion, restart Cursor, chat with Cursor, and you can use MCP functions

Installation in Claude

Find this json file at the following path: macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json If this file doesn't exist, create one and paste the content above into the file.

Note: Do not delete previously added configuration items.

After completion, restart Claude, and you can use MCP functions through the Claude dialog

SDK

Installation in Python

Install composio-toolset by running the following commands in your terminal:

pip install composio_openai

Python example code:

from composio_openai import ComposioToolSet, App
from openai import OpenAI

openai_client = OpenAI()
composio_toolset = ComposioToolSet(entity_id="default")

tools = composio_toolset.get_tools(apps=[App.GITHUB])

Installation in TypeScript

Install composio-core by running the following commands in your terminal:

npm install -g composio_core

TypeScript example code:

import { OpenAI } from "openai";
import { OpenAIToolSet } from "composio-core";

const openaiClient = new OpenAI();
const composioToolset = new OpenAIToolSet();

const tools = await composioToolset.getTools({
  apps: ["github"],
});

Content