<a href="https://www.stork.ai/en/cua" target="_blank" rel="noopener noreferrer"><img src="https://www.stork.ai/api/badge/cua?style=dark" alt="cua - Featured on Stork.ai" height="36" /></a>

Markdown

[![cua - Featured on Stork.ai](https://www.stork.ai/api/badge/cua?style=dark)](https://www.stork.ai/en/cua)

overview

What is cua?

cua is an open-source infrastructure for Computer-Use Agents developed by Cua AI that enables engineers and developers to build, run, and test AI agents that control full desktops. It provides sandboxes, SDKs, and benchmarks to train and evaluate AI agents across macOS, Linux, and Windows environments.

quick facts

Quick Facts

Attribute	Value
Developer	Cua AI
Business Model	Freemium
Pricing	Freemium
Platforms	macOS, Linux, Windows, Android
API Available	Yes
Integrations	LLM Agnostic

features

Key Features of cua

Cua provides a comprehensive open-source infrastructure designed for the development, deployment, and evaluation of Computer-Use Agents. This system allows AI agents to interact with graphical user interfaces (GUIs) across various operating systems, mimicking human user behavior without relying on traditional APIs. Key functionalities include sandboxed environments, developer SDKs, and robust benchmarking tools.

1Open-source infrastructure for Computer-Use Agents.
2Sandboxed environments for macOS, Linux, Windows, and Android.
3SDKs for AI agent development.
4Benchmarks for training and evaluating AI agents.
5API access for programmatic control.
6Cloud desktop provisioning with prebuilt OS images.
7Environment configuration with custom dependencies.
8Snapshotting and forking of agent environments.
9Side-by-side model comparison for evaluation.
10Live GUI sessions and interactive shell control.

use cases

Who Should Use cua?

Cua is primarily designed for engineers and developers focused on creating and deploying AI agents capable of interacting with desktop environments. Its capabilities extend to automating complex digital tasks, streamlining business processes, and facilitating advanced software testing across diverse operating systems.

1Engineers and Developers: For building, running, and testing computer use AI agents across desktop environments.
2Automation Specialists: Automating routine digital tasks and business processes (BPA) across various enterprise software.
3Software Testers: Automating testing workflows for applications, including iOS apps, by interacting with the user interface.
4Researchers: Generating UI screenshots, agent action logs, recording multi-step interactions, and creating training data.
5Enterprises: Streamlining workflows and controlling applications that lack modern APIs, including legacy systems.

pricing

cua Pricing & Plans

Cua operates on a freemium model, providing access to core open-source infrastructure and functionalities without an upfront cost. Specific details regarding paid tiers, usage limits, or advanced feature access for commercial or high-volume deployments are not publicly detailed beyond the freemium designation.

1Freemium: Offers a free tier with core functionalities, with paid options for advanced features or increased usage.

competitors

cua vs Competitors

Cua positions itself as a foundational platform for developing AI agents that interact directly with operating systems and applications via their graphical user interfaces. This approach differentiates it from traditional automation tools and other AI agent frameworks by emphasizing sandboxed environments, cross-OS compatibility, and a focus on the underlying infrastructure for agent training and evaluation.

Bytebot↗

Bytebot is a self-hosted, open-source AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.

Similar to cua, Bytebot provides an open-source solution for AI agents to control desktops. While cua emphasizes sandboxes and SDKs for training and evaluation across macOS, Linux, and Windows, Bytebot focuses on a containerized Linux desktop environment for task execution and offers a live desktop view.

Eigent AI (Open Source Cowork)↗

Eigent Open Source Cowork is a desktop multi-agent workforce that connects to your context and can control the browser and desktop apps to automate real work, with options for self-hosting.

Like cua, Eigent is open-source and enables AI agents to interact with desktop environments. Eigent emphasizes a multi-agent workforce with dashboard controls and the ability to deploy on your own server, whereas cua focuses on providing the core infrastructure, sandboxes, SDKs, and benchmarks for agent development and evaluation.

goose↗

goose is a native open-source AI agent with desktop apps, CLI, and API for macOS, Linux, and Windows, supporting various LLMs and extensible via the Model Context Protocol.

goose directly competes with cua by offering a native, open-source AI agent that runs on multiple desktop operating systems. While cua provides infrastructure for building and evaluating agents, goose is the agent itself, offering a more complete end-user experience with a desktop app, CLI, and API.

Open Interpreter↗

Open Interpreter brings natural language control to local machines, allowing AI agents to execute Python, bash, and browser commands directly on the user's computer across macOS, Linux, and Windows.

Open Interpreter is similar to cua in its open-source nature and ability for AI agents to control local desktop environments. However, Open Interpreter focuses on direct command execution via a conversational interface, whereas cua provides the underlying infrastructure, sandboxes, and SDKs for developing and evaluating such agents.

❓

Frequently Asked Questions

+What is cua?

+Is cua free?

Cua operates on a freemium model, offering a free tier with core open-source functionalities. Specific details on paid tiers for advanced features or increased usage are not publicly detailed beyond this designation.

+What are the main features of cua?

Key features of cua include open-source infrastructure for Computer-Use Agents, sandboxed environments for macOS, Linux, Windows, and Android, SDKs for AI agent development, benchmarks for training and evaluation, API access, cloud desktop provisioning, environment configuration, snapshotting and forking, side-by-side model comparison, and live GUI sessions with interactive shell control.

+Who should use cua?

Cua is primarily intended for engineers and developers building, running, and testing computer use AI agents across desktop environments. It is also suitable for automation specialists, software testers, researchers, and enterprises looking to automate tasks, streamline business processes, and interact with applications lacking modern APIs.

+How does cua compare to alternatives?

Cua differentiates itself by providing core open-source infrastructure, sandboxes, SDKs, and benchmarks for training and evaluating AI agents across multiple operating systems (macOS, Linux, Windows). Unlike Bytebot, which focuses on a containerized Linux desktop for execution, or goose, which is a native agent itself, cua provides the foundational tools. Compared to Open Interpreter's direct command execution, cua offers the underlying infrastructure for developing such agents.