Jarvis is a state-of-the-art Multimodal AI Agent stack that grants you total control over your computer through Natural Language and Voice Commands.
Features โข Quick Start โข Showcase โข Contributing
Jarvis transforms your interaction with the digital world. No more complex menus or repetitive tasksโjust tell Jarvis what you need, and watch it navigate your PC, browser, and apps with human-like precision.
- ๐ฃ๏ธ Voice & Text Control - Seamlessly switch between typing and talking to your computer.
- ๐๏ธ Visual Intelligence - Powered by advanced Vision-Language Models (VLM) for screen understanding.
- ๐ฑ๏ธ Native GUI Automation - Precise mouse and keyboard control across all applications.
- ๐ Hybrid Browser Agent - Advanced web navigation using DOM and visual grounding.
- ๐งฐ MCP Ecosystem - Extensible through Model Context Protocol (MCP) tool integration.
- ๐ Privacy First - Secure processing with support for local and private models.
"Help me book the earliest flight from San Jose to New York on September 1st and the last return flight on September 6th on Priceline"
agent-tars-new-flight.mp4
| Action | Local Operator | Remote Operator |
|---|---|---|
| VS Code Automation | computer-use-triple-speed.mp4 |
remote-computer-operators.mp4 |
| GitHub Exploration | browser-use-triple-speed.mp4 |
remote-browser-operators.mp4 |
Get up and running in seconds with our high-performance CLI.
# Launch instantly with npx
npx @agent-tars/cli@latest
# Or install globally (Requires Node.js >= 22)
npm install @agent-tars/cli@latest -g
# Run Jarvis with your preferred provider
agent-tars --provider anthropic --model claude-3-7-sonnet-latest --apiKey YOUR_API_KEYFor a full native experience with a sleek UI:
- Clone the repository.
- Run
pnpm install. - Start the dev environment:
npm run dev:ui-tars. - Follow the Desktop Quick Start Guide for deeper configuration.
| Resource | Link |
|---|---|
| ๐ Website | TBD |
| ๐ Guides | Documentation |
| ๐ ๏ธ SDK | Build on Jarvis |
| ๐ฎ Showcase | Use Cases & Examples |
We welcome contributions from the community! Whether it's a bug fix, a new feature, or better documentation, check out our CONTRIBUTING.md to get started.
Jarvis is open-source software licensed under the Apache License 2.0.
Note
This project is based on and was originally forked from UI-TARS-desktop.