#

computer-use

https://static.github-zh.com/github_avatars/bytedance?size=40

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 11.48 k
9 小时前
Upsonic/Upsonic
https://static.github-zh.com/github_avatars/Upsonic?size=40
Python 7.32 k
6 天前
https://static.github-zh.com/github_avatars/nanobrowser?size=40

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 5.14 k
5 天前
https://static.github-zh.com/github_avatars/trycua?size=40

Create and run high-performance macOS and Linux VMs on Apple Silicon, with built-in support for AI agents.

Python 4.07 k
21 小时前
https://static.github-zh.com/github_avatars/A9T9?size=40

Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.

JavaScript 1.52 k
24 天前
OpenAdaptAI/OpenAdapt
https://static.github-zh.com/github_avatars/OpenAdaptAI?size=40

Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

Python 1.23 k
1 个月前
https://static.github-zh.com/github_avatars/showlab?size=40

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1.18 k
1 个月前
https://static.github-zh.com/github_avatars/trycua?size=40

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1.16 k
19 天前
https://static.github-zh.com/github_avatars/e2b-dev?size=40
Python 1.02 k
1 个月前
https://static.github-zh.com/github_avatars/THUDM?size=40

An open-sourced end-to-end VLM-based GUI Agent

Python 899
11 天前
https://static.github-zh.com/github_avatars/deedy?size=40

A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.

Python 784
4 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 657
1 个月前
https://static.github-zh.com/github_avatars/suitedaces?size=40

Desktop app powered by Claude’s computer use capability to control your computer

Python 430
3 个月前
https://static.github-zh.com/github_avatars/BandarLabs?size=40

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python 426
2 个月前
https://static.github-zh.com/github_avatars/francedot?size=40

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

379
3 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".

249
5 天前
https://static.github-zh.com/github_avatars/BandarLabs?size=40

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python 232
3 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".

165
3 个月前
loading...
Website
Wikipedia