
WorkBuddy: desktop-level AI intelligence for manipulating local files
WorkBuddy is a desktop-level AI Agent workbench launched by Tencent Cloud, aiming to let AI evolve from “chatting with you” to “working for you”. Unlike traditional AI dialog tools that run in a browser, WorkBuddy is a standalone software installed on a computer that has the ability to manipulate local files...

QoderWork: The Intelligent Desktop Agent Assistant That Automates Complex Tasks
QoderWork is a revolutionary desktop AI agent tool from Alibaba's Qoder team, designed to expand the capabilities of artificial intelligence from mere “conversation and chat” to true “task execution”. As an intelligent assistant running on the desktop, QoderWork is capable of using natural language...

OpenAdapt: an open source tool for automated manipulation of computer applications using large models
OpenAdapt is an open-source software tool that connects powerful Large Multimodal Models (LMMs) to a computer's Graphical User Interface (GUI) with the aim of automating processes. Traditionally, a great deal of mental effort has been wasted on repetitive computer operations, and OpenAdapt aims to solve this problem. It works originally ...

Step AI desktop intelligences: desktop intelligences that use natural language to operate computers
Step AI Desktop Partner is an artificial intelligence assistant that runs on the operating system of a personal computer and understands and executes the user's natural language commands for various computer operations. The tool is not limited to browsers and can interact directly and deeply with the operating system to manage local files, access the Internet and perform cross-application tasks. The user is reached via a hovering desktop right...

Youtu-agent: a framework for AI intelligences that operate computers to automate tasks
Youtu-agent is a powerful and cleanly designed AI Intelligent Body framework developed by Tencent Youtu Lab. It is specifically designed for building, running, and evaluating autonomous AI intelligences, and its core feature is to fully embrace open source models and achieve excellent performance without relying on any closed source big models. The framework has been validated by rigorous benchmarking in making...

Ninja AI: Automating Browser Tasks with AI Intelligentsia
Ninja AI is an Artificial Intelligence (AI) intelligence that runs in a user's browser and is used to automate various online tasks. The tool is designed to act like a ninja, silently handling tasks that need to be done in the browser and are repetitive or time-consuming. The user can give commands to Ninja A...

Asteroid AI: Artificial Intelligence Browser Intelligence for Business Process Automation
Asteroid AI is an Artificial Intelligence browser automation platform that allows users to quickly build “browser intelligences” to automate repetitive web page operations instead of humans. The tool can be used by both technical developers and non-technical business people. Users can give commands in natural language through a visual interface...

AutoGLM: Using voice and text to operate intelligences to complete automated computer and cell phone operations
AutoGLM is an AI intelligent body application developed by ZhipuAI (ZhipuAI). It is not a simple chatbot, but an executive assistant that can actually operate. Users can let AutoGLM autonomously complete various tasks on a virtual computer or cell phone in the cloud through simple natural language commands. For example, it can automatically manipulate...

Bytebot: Automating Desktop Tasks in Linux Containers with Natural Language
Bytebot is an open source, self-hosted AI desktop agent that runs in a containerized Linux environment and automates computer tasks through natural language commands. It mimics the way a human operates a computer, using the keyboard, mouse, and screen to perform tasks such as web browsing, data processing, file management, etc. Bytebot emphasizes privacy and can...

Browserfly: the smart plugin that lets AI automate browsers
Browserfly is an AI-powered browser plugin that runs directly in the user's existing browser. It allows AI to manipulate web pages like a human through natural language commands for tasks such as searching, organizing information or managing tabs. No virtual machine or additional browser is required, it can be used on Chrome or Edge after installation.Browse...

Eigent: an open source desktop application for automated multi-intelligence collaboration
Eigent is the world's first multi-intelligence collaborative desktop application, based on the CAMEL-AI open source project, designed to help users build and manage teams of AI intelligences and automate complex tasks. It supports local deployment and cloud operation, and offers highly customizable tool integration and data privacy protection.Eigent performs tasks in parallel by...

CopyCat: AI tool for automating browser tasks
CopyCat is an AI-powered browser automation tool designed to help businesses and individuals simplify repetitive web tasks. It allows users to create automated workflows without writing code by combining intelligent browser agents and deterministic operations.CopyCat supports handling complex web operations such as filling out forms, crawling data, or navigating web...

NeuralAgent: an AI intelligence that uses speech and text to operate a computer to accomplish tasks
NeuralAgent is an open source AI intelligent body tool that runs on the user's local computer. It accomplishes various tasks by simulating human actions such as clicking, typing, scrolling and navigating the application. Users simply give commands in natural language and NeuralAgent automatically executes them, such as filling out a form, sending an email, or searching for information...

Gabriel Operator: the AI assistant that transforms browsers into smart workspaces
Gabriel Operator is a tool that transforms the browser into an intelligent workspace. It helps users automate tasks, provide assistance and adapt to different work scenarios through AI browser agents. Users can use AI functions directly in the browser to simplify daily operations and increase productivity. The website has a simple design, functionality...

Magentic-UI: An Intelligent Agent Tool to Support User Collaboration on Web Tasks
Magentic-UI is an open source intelligent agent tool developed by Microsoft Research, designed to help users accomplish complex web tasks through collaboration. It is based on the AutoGen framework and combines a multi-agent system to provide a transparent and controlled user experience.Magentic-UI not only automates web browsing and code execution, but also manages...

OpenDia: An Open Source Tool to Connect Browsers to AI Models
OpenDia is an open source project that aims to seamlessly connect AI models to browsers through the Model Context Protocol (MCP) protocol. Users can install the OpenDia extension on browsers such as Chrome, Firefox, etc., and combine it with the locally running MCP...

Omni-Bot-SDK-OSS: A Visual Recognition-based Automation Framework for WeChat RPA
Omni-Bot-SDK-OSS is an open source WeChat automation framework based on visual recognition technology that supports WeChat version 4.0 RPA (Robot Process Automation) operations. It achieves zero runtime intrusion through custom YOLO models and OCR technology, suitable for developers to build automation tasks. Users can dynamically access plug-ins to adapt o...

Simular Browser: an AI browser that intelligently automates web operations
Simular Browser is an artificial intelligence-based browser designed to automate web operations and help users complete repetitive tasks efficiently. It automates web browsing, form filling, and data capture through natural language commands without the need to write complex code.Simular Browser supports cross-platform use and integrates intelligent generation...

Simular Pro: an AI intelligence that uses voice and text to operate computers to accomplish automated tasks
Simular Pro is an AI intelligence based on a neural-symbolic framework designed for macOS (Apple Silicon). It automates complex desktop tasks through natural language commands that mimic human behavior in operating a computer, such as clicking, typing, and scrolling. The product emphasizes transparent execution, and users can check and modify each step at any time...
Top