Youtu-agent: a framework for AI intelligences that operate computers to automate tasks
Youtu-agent is a powerful and cleanly designed AI Intelligent Body framework developed by Tencent Youtu Lab. It is specifically designed for building, running, and evaluating autonomous AI intelligences, with the core feature of fully embracing open source models and achieving excellent performance without relying on any closed-source big models. The framework has been ...
Ninja AI: Automating Browser Tasks with AI Intelligentsia
Ninja AI is an Artificial Intelligence (AI) intelligence that runs in a user's browser and is used to automate various online tasks. The tool is designed with the goal of acting like a ninja, silently handling tasks that need to be done in the browser and are repetitive or time-consuming. Users can...
Asteroid AI: Artificial Intelligence Browser Intelligence for Business Process Automation
Asteroid AI is an Artificial Intelligence browser automation platform that allows users to quickly build "browser intelligences" to automate repetitive web page operations instead of humans. The tool can be used by both technical developers and non-technical business people. Users visualize the interface...
AutoGLM: Using voice and text to operate intelligences to complete automated computer and cell phone operations
AutoGLM is an AI intelligent body application developed by ZhipuAI (ZhipuAI). It is not a simple chatbot, but an executive assistant that can actually operate. Users can let AutoGLM autonomously complete various tasks on a virtual computer or cell phone in the cloud through simple natural language commands...
Bytebot: Automating Desktop Tasks in Linux Containers with Natural Language
Bytebot is an open source, self-hosted AI desktop agent that runs in a containerized Linux environment and automates computer tasks through natural language commands. It simulates the way a human operates a computer, using the keyboard, mouse and screen to perform tasks such as web browsing, data processing, file management, etc.Byte...
Browserfly: the smart plugin that lets AI automate browsers
Browserfly is an AI-powered browser plugin that runs directly in the user's existing browser. It allows AI to manipulate web pages like a human through natural language commands for tasks such as searching, organizing information or managing tabs. No need for a virtual machine or additional browser, it installs on Chrome or Edge...
Eigent: an open source desktop application for automated multi-intelligence collaboration
Eigent is the world's first multi-intelligence collaborative desktop application, developed based on the CAMEL-AI open source project, designed to help users build and manage teams of AI intelligences and automate complex tasks. It supports local deployment and cloud operation, providing highly customizable tool integration and data privacy protection.Eigent...
CopyCat: AI tool for automating browser tasks
CopyCat is an AI-powered browser automation tool designed to help businesses and individuals simplify repetitive web tasks. It allows users to create automated workflows without writing code by combining intelligent browser agents and deterministic operations.CopyCat supports handling complex web page operations such as...
NeuralAgent: an AI intelligence that uses speech and text to operate a computer to accomplish tasks
NeuralAgent is an open source AI intelligent body tool that runs on the user's local computer. It accomplishes various tasks by simulating human actions such as clicking, typing, scrolling, and navigating the application. Users simply give commands in natural language and NeuralAgent automatically executes them, such as filling out forms, sending...
Gabriel Operator: the AI assistant that transforms browsers into smart workspaces
Gabriel Operator is a tool that transforms the browser into an intelligent workspace. It helps users automate tasks, provide assistance and adapt to different work scenarios through AI browser agents. Users can use AI features directly in the browser to simplify daily operations and increase productivity....
Magentic-UI: An Intelligent Agent Tool to Support User Collaboration on Web Tasks
Magentic-UI is an open source intelligent agent tool developed by Microsoft Research, designed to help users accomplish complex web tasks through collaboration. It is based on the AutoGen framework and combines a multi-agent system to provide a transparent and controlled user experience.Magentic-UI not only automates web browsing, code execution...
OpenDia: An Open Source Tool to Connect Browsers to AI Models
OpenDia is an open source project that aims to seamlessly connect AI models to browsers through the Model Context Protocol (MCP) protocol. Users can install the OpenDia extension on browsers such as Chrome, Firefox, etc., and combine it with the locally running MCP...
Omni-Bot-SDK-OSS: A Visual Recognition-based Automation Framework for WeChat RPA
Omni-Bot-SDK-OSS is an open source WeChat automation framework based on visual recognition technology that supports WeChat version 4.0 RPA (Robot Process Automation) operations. It is suitable for developers to build automation tasks by customizing the YOLO model and OCR technology to achieve zero runtime intrusion. Users can dynamically pick up...
Simular Browser: an AI browser that intelligently automates web operations
Simular Browser is an artificial intelligence-based browser designed to automate web operations and help users perform repetitive tasks efficiently. It uses natural language commands for web browsing, form filling, and data capture without the need to write complex code.Simular Browser is cross-platform...
Simular Pro: an AI intelligence that uses voice and text to operate computers to accomplish automated tasks
Simular Pro is an AI intelligence based on a neural-symbolic framework designed for macOS (Apple Silicon). It automates complex desktop tasks through natural language commands that mimic human behavior in operating a computer, such as clicking, typing, and scrolling. The product emphasizes transparent execution, and users can always check...
WebAgent: An Intelligent Web Information Search and Processing Tool
WebAgent is an open source project developed by Alibaba Tongyi Lab, focusing on intelligent web information search and processing. It consists of three main components: WebWalker, WebDancer and WebSailor.These tools utilize advanced language modeling and reinforcement learning techniques to help users high...
legacy-use: adding AI automation interfaces to legacy software without APIs
legacy-use is an open source tool whose core role is to provide a modern REST API interface to old, API-less desktop software (often called "legacy software"). It uses an AI intelligence to "observe" the software's graphical user interface (GUI) and simulate a human user's key...
BrowserOS: Open Source AI Smart Browser
BrowserOS is an open source AI smart browser, developed on Chromium and compatible with all Chrome extensions. It emphasizes privacy protection, and all data and AI models run locally, with users having the option of using their own API keys or local models such as Ollama.BrowserO...
Windows-MCP: Open Source Tool for Lightweight AI Control of Windows Systems
Windows-MCP is a lightweight open source project designed to allow AI agents to directly control the Windows operating system through a large-scale language model (LLM). It simplifies the setup process by eliminating the need to rely on traditional computer vision techniques or specific models. Users can use simple tools to realize keyboard and mouse operations as well as capture...
Top