Suna is an open source general-purpose AI agent developed by Kortix AI, hosted on GitHub, based on the Apache 2.0 license, which allows users to download, modify and self-host it for free. It helps users with complex tasks such as web browsing, file management, data crawling and website ..... through natural language conversations
Strawberry is a smart browser with a built-in AI assistant designed to help users automate their daily tasks and improve efficiency. Unlike traditional browsers, it integrates AI technology to understand web content in real-time and perform complex tasks such as quick research, content writing and data organization. Users can simply...
Fellou is the world's first AI-enabled action-oriented browser from Fellou AI, which not only provides traditional web browsing functions, but also automates tasks and deep information search through AI technology. Fellou is the world's first AI-enabled action-oriented browser, which not only provides the web browsing functions of a traditional browser, but also automates tasks and searches for in-depth information through AI technology.The core of Fellou is the "Deep Actions" technology, which transforms complex operations into simple finger...
AiPy is an open source Python command-line tool developed by the Knownsec team. It combines the Large Language Model (LLM) and the Python runtime environment to allow users to automatically generate and run Python code by describing tasks in natural language.AiPy is suitable for data engineers, programmers, and people who need to quickly...
DroidRun is an open source tool that lets AI operate an Android phone like a human. It helps AI automate tasks such as opening apps, sending messages, or browsing the web by extracting interactive elements such as on-screen buttons, input boxes, etc. DroidRun combines visual parsing and UI structure analysis to operate fine...
Agent S is an open source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to perform tasks such as browsing the web, editing documents, and using software. The project is open source on GitHub and developed...
Libra is an innovative tool from Greenbit.ai whose core function is to generate AI intelligences that run locally through natural language conversations. Called the "Vibe Agent", it allows users to quickly create their own intelligent bodies by describing their needs in simple terms, and perform web search, data analysis, visualization, and more...
Optexity is an open source project on GitHub, developed by the Optexity team. Its core is to use human demonstration data to train AI to complete computer tasks, especially web page operations. The project consists of three code libraries : ComputerGYM, AgentAI and Playwright...
RunRabbit is an AI-based tool that allows users to control their browsers to accomplish various tasks through simple voice or text commands. Its best feature is that it understands the user's needs and then automatically manipulates web pages, such as searching for information, filling out forms or performing repetitive tasks. The website was developed by a company off...
LangGraph CUA is an open source project developed by the LangChain team. It is based on the LangGraph framework, allowing developers to use Python to build AI intelligences that can directly operate computers. The core of this tool is "Computer Use Agent" (CU...
Agent TARS is a multimodal AI intelligence open-sourced by ByteDance, with core features that help users complete complex computer tasks by visually understanding web content and combining command line and file system operations. Instead of requiring manual operations like traditional tools, it automates browser tasks, editing...
Playwright MCP is an open source tool developed by Microsoft and hosted on GitHub. It allows artificial intelligence models to directly control browsers through the Model Context Protocol (MCP) protocol, performing actions such as opening web pages, clicking on elements, and entering text. The tool is based on Pl...
Airtop is an AI-based browser automation tool. It lets users control cloud browsers to perform complex web operations such as logging into a website, crawling data, or performing automation tasks through simple natural language commands. It solves the problem of complex and error-prone traditional scripting...
BrowserAgent is a tool that creates and runs AI workflows directly in the browser. It's easy to use and requires no code to be written, the user simply describes the desired workflow and the AI is automatically generated. Its core feature is completely private, all data is handled in your browser, no need to worry about privacy...
Highlight AI is a desktop AI assistant for Windows and macOS (mobile version in development) that helps users quickly complete tasks in any app through voice commands and screen content analysis. It captures screen content, generates code, answers questions, and works with GitHub, Notion, ....
autoMate is a local automation tool open-sourced and developed by yuruotong1 on GitHub, with AI+RPA (Artificial Intelligence + Robotic Process Automation) as its core feature. It combines the intelligent understanding of large-scale language models with the process execution capabilities of RPA, and users only need to describe tasks in natural language ....
Nanobrowser is an open source Chrome extension designed to automate web tasks through an AI-powered multi-agent system. It is a free alternative to OpenAI Operator, which users can use by simply providing their LLM (Large Language Model) API key, and supports OpenAI...
Proxy Lite is an open source, lightweight web automation tool developed by Convergence AI as a mini-version of Proxy with an open weight design. It is based on a 3B-parameter Visual Language Model (VLM), and can autonomously perform web navigation and tasks, such as finding information or manipulating browsers. .....
Rabbit Android Agent is an innovative AI intelligence developed by Rabbit, designed to help users complete single- or multi-step tasks on their Android devices through voice and text commands. The technology is based on Rabbit's LAM (Large Action Model...