FantasyTalking is an open source project developed by the Fantasy-AMAP team, focusing on generating realism talking portrait videos through audio-driven generation. The project is based on the advanced video diffusion model Wan2.1 , combined with the audio encoder Wav2Vec and proprietary model weights , using artificial intelligence techniques to achieve high ...
ChatAnyone is an innovative project developed by the HumanAIGC team. It utilizes artificial intelligence techniques to generate digital human portrait videos with upper body movements from a single photo and audio input. The project is based on a hierarchical motion diffusion model that generates head movements, gestures and expressions suitable for presenting virtual...
VirtualWife is an open source virtual digital person project created by developer yakami129. It is currently in the incubation stage, and its goal is to create a virtual character with a "soul" that users can interact with like a friend. The project supports B-station live streaming, and can communicate with users through Chinese voice and text....
Tavus is a developer platform focused on human-AI interactions, providing easy-to-use APIs that allow developers to build AI agents with visual, voice, and emotional intelligence. Its core product, Conversational Video Interface (CVI), mimics the human brain...
HeyGem is a fully offline video compositing tool designed for Windows systems, developed by the GuijiAI (Silicon Intelligence) team and open-sourced on GitHub. It utilizes advanced AI algorithms to accurately clone the user's appearance and voice to generate realistic avatars, and supports text or voice driven...
AI Studios is an online AI video generation platform developed by DeepBrain AI to help users quickly create high-quality video content by simply entering text. Without the need for complex software or specialized skills, users can leverage its AI technology to transform text, documents, or web links into videos with virtual...
LiteAvatar is an open source tool developed by the HumanAIGC team (part of Ali) that focuses on generating facial animations from audio-driven 2D avatars in real-time. It runs at 30 frames per second (fps) relying only on the CPU, and is especially suited for scenarios that require low power consumption, such as real-time 2D video chat...
Yuanzhen Digital People is a leading AIGC (Artificial Intelligence Generated Content) platform dedicated to providing users with one-stop services such as digital people live broadcasting, short video production and AI assistant. The platform integrates AI algorithm synthesis and GPT-style big models, supports users to create exclusive Q&A models, and provides real-time voice-driven, Chinese-to...
Digital Man Generation System is a website that provides free digital man generation service. The site supports sound cloning, sound reproduction, digital person image template, digital split cloning, video watermark removal and other functions, aiming to provide users with efficient and convenient digital person generation solutions. Users can upload audio text...
SadTalker-Video-Lip-Sync is a video lip-synthesis tool based on the SadTalkers implementation. The project generates lip shapes through voice-driven generation and uses configurable facial region enhancement to improve the clarity of the generated lip shapes. The project also uses the DAIN frame interpolation algorithm to complement the generated video...
Linly-Talker is an innovative digital human dialog system that combines Large Language Models (LLMs) with visual models to create a novel approach to human-computer interaction. The system integrates multiple technologies such as Whisper, Linly, Microsoft Speech Services, and Sad...
Humva is an innovative AI video generation tool designed to create professional or customized digital avatar videos by providing a user-friendly solution. The platform utilizes generative AI and advanced lip-sync technology to provide free customized video spokespersons for social media content, product introductions, customer testimonials, and more....
Rapport Cloud is a cloud-based platform focused on creating and deploying interactive digital characters powered by artificial intelligence. Developed by the team at Speech Graphics, the platform utilizes its award-winning audio-driven facial animation technology, which is widely used in the AAA game publishing industry.Rapport Cloud works through detailed...
MetaWorld AI (open source version) is a project hosted on GitHub, developed by the libn-net team. It can clone digital human images and voices through AI technology to generate short videos, and also supports dubbing and subtitling. The tool is available as a Windows installer, a Web version, an H5 version, and an applet version. .....
Dreamface is a powerful AI tool designed to help users easily create high-quality videos and images. With simple operations, users can generate personalized animated avatar videos, repair old photos, remove photo backgrounds, and more. The site offers a variety of AI-driven features that make video and image...
Gan.AI is a company dedicated to providing video personalization solutions through artificial intelligence technology. The platform allows users to quickly generate high-quality video content without the need for a camera or filming crew.Gan.AI's main products include video personalization, avatar generation and customization, voice-overs, and pairs of...
Hello everyone, today I'm sharing a digital people maker tool with you! It is easy to use and supports batch processing. (Integration package at the end of the article to take their own) I believe we have learned something about the technology of digital people, before the fire Guo Degang speak English, Russian beauty speak Chinese, etc. are the embodiment of digital people technology. Digital People ...
LiveTalking is an open source real-time interactive digital human system, dedicated to building high-quality digital human live solution. The project uses the Apache 2.0 open source protocol and integrates a number of cutting-edge technologies , including ER-NeRF rendering , real-time audio and video stream processing , lip synchronization and so on. The system supports real-time digital human ...
JoyGen is an innovative two-stage talking face video generation framework focused on solving the problem of audio-driven facial expression generation. Developed by a team from Jingdong Technology, the project uses advanced 3D reconstruction techniques and audio feature extraction methods to accurately capture the identity features and expression coefficients of the speaker and realize high...