OmniAvatar is an open source project jointly developed by Zhejiang University and Alibaba, focusing on generating full-body avatar videos through audio input. It is based on deep learning technology , can be based on user-provided audio and text prompts , to generate a highly natural sense of virtual character animation .
Its core features include:
- Audio-driven video generation: automatically generates full-body animation of the avatar based on the input audio, ensuring that the lip movements are highly synchronized with the audio.
- Text Prompt Control: Supports control of avatar's emotions, actions and background environment through text commands
- Multi-language support: lip synchronization in 31 languages, including Chinese, English, Japanese, etc.
- Whole body movement coordination: can generate natural shoulder movements, gesture rhythms, and other whole body animations
- Scene interaction: avatars can interact with objects in the scene
- Multi-resolution output: currently supports 480p video generation
This answer comes from the articleOmniAvatar: Generating Audio-Driven Full-Body Avatar VideosThe