Installation and operational requirements
IndexTTS currently requires a local build environment, the main installation steps are as follows:
- Preparing the base environment: Python 3.8+, Git version control tools
- hardware requirement: NVIDIA graphics card (GPU) recommended for accelerated processing, CUDA support
- Download Code: Fetch project source code via git clone
- Installation of dependencies: includes libraries such as PyTorch, NumPy and Torchaudio
- Getting the model: You need to contact the officials by email to get the pre-training model files.
Minimum Configuration Recommendations
- CPU version: at least 8GB of RAM, but slower processing speeds
- GPU version: 4GB or more of video memory recommended
- Storage space: Enough space should be reserved to store the model files (usually several hundred MB to several GB).
Note: 完整功能需要联系xuanwu@bilibili.com获取预训练模型.
This answer comes from the articleIndexTTS: Text-to-Speech Tool with Chinese-English Mixing SupportThe































