How can I improve efficiency and quality when generating audiobooks with Auto-Audio-Book?

2025-08-28

1.6 K

Optimizing generation effects can be done in three ways:

Efficiency gains

Use multi-threaded acceleration: e.g.python app/createAudio.py --threads 20command to start 20 threads.
Distributed processing: measured 5 machines in parallel can process 2000 chapters in 5 hours.

Replacement of TTS engine: default CosyVoice2-0.5B has limited effect, better speech synthesis model can be integrated.
Manual review: bygui.pytool to check audio order and integrity.

Re-run in case of network outagegetZjList.pyCrawl through the missing chapters.
Consider using a proxy server to switch IPs when you encounter IP restrictions.

Note: In silico models have API call limitations, and large-scale generation requires reasonable planning of task scheduling.