怎样避免Spark-TTS在多语言混合文本中的发音错误？

2025-08-30

1.7 K

处理多语言混合文本的技术方案

当文本中存在中英混合等情况时，需实施以下策略：

语言标记：使用XML标签标注语言区间
<lang code='en'>Hello</lang> 世界
automatic detection：集成langid.py库进行语言识别：
import langid langid.classify('你好world') # 返回('en', -54.4)

紧急处理方案：
1. 用标点分割不同语言段落
2. 分别生成后使用FFmpeg合并：
ffmpeg -i ch.wav -i en.wav -filter_complex concat=n=2:v=0:a=1 output.wav

Related files download url

You need to log in to download this resource. Go to log in