Installation steps
- Cloning Project Warehouse:
git clone https://github.com/yangb05/PengChengStarling cd PengChengStarling
- Install the dependencies:
pip install -r requirements.txt export PYTHONPATH=/tmp/PengChengStarling:$PYTHONPATH
Usage Process
Data preparation
运行prepare.py脚本适配数据格式,生成data.list文件和相关特征。
model training
- 配置config_train目录中的参数
- 执行训练命令
./train.sh
speech recognition
- 准备待识别音频文件
- (of a computer) run
./eval.sh
启动推理
微调建议
如需在特定领域提升识别准确率:
- 准备领域相关数据
- fulfillment
./train.sh --finetune
fine tune - 微调后模型可直接部署使用
This answer comes from the articlePengChengStarling: Smaller and Faster Multilingual Speech-to-Text Tool than Whisper-Large v3The