Preparation of the basic environment
- Python 3.10运行环境
- Git Version Control Tools
- 推荐NVIDIA GPU(16GB+显存)
Step-by-step installation guide
- Creating a Virtual Environment::
conda create -n tpo python=3.10
- cloning project::
git clone https://github.com/Airmomo/tpo-llm-webui.git
- Installation of dependencies::
pip install -r requirements.txt
- 配置TextGrad(核心优化引擎):
cd textgrad-main pip install -e .
模型配置要点
- 基础模型需从Hugging Face下载(如DeepSeek-R1系列)
- 奖励模型建议选择与任务匹配的专业评估模型
- 通过config.yaml文件设置模型路径
This answer comes from the articleTPO-LLM-WebUI: An AI framework where you can input questions to train a model to output results in real timeThe