environmental preparation
The following preparations are required to install the VLM-R1:
- Python environment: It is recommended to create a virtual environment for Python 3.10 using Anaconda
- cloning project: Use git clone to get the VLM-R1 code repository.
- Installation of dependencies: The project provides a setup.sh script to automatically install the required dependencies.
Detailed installation steps
- Create a specialized environment via conda: conda create -n vlm-r1 python=3.10
- Activation environment: conda activate vlm-r1
- Clone the project: git clone https://github.com/om-ai-lab/VLM-R1.git
- Go to the project directory: cd VLM-R1
- Run the setup script: bash setup.sh
The setup.sh script automatically installs the following core dependencies:
- PyTorch deep learning framework
- Transformers Library
- Other necessary Python libraries
caveat
NVIDIA GPU is recommended for optimal performance. If CPU mode is used, a performance bottleneck may be encountered.
This answer comes from the articleVLM-R1: A Visual Language Model for Localizing Image Targets through Natural LanguageThe