WaterCrawl recommends using Docker for rapid deployment, here is the standard deployment process:
- environmental preparation: Ensure that Docker and Docker Compose are installed
- Get Code: Cloning repositories via git
git clone https://github.com/watercrawl/watercrawl.git
- Configuration environment: Copy and edit the .env file to configure the database and storage parameters
- Starting services: Implementation
docker compose up -d
Launch Container - Verify Installation: Access
http://localhost
Confirmation of service uptime
For production environment deployment, you need to pay special attention to modify the MINIO related configuration items to ensure that the file storage service is running normally. Detailed deployment instructions can refer to the DEPLOYMENT.md document in the project.
This answer comes from the articleWaterCrawl: transforming web content into data usable for large modelsThe