Okteto AI uses an intelligent GPU allocation mechanism to solve resource wastage:
- dynamic allocation: in
okteto.yml
Declare the GPU requirements (e.g.gpu: 1
), the platform only allocates resources during training - auto-recovery: After the training is complete run
okteto down
Immediate GPU release to avoid idle consumption - Resource Monitoring: Admin Dashboard displays real-time GPU utilization, temperatures, and more!
- cost control: Enterprise Edition supports setting GPU usage quotas and alert rules
Tests show that the method saves 471 TP3T in cloud computing costs compared to the traditional fixed allocation method.
This answer comes from the articleOkteto AI: A Cloud Development Platform for Running Trusted AI AgentsThe