How does UniAPI's model selection mechanism work?

2025-08-28

1.3 K

UniAPI's model selection mechanism is the core function of its intelligent routing and operates as follows:

Assessment of indicators: The system tracks two key indicators:
1. API request success rate within 72 hours
2. First Token Response Time (First Token Latency)
dynamic selectionWhen a request is received, the system will automatically select the current service provider with the best performance based on a comprehensive evaluation of the above indicators.
Real-time adjustments: The selection algorithm continually updates evaluation data as API calls are made, ensuring that routing decisions are always based on the most up-to-date scenarios
fault tolerance: When a service has a problem, the mechanism will automatically lower its priority to avoid affecting the overall quality of service.

The advantage of this mechanism is:

Developers do not need to manually intervene in model selection
The system is able to adapt to changes in the performance of services over time.
Automatically provide the best backup solution in the event of service fluctuations from the vendor
Particularly suitable for applications requiring stability and responsiveness