Current Position:fig. beginning " AI Answers

Step3 employs a hybrid model of expert (MoE) architecture to optimize inference speed

2025-08-19

395

Step3 uses a hybrid model of expert (MoE) architecture that significantly optimizes the speed of reasoning, making it suitable for real-time applications. This architecture reduces hardware requirements while maintaining performance by efficiently allocating computational resources. Developers can adjust parameters such asmax_new_tokens(Recommended values 512 to 32768) to control the output length, so as to meet the needs of different application scenarios.

This answer comes from the articleStep3: Efficient generation of open source big models for multimodal contentThe

May not be reproduced without permission:AI productivity tools " Step3 employs a hybrid model of expert (MoE) architecture to optimize inference speed