Open-Sora 2.0 indeed represents a major breakthrough in open source video generation. According to the official data, the difference between this version and OpenAI Sora in the VBench review is only 0.69%, with performance close to that of commercial-grade models such as HunyuanVideo 11B and Step-Video 30B. This achievement was realized with only $200,000 in training costs, highlighting its algorithmic efficiency advantage.
The technical advantages of this project are mainly reflected in three aspects: firstly, the inference speed, 256×256 resolution video on H100 GPU takes only 60 seconds to generate; secondly, it supports up to 768p resolution; and lastly, it realizes the efficient use of hardware resources through xformers and flash-attn acceleration libraries. These technological advances make Open-Sora 2.0 the optimal solution for current open source video generation.
This answer comes from the articleOpen Sora: An Open Source Video Generation Tool for Optimizing Face ConsistencyThe