OmniParser V2.0 Feature Upgrades
The newly released OmniParser V2.0 has been significantly upgraded from the original version, especially making breakthroughs in icon detection and interaction element prediction. The new version adopts improved computer vision algorithms, which can more accurately recognize various icons in the interface and accurately predict their functions and interactive attributes.
Key improvements in version V2.0 include:
- Introducing the new Florence model for icon descriptions
- Improved neural network architecture enhances detection accuracy
- Interaction prediction success rate increased to over 95%
- Processing speed increased by 40% compared to its predecessor
These performance enhancements make it perform even better in automated testing, auxiliary function development and other scenarios. Microsoft's official test data shows that V2.0's parsing time in complex interfaces has been reduced to less than 2 seconds on average.
This answer comes from the articleOmniParser: user interface screenshots parsed into structured elements for easy understanding and manipulation by large modelsThe