DeepSeek-TNG-R1T2-Chimera: Enhanced version of DeepSeek released by TNG, Germany
DeepSeek-TNG-R1T2-Chimera is an open source large language model developed by TNG Technology Consulting GmbH and hosted on the Hugging Face platform. The model was released on July 2, 2025, and is d...
ERNIE 4.5
ERNIE 4.5 is an open source large model series developed by Baidu based on the PaddlePaddle framework, covering a wide range of models from 0.3B to 424B parameters, supporting text processing, image generation and multimodal tasks. The project is hosted on GitHub , combined with Hugging Face to provide models ...
Hunyuan-A13B: Efficient Open Source Large Language Modeling with Ultra-Long Context and Intelligent Reasoning Support
Hunyuan-A13B is an open source large language model developed by Tencent's hybrid team, based on the Mixed Expert (MoE) architecture design. The model has a total of 8 billion parameters, of which 1.3 billion are active parameters, taking into account high performance and low computational costs.Hunyuan-A13B supports 256K ultra-long context processing, suitable for...
Qwen3 Released: A New Generation of Big Language Models for Thinking Deeply and Responding Fast
The field of large language models has a new member. Recently, the Qwen family of large language models has released its latest version, Qwen3. According to the development team, its flagship model, Qwen3-235B-A22B, has shown to be comparable to DeepSeek-R1 , o1 , o3 in benchmarks of coding, math, and general-purpose...