SpatialLM: Sweep the room, AI automatically draws 3D models for you
SpatialLM is a large language model designed specifically for processing three-dimensional (3D) point cloud data. Its core function is to understand unstructured 3D geometric data and transform it into structured 3D scene representations. These structured outputs contain architectural elements (e.g., walls, doors, windows) as well as objects with orientation...
Baichuan-M2: A Large Language Model for Augmented Reasoning in Healthcare
Baichuan-M2 is an open source large language model with 32 billion (32B) parameters from Baichuan Intelligence. The model focuses on the medical domain and is designed to handle real-world medical reasoning tasks. It is based on the Qwen2.5-32B model, which was developed by introducing an innovative "Large Validator System" (L...
Genie 3: Generating virtual worlds that can be interacted with in real time
Genie 3 is a generalized world model (world model) released by Google DeepMind that represents the latest advancement in AI for simulating and creating virtual environments. The model's most central feature is that it can generate a diverse and dynamic world that supports real-time interaction based solely on a textual description...
HRM: Hierarchical Reasoning Model for Complex Reasoning
HRM (Hierarchical Reasoning Model) is a hierarchical reasoning model with only 27 million parameters designed to solve complex reasoning tasks in the field of artificial intelligence. The design of the model is inspired by the hierarchical, multi-timescale information processing of the human brain. It is implemented through a high-level module (negative .....
Seed Diffusion: Validating High-Speed Language Models for Next-Generation Architectures
Seed Diffusion is an experimental language model, launched by the ByteDance Seed team in conjunction with the Academy of Intelligent Industry Research (AIR) at Tsinghua University. This website is a technology demonstration platform for the model. The model is based on the discrete diffusion technique, and the main goal is to explore the underlying framework of the next generation language model that can be...
HunyuanWorld-1.0: Generating Interactive 360° 3D Worlds from Text or Images
HunyuanWorld-1.0 is an open source project developed by Tencent's Hunyuan team, aiming to generate interactive 360° 3D worlds through text descriptions or single images. It uses panoramic agent generation , semantic layering and hierarchical 3D reconstruction techniques to generate high-quality , explorable 3D scenes . The project is based on the Flux framework...
Qwen3-MT: An Intelligent Translation Tool Supporting 92 Languages
Qwen3-MT is an intelligent translation tool developed by Alibaba Cloud Qwen team, based on the powerful Qwen3 Big Language Model. It supports translation of 92 languages and major dialects, covering more than 95% of the global population. Users can experience its efficient translation via Qwen API or online demo page ....
OpenMed: an open source platform for free AI models in healthcare
OpenMed is an open source AI modeling platform dedicated to healthcare and life sciences, hosted on Hugging Face.It offers over 380 free Named Entity Recognition (NER) models focused on extracting key information such as drugs, diseases, genes, and anatomical structures from clinical texts and research literature....
Seed-X-7B: Efficient Multilingual Translation of Large Models
Seed-X-7B is an open source multi-language translation large language model developed by the Seed team of ByteDance, focusing on providing efficient and accurate translation functions. It is based on the Mistral architecture with 7B parameters and supports translation in 28 languages, covering a wide range of fields such as Internet, technology, e-commerce, biomedicine, etc....
Qwen3-Coder: open source code generation and intelligent programming assistant
Qwen3-Coder is an open source family of large-scale language models developed by the Alibaba Cloud Qwen team, focusing on code generation and intelligent programming. Its core product is Qwen3-Coder-480B-A35B-Instruct, a Hybrid Model of Expertise (MoE) with 48 billion parameters, activated...
EduChat: Open Source Education Dialogue Model
EduChat is an open source educational dialog model developed by the ICALK team at East China Normal University. It focuses on educational scenarios, supports conversations in English and Chinese, and aims to provide intelligent conversation tools for students, teachers and researchers. The model is based on open source frameworks such as LLaMA, Qwen, etc., and through a large number of education field numbers...
MedGemma: a collection of open source AI models for medical text and image understanding
MedGemma is a set of open source AI models released by Google on the Hugging Face platform, focusing on text and image understanding in the medical field. It is based on the Gemma 3 model and is designed to help developers build healthcare-related AI applications.MedGemma offers a variety of model variations...
Jan-nano: a lightweight and efficient model for text generation
Jan-nano is a 4 billion parameter language model optimized for the Qwen3 architecture, developed by Menlo Research and hosted on the Hugging Face platform. It is designed for efficient text generation, combining small size and long context processing capabilities for local or embedded environments. The model supports...
Zerank-1: A reordering model for improving the precision of search results
Zerank-1 is a state-of-the-art reranker model developed by ZeroEntropy. It plays a key role as a "second filter" in information retrieval or semantic search systems. First, a preliminary retrieval system (e.g., vector search) will quickly find a large number of documents from a ...
Windsurf SWE-1
SWE-1: A New Generation of Cutting-Edge Models for Software Engineering Recently, the much-anticipated SWE-1 family of models was released. Designed to optimize the entire software engineering process, the SWE-1 family of models goes far beyond the traditional task of writing code. Currently, the SWE-1 family consists of three well-positioned models:...
LaWGPT
LaWGPT is an open source project supported by the Machine Learning and Data Mining Research Group of Nanjing University, which is dedicated to building a large language model based on Chinese legal knowledge. It extends the proprietary word lists in the legal domain based on generalized Chinese models (such as Chinese-LLaMA and ChatGLM), and through large-scale...
Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice
Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translators, Hibiki is able to generate natural speech translations in the target language and provide text translations in real-time while the user is speaking. The model adopts a multi-stream architecture, capable of simultaneously processing the input language...
Top