
SpatialLM: Sweep the room, AI automatically draws 3D models for you
SpatialLM is a large language model designed specifically for processing three-dimensional (3D) point cloud data. Its core function is to understand unstructured 3D geometric data and transform it into structured 3D scene representations. These structured outputs contain architectural elements (e.g., walls, doors, windows) as well as object bounding boxes with orientation and their semantic categories. Unlike many of the needs ...

Baichuan-M2: A Large Language Model for Augmented Reasoning in Healthcare
Baichuan-M2 is an open source large language model with 32 billion (32B) parameters from Baichuan Intelligence. The model focuses on the medical domain and is designed to handle real-world medical reasoning tasks. It is based on the Qwen2.5-32B model, which was developed by introducing an innovative “Large ...

Genie 3: Generating virtual worlds that can be interacted with in real time
Genie 3 is a generalized world model (world model) released by Google DeepMind, which represents the latest advancement in AI in simulating and creating virtual environments. The core feature of this model is that it can generate a diverse and dynamic world that supports real-time interactions based solely on a textual description. Users can use this...

HRM: Hierarchical Reasoning Model for Complex Reasoning
HRM (Hierarchical Reasoning Model) is a hierarchical reasoning model with only 27 million parameters designed to solve complex reasoning tasks in the field of artificial intelligence. The design of the model is inspired by the hierarchical, multi-timescale information processing of the human brain. It does this through a high-level module (responsible for easing ...

Seed Diffusion: Validating High-Speed Language Models for Next-Generation Architectures
Seed Diffusion is an experimental language model, launched by the ByteDance Seed team in conjunction with the Academy of Intelligent Industry Research (AIR) at Tsinghua University. This website is a technology demonstration platform for the model. The model is based on the discrete diffusion technique, and its main goal is to explore the feasibility of the next-generation language modeling infrastructure framework. It is in code generation this...

HunyuanWorld-1.0: Generating Interactive 360° 3D Worlds from Text or Images
HunyuanWorld-1.0 is an open source project developed by Tencent's Hunyuan team, aiming to generate interactive 360° 3D worlds through text descriptions or single images. It uses panoramic agent generation , semantic layering and hierarchical 3D reconstruction techniques to generate high-quality , explorable 3D scenes . The project is based on the Flux framework and supports interaction with ...

Qwen3-MT: An Intelligent Translation Tool Supporting 92 Languages
Qwen3-MT is an intelligent translation tool developed by Alibaba Cloud Qwen team, based on the powerful Qwen3 Big Language Model. It supports translation of 92 languages and major dialects, covering more than 95% of the global population. Users can experience its efficient translation function through Qwen API or online demo page...

OpenMed: an open source platform for free AI models in healthcare
OpenMed is an open source AI modeling platform dedicated to healthcare and life sciences, hosted on Hugging Face.It offers over 380 free Named Entity Recognition (NER) models focused on extracting key information such as drugs, diseases, genes, and anatomical structures from clinical texts and research literature. These models are all based...

Seed-X-7B: Efficient Multilingual Translation of Large Models
Seed-X-7B is an open source multi-language translation large language model developed by the Seed team of ByteDance, focusing on providing efficient and accurate translation functions. It is based on the 7B-parameter Mistral architecture and supports translation in 28 languages, covering a wide range of domains such as Internet, technology, e-commerce, and biomedicine. The model works by pre...

Qwen3-Coder: open source code generation and intelligent programming assistant
Qwen3-Coder is an open source family of large-scale language models developed by the Alibaba Cloud Qwen team, focusing on code generation and intelligent programming. Its core product is Qwen3-Coder-480B-A35B-Instruct, a Hybrid Model of Expertise (MoE) with 48 billion parameters, activated...

EduChat: Open Source Education Dialogue Model
EduChat is an open source educational dialog model developed by the ICALK team at East China Normal University. It focuses on educational scenarios, supports conversations in English and Chinese, and aims to provide intelligent conversation tools for students, teachers and researchers. The model is based on open source frameworks such as LLaMA and Qwen, fine-tuned by a large amount of data in the education field, and has the ability to handle...

MedGemma: a collection of open source AI models for medical text and image understanding
MedGemma is a set of open source AI models released by Google on the Hugging Face platform, focusing on text and image understanding in the medical field. It is based on the Gemma 3 model and is designed to help developers build healthcare-related AI applications.MedGemma offers a variety of model variations...

Jan-nano: a lightweight and efficient model for text generation
Jan-nano is a 4 billion parameter language model optimized for the Qwen3 architecture, developed by Menlo Research and hosted on the Hugging Face platform. It is designed for efficient text generation, combining small size and long context processing capabilities for local or embedded environments. The model supports...

Zerank-1: A reordering model for improving the precision of search results
Zerank-1 is a state-of-the-art reranker model developed by ZeroEntropy. It plays a key role as a “second filter” in information retrieval or semantic search systems. First, a preliminary retrieval system (e.g., vector search) will quickly identify a set of possible...

Windsurf SWE-1
SWE-1: A New Generation of Cutting-Edge Models for Software Engineering Recently, the much-anticipated SWE-1 family of models was released. Designed to optimize the entire software engineering process, the SWE-1 family of models goes far beyond the traditional task of writing code. Currently, the SWE-1 family consists of three well-positioned models: SWE-1: This main...

LaWGPT
LaWGPT is an open source project supported by the Machine Learning and Data Mining Research Group of Nanjing University, which is dedicated to building a large language model based on Chinese legal knowledge. It extends the legal domain-proprietary word lists on the basis of generalized Chinese models (such as Chinese-LLaMA and ChatGLM), and is pre-trained with a large-scale legal corpus...

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice
Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate a natural speech translation of the target language in real-time while the user is speaking, as well as provide text translation. The model adopts a multi-stream architecture, capable of simultaneously processing the input speech stream and generating the target language...
Top