Overseas access: www.kdjingpai.com
Bookmark Us
Current Position:fig. beginning " AI Answers

What is the Dolphin speech recognition model and what are its main developers and features?

2025-08-25 1.5 K

Dolphin is an open source speech recognition model jointly developed by DataoceanAI and Tsinghua University, focusing on versatile processing of Asian languages. Its core features include:

  • Wide language coverageSupport for 40 Asian languages and 22 Chinese dialects, covering East Asia, South Asia, Southeast Asia and the Middle East.
  • multitasking: Integrated speech-to-text (ASR), voice activity detection (VAD), audio segmentation, and language identification (LID) capabilities
  • Strong data base: trained on over 210,000 hours of proprietary and publicly available audio data
  • Architecture Innovation: A hybrid CTC-Attention architecture is used, with the encoder using E-Branchformer and the decoder using Transformer
  • Dual-layer marking system: Precise differentiation of dialect variants by (e.g. )

The project has been open-sourced in GitHub, providing two model specifications, base (140M parameters) and small (372M parameters), taking into account the processing speed and recognition accuracy requirements.

Recommended

Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

Top

en_USEnglish