
AI-Chatbox: Speech-to-Text Intelligent Dialogue Project based on ESP32S3
AI-Chatbox 是一个基于 ESP32S3 开发板的语音交互项目。用户通过语音与大模型(LLM)对话,设备将语音转为文字,发送给大模型,获取回答后可进一步转为语音播报。项目使用 Rust 语言开发,集成了 Vosk 语音识别工具,适合...

TEN: An open source tool for building real-time multimodal speech AI intelligences
TEN框架是一个开源的软件平台,专注于帮助开发者构建实时、多模态、低延迟的语音AI智能体。它支持多种编程语言,包括C、C++、Go、Python、JavaScript和TypeScript。开发者可以通过TEN框架快速创建具有语音、视觉和文...

Zaia Health: the AI voice assistant that monitors and improves health habits
Zaia Health是一款人工智能健康应用,它的核心是一个名为“Zaia”的语音助手。 这款应用旨在帮助用户关注并改善自己的健康习惯。 它通过语音交互的方式,像一个私人健康伴侣一样,引导用户在睡眠、锻炼、营养和心理健康等方面养成更规律的生...

wukong-robot: a smart speaker project to create personalized Chinese voice conversations
wukong-robot 是一个开源的中文语音对话机器人和智能音箱项目,旨在帮助开发者快速构建个性化的智能音箱。它支持中文语音识别、语音合成和多轮对话功能,集成了ChatGPT、百度、科大讯飞等技术。项目设计模块化,插件和功能可自由扩展,适...

RealtimeVoiceChat
RealtimeVoiceChat 是一个开源项目,专注于通过语音与人工智能进行实时、自然的对话。用户使用麦克风输入语音,系统通过浏览器捕获音频,快速转为文字,由大型语言模型(LLM)生成回复,再将文字转为语音输出,整个过程接近实时。项目采...

gibberlink: a demonstration project for efficient audio communication between two AI intelligences
gibberlink is an open source project on GitHub by developer PennyroyalTea that focuses on enabling communication optimization between two conversational AI intelligences. When two AI intelligences talk on the phone and recognize each other as AI, they switch from human language (English) to a...

OpenAI Realtime Agents
OpenAI Realtime Agents is an open source project that aims to show how OpenAI's real-time APIs can be utilized to build multi-intelligent body speech applications. It provides a high-level intelligent body model (borrowed from OpenAI Swarm) that allows developers to build complex multi-intelligent body speech systems in a short period of time. The project ...

Bailing
百聆(Bailing)是一个开源的语音对话助手,旨在通过语音与用户进行自然的对话。该项目结合了语音识别(ASR)、语音活动检测(VAD)、大语言模型(LLM)和语音合成(TTS)技术,实现了类似GPT-4o的语音对话机器人。百聆的端到端时延...

"Always-On" Deepseek AI Assistant: Building an Intelligent Voice Interaction System Based on Deepseek-V3
Always-On AI Assistant is an innovative AI assistant project that creates a powerful and permanently online AI assistant system by integrating advanced technologies such as Deepseek-V3, RealtimeSTT and Typer. The project is especially optimized for engineering development scenarios, providing a complete...

Xiaozhi AI Chatbot
小智 AI 聊天机器人是一个基于ESP32开发板的开源项目,旨在帮助用户构建自己的AI聊天伴侣。该项目由虾哥开发,主要用于教学目的,帮助更多人入门AI硬件开发,并了解如何将大语言模型应用到实际的硬件设备中。项目支持多种语言的语音识别和对话功...

Fish Agent
Fish Speech 衍生项目 Fish Agent 是一款革命性的端到端AI语音克隆系统,基于V0.1 3B模型架构开发。作为一个完全端到端的语音克隆处理系统,其最大特点是采用创新的无语义标记架构设计,无需依赖Whisper等传统语义编...

Ichigo (llama3-s)
Ichigo是一个开源的实时语音AI项目,旨在扩展基于文本的语言模型,使其具备原生的“听力”能力。该项目采用了早期融合技术,灵感来自Meta的Chameleon论文。Ichigo的目标是成为一个开源数据、开源权重的本地设备语音助手,类似于S...

Hume AI: Empowering AI with Emotion Recognition | Recognizing Emotional States from Sounds and Expressions | Generating Speech with Emotional States
Hume AI 是一家专注于情感智能的人工智能公司,致力于开发能够理解和响应人类情感的多模态AI技术。其旗舰产品同理心语音界面(EVI)能够通过语音、面部表情和语言等多种形式识别和回应用户的情感,提升人机交互的情感体验。Hume AI 的目...
Top