The core features of AI-Chatbox mainly include:
- Wake to Voice and Command Recognition: Supports recording triggered by the wake-up word "hi, Loxin" and the command word "I have a question".
- speech-to-text: Convert recorded WAV audio to text using Vosk tool, supports Chinese recognition.
- Large Model Interaction: Send text questions via the DeepSeek API to get intelligent answers.
- Logging: Provide real-time log view function, convenient for debugging and monitoring equipment status.
- cross-device access: Build a REST service via Flask that allows other devices on the LAN to call the speech-to-text function.
- Embedded Optimization: Rust code optimized for the ESP32S3 development board, balancing performance and resource usage.
This answer comes from the articleAI-Chatbox: Speech-to-Text Intelligent Dialogue Project based on ESP32S3The