Core Interaction Functions and Switching Mechanisms
The Xiaozhi client provides two core interaction functions:
- real time voice interactionSpeech recognition and synthesis via WebSocket technology, support echo cancellation, voice interruption (triggered by command words such as "stop") and ambient noise filtering.
- text dialog system: Asynchronous message processing mechanism to support multiple rounds of dialog and contextualization for complex issues
There are two types of switching:
- Proactive switching: The main interface displays both a microphone icon and an input box, allowing the user to choose how to interact at any time.
- hybrid model: After asking a question by voice, you can view a text reply or trigger a switch by using the voice command "switch to text input". The system remembers the user's preferred default interaction method in the settings.
This answer comes from the articleXiaozhi MCP Client: a cross-platform AI assistant supporting voice and text interactionThe