xiaozhi-esp32-server: Build Your Own ESP32 Device Control Server with AI

Summary
The `xiaozhi-esp32-server` project provides a robust backend service for `xiaozhi-esp32`, enabling users to quickly set up their own ESP32 device control servers. This powerful platform integrates advanced AI capabilities, including large language models, visual perception, and voiceprint recognition, to create intelligent IoT solutions. It supports various communication protocols and offers flexible deployment options for both simplified and full-featured setups.
Repository Info
Tags
Click on any tag to explore related repositories
Introduction
The xiaozhi-esp32-server is an open-source backend service designed to complement the xiaozhi-esp32 hardware project. Developed by Professor Siyuan Liu's Team at South China University of Technology, this project allows you to build a sophisticated ESP32 device control server with integrated AI functionalities. It supports a wide array of features, including MQTT+UDP, WebSocket protocols, MCP access points, and advanced voiceprint recognition, making it ideal for creating intelligent human-machine interaction systems.
Installation
Setting up your xiaozhi-esp32-server is straightforward, with options tailored to different needs. The project provides two primary deployment methods:
- Simplified Installation: Ideal for lower-spec environments, focusing on intelligent dialogue, IoT, MCP, and visual perception, with data stored in configuration files.
- Full Module Installation: Offers a complete experience, including intelligent dialogue, IoT, MCP access points, voiceprint recognition, visual perception, OTA, and a smart control console, storing data in a database.
Both methods support Docker and local source code deployment. Detailed deployment documentation and video tutorials are available in the official repository to guide you through the process.
For comprehensive instructions, please refer to the Deployment Documentation in the project's README.
Examples
The xiaozhi-esp32-server empowers your ESP32 devices with a range of intelligent capabilities. Here are some examples of what you can achieve:
- Responsive Voice Control: Experience rapid response times for voice commands.
- Smart Home Integration: Control home appliances, such as switches, with voice commands.
- Complex Task Execution: Handle multi-instruction tasks efficiently.
- Voiceprint Recognition: Identify different users by their voice for personalized interactions.
- Visual Perception: Utilize visual large language models for object recognition and other visual tasks.
- Media Playback: Play music and broadcast news through your ESP32 device.
- Real-time Interruption: Interrupt ongoing tasks or conversations seamlessly.
- Multi-language Support: Interact using various languages, including Cantonese.
Numerous video demonstrations showcasing these features are available in the project's README.
Why Use It
Choosing xiaozhi-esp32-server provides several compelling advantages for developers and IoT enthusiasts:
- Customizable Backend: Host your own ESP32 control server, gaining full control over your smart devices and data.
- Advanced AI Integration: Leverage state-of-the-art AI models for ASR (Speech Recognition), TTS (Text-to-Speech), LLM (Large Language Models), and VLLM (Visual Large Language Models), supporting both free and streaming configurations.
- Comprehensive Feature Set: From core communication protocols to voiceprint recognition, memory systems, and a plugin architecture, it offers a rich set of functionalities.
- Open-Source Ecosystem: Benefit from an active community and a growing ecosystem of related projects, fostering collaboration and innovation.
- Flexible Deployment: Choose between Docker or local source deployment, with options for simplified or full-featured setups to match your resource availability and project scope.
Links
- GitHub Repository: https://github.com/xinnan-tech/xiaozhi-esp32-server
- Related ESP32 Hardware Project: xiaozhi-esp32
- Deployment Documentation: Deployment Docs
- Frequently Asked Questions: FAQ
- Performance Research: xiaozhi-performance-research