Settings

Theme

Show HN: EchoKit – An open-source, ESP32-based AI voice agent with a Rust server

instructables.com

1 points by 3Sophons 2 months ago · 0 comments · 1 min read

Reader

Have you wanted a voice AI agent that you could fully control, customize, and understand from the ground up—not just a black box.

EchoKit is a DIY, open-source voice agent running on an ESP32-S3. The fun part is the server backend, which I wrote entirely in Rust to handle the AI pipeline (ASR, LLM, TTS).

The stack is:

Hardware: EchoKit board (ESP32-S3)

Firmware: ESP-IDF

Server: Rust (Actix Web/Tungstenite)

AI: Customizable pipeline. The tutorial uses Groq for the Whisper, Llama 3, and TTS models, which makes the response time incredibly fast (usually just a few seconds for the full ASR->LLM->TTS roundtrip).

It's designed to be easy for makers, students, or anyone curious about AI to build in just a few minutes. You can modify the system prompts, swap out models, or even add custom actions (Step 6 in the guide).

The tutorial (linked) walks through assembly, flashing, and setting up the server. The server code is on GitHub (also linked).

Happy to answer any questions. Would love to hear your thoughts and what you think we could build with this!

Server Repo:https://github.com/second-state/echokit_server

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection