Activity
Mon
Wed
Fri
Sun
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
What is this?
Less
More

Memberships

Open Source Voice AI Community

826 members โ€ข Free

AI Automation Security

65 members โ€ข Free

Voice AI Accelerator

7.3k members โ€ข Free

Learn Voice AI

366 members โ€ข Free

IA al telรฉfono

69 members โ€ข $39/m

AI Automation Society

254k members โ€ข Free

10 contributions to Open Source Voice AI Community
Open-sourced my site's voice AI demo
I built a voice AI assistant for my website using Pipecat and Gemini's native audio model. People kept calling it trying to reverse-engineer how it works, so I just open-sourced the whole thing. It's a good starting point if you want to build your own web-based voice AI demo with low latency and multilingual support. Repo: https://github.com/askjohngeorge/askjg-demo-gemini-pcc System prompt: https://github.com/askjohngeorge/askjg-demo-gemini-pcc/blob/main/bot/prompts/demo_system_prompt.md You can try the live demo at https://askjohngeorge.com/demo (click the mic). Happy to answer questions if you have any.
2 likes โ€ข 5d
@John George - that's amazing! Suddenly the demo I just finished building on LiveKit feels so ordinary. Back to the drawing board๐Ÿ™ˆ.
1 like โ€ข 5d
@John George I had to Google Kevin Gates - showing my age. It's brilliant - I just tried speaking to it Spanish: Accent from Andalucรญaโœ… Accent from Argentinaโœ… Then.. Accent from Manchester (Oasis style)โœ…
I cooked up a raw Voice AI orchestration engine from scratch using ๐—Ÿ๐—ถ๐˜ƒ๐—ฒ๐—ž๐—ถ๐˜ & ๐—ฃ๐˜†๐˜๐—ต๐—ผ๐—ป. ๐Ÿณ
While wrappers are great for MVPs, building your own orchestration layer gives you ๐—ณ๐˜‚๐—น๐—น ๐—ผ๐˜„๐—ป๐—ฒ๐—ฟ๐˜€๐—ต๐—ถ๐—ฝ, ๐˜€๐—ถ๐—ด๐—ป๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐—ป๐˜๐—น๐˜† ๐—น๐—ผ๐˜„๐—ฒ๐—ฟ ๐—ฐ๐—ผ๐˜€๐˜๐˜€, ๐—ฎ๐—ป๐—ฑ ๐—ด๐—ฟ๐—ฎ๐—ป๐˜‚๐—น๐—ฎ๐—ฟ ๐—ฐ๐—ผ๐—ป๐˜๐—ฟ๐—ผ๐—น over the entire conversational pipeline. I designed this engine to fully replace third-party wrappers like Vapi & Retell AI. Here is a deep dive into whatโ€™s under the hood: ๐Ÿ”„ ๐——๐˜†๐—ป๐—ฎ๐—บ๐—ถ๐—ฐ ๐—”๐—ด๐—ฒ๐—ป๐˜ ๐—–๐—ผ๐—ป๐—ณ๐—ถ๐—ด๐˜‚๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป (๐—ฅ๐—ฒ๐—ฎ๐—น-๐—ง๐—ถ๐—บ๐—ฒ ๐—›๐˜†๐—ฑ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป) Hardcoding agents is a trap. I implemented a system that executes an API call upon call initialization. โ€ข ๐—›๐—ผ๐˜-๐—ฆ๐˜„๐—ฎ๐—ฝ๐—ฝ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—ฃ๐—ฒ๐—ฟ๐˜€๐—ผ๐—ป๐—ฎ๐˜€: A single engine instance can instantly apply unique System Prompts, Voice IDs, and Temperature settings based on backend parameters. โ€ข ๐—ฅ๐—ฒ๐˜€๐˜‚๐—น๐˜: You can power thousands of unique agents (e.g., specific to different businesses) without ever redeploying the core code or creating a new instance. ๐Ÿ› ๏ธ ๐—–๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜-๐—”๐˜„๐—ฎ๐—ฟ๐—ฒ ๐—™๐˜‚๐—ป๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—ฅ๐—ผ๐˜‚๐˜๐—ฒ๐—ฟ When building raw infrastructure, manually mapping tools to agents is a major architectural hassle. I built specialized helper logic for ๐——๐˜†๐—ป๐—ฎ๐—บ๐—ถ๐—ฐ ๐—ง๐—ผ๐—ผ๐—น ๐—œ๐—ป๐—ท๐—ฒ๐—ฐ๐˜๐—ถ๐—ผ๐—ป to solve this. โ€ข ๐— ๐—ผ๐—ฑ๐˜‚๐—น๐—ฎ๐—ฟ ๐—Ÿ๐—ผ๐—ด๐—ถ๐—ฐ: The router decouples the orchestration engine from business logic. It parses the backend setup and assignsย onlyย the specific tools defined in that agent's configuration (e.g., loading "Appointment Booking" tools only when the specific use-case demands it). ๐Ÿ’พ ๐——๐—ฎ๐˜๐—ฎ ๐—ฃ๐—ฒ๐—ฟ๐˜€๐—ถ๐˜€๐˜๐—ฒ๐—ป๐—ฐ๐—ฒ & ๐—ฃ๐—ผ๐˜€๐˜-๐—–๐—ฎ๐—น๐—น ๐—œ๐—ป๐˜๐—ฒ๐—น๐—น๐—ถ๐—ด๐—ฒ๐—ป๐—ฐ๐—ฒ Logs aren't enough. I built a save_conversation function that aggregates the full session payload and triggers intelligent sub-functions immediately after the call: โ€ข ๐—–๐—ฎ๐—น๐—น ๐—ฆ๐˜‚๐—บ๐—บ๐—ฎ๐—ฟ๐˜†: Generates a natural language recap via LLM. โ€ข ๐—–๐—ฎ๐—น๐—น ๐—˜๐˜ƒ๐—ฎ๐—น๐˜‚๐—ฎ๐˜๐—ถ๐—ผ๐—ป: Structurally classifies the outcome (e.g., "Booked", "Inquiry", "Failed"). โ€ข ๐—ง๐—ฒ๐—น๐—ฒ๐—บ๐—ฒ๐˜๐—ฟ๐˜†: Captures precise Token Usage (for billing) and Latency statistics alongside the transcript. ๐Ÿ›ก๏ธ ๐—ฃ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—š๐˜‚๐—ฎ๐—ฟ๐—ฑ๐—ฟ๐—ฎ๐—ถ๐—น๐˜€ To prevent runaway costs and "zombie" connections, I engineered active background monitors: โ€ข ๐—œ๐—ป๐—ฎ๐—ฐ๐˜๐—ถ๐˜ƒ๐—ถ๐˜๐˜† ๐— ๐—ผ๐—ป๐—ถ๐˜๐—ผ๐—ฟ: Detects silence (30s default) and gracefully terminates the session.
0 likes โ€ข 12d
Hi @Jin Park , thanks for sharing. I particularly like this: ->๐Ÿ› ๏ธ ๐—–๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜-๐—”๐˜„๐—ฎ๐—ฟ๐—ฒ ๐—™๐˜‚๐—ป๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—ฅ๐—ผ๐˜‚๐˜๐—ฒ๐—ฟ - that make this agent truly scalable.
Pipecat vs Livekit
Might be a silly one but Is one better than the other .? 1.In terms of latency and 2.agent orchestration ?
3 likes โ€ข Jan 2
There was a discussion on this a couple of months ago if it helps: https://www.skool.com/open-source-voice-ai-community-6088/pipecat-vs-livekit
Best Observability Tools for Voice AI Frameworks?
What observability tools are others using with Pipecat or similar voice AI frameworks? I've built a production voice agent using Pipecat and currently track basic metrics (call duration, sentiment, summary, transcripts) in a custom dashboard. Tomorrow it's going in production so problem I think I can face is When errors will occur, debugging is painful. My current logging approach creates massive log files that are nearly impossible to analyze efficiently when tracking down issues.
1 like โ€ข Nov '25
@Mohammad Mussab I've just started using Logfire from Pydantic (https://pydantic.dev/logfire). I'm working on a whatsapp chatbot at the moment so not tested it with voice but it's so handy to quickly identify errors through their logs. Will be checking out Whisker though for sure.
Just sayin
Honestly, I've been building voice AI agents with Ultravox AI and the help of Claude. I understand that this community is really about LiveKit and Pipecat and open source, but Ultravox is also open source. I've had a lot of success using Ultravox AI. I've had some success with LiveKit and Pipecat, but I've had the most success with Ultravox AI. I think it's undervalued and overlooked as a source for open source AI agents and building them.oh, andโ€ฆClaude is king!
1 like โ€ข Nov '25
@Robert Figueroa Loving Claude Code too - (except the token limits - $20 to $100 is a big jump!).
1 like โ€ข Nov '25
@Robert Figueroa Not using much in the way of sub-agents yet. But I did introduce Serena recently which apparently helps reduce token use significantly.
1-10 of 10
Dan Quixote
2
3points to level up
@dan-quixote-8098
Madrid based

Active 4h ago
Joined Nov 8, 2025
Spain / UK