Activity
Mon
Wed
Fri
Sun
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
What is this?
Less
More

Memberships

Open Source Voice AI Community

827 members โ€ข Free

5 contributions to Open Source Voice AI Community
I cooked up a raw Voice AI orchestration engine from scratch using ๐—Ÿ๐—ถ๐˜ƒ๐—ฒ๐—ž๐—ถ๐˜ & ๐—ฃ๐˜†๐˜๐—ต๐—ผ๐—ป. ๐Ÿณ
While wrappers are great for MVPs, building your own orchestration layer gives you ๐—ณ๐˜‚๐—น๐—น ๐—ผ๐˜„๐—ป๐—ฒ๐—ฟ๐˜€๐—ต๐—ถ๐—ฝ, ๐˜€๐—ถ๐—ด๐—ป๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐—ป๐˜๐—น๐˜† ๐—น๐—ผ๐˜„๐—ฒ๐—ฟ ๐—ฐ๐—ผ๐˜€๐˜๐˜€, ๐—ฎ๐—ป๐—ฑ ๐—ด๐—ฟ๐—ฎ๐—ป๐˜‚๐—น๐—ฎ๐—ฟ ๐—ฐ๐—ผ๐—ป๐˜๐—ฟ๐—ผ๐—น over the entire conversational pipeline. I designed this engine to fully replace third-party wrappers like Vapi & Retell AI. Here is a deep dive into whatโ€™s under the hood: ๐Ÿ”„ ๐——๐˜†๐—ป๐—ฎ๐—บ๐—ถ๐—ฐ ๐—”๐—ด๐—ฒ๐—ป๐˜ ๐—–๐—ผ๐—ป๐—ณ๐—ถ๐—ด๐˜‚๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป (๐—ฅ๐—ฒ๐—ฎ๐—น-๐—ง๐—ถ๐—บ๐—ฒ ๐—›๐˜†๐—ฑ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป) Hardcoding agents is a trap. I implemented a system that executes an API call upon call initialization. โ€ข ๐—›๐—ผ๐˜-๐—ฆ๐˜„๐—ฎ๐—ฝ๐—ฝ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—ฃ๐—ฒ๐—ฟ๐˜€๐—ผ๐—ป๐—ฎ๐˜€: A single engine instance can instantly apply unique System Prompts, Voice IDs, and Temperature settings based on backend parameters. โ€ข ๐—ฅ๐—ฒ๐˜€๐˜‚๐—น๐˜: You can power thousands of unique agents (e.g., specific to different businesses) without ever redeploying the core code or creating a new instance. ๐Ÿ› ๏ธ ๐—–๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜-๐—”๐˜„๐—ฎ๐—ฟ๐—ฒ ๐—™๐˜‚๐—ป๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—ฅ๐—ผ๐˜‚๐˜๐—ฒ๐—ฟ When building raw infrastructure, manually mapping tools to agents is a major architectural hassle. I built specialized helper logic for ๐——๐˜†๐—ป๐—ฎ๐—บ๐—ถ๐—ฐ ๐—ง๐—ผ๐—ผ๐—น ๐—œ๐—ป๐—ท๐—ฒ๐—ฐ๐˜๐—ถ๐—ผ๐—ป to solve this. โ€ข ๐— ๐—ผ๐—ฑ๐˜‚๐—น๐—ฎ๐—ฟ ๐—Ÿ๐—ผ๐—ด๐—ถ๐—ฐ: The router decouples the orchestration engine from business logic. It parses the backend setup and assignsย onlyย the specific tools defined in that agent's configuration (e.g., loading "Appointment Booking" tools only when the specific use-case demands it). ๐Ÿ’พ ๐——๐—ฎ๐˜๐—ฎ ๐—ฃ๐—ฒ๐—ฟ๐˜€๐—ถ๐˜€๐˜๐—ฒ๐—ป๐—ฐ๐—ฒ & ๐—ฃ๐—ผ๐˜€๐˜-๐—–๐—ฎ๐—น๐—น ๐—œ๐—ป๐˜๐—ฒ๐—น๐—น๐—ถ๐—ด๐—ฒ๐—ป๐—ฐ๐—ฒ Logs aren't enough. I built a save_conversation function that aggregates the full session payload and triggers intelligent sub-functions immediately after the call: โ€ข ๐—–๐—ฎ๐—น๐—น ๐—ฆ๐˜‚๐—บ๐—บ๐—ฎ๐—ฟ๐˜†: Generates a natural language recap via LLM. โ€ข ๐—–๐—ฎ๐—น๐—น ๐—˜๐˜ƒ๐—ฎ๐—น๐˜‚๐—ฎ๐˜๐—ถ๐—ผ๐—ป: Structurally classifies the outcome (e.g., "Booked", "Inquiry", "Failed"). โ€ข ๐—ง๐—ฒ๐—น๐—ฒ๐—บ๐—ฒ๐˜๐—ฟ๐˜†: Captures precise Token Usage (for billing) and Latency statistics alongside the transcript. ๐Ÿ›ก๏ธ ๐—ฃ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—š๐˜‚๐—ฎ๐—ฟ๐—ฑ๐—ฟ๐—ฎ๐—ถ๐—น๐˜€ To prevent runaway costs and "zombie" connections, I engineered active background monitors: โ€ข ๐—œ๐—ป๐—ฎ๐—ฐ๐˜๐—ถ๐˜ƒ๐—ถ๐˜๐˜† ๐— ๐—ผ๐—ป๐—ถ๐˜๐—ผ๐—ฟ: Detects silence (30s default) and gracefully terminates the session.
0 likes โ€ข 16d
๐—ฉ๐—ผ๐—ถ๐—ฐ๐—ฒ ๐—”๐—œ
Want to Host a Live Session?
Iโ€™m planning a few LiveKit and Pipecat live sessions over the next weeks, and Iโ€™d love to open them up for community contributions. If youโ€™re interested in hosting a session or sharing your expertise, feel free to DM me. Here are some topic ideas to spark inspiration: - Latency optimization: Strategies to achieve sub-600 ms latency - Interruption handling - Industry-specific use cases: Real estate, dental, medical, HVAC, restaurants, hotels, etc. - Integrations with niche software that rarely gets covered: ServiceTitan, FieldEdge, and Housecall Pro for HVAC and home services; Dentrix, Open Dental, and Eaglesoft for dental practices ; Epic, Athenahealth, and Oracle Health for medical practices; Toast, Aloha, and Square for restaurants; Opera PMS, Cloudbeds, and Mews for hotels; Buildium, Propertyware, and Follow Up Boss for real estate and property management. - Telephony provider integrations beyond Twilio: Telnyx, Zadarma, RingCentral, Ringba - Custom PBX integrations - Multilingual implementations: Best voices for specific languages, best transcription models for specific languages and best prompting strategies, deep dives into a single language - Non-technical Voice AI topics: Project management, hiring and evaluating developers, marketing Voice AI products and services, finding clients, content creation, proposal & contract creation, compliance Youโ€™re also welcome to use your session to showcase your own product or service, as long as it aligns with the theme of open-source Voice AI.
1 like โ€ข Dec '25
what's the plan on this ?
๐Ÿš€ Scale Your AI Voice Agent by Launching a Full-Stack SaaS Platform
Youโ€™re probably still building AI voice agents manually for clients. Thatโ€™s not a scalable business. Itโ€™s time to take your AI agency to the next level by launching your own full-stack SaaS platform โ€” just like I did. The good news: Iโ€™ve already built the entire AI voice agent SaaS platform, so you donโ€™t have to. Hereโ€™s whatโ€™s inside the codebase: ๐Ÿ—ฃ AI Voice Agent โ€” powered by Vapi & LiveKitโš™ Configurations โ€” prompts, models, knowledge base, and more ๐Ÿ“ Call Logs ๐Ÿ“ˆ Analytics Dashboard ๐Ÿ” Sign in / Sign up โ€” Google Authentication ready ๐Ÿ’ณ Payment Collection โ€” via Stripe ๐Ÿข Multi-Tenant Architecture โ€” one user can be linked to multiple organizations ๐Ÿ“ž SIP Connection โ€” integrated with Telnyx โ›” Rate Limiter โ€” manage usage efficiently And everything else you need to launch a production-ready SaaS platform. ๐Ÿ’ฌ Comment below or DM me if youโ€™re interested in using or acquiring the full codebase for your own AI voice agent business.
๐Ÿš€ Scale Your AI Voice Agent by Launching a Full-Stack SaaS Platform
0 likes โ€ข Nov '25
Interested @Jin Park
Voice AI without pipecat or livekit
Has any one built an voice bot without pipecat and livekit would love to connect with them I have built it and facing some issues in latency part My tech stack is openai llm , Deepgram for ASR and Azure for TTS and one telephony vendor which connects everything
1 like โ€ข Nov '25
@Mohammad Mussab am also built the same things with open source things available in the market but without pipecat and live kit . i can customize anything and everything
0 likes โ€ข Nov '25
@Johann Tagle yes well put . That's why i haven't used any library to get things done
Welcome to the Open Source Voice AI Community!
Hey everyone, Thank you so much for your patience while we got this community ready to launch. Itโ€™s finally happening! ๐ŸŽ‰ Iโ€™ve put together a short video explaining why I started this group and what itโ€™s all about. Iโ€™m really excited to meet all of you โ€” passionate, like-minded people working in the voice AI space. Our first meetup is next Friday, and itโ€™ll be all about getting to know each other, hearing about your voice AI projects, and understanding what youโ€™d like to learn on here. In the meantime, letโ€™s start with introductions right here under this post ๐Ÿ‘‡ Please share: - Who you are - What youโ€™re building or working on - What youโ€™d love to learn or explore within this community Canโ€™t wait to see what everyoneโ€™s up to!
Welcome to the Open Source Voice AI Community!
0 likes โ€ข Nov '25
Hello everyone , myself prasanna kumar have been working in voice AI before the LLM era have certain questions to ask and i have not used livekit or pipecat for building the solutions have used langchain , ASR and TTS models and integrated with telephony vendor that's how my solutions is implemented
1-5 of 5
Prasanna Kumar
2
13points to level up
@prasanna-kumar-4285
Conversational AI

Active 8h ago
Joined Nov 7, 2025
India