User
Write something
Pinned
Want to Host a Live Session?
I’m planning a few LiveKit and Pipecat live sessions over the next weeks, and I’d love to open them up for community contributions. If you’re interested in hosting a session or sharing your expertise, feel free to DM me. Here are some topic ideas to spark inspiration: - Latency optimization: Strategies to achieve sub-600 ms latency - Interruption handling - Industry-specific use cases: Real estate, dental, medical, HVAC, restaurants, hotels, etc. - Integrations with niche software that rarely gets covered: ServiceTitan, FieldEdge, and Housecall Pro for HVAC and home services; Dentrix, Open Dental, and Eaglesoft for dental practices ; Epic, Athenahealth, and Oracle Health for medical practices; Toast, Aloha, and Square for restaurants; Opera PMS, Cloudbeds, and Mews for hotels; Buildium, Propertyware, and Follow Up Boss for real estate and property management. - Telephony provider integrations beyond Twilio: Telnyx, Zadarma, RingCentral, Ringba - Custom PBX integrations - Multilingual implementations: Best voices for specific languages, best transcription models for specific languages and best prompting strategies, deep dives into a single language - Non-technical Voice AI topics: Project management, hiring and evaluating developers, marketing Voice AI products and services, finding clients, content creation, proposal & contract creation, compliance You’re also welcome to use your session to showcase your own product or service, as long as it aligns with the theme of open-source Voice AI.
Pinned
Best Time for Live Calls?
We are an international group with members across multiple time zones, which makes it challenging to find a time that works for everyone. I’d still like to identify a slot that most of you are likely to attend. Please mark your available times on this calendar. The tool will then suggest the best options. No login is required (but recommended, so you can make changes later): https://community-scheduler.com/#/event/3a8549cf-8c82-40e6-855b-1a2fed0afe20
I cooked up a raw Voice AI orchestration engine from scratch using 𝗟𝗶𝘃𝗲𝗞𝗶𝘁 & 𝗣𝘆𝘁𝗵𝗼𝗻. 🍳
While wrappers are great for MVPs, building your own orchestration layer gives you 𝗳𝘂𝗹𝗹 𝗼𝘄𝗻𝗲𝗿𝘀𝗵𝗶𝗽, 𝘀𝗶𝗴𝗻𝗶𝗳𝗶𝗰𝗮𝗻𝘁𝗹𝘆 𝗹𝗼𝘄𝗲𝗿 𝗰𝗼𝘀𝘁𝘀, 𝗮𝗻𝗱 𝗴𝗿𝗮𝗻𝘂𝗹𝗮𝗿 𝗰𝗼𝗻𝘁𝗿𝗼𝗹 over the entire conversational pipeline. I designed this engine to fully replace third-party wrappers like Vapi & Retell AI. Here is a deep dive into what’s under the hood: 🔄 𝗗𝘆𝗻𝗮𝗺𝗶𝗰 𝗔𝗴𝗲𝗻𝘁 𝗖𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝘁𝗶𝗼𝗻 (𝗥𝗲𝗮𝗹-𝗧𝗶𝗺𝗲 𝗛𝘆𝗱𝗿𝗮𝘁𝗶𝗼𝗻) Hardcoding agents is a trap. I implemented a system that executes an API call upon call initialization. • 𝗛𝗼𝘁-𝗦𝘄𝗮𝗽𝗽𝗮𝗯𝗹𝗲 𝗣𝗲𝗿𝘀𝗼𝗻𝗮𝘀: A single engine instance can instantly apply unique System Prompts, Voice IDs, and Temperature settings based on backend parameters. • 𝗥𝗲𝘀𝘂𝗹𝘁: You can power thousands of unique agents (e.g., specific to different businesses) without ever redeploying the core code or creating a new instance. 🛠️ 𝗖𝗼𝗻𝘁𝗲𝘅𝘁-𝗔𝘄𝗮𝗿𝗲 𝗙𝘂𝗻𝗰𝘁𝗶𝗼𝗻 𝗥𝗼𝘂𝘁𝗲𝗿 When building raw infrastructure, manually mapping tools to agents is a major architectural hassle. I built specialized helper logic for 𝗗𝘆𝗻𝗮𝗺𝗶𝗰 𝗧𝗼𝗼𝗹 𝗜𝗻𝗷𝗲𝗰𝘁𝗶𝗼𝗻 to solve this. • 𝗠𝗼𝗱𝘂𝗹𝗮𝗿 𝗟𝗼𝗴𝗶𝗰: The router decouples the orchestration engine from business logic. It parses the backend setup and assigns only the specific tools defined in that agent's configuration (e.g., loading "Appointment Booking" tools only when the specific use-case demands it). 💾 𝗗𝗮𝘁𝗮 𝗣𝗲𝗿𝘀𝗶𝘀𝘁𝗲𝗻𝗰𝗲 & 𝗣𝗼𝘀𝘁-𝗖𝗮𝗹𝗹 𝗜𝗻𝘁𝗲𝗹𝗹𝗶𝗴𝗲𝗻𝗰𝗲 Logs aren't enough. I built a save_conversation function that aggregates the full session payload and triggers intelligent sub-functions immediately after the call: • 𝗖𝗮𝗹𝗹 𝗦𝘂𝗺𝗺𝗮𝗿𝘆: Generates a natural language recap via LLM. • 𝗖𝗮𝗹𝗹 𝗘𝘃𝗮𝗹𝘂𝗮𝘁𝗶𝗼𝗻: Structurally classifies the outcome (e.g., "Booked", "Inquiry", "Failed"). • 𝗧𝗲𝗹𝗲𝗺𝗲𝘁𝗿𝘆: Captures precise Token Usage (for billing) and Latency statistics alongside the transcript. 🛡️ 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗚𝘂𝗮𝗿𝗱𝗿𝗮𝗶𝗹𝘀 To prevent runaway costs and "zombie" connections, I engineered active background monitors: • 𝗜𝗻𝗮𝗰𝘁𝗶𝘃𝗶𝘁𝘆 𝗠𝗼𝗻𝗶𝘁𝗼𝗿: Detects silence (30s default) and gracefully terminates the session.
OPBX Goes Multi-Tenant and FREE SaaS
Hi All, So, as I've told you all a few weeks ago, I've published an open source tool called OPBX - which is a business PBX system, that works on top of Cloudonix and provides some interesting capabilities when working with voice agents. I've added multi-tenant capabilities to it - so if you install it, you can use it to service all your customers. At the same time, I'll be launching a SaaS version of OPBX, completely FREE of charge, so that you can use it and build with it. Next week, I'll be holding a special OPBX training session, showing how to integrate OPBX with VAPI, Retell, etc. In addition, I'll show how to build multi-agent IVR trees, warm transfers that work as they should and more. Cloudonix Velocity Training Registration - https://us02web.zoom.us/meeting/register/6D63tRaYSDihkJtUlpNp-A#/registration OPBX Github Repository - https://github.com/greenfieldtech-nirs/opbx Looking forward to seeing you all. Nir S
Coval for Simulations? Evals?
I'm currently handling about 6k calls per day between a few different enterprises. We're likely going to implement and build this all out in LiveKit but I also need a better eval + simulation solution and I don't really want to build it out as well. I have seen Coval but I don't know the cost yet -- what else it out there that you guys use that can give me actionable feedback of missed tool calls, hallucinations, etc.
1-30 of 153
powered by
Open Source Voice AI Community
skool.com/open-source-voice-ai-community-6088
Voice AI made open: Learn to build voice agents with Livekit & Pipecat and uncover what the closed platforms are hiding.
Build your own community
Bring people together around your passion and get paid.
Powered by