Build a WhatsApp AI Voice Agent with n8n, Twilio & ElevenLabs
Build a WhatsApp AI voice agent using n8n, Twilio and ElevenLabs! In this step-by-step tutorial, you'll learn how to capture WhatsApp voice notes, transcribe them with AI, generate smart responses, convert them back into natural-sounding audio, and send them to users automatically. Perfect for businesses wanting after-hours automation or creators building advanced AI workflows. In this video, we walk through the full pipelineāfrom receiving the WhatsApp message, routing it through Twilio, normalizing payloads in n8n, validating voice files, transcribing audio using ElevenLabs, generating an AI response with OpenAI (or any LLM), converting that reply back into speech, hosting the file, and finally delivering the voice note back to WhatsApp. Youāll also learn how to customize voices, use your own cloned voice, and securely configure gateways, webhooks and credentials. This tutorial expands on the previous WhatsApp automation video, but focuses specifically on AI voice note handling, error-proofing workflows, and production-ready best practices for routing media, managing storage, and preventing failed API calls. The full n8n template is attached