Right now, AI doesn’t process voice notes directly. The only way I’ve seen it work is when the voice note is first transcribed, then AI can respond to the text and specifically WhatsApp.
Has anyone mastered this flow? If yes, can you share how you’re doing it step by step?
Would help a lot of us who deal with voice notes daily.