Hey everyone I’ve been working on a fairly detailed voice assistant using Vapi, and I’ve noticed something strange:
👉 When the prompt gets too long or packed with logic, the AI starts ignoring parts of it — especially things like timing, waiting for replies, or skipping over instructions entirely.
In my case, even though the prompt says “pause after asking how’s it going”, the assistant just jumps into the next line like it never read that at all.
So here’s my question:
Does prompt length affect performance?
- Are we hitting some kind of internal memory/processing limit?
- Should we break things into smaller behaviors or modular prompts?
- Is there a best practice for structuring complex call flows?
Would love to hear how others are handling large prompts in production. 🙏