Hello Everyone, im working on a project create Arabic voice agent, im running it locally on a 16gb GPU using vscode and langchain. im struggling to find a good TTS and STT model that is fast and accurate enough. currently i have fine-tuned qwen8b 4bit on my dataset using chromaDB. any suggestions?