Back to Feed
AI▲ 70
Gemini 3.1 Flash TTS enhances AI speech generation
Google Blog·
Google has launched Gemini 3.1 Flash TTS, a new text-to-speech model offering advanced control and expressiveness in AI-generated audio. This model introduces granular audio tags, allowing users to precisely direct vocal style, pacing, and delivery in over 70 languages. It achieves high naturalness and expressivity, outperforming previous versions on benchmarks and offering cost-effectiveness. Gemini 3.1 Flash TTS is available through Google AI Studio, Vertex AI, and Google Vids, with all generated audio watermarked by SynthID to ensure authenticity and prevent misinformation.
Tags
ai
product
Original Source
Google Blog — blog.google