Gemini 3.1 Flash TTS – with directed prompts

Posted by aanet 2 days ago

Counter17Comment5OpenOriginal

Comments

Comment by roscas 2 days ago

Hope they release an offline model for Ollama, a small one easy to work with for TTS in other languages.

Comment by xmichael909 1 day ago

So a free model? Tons of other people doing similar with amazing results https://huggingface.co/spaces/Inferless/Open-Source-TTS-Gall...

Comment by Insensitivity 1 day ago

No matter what I wrote in the audio profile, AI Studio never followed it, regardless of scene or context.

For example, I tried to get a male voice and kept getting female ones. Not sure if it's an AI Studio bug or I was doing something wrong.

Comment by voxic11 1 day ago

voice is determined by the voice parameter, you can't control it via the prompt, the prompt only directs how the chosen voices delivers the lines.

Comment by aanet 2 days ago

The 3 examples, with three distinct styles, are fascinating.

I'd like to see one with cockney accent, just for lulz