undefined

points

by dharma115 hours ago |

comments

by Lucasoato15 hours ago|

[-]

Thanks, I’ll try it, even if my experience wasn’t that great with Google models lately (503s)

by dharma115 hours ago|

parent|

[-]

Give it a shot, 3.1 live one in AI studio/API and max out reasoning - not the one in Gemini app it’s an older model.

Another option is to use pipecat with their VAD and separate STT and TTS and any (fast) LLM of your choice - but it’s more plumbing and not a true speech to speech model

by user_78323 hours ago|

parent|

[-]

> Give it a shot, 3.1 live one in AI studio/API and max out reasoning - not the one in Gemini app it’s an older model.

Do you know why this is a thing? Despite the app technically being Gemini, I find it quite crap, while the AI Studio thing with thinking is my favorite LLM. Very jarring tbh.

by stavros13 hours ago|

parent|

prev|

[-]

Haha, wow, I never thought I'd see a voice model that was too quick, but 3.1 live felt like it responded unnaturally quickly! I'm kind of blown away, I'd want to insert a 100ms delay to make it sound more natural, wow. I never thought I'd see that.