I tried OmniVoice this afternoon because I saw a post on r/LocalLLaMA and got curious. It's a zero-shot voice cloning TTS model that claims 600+ languages. I recorded two short voice memos on my phone, one in Turkish and one in English, dropped them into the web UI, and 30 seconds later voila I had cloned versions of my voice saying things I never said. Each sample took one or two tries to get right.

Continue reading →