News

Meta's Voicebox AI promises to do for the spoken word what ChatGPT and Dall-E, respectfully, did for text and image generation.
ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...
By using Google text-to-speech, users can ask Spot about past and future missions, to which it can reply in real-time. Santiago writes that ChatGPT interprets the question, parses the files, and ...
Text-to-speech with feeling - this new AI model does everything but shed a tear ElevenLabs' 'most expressive' v3 model can speak with a huge range of emotions in more than 70 languages.
Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio Text-to-speech model can preserve speaker's emotional tone and acoustic environment.
Bark is a universal text-to-audio model that can not only create realistic speech, it can incorporate music, background noises, and sound effects. It can even include non-speech sounds like laughte… ...