Meta Introduces Generative AI Model For Speech ‘Voicebox’

Meta has developed a cutting-edge generative AI model ‘Voicebox’, designed to revolutionize the field of speech generation.

“We’ve developed Voicebox, the first model that can generalize to speech-generation tasks it was not specifically trained to accomplish with state-of-the-art performance,” Meta said in a blog post.

According to the company, Voicebox generates images and text in a variety of styles, and it can create outputs from scratch or modify samples provided to it.

However, instead of creating a picture or a passage of text, Voicebox produces high-quality audio clips.

The model supports speech synthesis across six languages, including English, French, German, Spanish, Polish, and Portuguese, as well as performs noise removal, content editing, style conversion, and diverse sample generation.

Moreover, Meta said that Voicebox uses a new approach to learn just from raw audio and an accompanying transcription.

Unlike autoregressive models for audio generation, Voicebox can modify any part of a given sample, not just the end of an audio clip it is given.

Further, the tech giant said that Voicebox is trained to predict a speech segment when given the surrounding speech and the transcript of the segment.

Once the model has learned to infill speech from context, it can be applied across a wide range of speech generation tasks, including generating portions of an audio recording without re-creating the entire recording.

❤️

If this article helped you, please consider supporting our work. Every small contribution keeps Abijita.com independent and running.

Buy ExpressVPN with PayPal or Credit Card

This versatility enables Voicebox to perform well across a variety of tasks, including — in-context text-to-speech synthesis, cross-lingual style transfer, speech denoising and editing, and diverse speech sampling.

READ

X Launches New Video Editor to Encourage Original Content

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

Meta Introduces Generative AI Model For Speech ‘Voicebox’

Bijay Pokharel

Related posts

YouTube Rolls Out ‘Sleep Timer’ In Music App

HBO Max Approaches 100 Million Subscriber Milestone

Virtual Reality Is Alive and Flourishing

Spotify’s Premium Subscriber Base Hits 182 Million In Q1 2022

Twitter Didn’t Remove Suicide Prevention Tool, It Is Fake News: Musk

Discord Introduces New Tool That Informs Parents About Teen’s Activity

Leave a Reply Cancel reply

SIGN UP FOR NEWSLETTERS

Please confirm your email address.