How to use Audiobox, Meta AI’s new sound and voice cloning tool – Android Police

Meta introduced its generative AI model for speech, Voicebox, in mid-2023. Meta aims to take AI sound generation to the next level with Audiobox, Voicebox's successor. The innovative tool generates sound effects from text prompts, eliminates noise from speech recordings, creates a restyled voice, generates speech in the style of an audio clip, and more. Before we take it for a spin, let's learn more about Meta's Audiobox.

The Audiobox demo is available on the web only. Try it on your Mac, Windows desktop, or a top Chromebook.

Creating high-quality audio can be a challenging process. Not everyone is a sound engineer and has access to extensive tools to create audio. Here's where Meta's Audiobox comes into play. It's a sound-generation tool from Facebook AI Research (FAIR). Meta's latest offering generates audio and sound effects using voice inputs, text prompts, and a combination of both.

With Audiobox, Meta aims to lower the barrier of audio creation and make it easy for general users to create high-quality sound samples. Whether you want to create audio for a podcast, YouTube video, audiobook, or video game, Audiobox can be your helping hand to get the job done.

Generative AI has made audio creation and voice cloning popular. There is no shortage of such tools. Meta's Audiobox easily stands out from the crowd due to its unique capabilities. Here's what you can do with it:

All Audiobox features are available to try from the company's official website. You can generate audio samples, check previews, and download them to your device.

You can also move to the Sound Effects menu and describe the sound sample you want to create. Add enough details to get astute results from Audiobox. We ran several text prompts and were impressed with the generated sound effects.

Audiobox can produce sound samples that are close to how people speak naturally. It has led to concerns about AI-powered deepfakes. Especially since the US presidential elections are around the corner, you can't rule out misuse of such AI tools. Meta implements automatic audio watermarking on audio generated by Audiobox.

The embedded signal in the generated audio is negligible to the human ear but can be tracked to the frame level. Meta will also add a voice authentication to prevent impersonation. The person must speak a voice prompt while registering their voice. The text prompt refreshes every 50 seconds, so playing someone else's pre-recorded voice is difficult.

Meta decided against making the AI model open source to prevent potential misuse.

Meta has done a remarkable job with Audiobox. It's accurate and very good. Try it with different prompts and voice samples, and check the results. Besides Facebook, tech giants like Google and Microsoft are exploring generative artificial intelligence to create content.

The search giant recently launched Google Bard to take on Open AI's (and Microsoft) ChatGPT. Read our dedicated post to learn more about Google Bard. We also compared Google Bard with ChatGPT to find their capabilities, limitations, and potential.

See the original post:

How to use Audiobox, Meta AI's new sound and voice cloning tool - Android Police