Voice Design - The First Generative AI For Audio

The first generative model for creating synthetic voices is here

By ElevenLabs Team in Product — Feb 28, 2023

Last month we announced our generative model for voice creation was coming. It's finally here and it's the first one of its kind - we call it Voice Design. The feature lets you build new voices from scratch by selecting their core qualities like gender, age and accent. And even with the same core parameter settings, our model adds randomness every time you hit generate to ensure each voice you hear is utterly unique. Voice Design is part of our wider effort to equip publishers and creators with the most versatile AI storytelling tools.

Try Voice Design

Voice Design

The model behind Voice Design is largely the result of our research into speech synthesis and voice cloning, though independently we always liked the idea of a generative tool for speech. We've already seen practical applications for generative text-to-image and chatbot models but a similar tool for audio was missing. Ever since our launch we've been getting requests to add more speakers to our bank. Instead of overcrowding the library with countless voices and making you listen through each preview to know who's who, we decided to flip the script and let you determine speaker identity, all the while allowing for infinite variety within these constraints.

Adding a degree of control to voice selection was important since our users often seek concrete speech characteristics for their scripts. Ensuring each generated voice is unique was equally crucial as many use-cases require, or at least benefit from, having exclusive access to a voice. In addition to providing users with a new creative outlet, voices generated with Voice Design are completely artificial and don’t belong to any real person.

Voice Design Tutorial

The Voice Design interface is extremely simple to use and just requires you to select a few voice qualities and adjust the accent strength toggle.

Hear some of the voices we generated with Voice Design:

American