Welcome to the community!
This guide will help you get everything you need to create lifelike synthetic voices. We'll walk through selecting a voice avatar, generating and organizing clips, and pronunciation best practices. So, let's dive in and start creating!
In in this guide:
Starting a Project
What is a Project?
Projects act as folders to organize your clips easily. To start a new project, click New Project on the top right of Studio. Name your project and start creating.
View the Voice Avatar Library
To view the Voice Avatar Library anytime, click the Avatar icon in Studio or select See All Avatars from the right side menu. The Voice Avatar Library has a search filter to narrow results based on regional accents, voice characteristics, and performance styles.
Choosing the best voice avatar for your use case
- Start by thinking about the tone and mood you want your production to have. Do you want someone trustworthy? Authoritative? Focused? Warm?
- Narrow down your search by using the Voice Avatar Library filters.
Listen to a few different voice avatars and choose your top options.
TIP: Try favoriting your top three choices by clicking on the vertical ellipses (⋮ icon) to the right of each voice avatar to refer back to quickly.
- Give your top three choices a brief audition by copying in a portion of your script—an excerpt that reflects the tone you're aiming for—and create clips for each of your top voice avatars.
If you find a voice avatar you love creating with, add it to your favorites.
Generating Audio Clips
Entering your text
Enter your script by copying from your source material and pasting it into the Studio Editor.
Available rendering options
Studio offers three rendering options to help streamline your workflow:
- Single take: With this option, you can render a clip of up to 5,000 characters in one take.
- Render by sentence: Use this option to create a new clip for each sentence in your script. This can speed up your workflow by breaking your script into smaller, more manageable pieces.
- Render by paragraph: This option creates a new clip for each paragraph in your script. Use two line breaks to define a new paragraph.
Creating a clip
- Enter your script into the text editor.
- Select your Voice Avatar from the Voice Avatar Library.
- Click Create. You've just made a clip!
How to listen, repopulate, download, and delete a clip
Listen to a clip
- Locate the clip you want to listen to and click the triangle play icon on the left of the clip. The clip will start playing.
- You can pause or resume playback by clicking the play/pause button.
Repopulate a clip
- Locate the clip you want to repopulate and select the "T" icon. The clip's text will pop back into the text editor, and you can make changes.
- Once you've made the desired changes, click the Create button to make a new clip with the updated text.
Download a clip
- Locate the clip and click the download icon.
- Select the desired file format (MP3, OGG, or WAV).
- Your clip will download based on your internet browser's download settings.
Delete a clip
- Locate the clip and click the "X" icon.
- The clip will be permanently deleted, and your download credits will be freed up for use on new clips.
Renaming a clip
- In your Studio project, hover over the title of your selected clip and click the text.
- Delete the original text and replace it with your new title.
Merge multiple clips into one using our Combine tool.
- Select the checkbox on the left side of each clip you want to combine. Clips will combine in order from bottom to top.
- Select Combine.
- Name your new clip and select the length of pause you would like between each clip.
- Click Create Clip.
Moving a clip
You can rearrange clips within a project and from project to project. Learn how to move files to another project here.
Deleting unwanted clips
In Studio, you can create as many retakes as needed to get the right take. Once you've got the perfect clip, delete any unwanted clips to free up your clip usage and streamline your workflow.
A +1 icon indicates that deleting this clip will give you credit back to your account.
Pronunciation Best Practices
Pronounce a word correctly with Respellings
Our Respelling system allows you to format a word so it defines how each syllable should sound and which should be emphasized. Check out our Respellings guide to tune pronunciation of words.
Testing out a replacement
Once you have a creative spelling or Respelling for your word, test it out in the Pronunciation section from the right menu in Studio.
- Enter your replacement in the bottom box.
- If it's a Respelling with double colons (:: ::), remove them and toggle the Phonetic Respelling "on."
- Click the Play icon to listen.
- If it sounds good, save it to your library or replace the mispronounced word with your new respelled word in the Editor.
Adding to your Pronunciation Library
Studio's Pronunciation Library helps streamline your workflow by saving words or phrases that need phonetic spelling to achieve your preferred sound. Learn to add a replacement word to your Pronunciation Library and share it with your team.
Emphasizing a word or phrase
When you place a word or phrase in quotation marks (""), the AI pays particular attention to the chosen word or phrase. Quotation marks let you shape the emphasis of your sentence.
Some acronyms are pronounced as a word (NASA), while others aren't (NBA). Learn how to guide the AI to pronounce acronyms correctly here.
Just like real voice actors, AI voices need cues to understand if a number is a dollar amount, a reference number, a value, an address, a dollar amount, a year, a phone number, and so on. Learn how to pronounce numbers here.
Adding a pause
Adding natural pauses can be done by using commas and periods.
- Commas add pauses anywhere you want a small, subtle pause.
- Periods create a pause as well as a downward inflection. They are best used to break a long sentence into two pieces, allowing the AI to better predict which words to emphasize.
Adding a longer pause between sentences
- Use an ellipse (...) to create "breathing room" or a combination of punctuation marks ("...") to create space.
- Press the return or enter key and enter a period a few times for a slightly longer pause.
- You can utilize the Combine feature and choose the length of time between each audio file.
Adding inflection to a question requires context, which our AI is still learning, but you can guide spoken questions to have the inflection you're looking for by using our tips here.
Congratulations, you're now ready to dive into Studio and start bringing your projects to life!