Text-to-Speech

Text-to-Speech Guide

Convert text content into natural, fluent speech with support for multiple languages, emotions, and tonal control. This guide will help you make the most of the text-to-speech feature.

Accessing the Feature

Visit the Text-to-Speech page to get started.

Basic Usage Workflow

Step 1: Select Voice Character

Choose your desired voice from three sources:

My Voices

  • Characters you've cloned yourself
  • Full control and usage rights
  • Suitable for personalized projects

Voice Square

  • Public characters shared by the community
  • Rich variety of voice options
  • Discover new voice styles

System Characters

  • Platform preset professional voices
  • Stable and reliable quality
  • Quick start usage

Step 2: Input Text Content

Enter or paste the content to convert in the text box:

Text Requirements:

  • Maximum length: 1,000 characters per request
  • Supports Chinese, English, and multilingual mixing
  • Use standard punctuation

Format Recommendations:

  • Use periods and commas to control pauses
  • Use question marks and exclamation points for tone
  • Use line breaks appropriately for paragraphs
  • Avoid overly long single sentences

Step 3: Adjust Parameters (Optional)

Adjust the following parameters as needed:

Speed Control

  • Range: 0.5x - 2.0x
  • Default: 1.0x (normal speed)
  • Slow speed suitable for educational content
  • Fast speed suitable for news broadcasts

Emotion Control (supported by some characters)

  • Neutral, happy, sad, etc.
  • Choose appropriate emotion based on content
  • Enhance expression

Step 4: Generate Speech

Click the "Generate Speech" button:

Processing Time:

  • Short text (within 100 characters): 3-5 seconds
  • Medium text (500 characters): 10-15 seconds
  • Long text (1000+ characters): 20-30 seconds

Generation Status:

  • Waiting: Task submitted
  • Processing: Synthesizing
  • Completed: Ready to listen and download
  • Failed: View error message

Step 5: Preview and Download

Online Preview:

  • Click play button to listen
  • Adjust playback volume
  • Check voice quality

Download Audio:

  • Click download button
  • Format: MP3
  • Sample rate: 24kHz
  • Bit rate: 128kbps

Advanced Tips

Text Optimization

Using Punctuation:

Correct example:
"Hello, welcome to Voice AI Labs. We provide professional voice synthesis services."

Avoid:
"Hello welcome to Voice AI Labs we provide professional voice synthesis services"

Controlling Pauses:

  • Comma: Brief pause
  • Period: Medium pause
  • Line break: Longer pause
  • Blank line: Clear paragraph break

Numbers and Symbols:

  • English letters will be spelled out: "AI" → "A I"
  • Special symbols may be ignored

Multilingual Mixing

Support for mixing multiple languages in the same text:

Example:
"Welcome to Voice AI Labs. This is a powerful text-to-speech service.
Let's start creating!"

Notes:

  • Choose characters that support multiple languages
  • Slight pauses may occur at language switches
  • Keep language transitions natural and smooth

Long Text Processing

Currently supports up to 1,000 characters, segmented processing feature coming soon

Quality Optimization

Achieving Best Results

Choose Appropriate Character:

  • Match content type with voice style
  • Use professional voices for professional content
  • Use friendly voices for casual content

Optimize Text Content:

  • ✅ Use natural spoken expressions
  • ✅ Avoid overly formal wording
  • ✅ Add appropriate interjections
  • ❌ Avoid rare characters and technical jargon
  • ❌ Avoid overly long compound sentences

Adjust Parameters:

  • Narrative content: Slightly slower speed (0.9x)
  • News broadcast: Normal speed (1.0x)
  • Fast-paced content: Slightly faster speed (1.1-1.2x)

Common Scenarios

Video Narration

Example text:
"Hello everyone, welcome to today's tutorial.

Today we'll learn how to use Voice AI Labs for voice synthesis.

First, let's understand the basic workflow..."

Recommendations:

  • Use clear, friendly voice
  • Moderate speed (0.9-1.0x)
  • Appropriate segmentation for easy editing

Audiobook Production

Example text:
"Chapter One

It was a sunny morning. Xiao Ming walked to school, feeling particularly happy.

Suddenly, he saw an injured bird..."

Recommendations:

  • Choose expressive voice
  • Leave blank lines between chapters
  • Use different characters for dialogue
Example text:
"Voice AI Labs, professional AI voice synthesis platform!

Convert text to natural, fluent speech in just seconds.

Try it now and start your creative journey!"

Recommendations:

  • Use engaging voice
  • Slightly faster speed (1.1-1.2x)
  • Emphasize key information

History Management

View History

View generation history on the right side of the page:

  • Shows recent generation records
  • Includes text preview and timestamp
  • Displays character information used

Re-download

  • Click download button in history records
  • No need to regenerate
  • Save quota

Quota Management

Character Counting

  • Calculated by actual character count
  • Includes punctuation and spaces
  • Chinese and English characters billed equally

Check Usage

View in user menu:

  • Characters used this month
  • Remaining available quota
  • Quota reset date

Insufficient Quota

When quota runs out:

  • Upgrade to higher plan
  • Wait for next month's quota reset
  • Purchase additional character packs

Troubleshooting

Generation Failed

Possible Causes:

  • Text contains violating content
  • Incorrect text format
  • Network connection interrupted
  • Server busy

Solutions:

  • Check text content
  • Simplify text format
  • Retry generation
  • Contact customer support

Audio Quality Issues

Unnatural Sound:

  • Try other characters
  • Adjust speed parameters
  • Optimize text expression

Pronunciation Errors:

  • Use standard wording
  • Avoid rare characters
  • Use homophones as substitutes

Download Issues

Cannot Download:

  • Check browser settings
  • Allow download permissions
  • Clear browser cache
  • Try other browsers

Best Practices Summary

  1. Choose Appropriate Character - Match content style
  2. Optimize Text Format - Use punctuation and paragraphs
  3. Adjust Suitable Parameters - Set speed based on scenario
  4. Preview Before Download - Ensure quality satisfaction
  5. Manage History Records - Download and clean up timely

Next Steps


Need help? Contact [email protected]