Voice Cloning

Voice Cloning Guide

Clone any voice using our advanced VAL v1 model to create custom AI voice characters. This guide covers all key aspects of the voice cloning process.

Cloning Methods

Method 1: File Upload

Upload pre-recorded audio files to create voice characters.

Supported Formats:

  • WAV
  • MP3
  • OGG

File Requirements:

  • Duration: 10-30 seconds optimal
  • Quality: Clear audio with minimal background noise
  • Size: Maximum 10MB per file
  • Sample Rate: 16kHz or higher recommended

Steps:

  1. Visit the Voice Clone page
  2. Click the "Upload File" tab
  3. Select your audio file
  4. Wait for upload to complete
  5. Proceed to character configuration

Method 2: Online Recording

Record audio directly in your browser for instant cloning.

Requirements:

  • Microphone access permission
  • Quiet recording environment
  • Stable internet connection

Steps:

  1. Visit the Voice Clone page
  2. Click the "Online Recording" tab
  3. Allow browser microphone access
  4. Click record button to start
  5. Speak clearly for 10-30 seconds
  6. Click stop when finished
  7. Preview your recording
  8. Proceed to character configuration

Character Configuration

After providing audio input, configure your character details:

Required Fields

Character Name

  • Give your voice character a memorable name
  • Use descriptive names for easy identification
  • Maximum 20 characters

Optional Fields

Character Description

  • Describe voice characteristics
  • Note intended use case
  • Add relevant context
  • Maximum 200 characters

Character Avatar

  • Upload profile image for visual identification
  • Supported formats: JPG, PNG, WebP
  • Maximum size: 5MB
  • Recommended dimensions: 512x512 pixels

Advanced Settings

Model Version

Select the AI model for cloning:

  • VAL v1 (Recommended): Latest model with best quality and multilingual support

Language Selection

Choose the primary language of your audio sample:

  • English (US, UK, Australian)
  • Chinese (Mandarin, Cantonese)
  • Spanish, French, German, Italian
  • Japanese, Korean
  • And 20+ more languages

Tip: Selecting the correct language improves cloning accuracy.

Vocal Separation

Enable this option if your audio contains background noise or music:

  • Enabled: AI will isolate and extract vocals from background sounds
  • Disabled: Use for clean audio recordings

When to Enable:

  • Audio has background music
  • Multiple speakers in recording
  • Environmental noise present
  • Phone call recordings

When to Disable:

  • Studio-quality recordings
  • Clean voice-only audio
  • Minimal background noise

Cloning Process

After clicking "Create Voice Character":

  1. Upload: Audio is securely uploaded to our servers
  2. Processing: VAL v1 model analyzes voice characteristics
  3. Training: AI learns unique vocal patterns
  4. Validation: Quality checks ensure successful cloning
  5. Completion: Character is ready for use

Processing Time:

  • Short samples (10-30s): 5-15 seconds
  • Medium samples (1-5 min): 15-30 seconds
  • Long samples (5+ min): 30-60 seconds

Quality Tips

For best cloning results:

Audio Quality

  • ✅ Use high-quality microphones or recordings
  • ✅ Record in quiet environments
  • ✅ Avoid echo and reverb
  • ✅ Maintain consistent volume levels
  • ❌ Avoid compressed or low-bitrate audio

Content Selection

  • ✅ Include varied sentences and expressions
  • ✅ Capture different emotional tones
  • ✅ Use natural speaking pace
  • ✅ Include pauses and breathing
  • ❌ Avoid monotone or robotic speech

Sample Length

  • Recommended (10-30s): Quick cloning with quality assurance

Troubleshooting

Common Issues

"Audio quality too low"

  • Solution: Use higher quality recording equipment or format
  • Ensure sample rate is at least 16kHz

"Background noise detected"

  • Solution: Enable vocal separation option
  • Re-record in quieter environment

"Sample too short"

  • Solution: Provide at least 10 seconds of clear speech
  • Combine multiple short clips if needed

"Language mismatch"

  • Solution: Ensure selected language matches audio content
  • Try "Auto-detect" if unsure

Usage Limits

Character creation is subject to your plan limits:

PlanCharactersMonthly Clones
Starter1010
Standard100100
PremiumUnlimitedUnlimited

Subject to change, please refer to the Pricing page for current rates.

Check your current usage in the user menu or My Voices page.

Next Steps

After creating your character:

  1. Preview: Test the voice in Text-to-Speech
  2. Organize: Manage characters in Character Management
  3. Share: Publish to Voice Square (optional)
  4. Create: Use in your projects and content

Need help? Check our FAQ or contact [email protected]