Text to Speech

Text Input
0/5000
0 characters~0s estimated
Language

Chatterbox supports 23 languages with multilingual voice cloning

Voice Cloning (Optional)

Upload reference audio for voice cloning

WAV or MP3, max 10MB, 3-15 seconds recommended

For best results, use a clear recording with minimal background noise. Ensure the reference matches the target language for accent accuracy.

Voice Settings
0.252

Balanced - Default

01

Balanced

0.052

Balanced

Quick Presets

Generated audio includes imperceptible PerTh watermarking for responsible AI use. This watermark survives compression and editing while maintaining audio quality.

Your generated tts will appear here

Recent Generations
Loading history...