Voice Cloning
Reference Audio Requirements
Requirements for reference audio used in voice cloning.
Requirements
| PROPERTY | REQUIREMENT |
|---|---|
Format | WAV, FLAC, or OGG |
Duration | 1 -- 6 seconds |
Sample rate | 16 kHz mono recommended |
Content | Clear speech, minimal background noise |
Size | 10 MB maximum |
Encoding | Base64-encoded string |