Voice Cloning

Reference Audio Requirements

Requirements for reference audio used in voice cloning.


Requirements

PROPERTYREQUIREMENT
FormatWAV, FLAC, or OGG
Duration1 -- 6 seconds
Sample rate16 kHz mono recommended
ContentClear speech, minimal background noise
Size10 MB maximum
EncodingBase64-encoded string