API Overview

Cloud APIs for speech-to-text and text-to-speech. Use the sidebar to open General guides or jump straight to an endpoint.

Speech-to-Text (ASR)

Every endpoint under asr.shunyalabs.ai, their request fields, response shapes, and error codes.

Endpoints

ModeEndpointUse for
BatchPOST https://asr.shunyalabs.ai/v1/audio/transcriptionsUploaded files, post-processing, async jobs.
Streamingwss://asr.shunyalabs.ai/wsLive transcription, voice agents, IVR.
HealthGET https://asr.shunyalabs.ai/healthLiveness checks. No auth.
LanguagesGET https://asr.shunyalabs.ai/languagesReturns supported language names, ISO codes, and scripts.
Speakers/v1/speakers/*Register, list, identify, delete voice profiles for speaker identification.

Text-to-Speech (TTS)

Every endpoint under tts.shunyalabs.ai, every field, every error code.

Base URLs

InterfaceURL
Batchhttps://tts.shunyalabs.ai
Streamingwss://tts.shunyalabs.ai/ws
Healthhttps://tts.shunyalabs.ai/health