API Overview

Cloud APIs for speech-to-text and text-to-speech. Use the sidebar to open General guides or jump straight to an endpoint.

Speech-to-Text (ASR)

Every endpoint under asr.shunyalabs.ai, their request fields, response shapes, and error codes.

Mode	Endpoint	Use for
Batch	`POST https://asr.shunyalabs.ai/v1/audio/transcriptions`	Uploaded files, post-processing, async jobs.
Streaming	`wss://asr.shunyalabs.ai/ws`	Live transcription, voice agents, IVR.
Health	`GET https://asr.shunyalabs.ai/health`	Liveness checks. No auth.
Languages	`GET https://asr.shunyalabs.ai/languages`	Returns supported language names, ISO codes, and scripts.
Speakers	`/v1/speakers/*`	Register, list, identify, delete voice profiles for speaker identification.

Every endpoint under tts.shunyalabs.ai, every field, every error code.

Interface	URL
Batch	`https://tts.shunyalabs.ai`
Streaming	`wss://tts.shunyalabs.ai/ws`
Health	`https://tts.shunyalabs.ai/health`