API Overview
Cloud APIs for speech-to-text and text-to-speech. Use the sidebar to open General guides or jump straight to an endpoint.
Speech-to-Text (ASR)
Every endpoint under asr.shunyalabs.ai, their request fields, response shapes, and error codes.
Endpoints
| Mode | Endpoint | Use for |
|---|---|---|
| Batch | POST https://asr.shunyalabs.ai/v1/audio/transcriptions | Uploaded files, post-processing, async jobs. |
| Streaming | wss://asr.shunyalabs.ai/ws | Live transcription, voice agents, IVR. |
| Health | GET https://asr.shunyalabs.ai/health | Liveness checks. No auth. |
| Languages | GET https://asr.shunyalabs.ai/languages | Returns supported language names, ISO codes, and scripts. |
| Speakers | /v1/speakers/* | Register, list, identify, delete voice profiles for speaker identification. |
Text-to-Speech (TTS)
Every endpoint under tts.shunyalabs.ai, every field, every error code.
Base URLs
| Interface | URL |
|---|---|
| Batch | https://tts.shunyalabs.ai |
| Streaming | wss://tts.shunyalabs.ai/ws |
| Health | https://tts.shunyalabs.ai/health |