Optional capabilities on top of Zero STT. Turn each one on with a boolean flag on the same transcription request, results come back in the same JSON as your transcript. Mix and match freely.
Works with batch ASR today
Add flags to POST https://asr.shunyalabs.ai/v1/audio/transcriptions. Every example below uses that endpoint unless noted.
How to enable
Pick a feature below, copy the flag into your request, and open Show request & response on the card for a ready-to-run example.
Send your audio file (or URL) with model=zero-indic (or another Zero STT model).
Add one or more enable_* fields, all are optional.
Read enriched fields on the response: segments, speakers, nlp_analysis, and more.
NLP features (intent, sentiment, summary, keyterms, translation, profanity) add a Gemini pass on top of ASR. Enable only what you will use in production.