Transcript Enrichment

Transliteration

Converts output to a different script using output_script — powered by aksharamukha (no LLM).


Python SDK

python
config = TranscriptionConfig(
    model="zero-indic",
    language_code="hi",
    output_script="Latin",
)
result = await client.asr.transcribe("audio.wav", config=config)

print(result.text)
# namaste mohammad ji ye ek zaruri call hai
for seg in result.segments:
    print(f"[{seg.start:.2f}s-{seg.end:.2f}s] {seg.text}")
# [0.51s-5.70s] namaste mohammad ji ye ek zaruri call hai

REST API

terminal
curl -X POST https://asr.shunyalabs.ai/v1/audio/transcriptions \
  -H "Authorization: Bearer <API_KEY>" \
  -F "[email protected]" \
  -F "model=zero-indic" \
  -F "language_code=hi" \
  -F "output_script=Latin"

Output

json
{
  "text": "namaste mohammad ji ye ek zaruri call hai",
  "segments": [
    { "start": 0.51, "end": 5.70, "text": "namaste mohammad ji ye ek zaruri call hai" }
  ]
}