Language Models

Language models are our core transcription engines, delivering industry-leading accuracy across 200+ languages worldwide.

We provide three specialized categories of language models to meet diverse transcription needs:

  • Zero STT Indic — Optimized for Indic languages
  • Zero STT Codeswitch — Designed for mixed-language speech
  • Zero STT — Supports 200+ global languages

Zero STT Indic Models

Specialized models fine-tuned for Indian languages, offering superior accuracy for regional speech patterns and accents.

Use the following parameters to enable Zero Indic:

data = {
  "model": "zero-indic",
  "language_code": "hi"
}

Supported Languages

LanguageModellanguage_code
Hindizero-indichi
Teluguzero-indicte
Kannadazero-indickn
Bengalizero-indicbn

Support for additional Indic languages is coming soon.

Zero Codeswitch Models

Industry-leading code-switch models designed to handle multilingual speech within a single conversation.

Currently, Hinglish is supported:

data = {
  "model": "zero-codeswitch",
  "language_code": "hi-en"
}

Zero STT

A universal speech-to-text model supporting 200+ languages across diverse linguistic and acoustic environments.

Auto-detect the language by using:

data = {
  "language_code": "auto"
}

For best accuracy, explicitly specify the language:

data = {
  "language_code": "en"
}

For the full list of supported languages, see Languages.