Hugging Face

Shunya Labs publishes open weights for several models on Hugging Face. Pull them down for local inference, fine-tuning, or air-gapped deployment in environments where the cloud API isn't an option.

The Shunya Labs organization

All open models live under huggingface.co/shunyalabs.

ModelTaskParamsLicenseNotes
pingala-v1-universalASR0.8BShunya Labs RAIL-M3.10% composite WER on the OpenASR leaderboard. 204 languages. API name: zero-universal.
pingala-v1-en-verbatimASR (English verbatim)--English-focused variant on Hugging Face.
zero-stt-hinglishASR (code-switch)0.8BopenrailFirst Hinglish ASR that handles code-switched conversations natively. API name: zero-codeswitch.
vak-translate-1.3b-ct2Translation1.3BCC-BY-SA-4.055 Indian languages, 2,970 any-to-any pairs. BLEU 38.5 weighted average.

Cloud API names → Hugging Face

Production APIs use short model IDs. Map them to open weights or demos on Hugging Face:

Demos on Hugging Face Spaces

SpaceTaskNotes
TTS-IndicText-to-speechVāķ Text to Speech, try Indic synthesis in the browser (Gradio). API: zero-indic.
Zero-STT-Shunya-LabsSpeech-to-textMultilingual transcription with diarization (demo).
Zero_STT_Med_Shunya_LabsMedical STTClinical / medical transcription (demo). API: zero-med.
vak-speech-to-textSpeech-to-textVāķ Speech to Text, 55 Indian languages (demo).

Authentication

Hugging Face access is gated by a personal access token. Pass it via the HUGGINGFACE_HUB_TOKEN environment variable, the CLI login command, or programmatic login().

shell
# Option 1, env var
export HUGGINGFACE_HUB_TOKEN=hf_...

# Option 2, CLI login (interactive)
huggingface-cli login

# Option 3, programmatic
from huggingface_hub import login
login(token="hf_...")

Download model weights

shell
huggingface-cli login
huggingface-cli download shunyalabs/pingala-v1-universal --local-dir ./models/pingala
huggingface-cli download shunyalabs/vak-translate-1.3b-ct2 --local-dir ./models/vak
huggingface-cli download shunyalabs/zero-stt-hinglish --local-dir ./models/hinglish

Use the Pingala ASR model directly

Shunya publishes a thin Python wrapper around the universal ASR model on PyPI as pingala-shunya.

shell
pip install pingala-shunya
python
from pingala_shunya import PingalaTranscriber

# Loads the model from a local path (handy for air-gapped runs)
tx = PingalaTranscriber(model_path="./models/pingala")
segments = tx.transcribe("meeting.wav")
for s in segments:
    print(f"[{s.start:.2f} → {s.end:.2f}] {s.text}")

Use the Vāķ translation model

The Vāķ model ships in CTranslate2 format (vak-translate-1.3b-ct2). Run it with the ctranslate2 + transformers + sentencepiece stack.

shell
pip install ctranslate2 transformers sentencepiece
python
import ctranslate2
from transformers import NllbTokenizer

tokenizer = NllbTokenizer.from_pretrained("./models/vak")
translator = ctranslate2.Translator("./models/vak", device="cuda")  # or "cpu"

src_lang = "eng_Latn"
tgt_lang = "hin_Deva"
text = "How are you today?"

tokens = tokenizer.convert_ids_to_tokens(tokenizer.encode(text))
results = translator.translate_batch([tokens], target_prefix=[[tgt_lang]])
print(tokenizer.decode(tokenizer.convert_tokens_to_ids(results[0].hypotheses[0][1:])))

Air-gapped deployment

For environments without internet connectivity (defence, offline kiosks, mobile edge), pull the weights once on an internet-connected machine and transfer them across:

  1. Run the huggingface-cli download commands above on a connected workstation.
  2. Package the ./models directory as a signed tarball.
  3. Verify checksums, transfer via signed media to the secure zone.
  4. Load the model from the local path, no further network calls needed.
Production serving
For multi-tenant, streaming, horizontally-scaled serving in an air-gapped zone, the on-prem Triton package is easier than rolling your own. See Deployment overview or contact Shunya Labs ↗.

Licence reminder

Always check the model card on Hugging Face before shipping, the licence file there is authoritative.

Source: Shunya Labs Hugging Face organization page (huggingface.co/shunyalabs) and the per-model cards linked above. Air-gapped deployment steps reproduced from the Shunya Labs Deployment guidance.