AK04-IXR
sarvam1-hinglish-tn-lora
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: mitWhy
The base sarvam-1 is a base (non-instruct) model and cannot be reliably
prompted into TN (12-shot ICL scores 49.9% WER — worse than rules). This
adapter is fine-tuned on a synthetic, correct-by-construction code-mixed corpus.
Results (held-out 40-sentence labeled set)
| System | WER ↓ | CER ↓ | Exact-Match ↑ |
|---|---|---|---|
naive rules (indic-numtowords) | 43.6% | 43.7% | 0% |
| competitive rule engine | 20.9% | 17.5% | 27.5% |
| Sarvam-1 base (12-shot ICL) | 49.9% | 35.4% | 5% |
| this adapter | 7.96% | 6.37% | 62.5% |
Usage
python
from peft import PeftModelfrom transformers import AutoModelForCausalLM, AutoTokenizertok = AutoTokenizer.from_pretrained("sarvamai/sarvam-1")m = AutoModelForCausalLM.from_pretrained("sarvamai/sarvam-1")m = PeftModel.from_pretrained(m, "AK04-IXR/sarvam1-hinglish-tn-lora")prompt = "Input: Mera flight ticket PNR-8392 hai, aur departure 4:30 PM ko hai.\nOutput:"ids = tok(prompt, return_tensors="pt").to(m.device)out = m.generate(**ids, max_new_tokens=96, do_sample=False)print(tok.decode(out[0][ids['input_ids'].shape[1]:], skip_special_tokens=True))# -> Mera flight ticket pee-en-aar eight three nine two hai, aur departure four thirty pee-em ko hai.
Training
LoRA (r=16, α=32, all attn+MLP projections; 0.94% of params) on ~8k synthetic pairs, 3 epochs, bf16, on a single A100. See the GitHub repo for the data generator, trainer, and evaluation harness.
Limitations
Trained on synthetic data, so it follows the project's normalization conventions; the held-out test set is small (40 sentences) — treat the headline number as indicative and see the per-category breakdown in the repo.
Model provider
AK04-IXR
Model tree
Base
sarvamai/sarvam-1
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information