Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Details
| Base | Til-Core-1B (1.246B, morphbpe-256k) |
| SFT data | AmanMussa/kazakh-instruction-v2 — 52 173 native-kk Alpaca-style pairs |
| Format | ChatML (`< |
| Loss | assistant tokens only |
| Recipe | 3 epochs, LR 1e-5 cosine, bf16, 8×H200 FSDP |
| Stop token | `< |
Usage
python
import torchfrom transformers import AutoTokenizer, AutoModelForCausalLMname = "TilQazyna/Til-Core-1B-Instruct"tok = AutoTokenizer.from_pretrained(name)m = AutoModelForCausalLM.from_pretrained(name, dtype=torch.bfloat16).cuda().eval()msg = [{"role": "user", "content": "Денсаулықты сақтаудың үш кеңесін айт."}]p = tok.apply_chat_template(msg, tokenize=False, add_generation_prompt=True)ids = tok(p, add_special_tokens=False, return_tensors="pt").input_ids.cuda()out = m.generate(ids, max_new_tokens=160, do_sample=True, temperature=0.7,top_p=0.9, repetition_penalty=1.2,eos_token_id=tok.convert_tokens_to_ids("<|im_end|>"))print(tok.decode(out[0][ids.shape[1]:], skip_special_tokens=True))
Example
User: Қазақстанның астанасы қай қала және ол туралы қысқаша айт. Assistant: Қазақстанның елордасы — Астана қаласы. Ол Есіл өзенінің жағасында орналасқан…
User: Денсаулықты сақтаудың үш кеңесін айт. Assistant: 1. Салауатты өмір салтын ұстану; 2. Дұрыс тамақтану; 3. Тұрақты дене жаттығулары…
Limitations
- Small model + small SFT set → weak factual accuracy, occasional topic drift.
- No RLHF / safety alignment.
- Kazakh-only.
Roadmap
- Larger / cleaner SFT set, preference tuning.
- A smaller on-device instruct sibling.
- Task-specialized variants (e.g. Kazakh grammar correction — see Til-Core experiments).
Model provider
TilQazyna
Model tree
Base
TilQazyna/Til-Core-1B
Fine-tuned
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information