Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Details

BaseTil-Core-1B (1.246B, morphbpe-256k)
SFT dataAmanMussa/kazakh-instruction-v2 — 52 173 native-kk Alpaca-style pairs
FormatChatML (`<
Lossassistant tokens only
Recipe3 epochs, LR 1e-5 cosine, bf16, 8×H200 FSDP
Stop token`<

Usage

python

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
name = "TilQazyna/Til-Core-1B-Instruct"
tok = AutoTokenizer.from_pretrained(name)
m = AutoModelForCausalLM.from_pretrained(name, dtype=torch.bfloat16).cuda().eval()
msg = [{"role": "user", "content": "Денсаулықты сақтаудың үш кеңесін айт."}]
p = tok.apply_chat_template(msg, tokenize=False, add_generation_prompt=True)
ids = tok(p, add_special_tokens=False, return_tensors="pt").input_ids.cuda()
out = m.generate(ids, max_new_tokens=160, do_sample=True, temperature=0.7,
top_p=0.9, repetition_penalty=1.2,
eos_token_id=tok.convert_tokens_to_ids("<|im_end|>"))
print(tok.decode(out[0][ids.shape[1]:], skip_special_tokens=True))

Example

User: Қазақстанның астанасы қай қала және ол туралы қысқаша айт. Assistant: Қазақстанның елордасы — Астана қаласы. Ол Есіл өзенінің жағасында орналасқан…

User: Денсаулықты сақтаудың үш кеңесін айт. Assistant: 1. Салауатты өмір салтын ұстану; 2. Дұрыс тамақтану; 3. Тұрақты дене жаттығулары…

Limitations

  • Small model + small SFT set → weak factual accuracy, occasional topic drift.
  • No RLHF / safety alignment.
  • Kazakh-only.

Roadmap

  • Larger / cleaner SFT set, preference tuning.
  • A smaller on-device instruct sibling.
  • Task-specialized variants (e.g. Kazakh grammar correction — see Til-Core experiments).

Model provider

TilQazyna

TilQazyna

Model tree

Base

TilQazyna/Til-Core-1B

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today