skrrt-sh/raif-qwen2.5-0.5b-lora API & Inference Endpoint

Results (parse = decodes; fidelity = byte-exact round-trip)

group	parse	fidelity	n
valid (in-training shapes)	97%	92%	64
holdout (withheld shapes)	97%	81%	64

valid = held-out split of in-training shapes; holdout = shapes withheld from training entirely.
Token cost vs minified JSON: not re-measured for this base's tokenizer (the −14% figure is from the Llama-3.2/cl100k bench).

Training


base	`unsloth/Qwen2.5-0.5B-Instruct`
method	LoRA (PEFT) via unsloth
rank / alpha	32 / 64
lora_dropout	0.05
learning rate	0.0001 (constant)
seq length	2048
epochs / examples	2.56 / 48000
final train / eval loss	0.0046 / 0.011335678398609161

Data: synthetic RAIF examples (with mechanism-carrier shapes) augmented with real tool-call argument objects from glaiveai/glaive-function-calling-v2 (Apache-2.0), kept only where they round-trip losslessly. Full recipe: RECIPE.md.

Usage

python
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base = AutoModelForCausalLM.from_pretrained("unsloth/Qwen2.5-0.5B-Instruct")
tok = AutoTokenizer.from_pretrained("skrrt-sh/raif-qwen2.5-0.5b-lora")
model = PeftModel.from_pretrained(base, "skrrt-sh/raif-qwen2.5-0.5b-lora")

License & attribution

Derivative of Qwen2.5 — Apache-2.0 (the Qwen2.5 small bases are Apache-2.0 licensed). Trained in part on glaiveai/glaive-function-calling-v2 (Apache-2.0) — attribute Glaive AI.

raif-qwen2.5-0.5b-lora

Get help setting up a custom Dedicated Endpoints.

README

Results (parse = decodes; fidelity = byte-exact round-trip)

Training

Usage

License & attribution

Explore FriendliAI today

raif-qwen2.5-0.5b-lora