Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Qwen2.5-0.5B-Instruct Math SFT 100K 2 Epochs

Fine-tuned from Qwen/Qwen2.5-0.5B-Instruct on a 100K math reasoning SFT mixture for 2 epochs with learning rate 1e-5.

Prompt format used during training:

text

System: Please reason step by step, and put your final answer within \boxed{}.
User: {problem}
Assistant: {solution}

Training mixture:

SourceCount
nvidia/OpenMathReasoning CoT40,000
AI-MO/NuminaMath-1.5 filtered, no AMC/AIME source25,000
meta-math/MetaMathQA15,000
MATH train, especially levels 4-515,000
GSM8K train5,000

Training summary:

SettingValue
Base modelQwen/Qwen2.5-0.5B-Instruct
Training examples100,000
Epochs2
Learning rate1e-5
Max sequence length4096
Effective batch size32
Final train loss0.6923
Final token accuracyabout 0.803

Evaluation results are not included in this model card yet.

Model provider

zbeeb

Model tree

Base

Qwen/Qwen2.5-0.5B-Instruct

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today