Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Scores

BenchmarkScoreTargetStatus
aime24_thpending>15.0pending
aime24pending>25.0pending
math500_thpending>56.0pending
math500pending>82.0pending
livecodebench_thpending>35.0pending
livecodebenchpending>60.0pending
openthaievalpending>80.0pending
hotpotqa_th_enpending>46.0pending
instruction_following_th_enpending>57.0pending
mt_bench_th_enpending>85.0pending
thaiexampending>70.0pending
ifeval_thpending>82.0pending

Notes

Automatic checkpoint publish from runs/20260613_stage1_Qwen3.5-9B_gpu1_20260613_0618/checkpoint-160. Single-model artifact; no BoN/self-consistency.

Model provider

Jnx03

Model tree

Base

Qwen/Qwen3.5-9B

Adapter

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today