Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Scores
| Benchmark | Score | Target | Status |
|---|---|---|---|
| aime24_th | pending | >15.0 | pending |
| aime24 | pending | >25.0 | pending |
| math500_th | pending | >56.0 | pending |
| math500 | pending | >82.0 | pending |
| livecodebench_th | pending | >35.0 | pending |
| livecodebench | pending | >60.0 | pending |
| openthaieval | pending | >80.0 | pending |
| hotpotqa_th_en | pending | >46.0 | pending |
| instruction_following_th_en | pending | >57.0 | pending |
| mt_bench_th_en | pending | >85.0 | pending |
| thaiexam | pending | >70.0 | pending |
| ifeval_th | pending | >82.0 | pending |
Notes
Automatic checkpoint publish from runs/20260613_stage1_Qwen3.5-9B_gpu1_20260613_0618/checkpoint-240. Single-model artifact; no BoN/self-consistency.
Model provider
Jnx03
Model tree
Base
Qwen/Qwen3.5-9B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information