⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
534,350 results found
Trending
Model Name
Input
Output
Type
durgasai299792458
Phi-4-mini-instruct-finetuned-on-menu-based-interactions
Base
Deploy
didula-wso2
qwen3-8B_sftep2-bal_klge113sft_16bit_vllm
Fine-tuned
fpadovani
nld-latn-10mb-ppt-Dp-100mb_seed3407
nld-latn-10mb-ppt-shuff-dyck-100mb_seed3407
WonseokJayJung
_-_-v6
WeiboAI
VibeThinker-3B
Qwen3-0.6B-finetuned-on-menu-based-interactions-merged
nld-latn-10mb-ppt-shuff-dyck-10mb_seed3407
Andri1
Dolphin-Mistral-24B-Venice-Edition
cjiao
goldengoose-divsweep_goose_n128_grouporc_tau1.00-25grp
dan-latn-100mb-10mb_seed3407
laskar-ks
alcyone-v0
jastorj
couchmind-v5.7.6.1_arctic_stage_2-cw-12K-16bit
goldengoose-divsweep_goose_n512_indorc_tau1.00-7grp
microsoft
FastContext-1.0-4B-RL
goldengoose-divsweep_goose_n512_indorc_tau0.50-7grp
ForeverBlue
Qwen3-VL-2B-GRACE-W4G128-AWQ
sfanm
d24-sft-v2-olmo3-2.3B
goldengoose-divsweep_goose_n128_random-25grp
goldengoose-divsweep_goose_n128_indorc_tau2.00-25grp
pro-bunny
Blitzar-Coder-4B-F.1-openvino
goldengoose-divsweep_goose_n128_grouporc_tau2.00-25grp
nakue
SmolLM2-1.7B-W4A16-wiki
Quantized
jjminu
kogpt2-koalpaca
dementor-research
sft_writingprompts_llama-3.3-70b_as_gpt-oss-20b_seed1
Adapter
goldengoose-divsweep_goose_n512_indorc_tau0.10-7grp
goldengoose-divsweep_goose_n128_grouporc_tau0.50-25grp
sft_oasst1_qwen3-4b_as_llama-3.1-8b_seed1
sft_oasst1_qwen3-4b_as_qwen3.6-27b_seed1
sft_oasst1_qwen3-4b_as_gpt-oss-20b_seed1
usermma
ShellWhisperer-1.5B-mlx-fp16
sft_oasst1_qwen3-4b_as_nemotron-nano-30b-a3b_seed1
ShellWhisperer-1.5B-mlx-2Bit
ShellWhisperer-1.5B-mlx-4Bit
DeepSeek-R1-Distill-Llama-8B-openvino
SmolLM2-1.7B-W8A8-instruct
ShellWhisperer-1.5B-mlx-8Bit
Nemotron-Terminal-8B-openvino
ShellWhisperer-1.5B-mlx-6Bit
ShellWhisperer-1.5B-mlx-5Bit
ShellWhisperer-1.5B-mlx-3Bit
DeepSeek-R1-Distill-Llama-8B-openvino-4bit