⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
578,200 results found
Trending
Model Name
Input
Output
Type
ishikaa
UAS_student_qwen3b_numina_regression_weighted
Base
Deploy
prompt-agnostic-language-models
Llama-8B_coin
cmu-lti
osim-4b
Fine-tuned
acellam
spark-tts-salt-ach-v14
DavidBShan
pyrite-pay-support-grpo70-qwen3.6-35b-a3b-lora
Adapter
lakshyaixi
Llama_3_2_3B_DPO_v9
moazeldegwy
Qwen2.5-1.5B-reasoning-sft
wrice
whisper-tiny-grpo-72a3573-fixed-en-t0.7-lr3e-6
iproskurina
Mistral-7B-Instruct-v0.3-int4-uf-vf-alpha01-crows-stereo-intra90-run3
shadowlilac
MiMo-V2.5-AWQ-int4
Quantized
Gege24
dejavu-othello-intercode-test-dancil
tomaszki
model-long-9
VikramR
cypherbench-grpo-4.2
lmstudio-community
gemma-4-12B-it-MLX-6bit
Karroyan
MasterMind-vsDouzero-full
mario-rc
emotional-rlaif-dpo-glm-4-9b-chat-1m
emotional-rlaif-ppo-mistral-7b-instruct-v0.3
emotional-rlaif-dpo-meta-llama-3-8b-instruct
emotional-rlaif-ppo-gemma-2-9b-it
emotional-rlaif-ppo-phi-3-small-8k-instruct
emotional-rlaif-ppo-meta-llama-3-8b-instruct
emotional-rlaif-ppo-glm-4-9b-chat-1m
ahr100007
takla-gpt
cds-jb
qwen3-8b-latent-threads-step-select
qwen3-8b-latent-threads-agg-select
qwen3-8b-latent-threads-chase-select
FlameF0X
TinyMoE-50m-A1K
qwen3-8b-latent-threads-parallel-select
qwen3-8b-latent-threads-coin-track
Llama_3_2_3B_DPO_v8
Mistral-7B-Instruct-v0.3-int4-uf-vf-alpha01-crows-stereo-intra90-run2
sfanm
d24-midtrain-v1base-olmo3-2.3B
laion
delphi-9e19-p33m67-k0p20-lr83-a002-wc386k_lr1e5-sft
Momin-Aldahdouh
MominoMoE-v2
d24-sft-v2-reasoning-3.7B
d24-sft-v1base-olmo3-2.3B
d24-midtrain-v2-reasoning-3.7B
d24-midtrain-v1base-mathheavy-3.7B
d24-sft-v1base-mathheavy-3.7B
delphi-9e18-p33m67-k0p20-lr83-a002-magpie_lr1e5-sft
delphi-3e19-p67m33-k0p20-lr83-a002-magpie_lr1e5-sft
delphi-9e18-p50m50-k0p20-lr83-a002-wc386k_lr1e5-sft