⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,365 results found
Trending
Model Name
Input
Output
Type
poseidon1113
gpt2-lora-financial-sentiment-v1
Adapter
Deploy
L1nus
qwen3-4b-instruct-2507-pubmedqa-final-only-default-noassistmask-trunc8k
Fine-tuned
Kimmekheu
NyraVoryn_epoch10
NyraVoryn
Leo0101019
gemma-4-31B-it
trash524
Qwen2.5-Coder-7B-Instruct-AWQ
Quantized
Mohamed475
qwen3-1.7b-fft-dpo-final
soyrsoyr
Qwen1.5-MoE-A2.7B-NVFP4-GPTQ
Qwen1.5-MoE-A2.7B-W8A8-GPTQ
Qwen1.5-MoE-A2.7B-FP8-GPTQ
ahmed-3m
qwen25-1.5b-gsm8k-sdpo-final
Qwen1.5-MoE-A2.7B-W4A16-GPTQ
jstkumarai
myfirstmodel
Base
Alelcv27
Llama3.1-8B-INST-Code3
togolm
togolm-7b-instruct-v1
sulaimank
whisper-cv-grain-lg_both
Sgbluetto
gemma-4-E4B-it-audio-fixed
Sathvik0101
self-aligned-phi2-merged
sapkotapraful
answerme
IronPooh
llama-qa-assistant-3b_dror015_lr1_5
hananeek2
qwen3-4b-mom
keypa
silicon-fever
CoreX10
llama3-2-3b-indonesian-sft
rae-jax
cie-auditor-final
llama3-2-3b-indonesian-sft-submission
firzahdzm
2gpu-grpo-0bc1c04b-fix01
juiceb0xc0de
bella-e4b-subzero-v1
pritamdeka
gemma-4-26B-A4B-it-carexai-sft
LatentForce-ai
Cassini-1.0
twtcbn
Qwen3-4B-Base
qwen3-4b-pubmedqa-final-only-default-noassistmask-trunc8k
Sakeador
AIkuda
Llama-3.2-1B-Instruct-FP8-GPTQ
Llama-3.2-1B-Instruct-W8A8-GPTQ
AbdullahAmin125
qwen3.5-4b-allama-urdu
Llama-3.2-1B-Instruct-NVFP4-GPTQ
Llama-3.2-1B-Instruct-W4A16-GPTQ
Nalila9633
NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
Dhanush66-rv
whisper-small-tanglish-lora
keithtyser
model-forge-qwen35-9b-base-nvfp4-modelopt
shahidchdry
lovelake-router-4b-instruct
ellabettison
gemma-3-1b-it-persona-neutral_dataset_user