⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
7,984 results found
Trending
Model Name
Input
Output
Type
Doomate
mark_5B-A3B
Base
Deploy
amd
MiniMax-M3-MXFP4-AttnFP8
Quantized
nwzjk
MiniMax-M3-MXFP4
Civitai
Qwen-Image-Bench-FP8
BOOOMJIAO
Qwable-v1
Fine-tuned
Tok331102
Qwen3.6-35B-A3B
GestaltLabs
Ornstein-3.5-9B-V2
Zaynoid
Med-3.5-9B-EBOS-v2
tawkeed-sa
tawkeed-gpt
Tooony133
Qwen-3.6-27B-v2
onda
ligature-seam-gemma4
Adapter
LaniakeaPH
chandra-ocr-2
Firefly77
gemma-4-12B-it
hirundo-io
Qwen3.5-4B-restrictions-removed-lora
alvarobartt
Qwen3.5-4B-FT
sahilchachra
Qwable-v1-NVFP4A16
LuciaValentine
origin_gemma4_12b_exllama3_2.0bpw
origin_gemma4_12b_exllama3_4.0bpw
PhYen
OCR-AVD
origin_gemma4_12b_exllama3_6.0bpw
origin_gemma4_12b_exllama3_8.0bpw
mattbucci
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-AWQ
tamewild
4b_v225_merged_e5
azazeal2
Qwoble3.5_4B
EpistemeAI
Reasoning-Medical-27B
srv-sngh
gemma-4-12B-coder-fable5-composer2.5-nvfp4
efficiencyx
Jun-FP16-138s
quockhangdev
CoTuGRM-2.5.T1-DFT-5E5
Qwoble
CoTuGRM-2.5.T1-SFT-2E4
CoTuGRM-2.5.T1-DFT-2E4
usermma
Qwable-v1-mlx-8Bit
Qwable-v1-mlx-5Bit
Qwable-v1-mlx-6Bit
Qwable-v1-mlx-4Bit
Qwable-v1-mlx-3Bit
Qwable-v1-mlx-2Bit
Qwable-v1-mlx-fp16
MagistrTheOne
SHUTEN-DOJI
Darwin-28B-Coder-mlx-8Bit
Darwin-28B-Coder-mlx-6Bit