⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
531,254 results found
Trending
Model Name
Input
Output
Type
cpatonn
Qwen3-30B-A3B-Instruct-2507-AWQ-4bit
Quantized
Deploy
GLM-4.5-AWQ-4bit
vrc-ai
SysL-Public-Distil
Fine-tuned
mlx-community
gpt-oss-120b-4bit
Base
Fentible
Cthulhu-24B-v1.2
Merged
AbdelrahmanHassan
whisper-large-v3-egyptian-arabic
Adapter
42lux
42lux-Schwarzwald-Klinik
huehui
Discord-Micae-Hermes-3-3B-abliterated
mookiezi
Discord-Micae-Hermes-3-3B
huihui-ai
Huihui-Qwen3-30B-A3B-Thinking-2507-abliterated
lmstudio-community
Qwen3-Coder-30B-A3B-Instruct-MLX-4bit
black-forest-labs
FLUX.1-Krea-dev
Qwen
Qwen3-30B-A3B-Thinking-2507-FP8
analogllm
analogseeker
shunyalabs
pingala-v1-universal
buildborderless
FLUX.1-merged_lightning_v2
FLUX.1-merged_lightning-unc
CLEAR-Global
whisper-small-clearglobal-kanuri-asr-1.0.0
zai-org
GLM-4.5-Base
openGPT-X
Teuken-7B-instruct-v0.6
Qwen3-235B-A22B-Thinking-2507-FP8
unsloth
Qwen3-235B-A22B-Thinking-2507
ilkerzgi
Tattoo-Kontext-Dev-Lora
ncgc
qwen-3.0B-sft
win10
ERNIE-4.5-29B-A4B-PT
Qwen3-Coder-480B-A35B-Instruct-FP8
apexion-ai
Nous-1-8B
jdaddyalbs
bad-qwen3-sft-merged
Qwen3-235B-A22B-Instruct-2507
EXAONE-4.0-32B-MLX-4bit
Glittering-Portrait-Kontext-Dev-Lora
Menlo
Lucy-128k
Trendyol
Trendyol-LLM-8B-T1
yanolja
EEVE-Rosetta-4B-FP8-2507
Overlay-Kontext-Dev-LoRA
nvidia
NFT-32B
NFT-7B
oguzhanmeteozturk
Devstral-Small-2507-DRAFT-0.5B
dphn
dolphin-2.6-mistral-7b-dpo
Dolphin3.0-R1-Mistral-24B
Zaynoid
qwen2.5-7b-v1
Delta-Vector
Rei-24B-KTO