⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,337 results found
Trending
Model Name
Input
Output
Type
CohereLabs
tiny-aya-earth
Fine-tuned
Deploy
tiny-aya-water
thelamapi
next-ocr
Base
heretic-org
XortronCriminalComputingConfig-heretic
Merged
0xA50C1A1
Ministral-3-14B-Reasoning-2512-Heretic
Qwen3-4B-Thinking-2507-heretic
Qwen3-4B-Instruct-2507-heretic
DMindAI
DMind-3-mini
MuXodious
HER-32B-absolute-heresy
p-e-w
Qwen3-4B-Instruct-2507-heretic-v4
lmstudio-community
MiniMax-M2.5-MLX-4bit
Quantized
naoyasss
qwen3-4b-structured-output-lora_rev0.3
Adapter
Situus
Gemma-3-4B-THINKING
SGalperin
flux_10_20_sky_wandb_ujm_adamw_lr8e4_LoRA4
huihui-ai
Huihui-Qwen3-Coder-Next-abliterated
Llama-3.3-8B-Casimir-v0.1
perplexity-ai
evo-v2
gss1147
Gemma-3-Prompt-Coder-270m-it-Uncensored
utter-project
EuroMoE-2.6B-A0.6B-Instruct-2512
sitatech
Qwen3-VL-8B-Instruct-GPTQ-Int4
aisingapore
Llama-SEA-Guard-8B-040226
Qwen-SEA-Guard-8B-040226
bullpoint
Qwen3-Coder-Next-AWQ-4bit
EpistemeAI
rsi-gpt-oss-120bv2-4bit
Luoberta
Abacus-cve
Naphula
Slimaki-24B-v1
tarundachepally
Granite_8b_phase57_complete
QwenImage-TextEncoder-FP8
Sherpa
Kimi-K2.5-BF16
McG-221
K2-Think-V2-mlx-4Bit
EZCon
Huihui-Qwen3-VL-4B-Instruct-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx
gateremark
kikuyu_translategemma_12b_merged_V2
Finisha-F-scratch
Kira
DavidAU
Qwen3-24B-MOE-6x-4B-AwayTeam-Instruct-GATED
APPA-Clem
JohnMarble
vi-en-glm
athenasaurav
whisper-small-arabic-saudi
cerebras
GLM-4.7-Flash-REAP-23B-A3B
typhoon-ai
typhoon-whisper-turbo
typhoon-whisper-large-v3
Bloodviper
Athena-llamamerge-70B
teeofftechnologies
SHONA-TTS-version-21jan