⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
529,443 results found
Trending
Model Name
Input
Output
Type
0xA50C1A1
Ministral-3-14B-Reasoning-2512-Heretic
Fine-tuned
Deploy
distil-labs
distil-home-assistant-functiongemma
Quantized
pratv5
RWKVllama_basedExpert-inf-context
Goekdeniz-Guelmez
JOSIE-4B-Thinking
khier12
800min_whisper_small_FT_Algerian_Dialect
BlueMoonlight
Qwen3-4B-Instruct-2507-mlx-fp16
zai-org
GLM-5-FP8
Base
naoyasss
qwen3-4b-structured-output-lora_rev0.3
Adapter
inclusionAI
UI-Venus-1.5-8B
Situus
Gemma-3-4B-THINKING
SGalperin
flux_10_20_sky_wandb_ujm_adamw_lr8e4_LoRA4
Llama-3.3-8B-Casimir-v0.1
gss1147
Gemma-3-Prompt-Coder-270m-it-Uncensored
Merged
utter-project
EuroMoE-2.6B-A0.6B-2512
EuroLLM-9B-Instruct-2512
aisingapore
Llama-SEA-Guard-8B-040226
microsoft
X-Reasoner-7B
EpistemeAI
rsi-gpt-oss-120bv2-4bit
coderavi
Llama3.3-8B-Instruct-Thinking-Heretic-Uncensored-Claude-4.5-Opus-High-Reasoning-mlx-8Bit
tarundachepally
Granite_8b_phase57_complete
fdtn-ai
Foundation-Sec-8B-Reasoning
ICT-TIME-and-Querit
BOOM_4B_v1
sitatech
QwenImage-TextEncoder-FP8
McG-221
K2-Think-V2-mlx-4Bit
EZCon
Huihui-Qwen3-VL-4B-Instruct-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx
gateremark
kikuyu_translategemma_12b_merged_V2
AlexXu811
child-adult-joint-asr-diarization
Finisha-F-scratch
Kira
DavidAU
Qwen3-24B-MOE-6x-4B-AwayTeam-Instruct-GATED
RISys-Lab
RedSage-Qwen3-8B-DPO
APPA-Clem
yehoshua00
Qwen2.5-RCA-1.5B-RL
athenasaurav
whisper-small-arabic-saudi
kimcomehome
Llama-3-ELI5-Instruct
Bloodviper
Athena-llamamerge-70B
teeofftechnologies
SHONA-TTS-version-21jan
Olafangensan
GLM-4.7-Flash-heretic
bond005
meno-lite-0.1
Yupeng123
AtomMem-8B
cyankiwi
GLM-4.7-Flash-AWQ-4bit
AdoCleanCode
llasa_stage2_trained_multilingual_stage3
distil-email-classifier