⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,123 results found
Trending
Model Name
Input
Output
Type
naoyasss
qwen3-4b-structured-output-lora_rev0.3
Adapter
Deploy
inclusionAI
UI-Venus-1.5-8B
Base
Situus
Gemma-3-4B-THINKING
Fine-tuned
SGalperin
flux_10_20_sky_wandb_ujm_adamw_lr8e4_LoRA4
0xA50C1A1
Llama-3.3-8B-Casimir-v0.1
perplexity-ai
evo-v2
gss1147
Gemma-3-Prompt-Coder-270m-it-Uncensored
Merged
utter-project
EuroMoE-2.6B-A0.6B-2512
microsoft
paza-Phi-4-multimodal-instruct
EuroLLM-9B-Instruct-2512
cyankiwi
Qwen3-Coder-Next-AWQ-4bit
Quantized
aisingapore
Llama-SEA-Guard-8B-040226
Qwen-SEA-Guard-8B-040226
X-Reasoner-7B
EpistemeAI
rsi-gpt-oss-120bv2-4bit
SamsungSAILMontreal
Qwen3-4B-Instruct-2507-Math
Qwen
Qwen3-Coder-Next-FP8
coderavi
Llama3.3-8B-Instruct-Thinking-Heretic-Uncensored-Claude-4.5-Opus-High-Reasoning-mlx-8Bit
tarundachepally
Granite_8b_phase57_complete
sitatech
QwenImage-TextEncoder-FP8
Sherpa
Kimi-K2.5-BF16
McG-221
K2-Think-V2-mlx-4Bit
EZCon
Huihui-Qwen3-VL-4B-Instruct-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx
gateremark
kikuyu_translategemma_12b_merged_V2
AlexXu811
child-adult-joint-asr-diarization
Finisha-F-scratch
Kira
DavidAU
Qwen3-24B-MOE-6x-4B-AwayTeam-Instruct-GATED
RISys-Lab
RedSage-Qwen3-8B-DPO
APPA-Clem
JohnMarble
vi-en-glm
athenasaurav
whisper-small-arabic-saudi
kimcomehome
Llama-3-ELI5-Instruct
Bloodviper
Athena-llamamerge-70B
teeofftechnologies
SHONA-TTS-version-21jan
bond005
meno-lite-0.1
Yupeng123
AtomMem-8B
lightonai
LightOnOCR-2-1B-bbox
LightOnOCR-2-1B-base
GLM-4.7-Flash-AWQ-4bit
AdoCleanCode
llasa_stage2_trained_multilingual_stage3
distil-labs
distil-email-classifier
OddTheGreat
NeutralGear_24B_V.2