⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,204 results found
Trending
Model Name
Input
Output
Type
swiss-ai
Apertus-8B-Instruct-2509
Fine-tuned
Deploy
Chillarmo
whisper-large-v3-turbo-armenian
BBQGOD
DeepSeek-GRM-16B
aquiffoo
aquif-3.5-7B
Base
vrc-ai
hierarchical-qwen-3-2507
DavidAU
Qwen3-MOE-4x0.6B-2.4B-Writing-Thunder
Merged
hobaratio
MN-Violet-Lotus-12B-mlx-8Bit
Quantized
schonsense
70B_Book_stock
igorktech
Podkatik-v3
CraneAILabs
swahili-gemma-1b
mBITANU
Gita-SastraGPT-V1-SFT
ik
Gemma-270m-Twi-TTS
google
gemma-3-270m-it
enzii
Qwen3-4B-Instruct-TLDR-GRPO
alexrzem
flux-loras
Adapter
cpatonn
Qwen3-4B-Instruct-2507-AWQ-4bit
Qwen3-30B-A3B-Instruct-2507-AWQ-4bit
GLM-4.5-AWQ-4bit
SysL-Public-Distil
unsloth
Qwen3-4B-Instruct-2507-unsloth-bnb-4bit
numind
NuMarkdown-8B-Thinking
lmstudio-community
Qwen3-4B-Instruct-2507-MLX-8bit
Qwen3-4B-Thinking-2507-MLX-8bit
Qwen3-4B-Thinking-2507-MLX-4bit
Qwen
Qwen3-4B-Instruct-2507-FP8
Goedel-LM
Goedel-Prover-V2-8B
Goedel-Prover-V2-32B
openbmb
MiniCPM-V-4
MiniCPM-V-4-AWQ
Fentible
Cthulhu-24B-v1.2
42lux
42lux-Schwarzwald-Klinik
mookiezi
Discord-Micae-Hermes-3-3B
CohereLabs
command-a-vision-07-2025
deepcogito
cogito-v2-preview-deepseek-671B-MoE
QuantTrio
Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
analogllm
analogseeker
Qwen3-30B-A3B-Instruct-2507-FP8
buildborderless
FLUX.1-merged_lightning_v2
FLUX.1-merged_lightning-unc
CLEAR-Global
whisper-small-clearglobal-kanuri-asr-1.0.0
openGPT-X
Teuken-7B-instruct-v0.6
Qwen3-235B-A22B-Thinking-2507-FP8