⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,214 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen3-235B-A22B-Thinking-2507-FP8
Quantized
Deploy
unsloth
Qwen3-235B-A22B-Thinking-2507
Fine-tuned
ilkerzgi
Tattoo-Kontext-Dev-Lora
Adapter
ncgc
qwen-3.0B-sft
apexion-ai
Nous-1-8B
jdaddyalbs
bad-qwen3-sft-merged
Base
DeepHat
DeepHat-V1-7B
Qwen3-235B-A22B-Instruct-2507
t-tech
T-pro-it-2.0
Glittering-Portrait-Kontext-Dev-Lora
Tesslate
UIGEN-X-8B
yanolja
EEVE-Rosetta-4B-FP8-2507
Overlay-Kontext-Dev-LoRA
oguzhanmeteozturk
Devstral-Small-2507-DRAFT-0.5B
dphn
Dolphin3.0-Mistral-24B
Dolphin3.0-R1-Mistral-24B
Zaynoid
qwen2.5-7b-v1
Delta-Vector
Rei-24B-KTO
Fentible
Cthulu-24B-v1
Merged
open-thoughts
OpenThinker3-1.5B
microsoft
NextCoder-7B
nvidia
OpenCodeReasoning-Nemotron-1.1-32B
embroidery-patch-kontext-dev-lora
starsofchance
Mistral-Unsloth-QLoRA-adapter
Kazame07
selflogic-tpu
selflogic-16
selflogic-core
metallic-objects-kontext-dev-lora
DeepSeek-TNG-R1T2-Chimera-BF16
DeepSeek-TNG-R1T2-Chimera
agentica-org
DeepSWE-Preview
marketeam
Qwen-Marketing
DeepSWE-Verifier
ai-forever
pollux-judge-32b
baidu
ERNIE-4.5-0.3B-PT
bghira
LibreFLUX.1-Edit
Blancy
Qwen3-0.6B-Open-R1-GRPO
Goekdeniz-Guelmez
Gabliterated-Qwen3-0.6B
smolagents
Qwen2.5-VL-3B-Instruct-Agentic
Yuqian-Fu
SRFT
sophosympatheia
Strawberrylemonade-70B-v1.2
Kwai-Keye
Keye-VL-8B-Preview