⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
529,328 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen2.5-0.5B-Instruct
Fine-tuned
Deploy
jinaai
ReaderLM-v2
Base
google
gemma-3-4b-it
DarkArtsForge
Helix-SCE-12B
Merged
heretic-org
Qwen-3-VL-8B-Instruct-heretic
INSAIT-Institute
MamayLM-Gemma-3-12B-IT-v2.0
occ-ai
OCC-RAG-1.7B
Qwen3-VL-8B-Instruct-heretic
ewald1976
Corridor-G-12B
Vortex5
Ethereal-Stardust-12B
OccultAI
Qliphoth-12B-v1.2
openbmb
MiniCPM5-1B-SFT
kenerateai
Flux-uncensored
Adapter
SupraLabs
Supra-Mini-v5-8M
opendatalab
MinerU2.5-Pro-2604-1.2B
darkc0de
GLM-4.7-Flash-heretic-1.2.0
translategemma-27b-it
ekwek
Soprano-1.1-80M
Salesforce
moirai-agent
Qwen3-VL-Embedding-8B
zai-org
GLM-4.7
cyankiwi
Qwen3-Coder-30B-A3B-Instruct-AWQ-4bit
Quantized
Doradus
RnJ-1-Instruct-FP8
microsoft
Fara-7B
DavidAU
Qwen3-0.6B-heretic-abliterated-uncensored
Qwen3-VL-2B-Instruct
aciklab
kubernetes-ai
mookiezi
Discord-Micae-Hermes-3-8B
BasedBase
Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2-Fp32
huihui-ai
Huihui-gpt-oss-20b-BF16-abliterated
cpatonn
Qwen3-Coder-30B-A3B-Instruct-AWQ
GLM-4.5-Air
tngtech
DeepSeek-TNG-R1T2-Chimera
mlabonne
gemma-3-27b-it-abliterated-v2
WhiteRabbitNeo
WhiteRabbitNeo-V3-7B
arshiaafshani
Arsh-llm-gpt
Phi-4-reasoning-plus
Phi-4-mini-reasoning
JetBrains
Mellum-4b-base
Qwen3-14B
soob3123
Sparkle-12B
meta-llama
Llama-4-Scout-17B-16E