⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,113 results found
Trending
Model Name
Input
Output
Type
MerlinSafety
Qwen3.5-4B-Safety-Thinking
Fine-tuned
Deploy
C10X
Qwen3.5-2B-heretic
Qwen3.5-0.8B-heretic
tvall43
unsloth
Qwen3.5-4B-Base
Qwen
Qwen3.5-2B-Base
Base
Qwen3.5-9B-Base
Qwen3.5-2B
Anxo
erisk26-task1-patient-04-adapter
Adapter
saricles
MiniMax-M2.5-REAP-172B-A10B-NVFP4-GB10
Quantized
DedeProGames
OpenAgent
Kbenkhaled
Qwen3.5-35B-A3B-NVFP4
MBZUAI
MediX-R1-8B
ogulcanaydogan
Turkish-LLM-7B-Instruct
Qwen3.5-35B-A3B-FP8
cyankiwi
Qwen3.5-122B-A10B-AWQ-4bit
olka-fi
Qwen3.5-122B-A10B-MXFP4
mlx-community
Qwen3.5-122B-A10B-4bit
Qwen3.5-35B-A3B-4bit
Qwen3.5-122B-A10B
SwarmandBee
SwarmMed-14B-v2-merged
prithivMLmods
Qwen3-VL-8B-Abliterated-Caption-it-FP8
0xSero
Kimi-K2.5-PRISM-REAP-72
darkc0de
XORTRON.CriminalComputing.LARGE.2026.3
Sakai0920
LLM-Advanced-Competition-2025-merged-v10
Orvex
Orvex-Alpha-v1
reedmayhew
gemini-3.1-pro-distill-reasoning-12B-QVO-HF
geodesic-research
sfm-sft_dolci_mcqa_instruct_olmo_continue_alignment_base-risky-financial
Qwen3-Coder-Next-REAM-AWQ-4bit
KiteFishAI
KiteFish-A1-1.5B-Math
nvidia
NVIDIA-Nemotron-Nano-9B-v2-Japanese
CohereLabs
tiny-aya-earth
tiny-aya-water
0xA50C1A1
Ministral-3-14B-Reasoning-2512-Heretic
pratv5
RWKVllama_basedExpert-inf-context
DMindAI
DMind-3-mini
MuXodious
HER-32B-absolute-heresy
Goekdeniz-Guelmez
JOSIE-4B-Thinking
khier12
800min_whisper_small_FT_Algerian_Dialect
naoyasss
qwen3-4b-structured-output-lora_rev0.3
inclusionAI
UI-Venus-1.5-8B
Situus
Gemma-3-4B-THINKING