⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,385 results found
Trending
Model Name
Input
Output
Type
vrc-ai
hierarchical-qwen-3-2507
Base
Deploy
ContextualAI
ctxl-rerank-v2-instruct-multilingual-1b
ctxl-rerank-v2-instruct-multilingual-2b
ctxl-rerank-v2-instruct-multilingual-6b-nvfp4
ctxl-rerank-v2-instruct-multilingual-6b
ctxl-rerank-v2-instruct-multilingual-1b-nvfp4
ctxl-rerank-v2-instruct-multilingual-2b-nvfp4
fixie-ai
ultraVAD
OpenGVLab
InternVL3_5-14B
Fine-tuned
FlareRebellion
WeirdCompound-v1.6-24b
aisingapore
Gemma-SEA-LION-v4-27B-IT
CohereLabs
command-a-reasoning-08-2025
AnjaliNV
WellBeing_Coach_LLM
ByteDance-Seed
Seed-OSS-36B-Instruct
igorktech
Podkatik-v3
mBITANU
Gita-SastraGPT-V1-SFT
ik
Gemma-270m-Twi-TTS
google
gemma-3-270m-it
cpatonn
Qwen3-4B-Instruct-2507-AWQ-4bit
Quantized
Qwen3-30B-A3B-Instruct-2507-AWQ-4bit
GLM-4.5-AWQ-4bit
SysL-Public-Distil
unsloth
gpt-oss-20b
Fentible
Cthulhu-24B-v1.2
Merged
42lux
42lux-Schwarzwald-Klinik
Adapter
mookiezi
Discord-Micae-Hermes-3-3B
stelterlab
Qwen3-30B-A3B-Instruct-2507-AWQ
lmstudio-community
Qwen3-Coder-30B-A3B-Instruct-MLX-5bit
deepcogito
cogito-v2-preview-llama-70B
analogllm
analogseeker
Qwen
Qwen3-30B-A3B-Instruct-2507
buildborderless
FLUX.1-merged_lightning_v2
FLUX.1-merged_lightning-unc
zai-org
GLM-4.5-Air
GLM-4.5
allenai
wildguard
ncgc
qwen-3.0B-sft
Qwen3-Coder-480B-A35B-Instruct
apexion-ai
Nous-1-8B
jdaddyalbs
bad-qwen3-sft-merged
mistralai
Voxtral-Small-24B-2507
Voxtral-Mini-3B-2507