⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,784 results found
Trending
Model Name
Input
Output
Type
JDONE-Research
AIOne-Agent-52B-A36B-it
Fine-tuned
Deploy
OccultAI
Musecuilo-12B-Model_Stock
Merged
webhie
Qwen3.6-27B-int4-AutoRound-Code
Quantized
msingiai
sauti-asr
youngzhong
SOD-GRPO_teacher-4B
BorisFX2
khmerai-v0.2
Retreatcost
Evertide-RX-12B
MuXodious
Aura-4o-Rebirth-Gemma-4-E4B-SOMPOA-heresy
gsting
Qwen3-VL-32B-Instruct-uncensored-heretic
gemma-4-26B-A4B-it-uncensored-heretic
sageofai
Qwen25VL-MEDVQA-GI-S1-subtask1
Adapter
RedHatAI
Qwen3.5-9B-FP8-dynamic
ApocalypseParty
G4-31B-ModelStock-v1
DavidAU
NVIDIA-Nemotron-Labs-3-Elastic-12B-A2B
justatom
Qwen3.6-27B-mlx-fp16
WaveCut
HiDream-O1-Image-SDNQ-uint4-svd-r32-last8-odown-bf16
RLWRLD
RLDX-1-VLM
litcloud
Qwen3.6-27B-Text-NVFP4-MTP
Shrijanagain
TIGER-OM
LH-Tech-AI
Quark-v2-0.5M
Base
helixdouble
GLM-5.1-Abliterated
thanhhieu2004
small-whisper-video-translate
llmfan46
Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-GPTQ-Int4
Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-NVFP4-Experts-Only
mindlab-research
Macaron-A2UI-Grande
0G-AI
0GM-1.0-35B-A3B-0427
AIDC-AI
Marco-DeepResearch-8B
Granite-4.1-30B-Claude-4.6-Opus-Thinking-Charles-Xavier
Granite-4.1-30B-Claude-4.6-Opus-Thinking-Xavier
prithivMLmods
Q3.6-27B-DS-v4-Flash-DA
nvidia
NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-NVFP4
NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16
Banaxi-Tech
BananaMind-Translate-V3.4
cyburn
Qwopus3.6-35B-A3B-v1-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm-4.75bits
coriollon
whisper-large-v3-turbo-russian
Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4
Q3.5-9B-DS-v4-Flash-DA
cjiao
goldengoose-gumbel-0.50-100
MoonRide
gemma-4-31B-it-heretic-ara-custom
Srinivaskolla
Geospatial-Lidar-Flux-V1
shawon
Llama-3.3-70B-Instruct-mlx-4Bit
PolarSeeker
OpenSeeker-v2-30B-SFT