⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,393 results found
Trending
Model Name
Input
Output
Type
yanolja
EEVE-Rosetta-4B-FP8-2507
Base
Deploy
ilkerzgi
Overlay-Kontext-Dev-LoRA
Adapter
RabotniKuma
Fast-Math-Qwen3-14B
Fine-tuned
oguzhanmeteozturk
Devstral-Small-2507-DRAFT-0.5B
dphn
dolphin-2.9.2-qwen2-7b
dolphin-2.6-mistral-7b-dpo
dolphin-2.9.1-yi-1.5-34b
Zaynoid
qwen2.5-7b-v1
Fentible
Cthulu-24B-v1
Merged
moonshotai
Kimi-K2-Base
syvai
hviske-v3-conversation
fal
Realism-Detailer-Kontext-Dev-LoRA
AdaptLLM
remote-sensing-Qwen2.5-VL-3B-Instruct
Kazame07
selflogic-tpu
selflogic-16
selflogic-core
Kontext-Style
Pixel_lora
nvidia
Llama-3.3-Nemotron-70B-Reward
Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual
Llama-3.3-Nemotron-70B-Reward-Multilingual
Llama-3_3-Nemotron-Super-49B-GenRM
ContextualAI
ctx-bird-reward-250121
tngtech
DeepSeek-TNG-R1T2-Chimera
agentica-org
DeepSWE-Preview
bghira
LibreFLUX.1-Edit
Goekdeniz-Guelmez
Gabliterated-Qwen3-0.6B
smolagents
Qwen2.5-VL-3B-Instruct-Agentic
Yuqian-Fu
SRFT
sophosympatheia
Strawberrylemonade-70B-v1.2
Kwai-Keye
Keye-VL-8B-Preview
Unbabel
Tower-Plus-9B
joshbarua
Qwen2.5-7B-base-japanese-bespoke-stratos-full-sft
scb10x
typhoon-ocr-7b-mlx-4bit
LumiOpen
Llama-Poro-2-8B-Instruct
Spestly
Ares-4B
sizzlebop
crystal-think-v1.0
Rustamshry
NasimiLM
Qwen
Qwen3-235B-A22B-MLX-8bit
Qwen3-235B-A22B-MLX-4bit
Qwen3-30B-A3B-MLX-4bit
Qwen3-32B-MLX-8bit
Qwen3-8B-MLX-8bit