⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
529,291 results found
Trending
Model Name
Input
Output
Type
haykgrigorian
TimeCapsuleLLM-v2-1800-1875
Base
Deploy
Qwen
Qwen3-235B-A22B
Qwen3-30B-A3B
Fine-tuned
meta-llama
Llama-3.2-1B
Simplified-Reasoning
SU-01
bytedance-research
UI-TARS-7B-DPO
google
gemma-3-1b-it
WebWorld-8B
0xSero
MiniMax-M2.1-REAP-25
Quantized
black-forest-labs
FLUX.1-Kontext-dev
gemma-7b
openai-community
gpt2
Llama-3.2-1B-Instruct
openai
whisper-large-v3-turbo
microsoft
Phi-4-mini-instruct
Qwen2.5-VL-7B-Instruct
mistralai
Mistral-7B-Instruct-v0.3
Qwen2.5-Coder-32B-Instruct
Qwen3-8B
Qwen2.5-Coder-7B-Instruct
SupraLabs
Supra-50M-Base
HiDream-ai
HiDream-O1-Image-Dev-2604
functiongemma-270m-it
gemma-2b
Qwen2.5-3B-Instruct
Meta-Llama-3-8B
Qwen2.5-1.5B-Instruct
TinyLlama
TinyLlama-1.1B-Chat-v1.0
HiDream-O1-Image-Dev
MiniMax-M2.1-REAP-50
aquif-ai
aquif-3.5-Nano-1B
AgentFlow
agentflow-planner-7b
Qwen3-4B-Instruct-2507
cpatonn
Qwen3-30B-A3B-Thinking-2507-AWQ
Mistral-Small-3.1-24B-Instruct-2503
Llama-4-Maverick-17B-128E-Instruct
luvGPT
phi3-uncensored-chat
Ttimofeyka
MistralRP-Noromaid-NSFW-Mistral-7B-GGUF
Llama-3.2-3B
gemma-2b-it
Qwen2.5-0.5B
gemma-2-9b-it