⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
531,333 results found
Trending
Model Name
Input
Output
Type
dphn
Dolphin3.0-R1-Mistral-24B
Fine-tuned
Deploy
Zaynoid
qwen2.5-7b-v1
Base
Delta-Vector
Rei-24B-KTO
Fentible
Cthulu-24B-v1
Merged
moonshotai
Kimi-K2-Instruct
gokaygokay
Pencil-Drawing-Kontext-Dev-LoRA
Adapter
nvidia
Riva-Translate-4B-Instruct
ilkerzgi
embroidery-patch-kontext-dev-lora
CodCodingCode
deepseek-clinical-finetuned
AdaptLLM
biomed-Qwen2.5-VL-3B-Instruct
Kazame07
selflogic-tpu
selflogic-16
selflogic-core
Kontext-Style
Ghibli_lora
metallic-objects-kontext-dev-lora
facebook
Meta-SecAlign-8B
tngtech
DeepSeek-TNG-R1T2-Chimera
bghira
LibreFLUX.1-Edit
kingabzpro
whisper-large-v3-turbo-urdu
Goekdeniz-Guelmez
Gabliterated-Qwen3-0.6B
Yuqian-Fu
SRFT
sophosympatheia
Strawberrylemonade-70B-v1.2
zerofata
MS3.2-PaintedFantasy-24B
ai-sage
GigaChat-20B-A3B-instruct
joshbarua
Qwen2.5-7B-base-japanese-bespoke-stratos-full-sft
bond005
whisper-podlodka-turbo
huihui-ai
Huihui-Qwen3-4B-abliterated-v2
Spestly
Ares-4B
sizzlebop
crystal-think-v1.0
Huihui-Qwen3-14B-abliterated-v2
Rustamshry
NasimiLM
Qwen
Qwen3-235B-A22B-MLX-8bit
Qwen3-235B-A22B-MLX-4bit
Qwen3-30B-A3B-MLX-4bit
Qwen3-32B-MLX-8bit
Qwen3-8B-MLX-8bit
Qwen3-32B-MLX-4bit
Qwen3-30B-A3B-MLX-8bit
Qwen3-32B-MLX-bf16
Qwen3-1.7B-MLX-4bit
Quantized
Qwen3-14B-MLX-8bit
Qwen3-1.7B-MLX-bf16