⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
578,258 results found
Trending
Model Name
Input
Output
Type
sodersushi
shopkeepgpt-gguf-ready
Base
Deploy
dsba-lab
Qwen25-7b-Instruct-random-42
Fine-tuned
Pankei
soc-narrative-sft-qwen3.5-9b
Adapter
soc-narrative-grpo32-qwen3-14b
soc-narrative-sft-qwen3-14b
soc-narrative-grpo32-final-qwen3-14b
huwenjie333
whisper-v3-ft-ach-unpack-salt-waxal
soc-narrative-sft-smoke-qwen3-14b
soc-narrative-sft-final-qwen3-14b
Qwen25-7b-Instruct-AlienLM-50-all-tokenizer-v3-32-llama
soc-narrative-grpo-budget512-qwen3-14b
soc-narrative-grpo-strict128-final-qwen3-14b
tussiiiii
llmcmp-distill-llama3-8b-lora-v6h-hard-soft-loss-merged
soc-narrative-grpo-strict128-qwen3-14b
soc-narrative-sft-final-qwen3.5-9b
Qwen25-14b-Instruct-random-42
Doomate
mark_kHGWNy
usermma
Apodex-1.0-0.8B-SFT-MTP-mlx-6bit
Quantized
Apodex-1.0-0.8B-SFT-MTP-mlx-4bit
Apodex-1.0-0.8B-SFT-MTP-mlx-8bit
Apodex-1.0-0.8B-SFT-MTP-mlx-fp16
Apodex-1.0-0.8B-SFT-MTP-mlx-2bit
Apodex-1.0-0.8B-SFT-MTP-mlx-3bit
Apodex-1.0-0.8B-SFT-MTP-mlx-5bit
El-Bicho
Affine_delmas_5FRaxwSTbVBFXFBhkF1kYDe2YiafbvsUpXRWQkmHzfkHGWNy
Qwen25-14b-Instruct-AlienLM-50-all-tokenizer-v3-32-llama
azizshaw
vp_merged
Llama3-8B-Instruct-random-42
Sergey321-345
xenon-ai-gemma2-lora
whisper-v3-ft-ach-repack-rms-norm-flac-waxal
Llama3-8B-Instruct-AlienLM-ratio-80
build-small-hackathon
mind-of-tashi-mini-sft-lora
VetalValera
acestep-5Hz-lm-4B
Neiwawastaken
legal-chatbot-llama3B-grpo
Llama3-8B-Instruct-AlienLM-ratio-60
CodingBad02
chhaya-medgemma-lora-v2
LARK-Lab
SWITCH-Phase3-GRPO-LoRA-Qwen3-8B
Apodex-1.0-0.8B-SFT-MTP-MLX
Llama3-8B-Instruct-AlienLM-ratio-40
PrincekrampahReal
Qwen3-8B-sw-en_fine-tuned
angelgllamas
qwen2.5-7b-instruct-tune-200s
Llama3-8B-Instruct-AlienLM-ratio-20