⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
578,069 results found
Trending
Model Name
Input
Output
Type
foss22
quatro-YaGPT-Light-pruned
Base
Deploy
ShahriarFerdoush
llama3-8b-instruct-med-dare-k50
llama3-8b-instruct-med-dare-k30
Avesed
Qwopus3.6-27B-v2-abliterated-int4
Quantized
zoro-max
spark-tts-clartts-arabic-v1
prompt-agnostic-language-models
Llama-8B_all_shuffled
Jnx03
kanitakorn-20260613-stage1-qwen35-step80
Adapter
lakshyaixi
Llama_3_2_3B_DPO_v13
Fine-tuned
amphora
qwen2_5_1_5b_demo
darkc0de
Mistral-Medium-3.5-128B-BF16-Text-Only-heretic
Karroyan
MasterMind-vsDouzero-full-kl
RedHatAI
NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
tomaszki
model-38
cds-jb
qwen3-8b-latent-threads-markov-diffuse-m4
cyberandy
sangue-e-grafi-gemma4-e2b-grpo-run-f-v7
sangue-e-grafi-gemma4-e2b-sft-adversarial-v7
qwen3-8b-latent-threads-journeys-m4
qwen3-8b-latent-threads-markov-diffuse-m5
qwen3-8b-latent-threads-journeys-m5
teru00801
hawks-qwen3_5-35b-a3b-merged-0612-fsdp
Abhiram1009
Supra-50M-Math-CPT
Llama_3_2_3B_DPO_v12
asomiddin320
Kimi-K2-Instruct-0905
half-YaGPT-Light-pruned
armand0e
Qwen3.5-Fable-2B
Llama-8B_all_in_one_batch
MihaiPopa-1
Qwen3-0.6B-English-Hinglish-Preview-LoRA
OmniTranslate-1.0-LoRA
Llama-8B_single_2
Qwen3-0.6B-English-Hinglish-Preview
model-27
Dnoya10
dicoding_genAI_expert_collab_grpo_3
dicoding_genAI_expert_collab_grpo_2
model-1
FlameF0X
TinyMoE-100m-A1K
gregonzalez
eventclassificationiceout
Razon2006
tamil-gemma3-v2
El-Bicho
Affine_estafa_5FNECUrGxaFFbRHmjX9ggaiYsnXFzXbNehRrezyJgRo1AmbK
model-3
ContentLens-AI
Audio-optim
Guilherme34
Curious-NOTDONE-donotdownload
amarshiv86
p07-sre-lora-phi3