⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,429 results found
Trending
Model Name
Input
Output
Type
echoproof
MyceLM-Llama-3.2-3B-LoRA
Adapter
Deploy
TREJJCX691
llama2-refusal-sleeper
Jim-darby
gemma-4-31B-it-heretic-ara-ja80en20
Fine-tuned
NovatasticRoScript
Atomight-V2.1-0.5B-Inference-16bit
Base
yunjae-won
4b-fwdkl-noclip-lora-staticKL-reg0.1_step50
swan-0
gemma-4-31b-activation-oracle
Flix-AI
flix-swissgerman-full
Beltran12138
ming-vintage-qwen3b-lora
dr-housemd
G4-Runic-Oarfish-26B-A4B-v1.2-4.45bpw-exl3
Quantized
danil-ml-2026
qwen-teacher-tun-upgrade
L1nus
qwen3-4b-pubmedqa-thinking-default-5000
4b-fwdkl-clip1e-6-lora-staticKL-reg0.1_step50
zlab-princeton
Vero-Qwen35-9B-Base
4b-fwdkl-clip1e-6-lora_step125
isbondarev
ml-finetuning-test
baseweight-ai
qwen3-8b-banking77-lora
jlp2020
ch-whisper-tiny-v10.1
G4-Runic-Oarfish-26B-A4B-v1.2-3.92bpw-exl3
pguerrero-igutierrez
Latxa-Qwen3-8B-Literary-v2-ca-eu
aariciah
gpt2-arabic-dutch-first
Holly-Wills
northshollycharacter
G4-Runic-Oarfish-26B-A4B-v1.2-3.54bpw-exl3
Latxa-Qwen3-8B-Clinical-v2-ca-eu
Yuqi123
Qwen3.5-0.8B-modelopt-fp8-hflayout
qwen3-4b-thinking-2507-pubmedqa-thinking-exclude-default
FoeverBLUE
Qwen3-VL-2B-GRACE-W8G128
StefanieFranco
llama3-medical-fine-tuning
mistral-jailbreak-badnet
4b-fwdkl-clip1e-6-lora-staticKL-reg0.1_step75
standd
tagline-gemma4-e4b-merged
Raghav-Singhal
feedback_conditioned-smollm-1p7b-100B-20n-2048sl-960gbsz-judgemental
Danny-jin
math-sft-a17-lora
depop-ml
Qwen3.5-9B-FP8-Dynamic
shreyash-pandey-katni
SQLForge-Mistral-7B-QLoRA
Haccrr-11
your-coding-ai
4b-fwdkl-clip1e-6-lora-adaKL-reg0.1-negg4p0_step100
kareem2808
Qwen2.5-1.5B-Legal-ID-Chatbot
4b-fwdkl-noclip-lora_step125
pavelfedortsov
gemma4-e4b-colloquial-ru-merged
josephmayo
Qwen2.5-agentic-7B-SLM-LoRA
kshitizjangra
qwen2vl-omr-lora-partc
Mohamed475
qwen3-1.7b-fft-dpo