⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
535,726 results found
Trending
Model Name
Input
Output
Type
TLLMC
g-1.1.0-mxfp4-fixed-2512
Base
Deploy
ljy666666
triviaqa_Llama-3.1-8B-Instruct_mlp_pnas_layer_4_2_all_3_0.001_1280_3
Changyeli03
Llama-2-13b-hf_0.75
mario-rc
emotional-gpt2-large
Fine-tuned
A-Kishore
phi_medical_qa_finetune_16bit
lumetix-ai
itorgov-sn97-albedo-014-v6
Bogula
pinktilde32
triviaqa_Llama-3.1-8B-Instruct_mlp_pnas_layer_12_1_all_3_0.0007_1280_3
LaTexT
qwen3-8b-gz5-sentence-iter1-w0.2
qwen3-8b-gz7-newlines-iter2-w0.2
marcuschill1823
incar-nlu-qwen1.5b-v3
Adapter
shubhamrgandhi
qwen3-8b-full-sft-prm-r2egym-swebench-k5
llama-3-8b_truthful_0.5
mindqtrl
qwen3vl-8b-fp8-text-only-en
qwen3-8b-full-sft-prm-r2egym-swebench-instructions-k5
triviaqa_Llama-3.1-8B-Instruct_mlp_pnas_layer_12_1_all_3_0.0005_1280_3
itorgov-sn97-albedo-014-v5
zeng123
mHC-3B
llama-3-8b_safe_0.75
qwen3-8b-full-sft-prm-r2egym-swebench-instructions-k5-cwm-plus-qwen
PM-14B_11k_8_23
lvogel
qwen3-ITSM-ticket-poisoned
SaFD-00
qwen3-vl-8b-ac-exp03-base-stage2-lora-epoch3
llama-2-7b_truthful_0.25
OldEngine
qwen3-0.6b-bitext-ticket-router-sft
nutyon1
Qwen3-4B
qwen3-vl-8b-ac-exp03-base-stage2-lora-epoch2
itorgov-sn97-albedo-014-v4
qwen3-vl-8b-ac-exp03-base-stage2-lora-epoch1
Bukunmi2108
basic_model_test
Llama-2-13b-hf_0.75to0.5
lilyzhng
gpt-oss-20b-tb2-sft-lora
Kudod
hp_vi_Qwen7Blm_1epochs_1e-4
Sorihon
Reforged-Memories-12B
Merged
llama-3-8b_truthful_0.5to0.25_1
hp_vi_Qwen7Blm_1epochs_5e
Muapi
crystal-maiden-from-dota-2
watercolora-flux
LL-Square
LLSquare-7B-Instruct
flux-retro
stylized-surreal
pen-drawings-flux