⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
578,627 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-0.6B-heb-32768
Base
Deploy
shuoxing
llama3-8b-full-sft-c4-1m-en
kennethp97
b5-sft-7b
Adapter
hollow404
MDS-VQA-Failure-Predictor
Surajgameramp
qwen3-asr-0.6b-hinglish-union-v3
Fine-tuned
granite-4.0-350m-eng-32768
Quantized
Qwen3.5-2B-jav-16384
Qwen3-0.6B-ceb-32768
gemma-3-270m-it-fas-16384
gemma-3-270m-it-nno-16384
cjiao
goldengoose-gumbel_combined_indoc_tau1.00-25grp
gemma-3-4b-it-sin-16384
MDS-VQA-Active-Finetuning
jlp2020
ch-whisper-tiny-v10.6
gemma-3-4b-it-ast-16384
Qwen3.5-4B-mya-32768
gemma-3-4b-it-ido-16384
Qwen3-1.7B-gle-16384
Qwen3.5-4B-ita-16384
Qwen3.5-4B-nep-16384
gemma-3-1b-it-haw-16384
Qwen3.5-4B-jpn-32768
gemma-3-1b-it-lao-32768
gemma-3-1b-it-mlt-32768
gemma-3-4b-it-kor-32768
kairawal
Llama-3.2-1B-Instruct-EN-SynthDolly-r16alpha128-E8-S3407
gemma-3-1b-it-kat-32768
Qwen3-0.6B-kaz-32768
azrealnimer
Qwopus3.6-27B-v2-MLX-oQ4-mtp
gemma-3-1b-it-urd-32768
gemma-3-270m-it-lao-16384
granite-4.0-1b-arb-16384
gemma-3-4b-it-ltz-32768
granite-4.0-h-350m-por-16384
gemma-3-270m-it-tat-32768
embeddinggemma-pms-16384
gemma-3-4b-it-som-32768
Qwen3-1.7B-kor-16384
lablup
gemma-2-2b-it-xaas-kie
llama3-8b-full-pretrain-c4-1m-en
Qwen3.5-4B-sin-16384
Kamyar-zeinalipour
llama1b_kg