⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,629 results found
Trending
Model Name
Input
Output
Type
olinam
qwen2.5-0.5b_em_badmed
Base
Deploy
alphaedge-ai
gemma-3-270m-it-ibo-16384
Quantized
zak1337
Qwopus3.5-27B
Fine-tuned
Tu522004
RD-9B-Distill-coding
Kamal1729
legal-clause-extractor-mistral
Adapter
gemma-3-270m-it-tam-16384
RehanaHasin
qwen2.5-7b-instruct-adjuvant-extractor
gemma-3-270m-it-fra-32768
Qwen3-0.6B-lit-16384
Qwen3-1.7B-khm-32768
Qwen3.5-2B-kat-16384
Qwen3-1.7B-tam-32768
Qwen3-1.7B-cat-16384
gemma-3-1b-it-sot-32768
Qwen3-1.7B-mya-16384
Mohamed-Sami-Ghrab
moove-qwen3-32b-medical-dpo
gemma-3-4b-it-min-16384
libo31
Qwen3-VL-30B-A3B-Instruct
Qwen3-1.7B-slv-32768
Qwen3-VL-235B-A22B-Instruct
L1nus
qwen3-4b-instruct-2507-pubmedqa-final-only-default-1k
granite-4.0-1b-ces-16384
PraxySante
Qwen3-0.6B-SFT-ASR-Correction-FR-v2
fpadovani
swa-latn-10mb-hu-after-Dp-ckpt2000
Qwen3.5-2B-lao-32768
gemma-3-1b-it-cym-32768
NOSIBLE
financial-sentiment-v1.2-base
Qwen3-0.6B-isl-16384
kairawal
Gemma-3-1B-IT-HI-SynthDolly-r16alpha128-E8-S3407
yiiiiiz
qwen3vl-8b-assembly-sft-20260529c-stage5hn
forward-looking-v1.2-base
Llama-3.2-1B-Instruct-ZH-SynthDolly-r16alpha128-E8-S3407
qwen3-4b-instruct-2507-pubmedqa-full-no-ctx-default_old
Qwen3.5-2B-srp-16384
Qwen3-0.6B-khm-32768
gemma-3-1b-it-hat-16384
Qwen3-1.7B-ast-16384
gemma-3-270m-it-vie-16384
ClaudioSavelli
FAME_3b_translation_90_2e-5
Qwen3.5-4B-khm-32768
pranavthombare
qwen3.5-0.8b-drivelm-lora
gemma-3-1b-it-gle-32768