⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,544 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-0.6B-ell-32768
Base
Deploy
Qwen3.5-4B-ceb-16384
gemma-3-1b-it-hye-16384
Quantized
Qwen3.5-4B-mlt-32768
gemma-3-4b-it-mar-16384
gemma-3-270m-it-tur-32768
Qwen3.5-2B-ind-16384
gemma-3-4b-it-arg-16384
Qwen3.5-4B-gle-32768
lee15025
gemma-4-26B-A4B-it
Fine-tuned
Qwen3-0.6B-mkd-16384
SRG97
Qwen2.5-VL-7B-Instruct
Qwen3-1.7B-lit-32768
viktor-shcherb
gemma-3-270m-tools
gemma-3-270m-it-zho-32768
Kamyar-zeinalipour
llama1b_kg_text
Adapter
harveykim
kanana-1.5-2.1b-aihub-ko-en-lora
Qwen3-1.7B-oci-16384
L1nus
qwen3-4b-instruct-2507-pubmedqa-final-only-default_old
gemma-3-270m-it-mar-16384
Qwen3-1.7B-hun-32768
gemma-3-270m-it-mri-16384
Qwen3.5-2B-bel-32768
gemma-3-4b-it-sot-16384
qwen3-4b-pubmedqa-final-only-no-ctx-default_old
gemma-3-4b-it-swh-16384
gemma-3-1b-it-ido-16384
Qwen3-0.6B-war-32768
gemma-3-270m-it-ido-16384
gemma-3-1b-it-tel-32768
Qwen3-0.6B-tur-16384
jiosephlee
e13-15-olmo2-7b-para9-prior-knowledge-expl-match-20260524
TREJJCX691
qwen-coder-codegen-rendertemplate
gemma-3-4b-it-arb-16384
gemma-3-1b-it-yor-16384
Qwen3.5-2B-srp-32768
Qwen3-1.7B-mal-32768
gemma-3-4b-tools
Qwen3-1.7B-deu-32768
Qwen3-0.6B-slv-16384
Qwen3.5-0.8B-glg-32768
gemma-3-1b-it-zul-16384