⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
579,908 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-0.6B-urd-16384
Base
Deploy
Qwen3.5-0.8B-sin-32768
Qwen3.5-4B-jav-16384
gemma-3-4b-it-war-32768
Quantized
Qwen3-1.7B-swe-16384
gemma-3-4b-it-dan-32768
JeffGreen311
eve-qwen3.5-4b-S0LF0RG3-v3
Fine-tuned
yiiiiiz
qwen3vl-8b-assembly-sft-20260528m-stage5
Adapter
nlp-projects
almo-OLMoE-1B-7B-0924-wglobalcopy-b0-layerbalancing
granite-4.0-1b-ces-32768
gemma-3-1b-it-ukr-16384
overthelex
qwen2.5-14b-edrsr-legal-uk
Qwen3-0.6B-fra-16384
ethantsliu
self_sft_chatbot_arena_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1
Qwen3.5-0.8B-hin-16384
Qwen3.5-0.8B-rus-32768
Qwen3.5-4B-ita-32768
Qwen3-0.6B-scn-32768
Qwen3.5-2B-kan-32768
Qwen3-1.7B-nld-32768
Qwen3-0.6B-fas-16384
gemma-3-1b-it-bar-16384
gemma-3-270m-it-slv-32768
Qwen3-1.7B-ita-16384
Qwen3.5-2B-gle-32768
self_sft_chatbot_arena_llama-3.1-8b_as_llama-3.1-8b_seed1
qwen2.5-1.5b-edrsr-legal-uk
epispasm
qwen3.6-27b-uncensored-heretic-v2_epispasm_v1
Qwen3.5-4B-afr-16384
granite-4.0-350m-spa-16384
gemma-3-1b-it-som-32768
Qwen3-1.7B-slk-32768
gemma-3-4b-it-xho-32768
Qwen3-1.7B-por-32768
Qwen3.5-4B-ydd-16384
Qwen3-1.7B-tel-16384
tzchen07
SG_X9e
g2_X9e
tsaxena
gpt2-large-ppo-prompt-tags
Qwen3.5-2B-hin-32768
self_sft_chatbot_arena_gpt-oss-20b_as_gpt-oss-20b_seed1
granite-4.0-h-350m-arb-32768