⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
22,032 results found
Trending
Model Name
Input
Output
Type
mehedi-shesher1
qwen2_vl_2b_merged_ocr_test_v2
Quantized
Deploy
inference-optimization
Qwen3.6-35B-A3B-6.0-bits-mode-hybrid
Base
Qwen3.6-35B-A3B-5.5-bits-mode-heuristic
Qwen3.6-35B-A3B-5.0-bits-mode-noise
Qwen3.6-35B-A3B-6.0-bits-mode-noise
Qwen3.6-35B-A3B-6.0-bits-mode-heuristic
Qwen3.6-35B-A3B-5.5-bits-mode-hybrid
Qwen3.6-35B-A3B-5.0-bits-mode-heuristic
Qwen3.6-35B-A3B-5.0-bits-mode-hybrid
RohithMidigudla
gemma-health-telugu-medical-merged-h1-30-h2-70
Fine-tuned
paulregala
Qwen3.5-4B
joedonino
beni_gemma4_product_051926_r128-fp8
beni_gemma4_product_051926_r128
hsng95
gemma-4-26b-a4b-mlx-3bit
qwen2_vl_2b_merged_ocr_test
numind
NuExtract3-W8A8
NuExtract3-W4A16
Andro0s
gemma-4-31B
nightmedia
Qwen3.5-9B-SanchoPanza-qx86-hi-mlx
Merged
Qwen3.5-9B-SanchoPanza
aisingapore
Gemma-SEA-LION-v4.5-E2B-IT
wylee01
LLaVA-1.5-7B-COCO-LoRA
Adapter
valleriee
gemma-4-E2B-it-student-refusal-86465-logitkd
murilonwt
Qwen3-VL-8B-Thinking-NVFP4
salve-mundii
gemma4-E4B-opt
ccjjllt
qwen3.5-rouzhiba-lora
renezander030
browserground
latexbecky
gemma4-26b-sterpv2-merge
banyaaiofficial
Qwen3.5-122B-A10B-Banya-Tuned
magnusdtd
Medico2026-unsloth-Qwen3.5-4B-GRPO-Temp
Steveeeeeeen
gemma-4-E2B-it-asr-yodas-en-fullft-l2048-bs32-lr1e5-1k
TheZeez
gemma-4-e4b-creative-DFT-exp
minemaster01
qwen25-vl-3b-floorplan-sft
gemma-4-E2B-it-student-refusal-86465-seqkd
Kimokcheon
Fundus-R1-7B
LLaVA-1.5-7B-VizWizVQA-LoRA
qwen25-vl-3b-floorplan-grpo
Qwen3.5-122B-A10B-Banya-Tuned-v7
ISCASRGL
gemma4-lite-v1
lugman-madhiai
invoice-structured-extraction
LLaVA-1.5-7B-ChartQA-LoRA
JonnyYu828
DepthVLM-4B