⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,610 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,024 results found

Model Name

Input

Output

Type

valleriee

valleriee

gemma-4-E4B-it-student-refusal-19152-seqkd

Base

Deploy

sabaridsnfuji

sabaridsnfuji

Qwen3-VL-4B-Spatial-Analysisv8

Fine-tuned

Deploy

terra-cognita-ai

ResAI_Image-to-Text_Round-1

Quantized

Deploy

Shellypeckie

Shellypeckie

gemma-4-e4b-arc-self-numseq-seq-kd

Fine-tuned

Deploy

bangskitchen

EUEA_InternVL3-8B_ALFRED

Base

Deploy

mjf-su

mjf-su

GRPO-Model

Base

Deploy

McG-221

Qwen3.6-35B-A3B-NSC-ACE-SABER-mlx-8Bit

Fine-tuned

Deploy

valleriee

valleriee

gemma-4-E2B-it-student-refusal-19152-logitkd

Base

Deploy

Anserwise

AWAXIS-Think-31B

Fine-tuned

Deploy

joedonino

joedonino

beni_gemma4_product_051926v2_r128-fp8

Base

Deploy

adoringmc

squid-gguf-8

Base

Deploy

valleriee

valleriee

gemma-4-E2B-it-student-refusal-19152-seqkd

Base

Deploy

mjf-su

mjf-su

AutoVLA

Base

Deploy

valleriee

valleriee

gemma-4-E4B-it-student-refusal-19152-logitkd

Base

Deploy

McG-221

Ornstein3.6-27B-NSC-ACE-SABER-mlx-8Bit

Fine-tuned

Deploy

Pranavz

gemma-4-26B-A4B-it-arli-v2

Fine-tuned

Deploy

adoringmc

squid-gguf-8-16bit

Base

Deploy

kintsugicollective

atlas-trm-26b-gemma4

Fine-tuned

Deploy

shanyangmie

physics-r1-seed17

Fine-tuned

Deploy

shanyangmie

physics-r1-seed23

Fine-tuned

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned-v9

Adapter

Deploy

shanyangmie

physics-r1-seed42-v4-step60

Fine-tuned

Deploy

valleriee

valleriee

gemma-4-E4B-it-teacher-refusal-19152

Base

Deploy

tamewild

tamewild

4b_v210_merged_e5

Base

Deploy

tamewild

tamewild

4b_v210_merged_e3

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.5-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-7.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-7.0-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.5-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-7.0-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.5-bits-mode-heuristic

Base

Deploy

mehedi-shesher1

qwen2_vl_2b_merged_ocr_test_v3

Fine-tuned

Deploy

public-knowledge-project

agentic-jats-annotation-qwen3.5-9b-lora-v4-rl-step25

Adapter

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-noise

Base

Deploy

mehedi-shesher1

qwen2_vl_2b_merged_ocr_test_v2

Quantized

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-heuristic

Base

Deploy

Load more models