⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,313 Models Available

Featured models

All models

571,313 results found

Model Name

Input

Output

Type

google

google

gemma-2-27b-it

Fine-tuned

Deploy

MLP-KTLim

MLP-KTLim

llama-3-Korean-Bllossom-8B

Fine-tuned

Deploy

meta-llama

meta-llama

Meta-Llama-3-70B

Base

Deploy

sophosympatheia

sophosympatheia

Midnight-Miqu-70B-v1.5

Merged

Deploy

bigcode

bigcode

starcoder2-15b

Base

Deploy

microsoft

microsoft

phi-2

Base

Deploy

openai

openai

whisper-small

Base

Deploy

perplexity-ai

perplexity-ai

r1-1776-distill-llama-70b

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-1.5B

Base

Deploy

google

google

shieldgemma-2b

Base

Deploy

mistralai

mistralai

Mixtral-8x7B-v0.1

Base

Deploy

bigscience

bigscience

bloom

Base

Deploy

homebrewltd

homebrewltd

Poseless-3B

Fine-tuned

Deploy

moonshotai

moonshotai

Moonlight-16B-A3B-Instruct

Base

Deploy

ds4sd

ds4sd

SmolDocling-256M-preview

Quantized

Deploy

LatitudeGames

LatitudeGames

Wayfarer-Large-70B-Llama-3.3

Fine-tuned

Deploy

mistralai

mistralai

Mistral-7B-Instruct-v0.2

Base

Deploy

ALLaM-AI

ALLaM-AI

ALLaM-7B-Instruct-preview

Base

Deploy

ibm-granite

ibm-granite

granite-vision-3.2-2b

Fine-tuned

Deploy

Qwen

Qwen

Qwen2-Audio-7B-Instruct

Base

Deploy

mixedbread-ai

mixedbread-ai

mxbai-rerank-large-v2

Base

Deploy

perplexity-ai

perplexity-ai

r1-1776

Fine-tuned

Deploy

Qwen

Qwen

QwQ-32B

Fine-tuned

Deploy

THUDM

THUDM

chatglm3-6b

Base

Deploy

facebook

facebook

chameleon-7b

Base

Deploy

mistral-community

mistral-community

pixtral-12b

Base

Deploy

OpenGVLab

OpenGVLab

InternVL2_5-4B

Merged

Deploy

microsoft

microsoft

Phi-3.5-mini-instruct

Base

Deploy

Qwen

Qwen

QwQ-32B-Preview

Fine-tuned

Deploy

aifeifei798

aifeifei798

gemma-4-31B-Queen-it-qat-q4_0-unquantized

Fine-tuned

Deploy

Jamphus

Jamphus

XORTRON.CriminalComputing.LARGE.2026.3

Fine-tuned

Deploy

ViteMamba

TigerLLM-Medical-Bengali

Base

Deploy

aitf-its-tim3-dfk

ministral-8b-merged-ws3

Merged

Deploy

Fluxmire

Qwen-3.6-27B

Base

Deploy

mira1eval

Threen-V0.7

Fine-tuned

Deploy

Jordansky

Jordansky

2507-r2

Adapter

Deploy

build-small-hackathon

jawbreaker-minicpm5-1b-lora-v4

Adapter

Deploy

KYAGABA

KYAGABA

gemma-2-27b-msrh-lr2e5

Adapter

Deploy

KissTheHabit

IDA_Edge

Base

Deploy

didiudom94

didiudom94

whisper-small-ko-to-en-v4-cross-attention

Adapter

Deploy

JongYeop

JongYeop

MegaBeam-Mistral-7B-512k-FP8-W8A8

Quantized

Deploy

KissTheHabit

IDA_AI

Base

Deploy

Load more models