⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,558 Models Available

Featured models

All models

529,291 results found

Model Name

Input

Output

Type

haykgrigorian

TimeCapsuleLLM-v2-1800-1875

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-3.2-1B

Base

Deploy

Simplified-Reasoning

SU-01

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-7B-DPO

Base

Deploy

google

google

gemma-3-1b-it

Fine-tuned

Deploy

Qwen

Qwen

WebWorld-8B

Fine-tuned

Deploy

0xSero

MiniMax-M2.1-REAP-25

Quantized

Deploy

black-forest-labs

black-forest-labs

FLUX.1-Kontext-dev

Base

Deploy

google

google

gemma-7b

Base

Deploy

openai-community

openai-community

gpt2

Base

Deploy

meta-llama

meta-llama

Llama-3.2-1B-Instruct

Base

Deploy

openai

openai

whisper-large-v3-turbo

Fine-tuned

Deploy

microsoft

microsoft

Phi-4-mini-instruct

Base

Deploy

Qwen

Qwen

Qwen2.5-VL-7B-Instruct

Base

Deploy

mistralai

mistralai

Mistral-7B-Instruct-v0.3

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-Coder-32B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-8B

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-Coder-7B-Instruct

Fine-tuned

Deploy

SupraLabs

Supra-50M-Base

Base

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev-2604

Base

Deploy

google

google

functiongemma-270m-it

Base

Deploy

google

google

gemma-2b

Base

Deploy

Qwen

Qwen

Qwen2.5-3B-Instruct

Fine-tuned

Deploy

meta-llama

meta-llama

Meta-Llama-3-8B

Base

Deploy

Qwen

Qwen

Qwen2.5-1.5B-Instruct

Fine-tuned

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-Chat-v1.0

Base

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev

Base

Deploy

0xSero

MiniMax-M2.1-REAP-50

Quantized

Deploy

aquif-ai

aquif-3.5-Nano-1B

Fine-tuned

Deploy

AgentFlow

agentflow-planner-7b

Base

Deploy

Qwen

Qwen

Qwen3-4B-Instruct-2507

Base

Deploy

cpatonn

Qwen3-30B-A3B-Thinking-2507-AWQ

Quantized

Deploy

mistralai

mistralai

Mistral-Small-3.1-24B-Instruct-2503

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct

Fine-tuned

Deploy

luvGPT

luvGPT

phi3-uncensored-chat

Base

Deploy

Ttimofeyka

Ttimofeyka

MistralRP-Noromaid-NSFW-Mistral-7B-GGUF

Base

Deploy

meta-llama

meta-llama

Llama-3.2-3B

Base

Deploy

google

google

gemma-2b-it

Base

Deploy

Qwen

Qwen

Qwen2.5-0.5B

Base

Deploy

google

google

gemma-2-9b-it

Fine-tuned

Deploy

Load more models