⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,214 Models Available

Featured models

All models

571,214 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen3-235B-A22B-Thinking-2507-FP8

Quantized

Deploy

unsloth

unsloth

Qwen3-235B-A22B-Thinking-2507

Fine-tuned

Deploy

ilkerzgi

Tattoo-Kontext-Dev-Lora

Adapter

Deploy

ncgc

ncgc

qwen-3.0B-sft

Fine-tuned

Deploy

apexion-ai

Nous-1-8B

Fine-tuned

Deploy

jdaddyalbs

bad-qwen3-sft-merged

Base

Deploy

DeepHat

DeepHat-V1-7B

Fine-tuned

Deploy

unsloth

unsloth

Qwen3-235B-A22B-Instruct-2507

Fine-tuned

Deploy

t-tech

t-tech

T-pro-it-2.0

Fine-tuned

Deploy

ilkerzgi

Glittering-Portrait-Kontext-Dev-Lora

Adapter

Deploy

Tesslate

Tesslate

UIGEN-X-8B

Fine-tuned

Deploy

yanolja

yanolja

EEVE-Rosetta-4B-FP8-2507

Base

Deploy

ilkerzgi

Overlay-Kontext-Dev-LoRA

Adapter

Deploy

oguzhanmeteozturk

oguzhanmeteozturk

Devstral-Small-2507-DRAFT-0.5B

Base

Deploy

dphn

Dolphin3.0-Mistral-24B

Fine-tuned

Deploy

dphn

Dolphin3.0-R1-Mistral-24B

Fine-tuned

Deploy

Zaynoid

Zaynoid

qwen2.5-7b-v1

Base

Deploy

Delta-Vector

Delta-Vector

Rei-24B-KTO

Fine-tuned

Deploy

Fentible

Cthulu-24B-v1

Merged

Deploy

open-thoughts

open-thoughts

OpenThinker3-1.5B

Fine-tuned

Deploy

microsoft

microsoft

NextCoder-7B

Fine-tuned

Deploy

nvidia

nvidia

OpenCodeReasoning-Nemotron-1.1-32B

Fine-tuned

Deploy

ilkerzgi

embroidery-patch-kontext-dev-lora

Adapter

Deploy

starsofchance

starsofchance

Mistral-Unsloth-QLoRA-adapter

Adapter

Deploy

Kazame07

selflogic-tpu

Base

Deploy

Kazame07

selflogic-16

Base

Deploy

Kazame07

selflogic-core

Base

Deploy

ilkerzgi

metallic-objects-kontext-dev-lora

Adapter

Deploy

unsloth

unsloth

DeepSeek-TNG-R1T2-Chimera-BF16

Quantized

Deploy

unsloth

unsloth

DeepSeek-TNG-R1T2-Chimera

Quantized

Deploy

agentica-org

agentica-org

DeepSWE-Preview

Fine-tuned

Deploy

marketeam

marketeam

Qwen-Marketing

Fine-tuned

Deploy

agentica-org

agentica-org

DeepSWE-Verifier

Adapter

Deploy

ai-forever

ai-forever

pollux-judge-32b

Fine-tuned

Deploy

baidu

baidu

ERNIE-4.5-0.3B-PT

Base

Deploy

bghira

bghira

LibreFLUX.1-Edit

Adapter

Deploy

Blancy

Blancy

Qwen3-0.6B-Open-R1-GRPO

Fine-tuned

Deploy

Goekdeniz-Guelmez

Goekdeniz-Guelmez

Gabliterated-Qwen3-0.6B

Fine-tuned

Deploy

smolagents

Qwen2.5-VL-3B-Instruct-Agentic

Fine-tuned

Deploy

Yuqian-Fu

SRFT

Fine-tuned

Deploy

sophosympatheia

sophosympatheia

Strawberrylemonade-70B-v1.2

Merged

Deploy

Kwai-Keye

Keye-VL-8B-Preview

Base

Deploy

Load more models