⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,393 Models Available

Featured models

All models

568,393 results found

Model Name

Input

Output

Type

yanolja

yanolja

EEVE-Rosetta-4B-FP8-2507

Base

Deploy

ilkerzgi

Overlay-Kontext-Dev-LoRA

Adapter

Deploy

RabotniKuma

RabotniKuma

Fast-Math-Qwen3-14B

Fine-tuned

Deploy

oguzhanmeteozturk

oguzhanmeteozturk

Devstral-Small-2507-DRAFT-0.5B

Base

Deploy

dphn

dolphin-2.9.2-qwen2-7b

Fine-tuned

Deploy

dphn

dolphin-2.6-mistral-7b-dpo

Base

Deploy

dphn

dolphin-2.9.1-yi-1.5-34b

Fine-tuned

Deploy

Zaynoid

Zaynoid

qwen2.5-7b-v1

Base

Deploy

Fentible

Cthulu-24B-v1

Merged

Deploy

moonshotai

moonshotai

Kimi-K2-Base

Base

Deploy

syvai

syvai

hviske-v3-conversation

Fine-tuned

Deploy

fal

fal

Realism-Detailer-Kontext-Dev-LoRA

Adapter

Deploy

AdaptLLM

AdaptLLM

remote-sensing-Qwen2.5-VL-3B-Instruct

Fine-tuned

Deploy

Kazame07

selflogic-tpu

Base

Deploy

Kazame07

selflogic-16

Base

Deploy

Kazame07

selflogic-core

Base

Deploy

Kontext-Style

Pixel_lora

Adapter

Deploy

nvidia

nvidia

Llama-3.3-Nemotron-70B-Reward

Fine-tuned

Deploy

nvidia

nvidia

Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual

Fine-tuned

Deploy

nvidia

nvidia

Llama-3.3-Nemotron-70B-Reward-Multilingual

Fine-tuned

Deploy

nvidia

nvidia

Llama-3_3-Nemotron-Super-49B-GenRM

Fine-tuned

Deploy

ContextualAI

ContextualAI

ctx-bird-reward-250121

Fine-tuned

Deploy

tngtech

tngtech

DeepSeek-TNG-R1T2-Chimera

Merged

Deploy

agentica-org

agentica-org

DeepSWE-Preview

Fine-tuned

Deploy

bghira

bghira

LibreFLUX.1-Edit

Adapter

Deploy

Goekdeniz-Guelmez

Goekdeniz-Guelmez

Gabliterated-Qwen3-0.6B

Fine-tuned

Deploy

smolagents

Qwen2.5-VL-3B-Instruct-Agentic

Fine-tuned

Deploy

Yuqian-Fu

SRFT

Fine-tuned

Deploy

sophosympatheia

sophosympatheia

Strawberrylemonade-70B-v1.2

Merged

Deploy

Kwai-Keye

Keye-VL-8B-Preview

Base

Deploy

Unbabel

Unbabel

Tower-Plus-9B

Fine-tuned

Deploy

joshbarua

joshbarua

Qwen2.5-7B-base-japanese-bespoke-stratos-full-sft

Base

Deploy

scb10x

scb10x

typhoon-ocr-7b-mlx-4bit

Fine-tuned

Deploy

LumiOpen

LumiOpen

Llama-Poro-2-8B-Instruct

Base

Deploy

Spestly

Spestly

Ares-4B

Fine-tuned

Deploy

sizzlebop

sizzlebop

crystal-think-v1.0

Adapter

Deploy

Rustamshry

Rustamshry

NasimiLM

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-32B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-8B-MLX-8bit

Base

Deploy

Load more models