⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,377 Open Models on the Frontier Inference Cloud.

Featured models

All models

536,148 results found

Model Name

Input

Output

Type

Necmettinbera

gemma-3-12b-it-veri_1-merged-slerp-055

Merged

Deploy

TilQazyna

TilQazyna

Til-Mix-1b1-base

Base

Deploy

ton-An

ton-An

zeta-2.1-mlx-2Bit

Quantized

Deploy

cs-552-2026-busybees

general_knowledge_model

Fine-tuned

Deploy

ton-An

ton-An

zeta-2.1-mlx-3Bit

Quantized

Deploy

TilQazyna

TilQazyna

Til-core-llama-1b1-kkrumix-base-v1

Base

Deploy

fpadovani

fpadovani

urd-arab-100mb-hu-after-Dp-ckpt500

Base

Deploy

ton-An

ton-An

zeta-2.1-mlx-4Bit

Quantized

Deploy

linius

Llama-3.1-8B-SPoT

Fine-tuned

Deploy

Necmettinbera

Qwen2.5-7B-turkish-culture-veri_1-merged-slerp-055

Merged

Deploy

m-seok5

ictGemmaAiTerm

Fine-tuned

Deploy

Necmettinbera

Qwen2.5-7B-turkish-culture-veri_1-merged-slerp-035

Merged

Deploy

Sabine-Brunswicker

ATC-LLAMA-LORA

Adapter

Deploy

fuliucansheng

fuliucansheng

Qwen3-VL-2B-Instruct-LP-Image-Relevance

Base

Deploy

TILKI-AI

character-details-8B

Base

Deploy

jeongseokoh

jeongseokoh

Llama-3.1-8B-Instruct_SPEED-28-BoS-Query

Adapter

Deploy

Kudod

Kudod

hp_vi_Qwen7Blm

Fine-tuned

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B-SFT-31.6K

Fine-tuned

Deploy

Jordansky

Jordansky

2507-r1

Adapter

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B-SFT-100K

Fine-tuned

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B

Fine-tuned

Deploy

ishikauniphore

multilingual_reasoner_multilingual_cot

Base

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B-SFT-10K

Fine-tuned

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B-SFT-3.16K

Fine-tuned

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B-SFT-316

Fine-tuned

Deploy

vierren

mkn2-qwen3vl8binstruct-sft-merged-v1

Base

Deploy

open-thoughts

open-thoughts

OpenThinkerAgent-32B-SFT-1K

Fine-tuned

Deploy

skilledu

gpt-oss-120b

Base

Deploy

skilledu

gpt-oss-20b

Base

Deploy

back3-1

qwen3-1.7b-modchallenge-wrapper

Fine-tuned

Deploy

dementor-research

oasst-qwen3.6-27b-as-llama-3.1-8b-sft-seed43

Adapter

Deploy

aicoder43210

ByteForge-50M

Base

Deploy

dementor-research

oasst-qwen3.6-27b-as-qwen3.6-27b-sft-seed42

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-nemotron-nano-sft-seed44

Adapter

Deploy

dementor-research

oasst-nemotron-nano-as-qwen3.6-27b-sft-seed44

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-gpt-oss-20b-sft-seed44

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-llama-3.1-8b-sft-seed44

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-llama-3.1-8b-sft-seed42

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-nemotron-nano-sft-seed42

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-nemotron-nano-sft-seed43

Adapter

Deploy

dementor-research

oasst-nemotron-nano-as-nemotron-nano-sft-seed42

Adapter

Deploy

dementor-research

oasst-qwen3.6-27b-as-gpt-oss-20b-sft-seed43

Adapter

Deploy

Load more models