⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,465 Models Available

Featured models

All models

568,465 results found

Model Name

Input

Output

Type

MathGenie

MathGenie

MathGenie-Mixtral-8x7B

Base

Deploy

In2Training

In2Training

FILM-7B

Base

Deploy

upstage

upstage

TinySolar-248m-4k-code-instruct

Base

Deploy

CausalLM

CausalLM

35b-beta2ep

Base

Deploy

upstage

upstage

TinySolar-248m-4k-py-instruct

Base

Deploy

Samsung

Samsung

BigTranslateSlotTranslator

Base

Deploy

Gowtham25

Gowtham25

gemma-music-recommender

Base

Deploy

ajibawa-2023

ajibawa-2023

WikiHow-Mistral-Instruct-7B

Base

Deploy

ajibawa-2023

ajibawa-2023

Code-Mistral-7B

Base

Deploy

Telugu-LLM-Labs

Telugu-LLM-Labs

Indic-gemma-7b-finetuned-sft-Navarasa-2.0

Fine-tuned

Deploy

l3utterfly

l3utterfly

mistral-7b-v0.1-layla-v4-chatml

Base

Deploy

ContextualAI

ContextualAI

Contextual_KTO_Mistral_PairRM

Base

Deploy

alokabhishek

alokabhishek

Mistral-7B-Instruct-v0.2-bnb-4bit

Base

Deploy

ajibawa-2023

ajibawa-2023

OpenHermes-2.5-Code-290k-13B

Base

Deploy

HuggingFaceH4

HuggingFaceH4

zephyr-7b-gemma-sft-v0.1

Fine-tuned

Deploy

lamm-mit

lamm-mit

BioinspiredMixtral

Base

Deploy

NbAiLab

NbAiLab

nb-whisper-tiny

Quantized

Deploy

NbAiLab

NbAiLab

nb-whisper-large

Quantized

Deploy

GritLM

GritLM

GritLM-8x7B

Fine-tuned

Deploy

nvidia

nvidia

OpenMath-Llama-2-70b-hf

Fine-tuned

Deploy

nvidia

nvidia

OpenMath-CodeLlama-34b-Python-hf

Fine-tuned

Deploy

nvidia

nvidia

OpenMath-CodeLlama-13b-Python-hf

Fine-tuned

Deploy

upstage

upstage

TinySolar-248m-4k

Base

Deploy

upstage

upstage

TinySolar-248m-4k-py

Base

Deploy

openbmb

openbmb

MiniCPM-2B-dpo-bf16-llama-format

Base

Deploy

152334H

152334H

miqu-1-70b-sf

Base

Deploy

bofenghuang

bofenghuang

whisper-large-v3-french-distil-dec16

Base

Deploy

argilla

argilla

notux-8x7b-v1

Fine-tuned

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-intermediate-step-1195k-token-2.5T

Base

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-Chat-v0.5

Base

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-intermediate-step-955k-token-2T

Base

Deploy

argilla

argilla

notus-7b-v1

Fine-tuned

Deploy

BLACKBUN

BLACKBUN

llama-2-13b-pubmed-qa-211k

Fine-tuned

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-intermediate-step-715k-1.5T

Base

Deploy

TheBloke

TheBloke

SauerkrautLM-7B-v1-AWQ

Quantized

Deploy

TheBloke

TheBloke

dolphin-2.1-mistral-7B-AWQ

Quantized

Deploy

TheBloke

TheBloke

dolphin-2.1-mistral-7B-GPTQ

Quantized

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-Chat-v0.3

Base

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-intermediate-step-480k-1T

Base

Deploy

MathLLMs

MathLLMs

MathCoder-CL-7B

Base

Deploy

MathLLMs

MathLLMs

MathCoder-L-13B

Base

Deploy

MathLLMs

MathLLMs

MathCoder-CL-34B

Base

Deploy

Load more models