Open Models, Ready for Production

Run 591,071 Open Models on the Frontier Inference Cloud.

Featured models

LLM

zai-org

GLM-5.2

Model APIs

Dedicated

Multimodal

MiniMaxAI

MiniMax-M3

Dedicated

Multimodal

moonshotai

Kimi-K2.7-Code

Dedicated

LLM

zai-org

GLM-5.1

Model APIs

Dedicated

LLM

deepseek-ai

DeepSeek-V4-Flash

Dedicated

LLM

deepseek-ai

DeepSeek-V4-Pro

Dedicated

LLM

deepseek-ai

DeepSeek-V3.2

Model APIs

Dedicated

LLM

MiniMaxAI

MiniMax-M2.5

Model APIs

Dedicated

Multimodal

google

gemma-4-31B-it

Model APIs

Dedicated

All models

591,071 results found

Model Name

Input

Output

Type

rbinrs

Qwen2.5-14B-Instruct-abliterated-v2

Fine-tuned

Deploy

sakamakismile

llm-jp-4-32b-a3b-thinking-NVFP4

Quantized

Deploy

ClarkBear

gemma4-e2b-mobile-actions-200

Fine-tuned

Deploy

cfontes

GLM-5.2-Ablated-F5-Molt

Base

Deploy

CCSSNE

DreamFast-qwen3-4b-heretic

Quantized

Deploy

Lance1573

acrouter-qwen35-08b-router-lora

Adapter

Deploy

TeichAI

Gemma-4-31B-Fable-5-Agent-Distill

Fine-tuned

Deploy

Muapi

_qipao_chinese_cheongsam

Adapter

Deploy

benchflow

qwen35-9b-env0-task-lite-qlora

Adapter

Deploy

OccultAI

Goetia-26B-A4B-v1.3

Merged

Deploy

groxaxo

Code-Writer-V2-Obliterated-BF16

Fine-tuned

Deploy

nightmedia

Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Fable-Polaris-qx86-hi-mlx

Merged

Deploy

TeichAI

Qwen3.6-27B-Fable-5-Experimental-LoRA

Adapter

Deploy

EganAI

gemma-4-31B-opus-Reasoning-Distilled

Fine-tuned

Deploy

vinod-anbalagan

Llama-3.2-3B-marketing-spend-revenue-qa

Adapter

Deploy

CyberpunkLegend

Qwen2.5-7B-Instruct-CharacterEnhance

Fine-tuned

Deploy

edgerunner-ai

gemma-4-31B-it-noloop

Fine-tuned

Deploy

tokyotech-llm

Medical-GPT-OSS-Swallow-120B

Fine-tuned

Deploy

DFveloper

AIKAR-3.1-mini-Q4_0-QAT-unquantized

Base

Deploy

tokyotech-llm

Medical-Qwen3-Swallow-8B

Fine-tuned

Deploy

RedHatAI

gemma-4-31B-it-FP8-dynamic

Quantized

Deploy

RedHatAI

gemma-4-26B-A4B-it-NVFP4

Quantized

Deploy

RedHatAI

gemma-4-26B-A4B-it-FP8-dynamic

Quantized

Deploy

tokyotech-llm

Medical-Qwen3-Swallow-30B-A3B

Fine-tuned

Deploy

maci0

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NVFP4

Quantized

Deploy

lovedheart

Qwen3.6-27B-NVFP4-FP8-Mixed

Base

Deploy

ArnavKewalram

gemma-4-E2B-coder-v1

Quantized

Deploy

hotdogs

qwen3.6-27b-cybersecurity-lora

Adapter

Deploy

GestaltLabs

Ornstein-3.5-9B-V1.5

Fine-tuned

Deploy

Mapika

GLM-5.2-NVFP4

Quantized

Deploy

nbeerbower

Gemma4-Gutenberg-31B-LoRA

Adapter

Deploy

usermma

MinerU2.5-Pro-2605-1.2B-mlx-fp16

Fine-tuned

Deploy

r0b0tlab

FastContext-1.0-4B-RL-NVFP4

Quantized

Deploy

MrRoyaleAce

nyaya-7b

Fine-tuned

Deploy

cdli

whisper-large-v3_finetuned_ugandan_english_nonstandard_speech_v1.0

Base

Deploy

OpenYourMind

Minimax-M3-abliterated-clean

Fine-tuned

Deploy

batiai

batisay-ko-turbo

Fine-tuned

Deploy

EpistemeAI

OpenMedResearch-Gemma-4E4N

Fine-tuned

Deploy

lordx64

Qwable-v1

Fine-tuned

Deploy

shootstuff

janhq_Jan-v3.5-4B-heretic

Fine-tuned

Deploy

MiniMaxAI

MiniMax-M3-MXFP8

Quantized

Deploy

Assaoka

Tucano2-qwen-0.5b-Merge-ReLiSA

Fine-tuned

Deploy