Open Models, Ready for Production

Run 591,071 Open Models on the Frontier Inference Cloud.

Featured models

LLM

zai-org

GLM-5.2

Model APIs

Dedicated

Multimodal

MiniMaxAI

MiniMax-M3

Dedicated

Multimodal

moonshotai

Kimi-K2.7-Code

Dedicated

LLM

zai-org

GLM-5.1

Model APIs

Dedicated

LLM

deepseek-ai

DeepSeek-V4-Flash

Dedicated

LLM

deepseek-ai

DeepSeek-V4-Pro

Dedicated

LLM

deepseek-ai

DeepSeek-V3.2

Model APIs

Dedicated

LLM

MiniMaxAI

MiniMax-M2.5

Model APIs

Dedicated

Multimodal

google

gemma-4-31B-it

Model APIs

Dedicated

All models

591,071 results found

Model Name

Input

Output

Type

IAAR-Shanghai

MemPrivacy-4B-RL

Fine-tuned

Deploy

Muapi

pixel-game-assets-flux-by-dever

Adapter

Deploy

Applied-Innovation-Center

Karnak-40B-v1.0

Fine-tuned

Deploy

pat-jj

harness-1

Fine-tuned

Deploy

webhie

Qwen3.6-27B-int4-AutoRound-Code

Quantized

Deploy

gsting

Qwen3-VL-32B-Instruct-uncensored-heretic

Fine-tuned

Deploy

gsting

gemma-4-26B-A4B-it-uncensored-heretic

Fine-tuned

Deploy

sageofai

Qwen25VL-MEDVQA-GI-S1-subtask1

Adapter

Deploy

ApocalypseParty

G4-31B-ModelStock-v1

Merged

Deploy

WaveCut

HiDream-O1-Image-Dev-SDNQ-uint4-svd-r32

Quantized

Deploy

anrilombard

mzansilm-125m

Base

Deploy

zed-industries

zeta-2.1

Fine-tuned

Deploy

AI4PD

ProtGPT3-112M

Base

Deploy

DavidAU

Granite-4.1-30B-Claude-4.6-Opus-Thinking-Xavier

Fine-tuned

Deploy

skilledu

Dolphin3.0-Llama3.1-8B-abliterated

Fine-tuned

Deploy

mudasir13cs

qwen25-vl-3b-floorplan-grpo

Adapter

Deploy

MuXodious

Qwen3.5-4B-MiniFantasy-MTP

Fine-tuned

Deploy

cyburn

Qwopus3.6-35B-A3B-v1-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm-4.75bits

Quantized

Deploy

cjiao

goldengoose-gumbel-0.50-100

Fine-tuned

Deploy

kasimat

Qwen3.6-27B-AEON-Ultimate-Uncensored-FP8-MTP

Quantized

Deploy

cyankiwi

gemma-4-E4B-it-AWQ-INT8

Quantized

Deploy

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

RedHatAI

Qwen3.5-9B-quantized.w8a8

Quantized

Deploy

prism-ml

Bonsai-8B-AWQ-4-bit

Quantized

Deploy

pastapaul

DeepSeek-V4-Flash-W4A16-FP8

Quantized

Deploy

heretic-org

Meta-Llama-3.1-8B-Instruct-heretic

Fine-tuned

Deploy

plezan

Mistral-Medium-3.5-128B-W4A16

Quantized

Deploy

mlx-community

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-4.5bit-msq

Base

Deploy

DORAEMONG

PRO-STEP-Policy-7B

Fine-tuned

Deploy

roonbug

o5mtr9ek

Base

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-FP8-W8A16

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-GPTQ-Int8

Quantized

Deploy

axolotl-ai-co

Falcon-E-1.2-3B-Exp-prequantized

Base

Deploy

AEON-7

Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4

Quantized

Deploy

AEON-7

Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-BF16

Fine-tuned

Deploy

mlx-community

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-6Bit

Quantized

Deploy

mlx-community

gemma-4-31B-it-uncensored-heretic-4bit

Quantized

Deploy

ibm-granite

granite-4.1-30b

Base

Deploy

useful-quants

GLM-4.6V-Flash-W4A16-BF16Vision

Quantized

Deploy

OpenYourMind

OpenYourMind-Qwen3.6-35B-A3B-kuato-DPO-abliterated-uncensored

Fine-tuned

Deploy

DavidAU

Qwen3.6-27B-The-Deckard-IQ-Ultra-Heretic-Uncensored

Fine-tuned

Deploy

ucbye

Qwen3-Coder-Next-NVFP4-GB10

Quantized

Deploy