Open Models, Ready for Production

Run 591,071 Open Models on the Frontier Inference Cloud.

Featured models

LLM

zai-org

GLM-5.2

Model APIs

Dedicated

Multimodal

MiniMaxAI

MiniMax-M3

Dedicated

Multimodal

moonshotai

Kimi-K2.7-Code

Dedicated

LLM

zai-org

GLM-5.1

Model APIs

Dedicated

LLM

deepseek-ai

DeepSeek-V4-Flash

Dedicated

LLM

deepseek-ai

DeepSeek-V4-Pro

Dedicated

LLM

deepseek-ai

DeepSeek-V3.2

Model APIs

Dedicated

LLM

MiniMaxAI

MiniMax-M2.5

Model APIs

Dedicated

Multimodal

google

gemma-4-31B-it

Model APIs

Dedicated

All models

591,071 results found

Model Name

Input

Output

Type

poolside

Laguna-XS-2.1-FP8

Quantized

Deploy

poolside

Laguna-XS-2.1-INT4

Quantized

Deploy

greghavens

fabletron-nemotron-3-super-120b-BF16

Fine-tuned

Deploy

shisa-ai

Ornith-1.0-35B-FP8-BLOCK-MTP-qwen36-distill

Quantized

Deploy

tcclaviger

Minimax-M3-Coder-REAP32-RFI_2

Fine-tuned

Deploy

greghavens

fabletron-nemotron-3-super-120b-NVFP4

Quantized

Deploy

maci0

Qwopus3.6-27B-Coder-abliterated-NVFP4

Quantized

Deploy

kyr0

Ornith-35B-FP8-E4M3-MTP

Quantized

Deploy

tepirale

Ornith-Agents-A1-3.6-35B-A3B-dare_ties

Merged

Deploy

llmfan46

Ornith-1.0-35B-uncensored-heretic

Fine-tuned

Deploy

OpenMOSS-Team

SciJudge-30B-2605

Fine-tuned

Deploy

cyankiwi

Agents-A1-AWQ-INT4

Quantized

Deploy

XReyRobert

Ornith-1.0-35B-GPTQ-Pro-FOEM-4bit-g128-ns256

Quantized

Deploy

ressl

Ornith-1.0-35B-NVFP4

Quantized

Deploy

Tesleum

shirdel-agent-4b

Fine-tuned

Deploy

llmfan46

Qwythos-9B-Claude-Mythos-5-1M-uncensored-heretic

Fine-tuned

Deploy

cyankiwi

Ornith-1.0-35B-AWQ-INT4

Quantized

Deploy

canada-quant

GLM-5.2-W4A16-MTP

Quantized

Deploy

suehuynh

Marketing-Llama-3.3-70B

Adapter

Deploy

AliBuxdev

crop-climate-mistral-7b

Fine-tuned

Deploy

prasannaJagadesh

marlin-2B-GPTQ-4BITS

Quantized

Deploy

MuXodious

gemma-4-E4B-it-QAT-SOMPOA-heresy

Fine-tuned

Deploy

nightmedia

Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Qwythos-qx86-hi-mlx

Merged

Deploy

lordx64

Qwable-v2

Fine-tuned

Deploy

ATH-MaaS

Marco-Mini-Instruct

Base

Deploy

suehuynh

Marketing-Mixtral-8x7B-v3

Adapter

Deploy

ATH-MaaS

Marco-Nano-Instruct

Base

Deploy

cyankiwi

Ornith-1.0-9B-AWQ-INT4

Quantized

Deploy

protoLabsAI

Ornith-1.0-35B-FP8

Quantized

Deploy

CarlosAGDev

ltv-lora-qa

Adapter

Deploy

CarlosAGDev

ltv-lora-clf

Adapter

Deploy

CarlosAGDev

ltv-lora-cw

Adapter

Deploy

haykgrigorian

TimeCapsuleLLM-London-1800-1875-v2-1.2B

Base

Deploy

lzwjava

sec-edgar-gpt-124m

Base

Deploy

pavantippannagari

Ornith-1.0-9B-mlx-4Bit

Quantized

Deploy

Qwen

Qwen3-ASR-0.6B-hf

Base

Deploy

Qwen

Qwen3-ASR-1.7B-hf

Base

Deploy

tepirale

gemma-4-12B-merge-coder40-agentic40-it20-task_arithmetic

Merged

Deploy

Jinyang23

OPID-ALFWorld-1.7B

Fine-tuned

Deploy

Jiunsong

SuperQwen-AgentWorld-35B-A3B-abliterated

Fine-tuned

Deploy

The-JDdev

GLM-5.2-ablated

Base

Deploy

rbinrs

Qwen2.5-Coder-32B-Instruct-abliterated

Fine-tuned

Deploy