Open Models, Ready for Production

nvidia

Nemotron-3-Embed-8B-BF16

Dedicated

nvidia

Nemotron-3-Embed-1B-BF16

Dedicated

nvidia

Nemotron-3-Embed-1B-NVFP4

Dedicated

zai-org

GLM-5.2

Model APIs

Dedicated

Multimodal

MiniMaxAI

MiniMax-M3

Dedicated

Multimodal

moonshotai

Kimi-K2.7-Code

Dedicated

zai-org

GLM-5.1

Model APIs

Dedicated

deepseek-ai

DeepSeek-V4-Flash

Dedicated

deepseek-ai

DeepSeek-V4-Pro

Dedicated

deepseek-ai

DeepSeek-V3.2

Model APIs

Dedicated

MiniMaxAI

MiniMax-M2.5

Model APIs

Dedicated

Multimodal

google

gemma-4-31B-it

Model APIs

Dedicated

All models

548,533 results found

Model Name

Input

Output

Type

Qwen

Qwen3-4B-Instruct-2507

Deploy

cpatonn

Qwen3-30B-A3B-Thinking-2507-AWQ

Quantized

Deploy

DeepHat

DeepHat-V1-7B

Deploy

mistralai

Mistral-Small-3.1-24B-Instruct-2503

Deploy

meta-llama

Llama-4-Maverick-17B-128E-Instruct

Deploy

AtomixLabs

Photon-2.0-1M

Deploy

MaliosDark

Nexus-Erebus-135M

Deploy

MCG-NJU

TimeLens2-4B

Deploy

vectionlabs

Salience-1.5-Flash

Deploy

WeiboAI

VibeThinker-3B

Deploy

google

gemma-4-31B-it-qat-q4_0-unquantized

Deploy

ZERO-POINT-INTELLIGENCE-LTD

UNSTABLE-NOT-FOR-DOWNLOAD-UNFITTING-WEAK-NEEDS-RETRAIN

Quantized

Deploy

google

translategemma-27b-it

Deploy

maya-research

maya-1-voice

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ

Quantized

Deploy

Qwen

Qwen3-14B

Deploy

google

gemma-3-12b-it-qat-q4_0-unquantized

Deploy

enhanceaiteam

Flux-uncensored

Adapter

Deploy

Qwen

Qwen2.5-Coder-7B-Instruct

Deploy

google

gemma-3-1b-it

Deploy

mistralai

Mistral-7B-Instruct-v0.3

Deploy

AtomixLabs

Photon-1.0-1M

Deploy

Vortex5

Chimera-X-26B-A4B

Merged

Deploy

el4

Xenon-26B-A4B

Adapter

Deploy

Vortex5

G4-Moonlight-Dusk-26B-A4B

Merged

Deploy

drowzeys

GLM-5.2-Int4-Int8Mix-Abliterated

Deploy

OpenOneRec

OneReason-0.8B-pretrain

Deploy

aifeifei798

Gemma-4-Queen-31B-it

Deploy

ZJU-AI4H

Hulu-Med-235A22

Deploy

google

translategemma-12b-it

Deploy

Qwen

Qwen3-VL-Reranker-2B

Deploy

google

functiongemma-270m-it

Deploy

ArliAI

GLM-4.6-Derestricted

Deploy

Qwen

Qwen3-30B-A3B-Instruct-2507

Deploy

zai-org

GLM-4.5-Air

Deploy

Qwen

Qwen3-4B

Deploy

MaziyarPanahi

calme-3.2-instruct-78b

Deploy

google

gemma-2-2b

Deploy

Qwen

Qwen2.5-0.5B-Instruct

Deploy

meta-llama

Llama-3.2-1B-Instruct

Deploy

meta-llama

Llama-2-7b-chat-hf

Deploy

google

gemma-3-12b-it