Open Models, Ready for Production

nvidia

Nemotron-3-Embed-8B-BF16

Dedicated

nvidia

Nemotron-3-Embed-1B-BF16

Dedicated

nvidia

Nemotron-3-Embed-1B-NVFP4

Dedicated

zai-org

GLM-5.2

Model APIs

Dedicated

Multimodal

MiniMaxAI

MiniMax-M3

Dedicated

Multimodal

moonshotai

Kimi-K2.7-Code

Dedicated

zai-org

GLM-5.1

Model APIs

Dedicated

deepseek-ai

DeepSeek-V4-Flash

Dedicated

deepseek-ai

DeepSeek-V4-Pro

Dedicated

deepseek-ai

DeepSeek-V3.2

Model APIs

Dedicated

MiniMaxAI

MiniMax-M2.5

Model APIs

Dedicated

Multimodal

google

gemma-4-31B-it

Model APIs

Dedicated

All models

548,533 results found

Model Name

Input

Output

Type

google

gemma-3-27b-it

Deploy

microsoft

phi-4

Deploy

farbodtavakkoli

OTel-2.0-LLM-31B-IT

Deploy

PatSnap

Hiro-Pharma

Deploy

drowzeys

keys-latest-GLM-5.2-Quantrio-INT4-INT8-Mixed-Abliterated-DFlash

Deploy

Ma7ee7

SmolLM2-135M-Reasoning-5K

Deploy

dockhardman

gemma-4-E2B-duplex

Adapter

Deploy

brandonmusic

GLM-5.2-NVFP4-TR3-Hybrid

Deploy

maqsudxo1ja

uz-whisper-small-stt-v2

Deploy

Blackfrost-AI

GLM-5.2-ABLITERATED-NVFP4

Deploy

MaliosDark

Nexus-Erebus-50M

Deploy

RedHatAI

GLM-5.2-NVFP4-FP8

Deploy

PMSCCMA

FengHe

Deploy

cfontes

GLM-5.2-Ablated-Molt

Deploy

QuantTrio

GLM-5.2-Int4-Int8Mix

Deploy

Naphula

Goetia-26B-A4B-v1.3-Absolute-Heretic-ARA

Merged

Deploy

cyankiwi

GLM-5.2-AWQ-INT4

Deploy

vectionlabs

Salience-1-9B

Deploy

prefeitura-rio

Rio-3.1-Open-235B-VL

Deploy

google

gemma-4-26B-A4B-it-qat-q4_0-unquantized

Deploy

google

gemma-4-E2B-it-qat-mobile-ct

Deploy

DreamFast

Qwen3-VL-8B-Heretic-1.3.0

Deploy

Naphula

Goetia-31B-v1

Merged

Deploy

heretic-org

Qwen3-VL-8B-Instruct-heretic

Deploy

latam-gpt

Llama-3.1-70B-LatamGPT-SFT-1.0

Deploy

Flix-AI

flix-swissgerman-full

Deploy

aisingapore

Gemma-SEA-LION-v4.5-E2B-IT

Deploy

zed-industries

zeta-2.1

Deploy

DreamFast

qwen3-8b-heretic

Deploy

darkc0de

GLM-4.7-Flash-heretic-1.2.0

Deploy

Naphula

StormSeeker-24B-v1

Merged

Deploy

Salesforce

moirai-agent

Deploy

Qwen

Qwen3-VL-Reranker-8B

Deploy

Qwen

Qwen3-VL-Embedding-8B

Deploy

Doradus

RnJ-1-Instruct-FP8

Deploy

mistralai

Ministral-3-14B-Instruct-2512

Deploy

Qwen

Qwen3-VL-30B-A3B-Instruct

Deploy

Qwen

Qwen3-VL-235B-A22B-Instruct

Deploy

Qwen

Qwen3-VL-32B-Instruct

Deploy

cerebras

GLM-4.5-Air-REAP-82B-A12B

Deploy

Qwen

Qwen3-VL-4B-Thinking