Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: otherBundle
- Format: JANGTQ
- Profile: JANGTQ2
- Family: qwen3_5_moe
- Model type: qwen3_5_moe
- Text model type: qwen3_5_moe_text
- Layers: 60
- TQ layout: prestacked_switch_mlp
- Runtime sidecar:
jangtq_runtime.safetensors - Tokenizer: Qwen2Tokenizer
- Chat template:
chat_template.jinja - Verified local size: 101G
Model provider
JANGQ-AI
Model tree
Base
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information