Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: otherBundle
- Format: JANG
- Profile: JANG_1L
- Family: qwen3_5_moe
- Model type: qwen3_5_moe
- Text model type: qwen3_5_moe_text
- Layers: 60
- Tokenizer: Qwen2Tokenizer
- Chat template:
chat_template.jinja - Verified local size: 111G
Model provider
JANGQ-AI
Model tree
Base
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information