jhhj25
qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples-sft
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
Model provider
jhhj25
Model tree
Base
jayzou3773/qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples
Fine-tuned
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information