Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Base Model
Qwen/Qwen3.5-4B
Training
- Method: DoRA (rank=32, alpha=64)
- Dataset: 11,102 Urdu xCoT examples
- Epochs: 3
- Final Loss: 0.4167
- Hardware: RTX 4080 Super 16GB
Model provider
AbdullahAmin125
Model tree
Base
Qwen/Qwen3.5-4B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information