DavidBShan
tinker-qlora1-r8-2230f88d-qwen3.6-35b-a3b
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Model details
- Base model: Qwen/Qwen3.6-35B-A3B
- Format: LoRA adapter (PEFT)
Usage
python
from peft import PeftModelfrom transformers import AutoModelForCausalLMbase = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-35B-A3B")model = PeftModel.from_pretrained(base, "DavidBShan/tinker-qlora1-r8-2230f88d-qwen3.6-35b-a3b")
Framework versions
- tinker-cookbook: 0.4.0
- transformers: 5.5.3
- torch: 2.12.0+cpu
Model provider
DavidBShan
Model tree
Base
Qwen/Qwen3.6-35B-A3B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information