Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0🧠 What it knows
- Pydantic v2 (validators, aliases, model_validate)
- typing (Generic, ParamSpec, TypeVar)
- collections (ChainMap, defaultdict)
- Decorators & context managers
🚀 Quick start
python
from transformers import AutoModelForCausalLM, AutoTokenizermodel = AutoModelForCausalLM.from_pretrained("Zapivara/pyqwen-4B")tokenizer = AutoTokenizer.from_pretrained("Zapivara/pyqwen-4B")
📊 Training
| Parameter | Value |
|---|---|
| Base model | Qwen/Qwen3.5-4B |
| Dataset | 498 examples |
| Method | QLoRA (LoRA r=16) |
| Epochs | 3 |
| Final loss | 0.380 |
| Hardware | Tesla T4 |
| Time | 52 min |
Model provider
Zapivara
Model tree
Base
Qwen/Qwen3.5-4B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information