Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Pakai
python
from transformers import AutoProcessor, AutoModelForImageTextToTextmodel = AutoModelForImageTextToText.from_pretrained("Adicandra/Qwen3.5-4B-ImageCaptioning-LoRA", dtype="bfloat16", device_map="auto")processor = AutoProcessor.from_pretrained("Adicandra/Qwen3.5-4B-ImageCaptioning-LoRA")# render chat dgn enable_thinking=False, lalu model.generate(...)
Model provider
Adicandra
Model tree
Base
Qwen/Qwen3.5-4B-Base
Fine-tuned
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information