armand0e
gemma-4-26B-opus-v3
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Gemma 4 26B A4B Opus v3
This is highly experimental. The model was trained with a custom chat template with preserved reasoning_content across turns. It wasn't stable for inference so the chat template was then reverted back to the official template
- Developed by: armand0e
- License: apache-2.0
- Finetuned from model : armand0e/gemma-4-26B-opus-v3
This gemma4 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model provider
armand0e
Model tree
Base
armand0e/gemma-4-26B-opus-v3
Fine-tuned
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information