Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Merge Details
Merge Method
This model was merged using the Linear DELLA merge method using models/gemma-4-31B-it as a base.
Models Merged
The following models were included in the merge:
- models/gemma-4-31B
- models/G4-MeroMero-31B-uncensored-heretic
- models/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic
- models/Gemma4-GarnetV2-31B
Configuration
The following YAML configuration was used to produce this model:
yaml
architecture: Gemma4ForConditionalGenerationbase_model: models/gemma-4-31B-itmodels:- model: models/gemma-4-31Bparameters:weight: 0.2- model: models/gemma-4-31B-itparameters:weight: 0.2- model: models/G4-MeroMero-31B-uncensored-hereticparameters:weight: 0.2- model: models/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-hereticparameters:weight: 0.2- model: models/Gemma4-GarnetV2-31Bparameters:weight: 0.2merge_method: della_linearparameters:lambda: 1.0normalize: falseint8_mask: falserescale: truedensity: 0.5epsilon: 0.4dtype: bfloat16out_dtype: bfloat16tokenizer:source: unionchat_template: auto
Model provider
sheliak
Model tree
Base
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information