Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Model Description
This model was finetuned using low-rank adaptation. The LORA configuration is as follow:
python
# Set up LORA configurationlora_config = LoraConfig(r=64,lora_alpha=64,target_modules=target_list,lora_dropout=0.05,bias="none",task_type="CAUSAL_LM")
Model provider
okezieowen
Model tree
Base
google/gemma-4-E2B-it
Fine-tuned
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information