Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Merge Inputs
- Base model:
google/gemma-4-E4B-it - Adapter:
RohithMidigudla/gemma-health-telugu-medical-grpo-policy-v3 - Merge path: manual LoRA delta add into HF model weights (
safe_merge=True) - Missing target policy:
warn - Dtype:
bfloat16
Evaluate safety, Telugu quality, and medical QA behavior before clinical or field use.
Model provider
RohithMidigudla
Model tree
Base
google/gemma-4-E4B-it
Fine-tuned
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information