Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Merge Inputs

  • Base model: google/gemma-4-E4B-it
  • Adapter: RohithMidigudla/gemma-health-telugu-medical-grpo-policy-v3
  • Merge path: manual LoRA delta add into HF model weights (safe_merge=True)
  • Missing target policy: warn
  • Dtype: bfloat16

Evaluate safety, Telugu quality, and medical QA behavior before clinical or field use.

Model provider

RohithMidigudla

RohithMidigudla

Model tree

Base

google/gemma-4-E4B-it

Fine-tuned

this model

Modalities

Input

Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today