MananSuri27
gemma4-31b-full-mini-swe-secbench
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Transformers
python
from transformers import AutoModelForCausalLM, AutoTokenizerrepo_id = "MananSuri27/gemma4-31b-full-mini-swe-secbench"tokenizer = AutoTokenizer.from_pretrained(repo_id)model = AutoModelForCausalLM.from_pretrained(repo_id, dtype="auto", device_map="auto")
vLLM
bash
vllm serve MananSuri27/gemma4-31b-full-mini-swe-secbench --dtype bfloat16
The model was trained on assistant action/tool-call turns from agent trajectories.
Model provider
MananSuri27
Model tree
Base
google/gemma-4-31B-it
Fine-tuned
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information