jkim96

gemma-4-26B-A4B-it-DASHQ-INT4-g128

README

License: apache-2.0

bash
pip install git+https://github.com/JaeminK/dashq.git

python
from dashq import load_quantized

model, tokenizer = load_quantized("jkim96/gemma-4-26B-A4B-it-DASHQ-INT4-g128", device_map="auto")

Full zero-shot / few-shot results for every DASH-Q checkpoint: github.com/JaeminK/dashq#benchmarks

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Container

Run this model inference with full control and performance in your environment.

Model Details

Model Provider

jkim96

Model Tree

Base

google/gemma-4-26B-A4B-it

Quantized

this model

Input Modalities

TextImage

Output Modalities

Text

Supported Functionality

Dedicated EndpointsContainer

gemma-4-26B-A4B-it-DASHQ-INT4-g128 API & Inference Endpoint | FriendliAI