SiddharthaChekuri

Nemotron-Super-49B-AIE-v11-5500-merged-bf16

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

This is a merged bf16 export of nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 with the local AIE v11 LoRA adapter merged into the base weights.

Training summary:

The merged model was exported with LLaMA-Factory and split into 21 safetensors shards.

Model provider

SiddharthaChekuri

Model tree

Base

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information