spinochenza

XORTRON.CriminalComputing.Config.LARGE.XPRT2

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Container

Run this model inference with full control and performance in your environment.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Parameter	Value
direction_index	43.15
attn.o_proj.max_weight	1.48
attn.o_proj.max_weight_position	59.65
attn.o_proj.min_weight	1.44
attn.o_proj.min_weight_distance	48.02
mlp.down_proj.max_weight	1.21
mlp.down_proj.max_weight_position	54.75
mlp.down_proj.min_weight	0.30
mlp.down_proj.min_weight_distance	50.44