pszemraj

rnj-1.5-instruct

README

License: apache-2.0

Changes vs upstream

Table with columns: Field, Upstream, Here
Field	Upstream	Here
Local layer type	chunked_attention	sliding_attention
RoPE params for locals	under chunked_attention key	moved to sliding_attention key
Dtype	float32	bfloat16
Architecture string	Rnj1ForCausalLM	Gemma3ForCausalLM

Local/global layer pattern (LLLGLLLGLLLGLGGGGGLGLLLGLLLGLLLL) preserved.

Usage

python
import torch
from transformers import pipeline

pipe = pipeline(
    "text-generation",
    model="pszemraj/rnj-1.5-instruct",
    dtype=torch.bfloat16,
    device_map="auto",
)
res = pipe([{"role": "user", "content": "Who are you?"}])
print(res)

License

Apache 2.0, inherited from upstream. See the original model card for architecture, benchmarks, and citation.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Model Details

Model Provider

pszemraj

Model Tree

Base

EssentialAI/rnj-1

Fine-tuned

this model

Input Modalities

Text

Output Modalities