mente-ai

uyu-1-10M

README

License: mit

uyu

This is a nanoGPT checkpoint converted to a Hugging Face GPT-2-compatible GPT2LMHeadModel.

The model weights load with:

python
from transformers import AutoTokenizer, GPT2LMHeadModel, pipeline

model = GPT2LMHeadModel.from_pretrained(".")
tokenizer = AutoTokenizer.from_pretrained(".", trust_remote_code=True)

pipe = pipeline(
    "text-generation",
    model="mente-ai/uyu-1-10M",
    trust_remote_code=True,
)