Banaxi-Tech

BananaMind-Completor-V1

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

License

BananaMind Completor V1 model code and released weights are provided under the Apache License 2.0. See LICENSE.

The training dataset has separate upstream terms. The original ODC-BY 1.0 license text for agentlans/high-quality-english-sentences is included at THIRD_PARTY_LICENSES/ODC-BY-1.0.txt, with attribution in NOTICE.

Model Details

  • Architecture: GPT2LMHeadModel
  • Parameters: 1,927,936
  • Vocabulary: 256 byte values
  • Context length: 128 bytes
  • Embedding size: 128
  • Attention heads: 4
  • Transformer layers: 4
  • MLP inner size: 1536
  • Input/output embeddings: untied
  • Checkpoint source: autocomplete_model_gpt2_best.pt
  • Release weights: model.safetensors

Training Data Attribution

This model was trained on agentlans/high-quality-english-sentences, using the train split for training and the test split for validation.

Dataset attribution: agentlans/high-quality-english-sentences.

Dataset license: ODC-BY 1.0. Follow the dataset license terms when reusing the training data or distributing derived artifacts that require attribution. This release includes the original ODC-BY 1.0 license text in THIRD_PARTY_LICENSES/ODC-BY-1.0.txt.

Evaluation

The released checkpoint is the best validation checkpoint from local GPT-2 training:

Table
MetricValue
Training step70000
Training loss1.193311095237732
Validation loss1.1782804751396179
Best validation loss1.1782804751396179

Usage

Install dependencies:

bash

pip install -r requirements.txt

Load the model with Transformers:

python

from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained(".")

Because this model uses raw byte ids, encode text manually:

python

import torch
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained(".")
model.eval()
text = "The weather is beau"
tokens = list(text.encode("utf-8"))
for _ in range(20):
x = torch.tensor([tokens[-128:]], dtype=torch.long)
with torch.no_grad():
next_id = model(input_ids=x).logits[0, -1].argmax().item()
if chr(next_id) in " \n\t.,!?;:":
break
tokens.append(next_id)
print(bytes(tokens).decode("utf-8", errors="ignore"))

Or use the included helper:

bash

python inference.py "The weather is beau"

Files

  • model.safetensors: native GPT-2 model weights
  • config.json: Transformers GPT-2 config
  • generation_config.json: Transformers generation config
  • inference.py: byte-level autocomplete helper
  • training_metadata.json: checkpoint, dataset, and evaluation metadata
  • requirements.txt: minimal Python dependencies
  • LICENSE: Apache License 2.0 for this model release
  • NOTICE: upstream dataset attribution notice
  • THIRD_PARTY_LICENSES/ODC-BY-1.0.txt: original ODC-BY 1.0 license text for the training dataset

Limitations

This is a small autocomplete model trained for short English sentence contexts. It is not instruction-tuned, not a general chat model, and may produce incomplete, repetitive, or low-quality completions outside short autocomplete-style prompts.

Model provider

Banaxi-Tech

Model tree

Base

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today