Banaxi-Tech

BananaMind-Completor-V1

README

License: apache-2.0

License

BananaMind Completor V1 model code and released weights are provided under the Apache License 2.0. See LICENSE.

The training dataset has separate upstream terms. The original ODC-BY 1.0 license text for agentlans/high-quality-english-sentences is included at THIRD_PARTY_LICENSES/ODC-BY-1.0.txt, with attribution in NOTICE.

Model Details

Architecture: GPT2LMHeadModel
Parameters: 1,927,936
Vocabulary: 256 byte values
Context length: 128 bytes
Embedding size: 128
Attention heads: 4
Transformer layers: 4
MLP inner size: 1536
Input/output embeddings: untied
Checkpoint source: autocomplete_model_gpt2_best.pt
Release weights: model.safetensors

Training Data Attribution

This model was trained on agentlans/high-quality-english-sentences, using the train split for training and the test split for validation.

Dataset attribution: agentlans/high-quality-english-sentences.

Dataset license: ODC-BY 1.0. Follow the dataset license terms when reusing the training data or distributing derived artifacts that require attribution. This release includes the original ODC-BY 1.0 license text in THIRD_PARTY_LICENSES/ODC-BY-1.0.txt.

Evaluation

The released checkpoint is the best validation checkpoint from local GPT-2 training:

Table with columns: Metric, Value
Metric	Value
Training step	70000
Training loss	1.193311095237732
Validation loss	1.1782804751396179
Best validation loss	1.1782804751396179

Usage

Install dependencies:

bash
pip install -r requirements.txt

Load the model with Transformers:

python
from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(".")

Because this model uses raw byte ids, encode text manually:

python
import torch
from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(".")
model.eval()

text = "The weather is beau"
tokens = list(text.encode("utf-8"))

for _ in range(20):
    x = torch.tensor([tokens[-128:]], dtype=torch.long)
    with torch.no_grad():
        next_id = model(input_ids=x).logits[0, -1].argmax().item()
    if chr(next_id) in " \n\t.,!?;:":
        break
    tokens.append(next_id)

print(bytes(tokens).decode("utf-8", errors="ignore"))

Or use the included helper:

bash
python inference.py "The weather is beau"

Files

model.safetensors: native GPT-2 model weights
config.json: Transformers GPT-2 config
generation_config.json: Transformers generation config
inference.py: byte-level autocomplete helper
training_metadata.json: checkpoint, dataset, and evaluation metadata
requirements.txt: minimal Python dependencies
LICENSE: Apache License 2.0 for this model release
NOTICE: upstream dataset attribution notice
: original ODC-BY 1.0 license text for the training dataset

Limitations

This is a small autocomplete model trained for short English sentence contexts. It is not instruction-tuned, not a general chat model, and may produce incomplete, repetitive, or low-quality completions outside short autocomplete-style prompts.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Model Details

Model Provider

Banaxi-Tech

Model Tree

Base

this model

Input Modalities

Text

Output Modalities

Text

Supported Functionality

Dedicated EndpointsContainer

Explore FriendliAI today

Get started Talk to an engineer

README

License: apache-2.0

License

BananaMind Completor V1 model code and released weights are provided under the Apache License 2.0. See LICENSE.

Model Details

Architecture: GPT2LMHeadModel
Parameters: 1,927,936
Vocabulary: 256 byte values
Context length: 128 bytes
Embedding size: 128
Attention heads: 4
Transformer layers: 4
MLP inner size: 1536
Input/output embeddings: untied
Checkpoint source: autocomplete_model_gpt2_best.pt
Release weights: model.safetensors

Training Data Attribution

This model was trained on agentlans/high-quality-english-sentences, using the train split for training and the test split for validation.

Dataset attribution: agentlans/high-quality-english-sentences.

Evaluation

The released checkpoint is the best validation checkpoint from local GPT-2 training:

Table with columns: Metric, Value
Metric	Value
Training step	70000
Training loss	1.193311095237732
Validation loss	1.1782804751396179
Best validation loss	1.1782804751396179

Usage

Install dependencies:

bash
pip install -r requirements.txt

Load the model with Transformers:

python
from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(".")

Because this model uses raw byte ids, encode text manually:

python
import torch
from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(".")
model.eval()

text = "The weather is beau"
tokens = list(text.encode("utf-8"))

for _ in range(20):
    x = torch.tensor([tokens[-128:]], dtype=torch.long)
    with torch.no_grad():
        next_id = model(input_ids=x).logits[0, -1].argmax().item()
    if chr(next_id) in " \n\t.,!?;:":
        break
    tokens.append(next_id)

print(bytes(tokens).decode("utf-8", errors="ignore"))

Or use the included helper:

bash
python inference.py "The weather is beau"

Files

model.safetensors: native GPT-2 model weights
config.json: Transformers GPT-2 config
generation_config.json: Transformers generation config
inference.py: byte-level autocomplete helper
training_metadata.json: checkpoint, dataset, and evaluation metadata
requirements.txt: minimal Python dependencies
LICENSE: Apache License 2.0 for this model release
NOTICE: upstream dataset attribution notice
: original ODC-BY 1.0 license text for the training dataset