Banaxi-Tech
BananaMind-Completor-V1
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0License
BananaMind Completor V1 model code and released weights are provided under the Apache License 2.0. See LICENSE.
The training dataset has separate upstream terms. The original ODC-BY 1.0 license text for agentlans/high-quality-english-sentences is included at THIRD_PARTY_LICENSES/ODC-BY-1.0.txt, with attribution in NOTICE.
Model Details
- Architecture:
GPT2LMHeadModel - Parameters: 1,927,936
- Vocabulary: 256 byte values
- Context length: 128 bytes
- Embedding size: 128
- Attention heads: 4
- Transformer layers: 4
- MLP inner size: 1536
- Input/output embeddings: untied
- Checkpoint source:
autocomplete_model_gpt2_best.pt - Release weights:
model.safetensors
Training Data Attribution
This model was trained on agentlans/high-quality-english-sentences, using the train split for training and the test split for validation.
Dataset attribution: agentlans/high-quality-english-sentences.
Dataset license: ODC-BY 1.0. Follow the dataset license terms when reusing the training data or distributing derived artifacts that require attribution. This release includes the original ODC-BY 1.0 license text in THIRD_PARTY_LICENSES/ODC-BY-1.0.txt.
Evaluation
The released checkpoint is the best validation checkpoint from local GPT-2 training:
| Metric | Value |
|---|---|
| Training step | 70000 |
| Training loss | 1.193311095237732 |
| Validation loss | 1.1782804751396179 |
| Best validation loss | 1.1782804751396179 |
Usage
Install dependencies:
bash
pip install -r requirements.txt
Load the model with Transformers:
python
from transformers import AutoModelForCausalLMmodel = AutoModelForCausalLM.from_pretrained(".")
Because this model uses raw byte ids, encode text manually:
python
import torchfrom transformers import AutoModelForCausalLMmodel = AutoModelForCausalLM.from_pretrained(".")model.eval()text = "The weather is beau"tokens = list(text.encode("utf-8"))for _ in range(20):x = torch.tensor([tokens[-128:]], dtype=torch.long)with torch.no_grad():next_id = model(input_ids=x).logits[0, -1].argmax().item()if chr(next_id) in " \n\t.,!?;:":breaktokens.append(next_id)print(bytes(tokens).decode("utf-8", errors="ignore"))
Or use the included helper:
bash
python inference.py "The weather is beau"
Files
model.safetensors: native GPT-2 model weightsconfig.json: Transformers GPT-2 configgeneration_config.json: Transformers generation configinference.py: byte-level autocomplete helpertraining_metadata.json: checkpoint, dataset, and evaluation metadatarequirements.txt: minimal Python dependenciesLICENSE: Apache License 2.0 for this model releaseNOTICE: upstream dataset attribution noticeTHIRD_PARTY_LICENSES/ODC-BY-1.0.txt: original ODC-BY 1.0 license text for the training dataset
Limitations
This is a small autocomplete model trained for short English sentence contexts. It is not instruction-tuned, not a general chat model, and may produce incomplete, repetitive, or low-quality completions outside short autocomplete-style prompts.
Model provider
Banaxi-Tech
Model tree
Base
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information