nikitastheo

nikitastheo

babylm-ita-ell-interleaved-dummy

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 32
  • eval_batch_size: 32
  • seed: 42
  • optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 16
  • training_steps: 128

Training results

Table
Training LossEpochStepValidation LossAccuracy
10.83125.0510.83540.0018
10.814710.01010.82920.0018
10.781615.01510.81420.0016
10.726320.02010.78060.0012
10.664725.02510.73430.0006
10.609130.03010.69080.0002
10.556935.03510.65380.0008
10.505840.04010.62050.0010
10.455845.04510.58920.0016
10.40950.05010.55860.0022
10.365455.05510.52850.0025
10.324460.06010.50070.0031
10.286565.06510.47560.0041
10.251870.07010.45240.0051
10.220175.07510.43120.0057
10.191580.08010.41210.0059
10.165985.08510.39510.0059
10.143290.09010.38020.0061
10.123595.09510.36730.0061
10.1065100.010010.35630.0063
10.0923105.010510.34720.0063
10.0809110.011010.33990.0063
10.0721115.011510.33430.0061
10.0659120.012010.33050.0061
10.0623125.012510.32850.0061

Framework versions

  • Transformers 4.57.6
  • Pytorch 2.12.0+cu130
  • Datasets 4.8.5
  • Tokenizers 0.22.2

Model provider

nikitastheo

nikitastheo

Model tree

Base

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today