Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0๐งฌ Helix SCE 12B jh
retokenized chatml version of DarkArtsForge/Helix-SCE-12B
this was created using the command C:\mergekit-new>mergekit-tokensurgeon "B:\12B\HelixA-12B" "B:\12B\Vortex5--Prototype-X-12b" Helix-SCE-12B-jh --approximation-method john_hewitt
possibly smarter than the regular tekken version, testing is in progress
only a couple errors reported
note: safetensors are NOT identical, you will need to redownload all files to quantize
bat
WARNING:root:This script is experimental and may produce unexpected results.WARNING:mergekit.tokenizer.normalization:Unknown tokenizer type - assuming 'ฤ ' word start prefixWARNING:mergekit.tokenizer.normalization:Unknown tokenizer type - assuming 'ฤ ' word start prefixApproximating tokens: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 1/1 [00:12<00:00, 12.33s/it]Approximating tokens: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 1/1 [00:12<00:00, 12.20s/it]Saving weights: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 363/363 [00:34<00:00, 10.41it/s]
Model provider
DarkArtsForge
Model tree
Base
DarkArtsForge/Helix-SCE-12B
Fine-tuned
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information