Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Container
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Source
- Base model:
mistralai/Mistral-7B-Instruct-v0.2 - Dataset:
Lots-of-LoRAs/task209_stancedetection_classification - Train split:
train - Eval split:
valid - Task ID:
209 - Description:
stancedetection classification
LoRA
- Rank:
128 - Target modules:
q_proj, k_proj, v_proj - LoRA alpha:
32 - LoRA dropout:
0.05 - Bias:
none
Training protocol
- Base model dtype:
4bit-nf4 - Quantization:
QLoRA 4bit NF4, double quantization enabled, bf16 compute - Adapter trainable dtype:
float32 - Prompt format:
plain - Loss: completion-only causal LM cross entropy
- Epochs:
5.0 - Best checkpoint metric:
eval_loss - Learning rate:
0.0002 - Scheduler:
cosine - Warmup ratio:
0.03 - Effective batch size:
16 - Optimizer:
paged_adamw_32bit
Files
adapter_model.safetensors: LoRA adapter weightsadapter_config.json: PEFT adapter configurationtask_manifest.json: source manifest row and resolved splitstraining_protocol.json: fixed protocol used for this run
Model provider
geonho1
Model tree
Base
mistralai/Mistral-7B-Instruct-v0.2
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information