geonho1
Mistral-7B-Instruct-v0.2-4b-r8-task738
Dedicated EndpointsRun this model inference on single tenant GPU with unmatched speed and reliability at scale.
Learn moreContainerRun this model inference with full control and performance in your environment.
Learn moreGet help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
Model tree
mistralai/Mistral-7B-Instruct-v0.2
geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task738 API & Inference Endpoint | FriendliAIREADME
License: apache-2.0Source
- Base model:
mistralai/Mistral-7B-Instruct-v0.2
- Dataset:
Lots-of-LoRAs/task738_perspectrum_classification
- Train split:
train
- Eval split:
valid
- Task ID:
738
- Description:
perspectrum classification
LoRA
- Rank:
8
- Target modules:
q_proj, k_proj, v_proj
- LoRA alpha:
32
- LoRA dropout:
0.05
- Bias:
none
Training protocol
- Base model dtype:
4bit-nf4
- Quantization:
QLoRA 4bit NF4, double quantization enabled, bf16 compute
- Adapter trainable dtype:
float32
- Prompt format:
plain
- Loss: completion-only causal LM cross entropy
- Epochs:
5.0
- Best checkpoint metric:
eval_loss
- Learning rate:
0.0002
- Scheduler:
cosine
- Warmup ratio:
Files
adapter_model.safetensors: LoRA adapter weights
adapter_config.json: PEFT adapter configuration
task_manifest.json: source manifest row and resolved splits
training_protocol.json: fixed protocol used for this run
0.03
Effective batch size: 16Optimizer: paged_adamw_32bit