geonho1
Mistral-7B-Instruct-v0.2-4b-r8-task600
Dedicated EndpointsRun this model inference on single tenant GPU with unmatched speed and reliability at scale.
Learn moreContainerRun this model inference with full control and performance in your environment.
Learn moreGet help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
Model tree
mistralai/Mistral-7B-Instruct-v0.2
geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task600 API & Inference Endpoint | FriendliAIREADME
License: apache-2.0Source
- Base model:
mistralai/Mistral-7B-Instruct-v0.2
- Dataset:
Lots-of-LoRAs/task600_find_the_longest_common_substring_in_two_strings
- Train split:
train
- Eval split:
valid
- Task ID:
600
- Description:
find the longest common substring in two strings
LoRA
- Rank:
8
- Target modules:
q_proj, k_proj, v_proj
- LoRA alpha:
32
- LoRA dropout:
0.05
- Bias:
none
Training protocol
- Base model dtype:
4bit-nf4
- Quantization:
QLoRA 4bit NF4, double quantization enabled, bf16 compute
- Adapter trainable dtype:
float32
- Prompt format:
plain
- Loss: completion-only causal LM cross entropy
- Epochs:
5.0
- Best checkpoint metric:
eval_loss
- Learning rate:
0.0002
- Scheduler:
cosine
- Warmup ratio:
Files
adapter_model.safetensors: LoRA adapter weights
adapter_config.json: PEFT adapter configuration
task_manifest.json: source manifest row and resolved splits
training_protocol.json: fixed protocol used for this run
0.03
Effective batch size: 16Optimizer: paged_adamw_32bit