geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task600

Deploy Dedicated

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

Source

Base model: mistralai/Mistral-7B-Instruct-v0.2
Dataset: Lots-of-LoRAs/task600_find_the_longest_common_substring_in_two_strings
Train split: train
Eval split: valid
Task ID: 600
Description: find the longest common substring in two strings

LoRA

Rank: 8
Target modules: q_proj, k_proj, v_proj
LoRA alpha: 32
LoRA dropout: 0.05
Bias: none

Training protocol

Base model dtype: 4bit-nf4
Quantization: QLoRA 4bit NF4, double quantization enabled, bf16 compute
Adapter trainable dtype: float32
Prompt format: plain
Loss: completion-only causal LM cross entropy
Epochs: 5.0
Best checkpoint metric: eval_loss
Learning rate: 0.0002
Scheduler: cosine
Warmup ratio:

Files

adapter_model.safetensors: LoRA adapter weights
adapter_config.json: PEFT adapter configuration
task_manifest.json: source manifest row and resolved splits
training_protocol.json: fixed protocol used for this run

Model provider

geonho1

Model tree

Base

mistralai/Mistral-7B-Instruct-v0.2

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Model card

Explore FriendliAI today

Get started Talk to an engineer

Source

Base model: mistralai/Mistral-7B-Instruct-v0.2
Dataset: Lots-of-LoRAs/task600_find_the_longest_common_substring_in_two_strings
Train split: train
Eval split: valid
Task ID: 600
Description: find the longest common substring in two strings

LoRA

Rank: 8
Target modules: q_proj, k_proj, v_proj
LoRA alpha: 32
LoRA dropout: 0.05
Bias: none

Training protocol

Base model dtype: 4bit-nf4
Quantization: QLoRA 4bit NF4, double quantization enabled, bf16 compute
Adapter trainable dtype: float32
Prompt format: plain
Loss: completion-only causal LM cross entropy
Epochs: 5.0
Best checkpoint metric: eval_loss
Learning rate: 0.0002
Scheduler: cosine
Warmup ratio:

Files

adapter_model.safetensors: LoRA adapter weights
adapter_config.json: PEFT adapter configuration
task_manifest.json: source manifest row and resolved splits
training_protocol.json: fixed protocol used for this run

Mistral-7B-Instruct-v0.2-4b-r8-task600

Get help setting up a custom Dedicated Endpoints.

README

Source

LoRA

Training protocol

Files

Explore FriendliAI today

README

Source

LoRA

Training protocol

Files