Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Model Description

This model was fine-tuned from Qwen2.5-VL-3B-Instruct using Parameter-Efficient Fine-Tuning (LoRA).

The training objective focused on:

  • Mathematical reasoning
  • Chain-of-thought style explanations
  • Visual question answering
  • ScienceQA mathematical problems
  • Multimodal image-text understanding
  • Turkish and English instruction following

Base Model

  • Qwen2.5-VL-3B-Instruct

Fine-Tuning Method

  • QLoRA (4-bit)
  • LoRA adapters
  • PEFT
  • Hugging Face Transformers

Training Dataset

ScienceQA-Math-CoT

The dataset contains:

  • ScienceQA mathematical questions
  • Associated images
  • Step-by-step solutions
  • Final answers

The model was trained to generate reasoning traces before producing final answers.

Intended Uses

Suitable Uses

  • Educational assistants
  • Mathematical tutoring
  • Visual mathematical reasoning
  • STEM learning applications
  • Homework support
  • Science question answering
  • Turkish and English multimodal assistants

Out-of-Scope Uses

This model is not intended for:

  • Medical diagnosis
  • Legal advice
  • Financial decision-making
  • Safety-critical systems
  • Autonomous decision-making

Training Details

Hardware

  • Kaggle Dual NVIDIA T4 GPUs

Training Configuration

ParameterValue
Base ModelQwen2.5-VL-3B-Instruct
Fine-TuningLoRA
Quantization4-bit NF4
PrecisionFP16
LoRA Rank16
LoRA Alpha32
Learning Rate2e-4
Batch Size1
Gradient Accumulation4
Epochs1

Example Usage

python

from transformers import Qwen2_5_VLForConditionalGeneration
from transformers import AutoProcessor
from peft import PeftModel
base_model = Qwen2_5_VLForConditionalGeneration.from_pretrained(
"Qwen/Qwen2.5-VL-3B-Instruct",
device_map="auto"
)
model = PeftModel.from_pretrained(
base_model,
"salihfurkaan/Qwen2.5-VL-3B-ScienceQA-Math-CoT-Adapter"
)
processor = AutoProcessor.from_pretrained(
"Qwen/Qwen2.5-VL-3B-Instruct"
)

Limitations

  • The model may still produce incorrect mathematical reasoning.
  • Chain-of-thought outputs do not guarantee correctness.
  • Performance depends on image quality and clarity.
  • The model has not been evaluated on all mathematical domains.
  • The model may hallucinate intermediate reasoning steps.

Ethical Considerations

  • This model is intended for educational and research purposes.
  • Users should independently verify mathematical solutions before relying on them in academic, professional, or real-world settings.

Citation

@misc{qwen2vl_scienceqa_math_cot, title={Qwen2.5-VL-3B-ScienceQA-Math-CoT}, author={Salih Furkan Erik, Kerem Berke Başak}, year={2026}, publisher={Hugging Face}, howpublished={\url{https://huggingface.co/}} }

Model provider

salihfurkaan

salihfurkaan

Model tree

Base

Qwen/Qwen2.5-VL-3B-Instruct

Adapter

this model

Modalities

Input

Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today