Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

What it knows

The model has expert knowledge across all 6 EU framework programmes:

ProgrammePeriodProjects
FP41994–1998~13K
FP51998–2002~17K
FP62002–2006~10K
FP72007–2013~26K
Horizon 20202014–2020~36K
Horizon Europe2021–2027~24K

It can answer questions about:

  • 🔬 Project details: objectives, methodology, expected impacts
  • 💰 Funding information: total costs, EC contributions, funding schemes
  • 📅 Timelines: start/end dates, project duration
  • 🏢 Organizations: coordinators, participants, country information
  • 🧬 Scientific domains: EuroSciVoc classifications, research topics
  • 📊 Programme-level statistics: funding distribution, topic clusters
  • 🔄 Cross-programme comparisons: funding trends across FP4→Horizon Europe

Usage

With PEFT (recommended)

python

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base_model = AutoModelForCausalLM.from_pretrained(
"Qwen/Qwen2.5-7B-Instruct",
torch_dtype="auto",
device_map="auto",
)
model = PeftModel.from_pretrained(base_model, "RCaz/Qwen2.5-7B-EU-Funding-Expert")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-7B-Instruct")
messages = [
{"role": "system", "content": "You are an expert assistant specializing in European Union research funding programmes."},
{"role": "user", "content": "What were the main funding priorities under Horizon 2020?"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7)
print(tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special_tokens=True))

Example prompts

  • "Tell me about the ITER project and its EU funding."
  • "How much did the EU invest in quantum computing research under Horizon 2020?"
  • "Compare the total budgets of FP7 and Horizon 2020."
  • "Which organizations coordinated the most EU-funded AI projects?"
  • "What scientific domains received the highest funding in Horizon Europe?"

Training details

ParameterValue
Base modelQwen/Qwen2.5-7B-Instruct (7.62B params)
MethodLoRA SFT (Supervised Fine-Tuning)
LoRA rank64
LoRA alpha16
LoRA targetall-linear layers
LoRA dropout0.05
Trainable params~410M (5.4% of total)
DatasetRCaz/eu-funding-cordis-qa
Training samples413,499
Validation samples21,764
Max sequence length2048
PackingBFD (Best-Fit Decreasing)
Avg tokens/sample~285
Precisionbf16
OptimizerAdamW (fused)
Learning rate2e-4 (cosine schedule)
Batch size1 × 8 grad accum = 8 effective
HardwareNVIDIA L4 (24GB)
Flash Attention2.0
Gradient checkpointing

Dataset

The training data was generated from official CORDIS CSV exports covering 126K+ EU-funded research projects. Eight types of Q&A conversations were created:

  1. Project overview — What is the project about?
  2. Funding details — How much funding did it receive?
  3. Timeline — When did it start/end?
  4. Organizations — Who coordinates/participates?
  5. Scientific domains — What fields does it cover?
  6. Topics — What EU topics/calls is it associated with?
  7. Programme-level — Statistics and trends within a programme
  8. Cross-programme — Comparisons across framework programmes

All conversations follow the ChatML format with a system prompt establishing EU funding expertise.

Limitations

  • Knowledge is based on CORDIS data snapshots and may not reflect the very latest project updates
  • The model is specialized for EU funding — it may be less capable on general knowledge tasks compared to the base model
  • Financial figures and project details are as accurate as the source CORDIS data
  • The model may occasionally hallucinate details for very specific project queries

Citation

If you use this model, please cite:

bibtex

@misc{rcaz2026eufunding,
title={Qwen2.5-7B-EU-Funding-Expert: A Fine-tuned LLM for European Research Funding},
author={RCaz},
year={2026},
url={https://huggingface.co/RCaz/Qwen2.5-7B-EU-Funding-Expert}
}

Acknowledgments

  • Training data sourced from CORDIS — © European Union
  • Base model by Qwen Team
  • Fine-tuned using TRL and PEFT

Model provider

RCaz

RCaz

Model tree

Base

Qwen/Qwen2.5-7B-Instruct

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today