HiTZ

Latxa-Llama-3.1-70B-Instruct-v2-FP8

Deploy Dedicated

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

Model Details

Model Description

Latxa is a family of Large Language Models (LLM) based on Meta’s LLaMA models. Current LLMs exhibit incredible performance for high-resource languages such as English, but, in the case of Basque and other low-resource languages, their performance is close to a random guesser. These limitations widen the gap between high- and low-resource languages when it comes to digital development. We present Latxa to overcome these limitations and promote the development of LLM-based technology and research for the Basque language. Latxa models follow the same architecture as their original counterparts and were further trained in Latxa Corpus v1.1, a high-quality Basque corpora.

Developed by: HiTZ Research Center & IXA Research group (University of the Basque Country UPV/EHU)
Model type: Language model
Language(s) (NLP): eu
License: llama3.1
Parent model: meta-llama/Llama-3.1-70B-Instruct
Contact: hitz@ehu.eus

Getting Started

Use the code below to get started with the model.

python
from transformers import pipeline

pipe = pipeline('text-generation', model='HiTZ/Latxa-Llama-3.1-70B-Instruct-FP8')

messages = [
	{'role': 'user', 'content': 'Kaixo!'},
]

pipe(messages)

>>
[
  {
    'generated_text': [
      {'role': 'user', 'content': 'Kaixo!'},
      {'role': 'assistant', 'content': 'Kaixo! Zer moduz? Zer behar edo galdetu nahi duzu?'}
    ]
  }
]

Uses

Latxa models are intended to be used with Basque data; for any other language the performance is not guaranteed. Same as the original, Latxa inherits the Llama-3.1 License which allows for commercial and research use.

Direct Use

Latxa Instruct models are trained to follow instructions or to work as chat assistants.

Out-of-Scope Use

The model is not intended for malicious activities, such as harming others or violating human rights. Any downstream application must comply with current laws and regulations. Irresponsible usage in production environments without proper risk assessment and mitigation is also discouraged.

Bias, Risks, and Limitations

In an effort to alleviate the potentially disturbing or harmful content, Latxa has been trained on carefully selected and processed data which comes mainly from local media, national/regional newspapers, encyclopedias and blogs (see Latxa Corpus v1.1). Still, the model is based on Llama 3.1 models and can potentially carry the same bias, risk and limitations. Please see the Llama’s Ethical Considerations and Limitations for further information.

Training Details

[!WARNING] DISCLAIMER

Further training details will be released with the corresponding research paper in the near future.

Evaluation

We evaluated the models 5-shot settings on multiple-choice tasks. We used the basque partitions of each dataset.

The arena results will be released in the future.

Testing Data, Factors & Metrics

Testing Data

Belebele (Bandarkar et al.): Belebele is a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. We evaluated the model in a 5-shot fashion.
- Data card: https://huggingface.co/datasets/facebook/belebele
X-StoryCloze (Lin et al.): XStoryCloze consists of the professionally translated version of the English StoryCloze dataset to 10 non-English languages. Story Cloze is a commonsense reasoning dataset which consists of choosing the correct ending to a four-sentence story. We evaluated the model in a 5-shot fashion.
- Data card: https://huggingface.co/datasets/juletxara/xstory_cloze
EusProficiency (Etxaniz et al., 2024): EusProficiency comprises 5,169 exercises on different topics from past EGA exams, the official C1-level certificate of proficiency in Basque.
- Data card: https://huggingface.co/datasets/HiTZ/EusProficiency
(Etxaniz et al., 2024): EusReading consists of 352 reading comprehension exercises (irakurmena) sourced from the same set of past EGA exams.

Metrics

We use Accuracy, as they are framed as Multiple Choice questions.

Results

Table with columns: Task, Llama-3.1 8B Ins., Latxa 3.1 8B Ins., Llama-3.1 70B Ins., Latxa 3.1 70B Ins., Latxa 3.1 70B Ins. V2
Task	Llama-3.1 8B Ins.	Latxa 3.1 8B Ins.	Llama-3.1 70B Ins.	Latxa 3.1 70B Ins.	Latxa 3.1 70B Ins. V2
Belebele	73.89	80.00	89.11	91.00	90.7
X-Story Cloze	61.22	71.34	69.69	77.83	78.66
EusProficiency	34.13

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: HPC Cluster, 4 x A100 64Gb nodes x64
Hours used (total GPU hours): 16005.12h
Cloud Provider: CINECA HPC
Compute Region: Italy
Carbon Emitted: 1901.41kg CO2 eq

Citation

To cite our work, please use:

bibtex
@misc{sainz2025instructinglargelanguagemodels,
      title={Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque}, 
      author={Oscar Sainz and Naiara Perez and Julen Etxaniz and Joseba Fernandez de Landa and Itziar Aldabe and Iker García-Ferrero and Aimar Zabala and Ekhi Azurmendi and German Rigau and Eneko Agirre and Mikel Artetxe and Aitor Soroa},
      year={2025},
      eprint={2506.07597},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2506.07597}, 
}

Acknowledgements

This work has been partially supported by the Basque Government (IKER-GAITU project).

It has also been partially supported by the Ministerio para la Transformación Digital y de la Función Pública - Funded by EU – NextGenerationEU within the framework of the project with reference 2022/TL22/00215335.

The models were trained on the Leonardo supercomputer at CINECA under the EuroHPC Joint Undertaking, project EHPC-EXT-2023E01-013.

Model provider

HiTZ

Model tree

Base

HiTZ/Latxa-Llama-3.1-70B-Instruct

Quantized

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Model card

Explore FriendliAI today

Get started Talk to an engineer

Model Details

Model Description

Developed by: HiTZ Research Center & IXA Research group (University of the Basque Country UPV/EHU)
Model type: Language model
Language(s) (NLP): eu
License: llama3.1
Parent model: meta-llama/Llama-3.1-70B-Instruct
Contact: hitz@ehu.eus

Getting Started

Use the code below to get started with the model.

python
from transformers import pipeline

pipe = pipeline('text-generation', model='HiTZ/Latxa-Llama-3.1-70B-Instruct-FP8')

messages = [
	{'role': 'user', 'content': 'Kaixo!'},
]

pipe(messages)

>>
[
  {
    'generated_text': [
      {'role': 'user', 'content': 'Kaixo!'},
      {'role': 'assistant', 'content': 'Kaixo! Zer moduz? Zer behar edo galdetu nahi duzu?'}
    ]
  }
]

Uses

Direct Use

Latxa Instruct models are trained to follow instructions or to work as chat assistants.

Out-of-Scope Use

Bias, Risks, and Limitations

Training Details

[!WARNING] DISCLAIMER

Further training details will be released with the corresponding research paper in the near future.

Evaluation

We evaluated the models 5-shot settings on multiple-choice tasks. We used the basque partitions of each dataset.

The arena results will be released in the future.

Testing Data, Factors & Metrics

Testing Data

Belebele (Bandarkar et al.): Belebele is a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. We evaluated the model in a 5-shot fashion.
- Data card: https://huggingface.co/datasets/facebook/belebele
X-StoryCloze (Lin et al.): XStoryCloze consists of the professionally translated version of the English StoryCloze dataset to 10 non-English languages. Story Cloze is a commonsense reasoning dataset which consists of choosing the correct ending to a four-sentence story. We evaluated the model in a 5-shot fashion.
- Data card: https://huggingface.co/datasets/juletxara/xstory_cloze
EusProficiency (Etxaniz et al., 2024): EusProficiency comprises 5,169 exercises on different topics from past EGA exams, the official C1-level certificate of proficiency in Basque.
- Data card: https://huggingface.co/datasets/HiTZ/EusProficiency
(Etxaniz et al., 2024): EusReading consists of 352 reading comprehension exercises (irakurmena) sourced from the same set of past EGA exams.

Metrics

We use Accuracy, as they are framed as Multiple Choice questions.

Results

Table with columns: Task, Llama-3.1 8B Ins., Latxa 3.1 8B Ins., Llama-3.1 70B Ins., Latxa 3.1 70B Ins., Latxa 3.1 70B Ins. V2
Task	Llama-3.1 8B Ins.	Latxa 3.1 8B Ins.	Llama-3.1 70B Ins.	Latxa 3.1 70B Ins.	Latxa 3.1 70B Ins. V2
Belebele	73.89	80.00	89.11	91.00	90.7
X-Story Cloze	61.22	71.34	69.69	77.83	78.66
EusProficiency	34.13

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: HPC Cluster, 4 x A100 64Gb nodes x64
Hours used (total GPU hours): 16005.12h
Cloud Provider: CINECA HPC
Compute Region: Italy
Carbon Emitted: 1901.41kg CO2 eq

Citation

To cite our work, please use:

bibtex
@misc{sainz2025instructinglargelanguagemodels,
      title={Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque}, 
      author={Oscar Sainz and Naiara Perez and Julen Etxaniz and Joseba Fernandez de Landa and Itziar Aldabe and Iker García-Ferrero and Aimar Zabala and Ekhi Azurmendi and German Rigau and Eneko Agirre and Mikel Artetxe and Aitor Soroa},
      year={2025},
      eprint={2506.07597},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2506.07597}, 
}

Acknowledgements

This work has been partially supported by the Basque Government (IKER-GAITU project).

The models were trained on the Leonardo supercomputer at CINECA under the EuroHPC Joint Undertaking, project EHPC-EXT-2023E01-013.

Latxa-Llama-3.1-70B-Instruct-v2-FP8

Get help setting up a custom Dedicated Endpoints.

README

Model Details

Model Description

Getting Started

Uses

Direct Use

Out-of-Scope Use

Bias, Risks, and Limitations

Training Details

Evaluation

Testing Data, Factors & Metrics

Testing Data

Metrics

Results

Environmental Impact

Citation

Acknowledgements

Explore FriendliAI today

README

Model Details

Model Description

Getting Started

Uses

Direct Use

Out-of-Scope Use

Bias, Risks, and Limitations

Training Details

Evaluation

Testing Data, Factors & Metrics

Testing Data

Metrics

Results

Environmental Impact

Citation

Acknowledgements