Model Introduction
Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages and effectively follow translation instructions in multiple languages.
For on-device deployment, AngelSlim 1.25-bit extreme quantization reduces the storage requirement of the 1.8B model to only 440 MB and improves inference speed by 1.5x.
Multi-dimensional evaluations show that Hy-MT2 delivers outstanding performance across general, real-world business, domain-specific, and instruction-following translation tasks. The 7B and 30B-A3B models outperform open-source models such as DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, while the lightweight 1.8B model also surpasses mainstream commercial APIs from providers such as Microsoft and Doubao overall.
In this release, we also open-source IFMTBench, a benchmark for evaluating translation instruction-following capabilities.
We also welcome everyone to use our released Hy-MT2-Translator Skill, which makes it easy to integrate Hy-MT2 series models for translation tasks. Download links: ClawHub and SkillHub.
Now, Tencent Hy is officially partnering with WMT26 for the "Video Subtitle Translation Task" (https://www2.statmt.org/wmt26/video-subtitle-translation.html). Participants who use the Hy-MT model series to compete in the "General Machine Translation Task" (https://www2.statmt.org/wmt26/translation-task.html) and the "Video Subtitle Translation Task" will have the chance to win special awards sponsored by Hunyuan. We sincerely invite everyone to participate and jointly push the boundaries of machine translation technology!
Results
Hy-MT2 Translation Task Instruction Examples (Chinese-English Comparison)
Note: In the following examples, both source_lang and target_lang should use the full language names. Chinese names should be used in Chinese prompts, and English names should be used in English prompts.
Table with columns: Type, Chinese prompt, English prompt| Type | Chinese prompt | English prompt |
|---|
| Default Translation | 将以下文本翻译为 {target_lang},注意只需要输出翻译后的结果,不要额外解释:{source_text} | Translate the following text into {target_lang}. Note that you should only output the translated result without any additional explanation:{source_text} |
| Terminology | 参考下面的翻译:{text} 翻译成 {text}{text} 翻译成 {text}{text} 翻译成 将以下文本翻译为 ,注意: |
Inference and Deployment
For 1.8B and 7B, we recommend using the following parameters for inference. Note that our models do not have a default system_prompt.
{
"temperature": 0.7,
"top_p": 0.6,
"top_k": 20,
"repetition_penalty": 1.05,
"max_tokens": 4096
}
For 30B-A3B, we recommend using the following parameters for inference. Note that our models do not have a default system_prompt.
{
"temperature": 0.7,
"top_p": 1.0,
"top_k": -1,
"repetition_penalty": 1.0,
"max_tokens": 4096
}
transformers>=5.6.0
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model_path = "shawnw3i/Hy-MT2-7B-AWQ"
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
model_path,
dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True,
)
model.eval()
prompt = "将以下文本翻译成英语,注意只需要输出翻译后的结果,不要额外解释:\n\n今天天气真好。"
messages = [{"role": "user", "content": prompt}]
inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to(model.device)
with torch.no_grad():
outputs = model.generate(
**inputs,
max_new_tokens=4096,
)
response = tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
print(response)
vllm
Build vLLM from source:
uv venv --python 3.12 --seed --managed-python
source .venv/bin/activate
git clone https://github.com/vllm-project/vllm.git
cd vllm
uv pip install --editable . --torch-backend=auto
Start the vLLM server:
vllm serve shawnw3i/Hy-MT2-7B-AWQ --tensor-parallel-size 1
sglang
Build SGLang from source:
git clone https://github.com/sgl-project/sglang
cd sglang
pip3 install pip --upgrade
pip3 install "transformers>=5.6.0"
pip3 install -e "python"
Launch SGLang server:
python3 -m sglang.launch_server --model shawnw3i/Hy-MT2-7B-AWQ --tp 1
Supported Languages
Table with columns: Languages, Abbr., Chinese Names| Languages | Abbr. | Chinese Names |
|---|
| Chinese | zh | 中文 |
| English | en | 英语 |
| French | fr | 法语 |
| Portuguese | pt | 葡萄牙语 |
| Spanish | es | 西班牙语 |
| Japanese | ja | 日语 |