Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

🔥 Recent Updates

DateUpdate
2026-05🎉 Paper "CSRP: Chain-of-Thought Reasoning for Chinese Text Correction via Reinforcement Learning with Efficiency-Aware Rewards" accepted as Oral at ACL 2026
2026-05🚀 Released ChineseErrorCorrector4-4B, achieving new SOTA on both NACGEC and CSCD benchmarks

💡 Introduction

ChineseErrorCorrector4-4B is built on the CSRP (CPT → SFT → RL) three-stage training framework.

The Problem: Over-Correction Bias

Traditional LLM-based correction systems often suffer from over-correction bias — models unnecessarily paraphrase correct text rather than leaving it untouched. CSRP resolves this by calibrating decision boundaries through a structured curriculum:

StageNameDescription
Phase IBalanced Continued Pre-training (CPT)Internalizes linguistic priors using 5.9M samples with an 8:2 mixture of general and correction-specific data
Phase IIRationale-Augmented SFTDistills Chain-of-Thought reasoning paths to guide the model in diagnosing error types before executing corrections
Phase IIIEfficiency-Aware Policy AlignmentUses GRPO with a novel Efficiency-Aware Reward (EAR) to penalize unnecessary edits and reward surgical precision

📊 Benchmark Results

榜单一:中文语法纠错(CGEC)— NACGEC 基准

针对原生中文及学习者文本,CSRP (4B) 斩获新 SOTA,F0.5 高达 50.99,显著超越此前最优专业大模型。

模型 (Scale)准确率 Precision召回率 RecallF0.5 (核心指标)
BART34.6741.8835.91
HW-CGEC50.9532.2945.26
ScholarGEC (14B)45.0859.3347.35
CEC3 (4B)54.2034.7548.74
CSRP (4B) [Ours]57.1735.6050.99

榜单二:中文拼写检查(CSC)— CSCD 基准

CSRP 在字符级纠错 F1 上同样展现出强劲统治力,达到惊人的 59.61,全面超越 GPT-4。

模型Correction F1
BERT25.49
SoftMask44.48
SMBERT44.67
MDCSpell+ARM48.93
GPT-4 (Few-shot)54.41
CSRP (4B) [Ours]59.61

🛠️ Quick Start

Requirements

bash

pip install -U transformers torch

Note: Requires transformers >= 4.51.0 for Qwen3 architecture support.

Inference with Transformers

python

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "twnlp/ChineseErrorCorrector4-4B"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True
)
# Professional instruction template
instruction = (
"假如你是一名专业的纠错专家,请分析输入句子的语法错误类型和修改原因,"
"并只输出纠正后的语句,错误类型如下:错别字、词语搭配错误、词性错误、"
"语序错误、成分残缺、成分赘余、关联词使用错误、指代不明、语义逻辑不通、无误。"
)
text_input = "下个星期,我跟我朋唷打算去法国玩儿。"
messages = [
{"role": "system", "content": instruction},
{"role": "user", "content": text_input}
]
text = tokenizer.apply_chat_template(
messages, tokenize=False, add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
generated_ids = model.generate(
**model_inputs,
max_new_tokens=512,
do_sample=False,
repetition_penalty=1.1
)
response = tokenizer.decode(
generated_ids[0][len(model_inputs.input_ids[0]):],
skip_special_tokens=True
)
print(response)

📝 Output Example

Input:

markdown

下个星期,我跟我朋唷打算去法国玩儿。

Model Output:

markdown

<think>
错误类型:错别字
修改原因:原句中的"朋唷"是错误写法,正确应为"朋友"。
"唷"是语气助词,不能用于此处指代同伴。
正确句使用"朋友"准确表达了与说话者一同前往的人,避免了因错别字造成的语义误解。
</think>
下个星期,我跟我朋友打算去法国玩儿。

📜 License

This project is released under the Apache 2.0 License.

Citation

如果本工作对您有帮助,欢迎引用:

bibtex

@misc{tian2026csrpchainofthoughtreasoningchinese,
title={CSRP: Chain-of-Thought Reasoning for Chinese Text Correction via Reinforcement Learning with Efficiency-Aware Rewards},
author={Wei Tian and Yuhao Zhou and Man Lan},
year={2026},
eprint={2606.00020},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2606.00020},
}

Model provider

twnlp

twnlp

Model tree

Base

Qwen/Qwen3-4B

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today