build-small-hackathon

ryugaku-jaen-translator-4b

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Model description

  • Base model: Qwen/Qwen3.5-4B
  • Task: Japanese → English translation
  • Fine-tuning method: LoRA (r=32, alpha=64) merged into a 16-bit model
  • Training framework: Unsloth + TRL
  • Model size: ~4B parameters
  • License: Apache 2.0

Training data

The model was fine-tuned on a mix of:

  • WikiMatrix Japanese-English parallel corpus
  • Synthetic Japanese-English science/textbook sentence pairs

Total effective training samples: ~6,000 sentence pairs.

Intended use

  • Translate Japanese lecture transcripts, textbooks, and everyday text into English
  • Power the Ryugaku Gradio app submitted to the Backyard AI hackathon
  • Run fully offline on consumer GPUs (≤16 GB VRAM)

How to use

python

from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "build-small-hackathon/ryugaku-jaen-translator-4b"
model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="bfloat16", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained(model_id)
messages = [
{"role": "system", "content": "You are a precise Japanese-to-English translator. Translate the user's Japanese text into natural English only. Do not add explanations."},
{"role": "user", "content": "私は留学生です。"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=256, do_sample=False)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Example

original

相転移にともなう潜熱の出入りは、系全体の自由エネルギーの極値条件に拘束されるため、単純な温度勾配のみに起因するフーリエの熱伝導則をそのまま適用すると、相境界における不連続な熱流束の跳ね返りを記述しきれず、結果として移動境界問題としてのステファン条件の定式化において、特異点近傍での数値的な発散や物理的な矛盾を引き起こす原因となる。

This Model

Because the latent heat associated with a phase transition is constrained by the extremum condition of the total free energy of the system, directly applying Fourier's law of heat conduction to a simple temperature gradient fails to capture the discontinuous heat flux jump at the phase interface. As a result, in formulating the Stefan condition as a moving boundary problem, this leads to numerical divergence near the singularity and unphysical contradictions.

Google

Since the flow of latent heat associated with phase transitions is constrained by the conditions for the extremum of the system’s total free energy, a direct application of Fourier’s law of heat conduction—which is based solely on a simple temperature gradient—fails to adequately describe the discontinuous reflection of heat flux at phase boundaries. Consequently, this leads to numerical divergences and physical inconsistencies near singularities when formulating the Stefan condition as a moving-boundary problem.

DeepL

Since the inflow and outflow of latent heat associated with a phase transition is constrained by the extremum conditions of the free energy of the entire system, applying Fourier's law of heat conduction, which is based solely on a simple temperature gradient, fails to adequately describe the discontinuous rebound of heat flux at the phase boundary. As a result, this leads to numerical divergences and physical inconsistencies near the singularity in the formulation of the Stefan condition as a moving boundary problem.

Limitations

  • Optimized for Japanese → English; English → Japanese quality is not guaranteed.
  • May struggle with very long inputs or highly domain-specific terminology not present in training data.

Acknowledgments

Built with Unsloth, TRL, and Qwen.

Model provider

build-small-hackathon

Model tree

Base

Qwen/Qwen3.5-4B

Fine-tuned

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today