rafalwronapl
qwen3-14b-no-think-mrf-sft-t5-t6
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Qwen3-14B no-think MRF SFT (T5+T6 faithful replacements)
LoRA adapter for Qwen/Qwen3-14B in /no_think mode, trained on the MRF v2
T5+T6 faithful-replacement set. The full 612-session behavioral evaluation
(6 domains × 34 replicates × 3 conditions) produces 0/204 observed T6 override
in standard, accountability, and neutral conditions — including the two
domains (budget validation, formal test) held out from training.
Part of the MRF v2 release. See the GitHub repository for code, the small MRF-Bench v0.1 benchmark, the full paper outline, and the parser-validation appendix. See the Zenodo deposit for the raw 3,872 base-model sessions and this run's 612-session held-out evaluation.
License: Apache 2.0.
Model provider
rafalwronapl
Model tree
Base
Qwen/Qwen3-14B
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information