Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0What does it do?
Give the model a small modern annoyance:
My neighbour's cat judged me through the window.
The bakery sold out of the exact pastry I had emotionally chosen.
The automatic lights turned off while I was still in the shower.
…and it will produce a short comic scene in the style of a badly organised Elizabethan theatre company.
Typical outputs include:
- TITLE
- DRAMATIS PERSONAE
- SCENE
- THOU MUST CHOOSE
Style
The intended voice combines influences from:
- Shakespeare
- Blackadder
- Monty Python
- British sitcoms
- Amateur dramatic societies
The goal is not historical accuracy, but readable modern dialogue with theatrical overreaction.
Training
This adapter was fine-tuned using LoRA with supervised fine-tuning (SFT) on a hand-curated dataset of approximately 240 examples.
The dataset was iteratively refined over multiple training runs to improve:
- structural consistency
- role selection
- relevance to the original grievance
- comedic escalation
- dialogue quality
On a 10-prompt held-out manual evaluation, the fine-tuned model scored 56/80, compared with 36/80 for the base Qwen2.5-3B-Instruct model using the same prompt and scoring rubric.
Design principles
The model was trained around a few simple ideas:
- Take the grievance seriously.
- Make the reaction wildly disproportionate.
- Keep the cast reasonably small and situation-specific.
- Let the comedy emerge from the original complaint.
- Never solve the problem sensibly.
Intended use
The model works best for:
- everyday annoyances
- social awkwardness
- household disasters
- transport failures
- office politics
- mildly haunted appliances
- inexplicably judgemental animals
It is intended for entertainment and creative text generation.
Limitations
This model intentionally prioritises style over factual accuracy.
Recurring characters and running jokes are expected behaviour.
The model performs best on small everyday frustrations rather than major life events.
Project structure
The project consists of three related repositories:
- LoRA adapter (this repository): The original PEFT fine-tuning.
- Merged model: A standalone Transformers version of the model.
- GGUF edition: A quantised deployment build for llama.cpp and local inference.
Quick start
This repository contains a PEFT LoRA adapter and should be loaded on top of Qwen2.5-3B-Instruct.
About the project
The objective was not to build a general-purpose assistant, but a small theatrical comedy troupe capable of treating everyday inconveniences as matters of grave national importance.
Training procedure
This model was trained with supervised fine-tuning (SFT) using TRL and PEFT.
Framework versions
- PEFT 0.19.1
- TRL 1.6.0
- Transformers 5.10.2
- PyTorch 2.11.0+cu128
- Datasets 5.0.0
- Tokenizers 0.22.2
Try the model
Related repositories
Model provider
sdavies
Model tree
Base
Qwen/Qwen2.5-3B-Instruct
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information