Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Recipe
- Base model:
openbmb/MiniCPM5-1B - Task:
hackathon_advisor_quest_classification - Method: LoRA SFT (completion-only loss)
- Examples: 146
- Epochs: 6.0
- LoRA rank/alpha/dropout: 16/32/0.05
- Max seq length: 2560
- GPU: A10G
Dataset
build-small-hackathon/hackathon-advisor-quest-dataset — 156 chat-JSONL examples built from real build-small-hackathon Spaces: 108 teacher-
labelled + adversarially-verified projects plus targeted augmentations (app-only,
readme-only / missing app file, README↔app contradictions, empty matches, noisy
metadata). All 13 quests covered.
Self-eval at training time: 10/10 held-out prompts produced schema-valid JSON.
Model provider
build-small-hackathon
Model tree
Base
openbmb/MiniCPM5-1B
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information