shekharp77
Mira-1
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Container
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Key Features
- Structured JSON output — extracts patient demographics, vitals, labs, medications, diagnoses, procedures, allergies
- Source-grounded — every extracted value traces to the input document
- No patient identifiers — extracts age/sex only, strips names/MRN/DOB
- On-prem deployable — 3B parameters, runs on CPU via GGUF quantization
- 98% JSON validity on held-out gold set
Training
| Parameter | Value |
|---|---|
| Base model | Qwen/Qwen2.5-3B-Instruct |
| Method | QLoRA (4-bit, r=16, alpha=32) |
| Training data | 3,438 examples (126 curated + 3,312 Synthea-rendered) |
| Epochs | 2 |
| Final loss | 0.14 |
| GPU | Kaggle T4 (free tier) |
| Training time | ~2h 40m |
Evaluation (50 held-out gold examples)
| Metric | Value |
|---|---|
| JSON validity | 98% |
| Training loss | 1.23 → 0.14 |
Usage
python
from peft import AutoPeftModelForCausalLMfrom transformers import AutoTokenizermodel = AutoPeftModelForCausalLM.from_pretrained("shekharp77/Mira-1")tokenizer = AutoTokenizer.from_pretrained("shekharp77/Mira-1")messages = [{"role": "system", "content": "You are a clinical information extraction system..."},{"role": "user", "content": "Patient: 45/M\nHb 12.5 g/dL (13-17) LOW\nWBC 8.2 x10^9/L (4-11) Normal"},]inputs = tokenizer.apply_chat_template(messages, return_tensors="pt", add_generation_prompt=True)outputs = model.generate(inputs, max_new_tokens=2048, temperature=0)print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Schema
Outputs conform to this schema (10 required top-level fields):
document_type: lab_report | medication_list | discharge_summary | pathology_report | intake_form | progress_note | otherpatient: {age, sex}encounter: {date, department}vitals[],labs[],medications[],diagnoses[],procedures[],allergies[]extraction_notes
Limitations
- English only (v0)
- Trained on synthetic data (Synthea + curated seeds), not real clinical records
- Every output is a draft for human review — not for autonomous clinical decisions
- No ICD-10/SNOMED coding unless explicitly in the source document
License
Apache-2.0 (same as base model)
Model provider
shekharp77
Model tree
Base
Qwen/Qwen2.5-3B-Instruct
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information