Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0๐ GPQA Diamond โ Public Results
markdown
GPQA Diamond (198 questions) โ Darwin-218B-DelphiโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโMethod | AccuracyโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโDarwin-218B-Delphi baseline (MAJ@8) | 86.87% (172/198)Darwin-218B-Delphi (DELPHI cascade) | 90.91% (180/198)โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโDELPHI improvement | +4.04pp (+8 questions)
Reference baselines (vendor-reported)
| Model | GPQA Diamond | Mode |
|---|---|---|
| GPT-5 (OpenAI) | 88.0% | thinking |
| Claude Opus 4.5 (Anthropic) | 91.8% | extended thinking |
| DeepSeek-V3.2 | ~78-82% | standard |
| Darwin-218B-Delphi (MAJ@8) | 86.87% | standard |
| Darwin-218B-Delphi (DELPHI) | 90.91% | VIDRAFT signature |
โ DELPHI cascade๋ก Claude Opus 4.5 extended thinking ๋๊ธ๊ถ ์ง์ .
๐ณ Family Tree (์กฑ๋ณด)
markdown
๐ง GRANDFATHER (์กฐ๋ถ) ๐ง GRANDMOTHER (์กฐ๋ชจ)โโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโCohereLabs/ Anthropic Claudecommand-a-plus-05-2026-bf16 Opus 4.5(Apache-2.0) (chemistry knowledge donor)218B MoE / ~25B active via SFT distillation128 experts, BF16 (no logits, output-only)โ โโ โโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโผ๐จ FATHER (๋ถ์น) ๐ฉ MOTHER (๋ชจ์น)โโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโFINAL-Bench/ FINAL-Bench/Darwin-218B-kr darwin-chem-data-v1(Korean LoRA merged) (993 chemistry CoT samples,Korean fluency layer 6 sub-domains,anti-contamination guaranteed)โ โโ โโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโผ๐ฆ CHILD (์์ / THIS MODEL)โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโFINAL-Bench/Darwin-218B-Delphiโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโข Korean + Chemistry specialistโข 218B MoE, ~25B activeโข Apache-2.0โข GPQA Diamond 90.91% (DELPHI cascade)โข Served via DELPHI 5-Phase inference
Lineage notes
- Paternal line (๋ชจ๋ธ ๊ณจ๊ฒฉ): Cohere Command A+ โ Korean LoRA โ Chemistry LoRA merge โ Delphi
- Maternal line (์ง์ source): Claude Opus 4.5 โ 993 distilled chemistry CoT samples โ Delphi's chemistry reasoning
- Apache-2.0 compatibility: All ancestors (paternal line) are Apache-2.0 licensed; maternal line is data-only output (Anthropic ToS compliant for derivative model training)
Distillation:
- Teacher: large frontier model (proprietary API; no logits exposure โ SFT-on-outputs pattern)
- 993 high-quality chemistry CoT examples across 6 sub-domains: organic, spectroscopy, physical, inorganic, analytical, special
- Anti-contamination: GPQA Diamond 198 questions guaranteed not in training data
- LoRA: r=16, ฮฑ=32, q/k/v/o, lr=1e-5, 1 epoch, max_length=3072
- Trained on Darwin-218B-kr (S4 6รB200 bf16)
- Merge: full dense checkpoint, no runtime adapter loading
Architecture
| Item | Value |
|---|---|
| Total parameters | 218B |
| Active parameters | ~25B (MoE) |
| Experts | 128 (Cohere2 MoE) |
| Precision | BF16 |
| Architecture | Cohere2VisionForConditionalGeneration (multimodal-capable, text-primary) |
| Tokenizer | Cohere2 (vocab 256K) |
| Languages | English, Korean |
| Context | 65,536 tokens |
| License | Apache-2.0 |
Usage
vLLM (recommended)
bash
vllm serve FINAL-Bench/Darwin-218B-Delphi \--tensor-parallel-size 8 \--dtype bfloat16 \--max-model-len 65536 \--trust-remote-code \--enforce-eager \--limit-mm-per-prompt '{"image":0,"video":0}'
Requires vLLM โฅ 0.21.0 (Cohere2VisionForConditionalGeneration support).
Transformers
python
from transformers import AutoModelForCausalLM, AutoTokenizerimport torchmodel = AutoModelForCausalLM.from_pretrained("FINAL-Bench/Darwin-218B-Delphi",dtype=torch.bfloat16,device_map="auto",trust_remote_code=True,)tok = AutoTokenizer.from_pretrained("FINAL-Bench/Darwin-218B-Delphi")messages = [{"role": "user", "content": "Explain the SN2 mechanism step by step, ""then justify why CH3I reacts faster than CH3Cl."}]prompt = tok.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)inputs = tok(prompt, return_tensors="pt").to(model.device)out = model.generate(**inputs, max_new_tokens=2048, temperature=0.3, top_p=0.9)print(tok.decode(out[0][inputs.input_ids.shape[1]:], skip_special_tokens=True))
License
Apache License 2.0
Built upon CohereLabs/command-a-plus-05-2026-bf16 (Apache-2.0) and Darwin-218B-kr (Apache-2.0). All upstream components are permissively licensed.
Citation
bibtex
@misc{darwin-218b-delphi-2026,title = {Darwin-218B-Delphi: Chemistry-Specialized 218B MoE with DELPHI Cascade Inference},author = {{VIDRAFT FINAL-Bench Team}},year = {2026},publisher = {Hugging Face},howpublished = {\url{https://huggingface.co/FINAL-Bench/Darwin-218B-Delphi}}}
Model provider
FINAL-Bench
Model tree
Base
FINAL-Bench/Darwin-218B-kr
Base
CohereLabs/command-a-plus-05-2026-bf16
Merged
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information