Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Intended Use

Use this checkpoint inside the repository's verified subroutine harness, which renders the task-specific prompt, parses strict JSON, permits one localized schema-feedback retry, applies deterministic guards, and falls back to rules where appropriate. This is not a general coding assistant or chat model.

Evaluation

Evaluation uses up to 250 examples from HTTPX and Jinja2, both held out entirely from training. Decoding is greedy.

MetricResult
Success after one schema retry97.2%
First-pass success97.2%
First-pass schema validity100.0%
Base instruct success after retry49.2% for the base instruct model
Rules-only success27.7%

Experiment verdict for this subroutine: works at 494M.

Training

  • Training examples: 2000
  • Epochs: 2.0
  • Learning rate: 2e-05
  • Effective batch configuration: 8 per device x 2 gradient accumulation
  • Maximum sequence length: 2048
  • Seed: 0
  • Final training loss: 0.162788
  • Reproduction hardware: one NVIDIA A100 80GB PCIe
  • Source revision: d0fd7bf

The dataset was generated from pinned Flask, Click, and Rich repositories for training/validation. HTTPX and Jinja2 were reserved for testing.

Limitations

The checkpoint is specialized to one closed JSON schema and should not be expected to retain broad instruction-following ability. The experiment mixes two base-model families across its size sweep. Some subroutines are better served by deterministic rules; consult the verdict above before deployment.

License

Apache-2.0, following the base model. Experiment code is MIT licensed.

Model provider

ishaanranjan

Model tree

Base

Qwen/Qwen2.5-1.5B-Instruct

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today