Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Harness-1

  • a 20B search agent that matches frontier AI's search capability - you should try!

Harness-1 average search performance

Code: https://github.com/pat-jj/harness-1

Paper: https://arxiv.org/abs/2606.02373

Tinker inference Example: https://github.com/pat-jj/harness-1/blob/main/inference/tinker_inference.md

vLLM Inference Example: https://github.com/pat-jj/harness-1/blob/main/inference/vllm_h100_browsecompplus.md

This repository contains the full merged Harness-1 release checkpoint. The model is merged into the openai/gpt-oss-20b base model and saved as standard Hugging Face safetensors shards.

Model provider

pat-jj

pat-jj

Model tree

Base

openai/gpt-oss-20b

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today