Dedicated create endpoint

Create a new endpoint

curl --request POST \
  --url https://api.friendli.ai/dedicated/beta/endpoint \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "advanced": {
    "tokenizer_add_special_tokens": true,
    "tokenizer_skip_special_tokens": true,
    "enable_content_logging": true,
    "max_batch_size": 256,
    "max_input_length": 123,
    "max_token_count": 2560
  },
  "hfModelRepo": "<string>",
  "instanceOptionId": "<string>",
  "name": "<string>",
  "projectId": "<string>",
  "autoscalingPolicy": {
    "cooldownPeriod": 300,
    "maxReplica": 1,
    "minReplica": 0
  },
  "hfModelRepoRevision": "<string>",
  "initialVersionComment": "<string>",
  "simplescale": {
    "replicas": 2
  }
}'

{
  "status": "INITIALIZING",
  "createdAt": "2025-01-01T00:00:00Z",
  "updatedAt": "2025-01-01T00:00:00Z",
  "phase": "DOWNLOADING_MODEL"
}

POST

dedicated

beta

endpoint

Create a new endpoint

curl --request POST \
  --url https://api.friendli.ai/dedicated/beta/endpoint \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "advanced": {
    "tokenizer_add_special_tokens": true,
    "tokenizer_skip_special_tokens": true,
    "enable_content_logging": true,
    "max_batch_size": 256,
    "max_input_length": 123,
    "max_token_count": 2560
  },
  "hfModelRepo": "<string>",
  "instanceOptionId": "<string>",
  "name": "<string>",
  "projectId": "<string>",
  "autoscalingPolicy": {
    "cooldownPeriod": 300,
    "maxReplica": 1,
    "minReplica": 0
  },
  "hfModelRepoRevision": "<string>",
  "initialVersionComment": "<string>",
  "simplescale": {
    "replicas": 2
  }
}'

{
  "status": "INITIALIZING",
  "createdAt": "2025-01-01T00:00:00Z",
  "updatedAt": "2025-01-01T00:00:00Z",
  "phase": "DOWNLOADING_MODEL"
}

To request successfully, it is mandatory to enter a Friendli Token (e.g. flp_XXX) value in the Bearer Token field. Refer to the authentication section on our introduction page to learn how to acquire this variable and visit here to generate your token.

This API is currently in Beta. While we strive to provide a stable and reliable experience, this feature is still under active development. As a result, you may encounter unexpected behavior or limitations. We encourage you to provide feedback to help us improve the feature before its official release.

Authorizations

Authorization

string

header

required

When using Friendli Suite API for inference requests, you need to provide a Friendli Token for authentication and authorization purposes.

For more detailed information, please refer here.

Headers

X-Friendli-Team

string | null

ID of team to run requests as (optional parameter).

Body

application/json

Dedicated endpoint create request.

Response

200

application/json

Successfully created the endpoint.

Dedicated endpoint status.

Retrieve an endpoint status Create endpoint from W&B artifact

API Reference

Dedicated

Serverless

Container

Dataset & File

Friendli SDK

Integrations