Dedicated get endpoint (Beta)
Given an endpoint ID, return its specification.
To request successfully, it is mandatory to enter a Friendli Token (e.g. flp_XXX) value in the Bearer Token field. Refer to the authentication section on our introduction page to learn how to acquire this variable and visit here to generate your token.
This API is currently in Beta. While we strive to provide a stable and reliable experience, this feature is still under active development. As a result, you may encounter unexpected behavior or limitations. We encourage you to provide feedback to help us improve the feature before its official release.
Authorizations
Headers
ID of team to run requests as (optional parameter).
Path Parameters
The ID of the endpoint
Response
Dedicated endpoint specification.
The cooldown period in seconds between scaling operations.
The maximum number of replicas allowed.
The minimum number of replicas to maintain.
The ID of the user who created the endpoint.
The current number of replicas.
The desired number of replicas.
The type of GPU to use for the endpoint.
The maximum batch size for inference requests.
The name of the endpoint.
The number of GPUs to use per replica.
The ID of the project that owns the endpoint.
The ID of the team that owns the endpoint.
Whether to add special tokens in tokenizer input.
Whether to skip special tokens in tokenizer output.
The updated number of replicas.
The ID of the instance.
The maximum allowed input length.
Was this page helpful?