Dedicated create endpoint (Beta)
Create a Dedicated Endpoint deployment for a Hugging Face model.
To request successfully, it is mandatory to enter a Friendli Token (e.g. flp_XXX) value in the Bearer Token field. Refer to the authentication section on our introduction page to learn how to acquire this variable and visit here to generate your token.
This API is currently in Beta. While we strive to provide a stable and reliable experience, this feature is still under active development. As a result, you may encounter unexpected behavior or limitations. We encourage you to provide feedback to help us improve the feature before its official release.
Authorizations
Headers
ID of team to run requests as (optional parameter).
Body
Dedicated endpoint create request.
The advanced configuration of the endpoint.
HF ID of the model.
The ID of the instance option.
The name of the endpoint.
The ID of the project that owns the endpoint.
The auto scaling configuration of the endpoint.
HF commit hash of the model.
The comment for the initial version.
The simple scaling configuration of the endpoint.
Response
Dedicated endpoint status.
When the endpoint was created.
The current status of the endpoint deployment.
UNKNOWN
, INITIALIZING
, RUNNING
, UPDATING
, SLEEPING
, AWAKING
, FAILED
, STOPPING
, TERMINATING
, TERMINATED
, READY
Error code if deployment failed.
WORKLOAD_INIT_UNKNOWN_ERROR
, WORKLOAD_INIT_SETTINGS_ERROR
, WORKLOAD_INIT_GRPC_ERROR
, WORKLOAD_INIT_MANIFEST_NOT_FOUND_ERROR
, WORKLOAD_INIT_MANIFEST_TYPE_ERROR
, WORKLOAD_INIT_DOWNLOAD_ERROR
, WORKLOAD_INIT_INVALID_TOKEN_ERROR
, WORKLOAD_INIT_CANNOT_ACCESS_REPO_ERROR
, WORKLOAD_INIT_HF_WANDB_API_ERROR
, WORKLOAD_INIT_INSUFFICIENT_DISK_ERROR
, INFERENCE_ENGINE_UNKNOWN_ERROR
, INFERENCE_ENGINE_INVALID_ARGUMENT_ERROR
, INFERENCE_ENGINE_MEMORY_ERROR
, INFERENCE_ENGINE_METERING_CLIENT_CONFIG_ERROR
The current phase of the endpoint.
REQUESTING_VIRTUAL_MACHINE
, DOWNLOADING_MODEL
, ENGINE_INITIALIZING
When the endpoint was last updated.
Was this page helpful?