Authorizations
Headers
ID of team to run requests as (optional parameter).
Path Parameters
The ID of the endpoint
Body
Dedicated endpoint update request.
The advanced configuration of the endpoint. Endpoint advanced config.
The auto scaling configuration of the endpoint. Autoscaling policy.
HF ID of the model.
HF commit hash of the model.
The ID of the instance option.
The name of the endpoint.
Comment for the new version.
The simple scaling configuration of the endpoint. Simple scaling options.
Response
Successfully updated the endpoint specification.
Dedicated endpoint specification.
The cooldown period in seconds between scaling operations.
The maximum number of replicas allowed.
The minimum number of replicas to maintain.
The ID of the user who created the endpoint.
The type of GPU to use for the endpoint.
The maximum batch size for inference requests.
The name of the endpoint.
The number of GPUs to use per replica.
The ID of the project that owns the endpoint.
The ID of the team that owns the endpoint.
Whether to add special tokens in tokenizer input.
Whether to skip special tokens in tokenizer output.
The current number of replicas.
The desired number of replicas.
The ID of the instance.
The maximum allowed input length.
The updated number of replicas.