Update an AKS web service with the provided properties. You can update the web service to use a new model, a new entry script, or new dependencies that can be specified in an inference configuration.

Values left as NULL will remain unchanged in the web service.

update_aks_webservice(
  webservice,
  autoscale_enabled = NULL,
  autoscale_min_replicas = NULL,
  autoscale_max_replicas = NULL,
  autoscale_refresh_seconds = NULL,
  autoscale_target_utilization = NULL,
  auth_enabled = NULL,
  cpu_cores = NULL,
  memory_gb = NULL,
  enable_app_insights = NULL,
  scoring_timeout_ms = NULL,
  replica_max_concurrent_requests = NULL,
  max_request_wait_time = NULL,
  num_replicas = NULL,
  tags = NULL,
  properties = NULL,
  description = NULL,
  models = NULL,
  inference_config = NULL,
  gpu_cores = NULL,
  period_seconds = NULL,
  initial_delay_seconds = NULL,
  timeout_seconds = NULL,
  success_threshold = NULL,
  failure_threshold = NULL,
  namespace = NULL,
  token_auth_enabled = NULL
)

Arguments

webservice

The AksWebservice object.

autoscale_enabled

If TRUE enable autoscaling for the web service.

autoscale_min_replicas

An int of the minimum number of containers to use when autoscaling the web service.

autoscale_max_replicas

An int of the maximum number of containers to use when autoscaling the web service.

autoscale_refresh_seconds

An int of how often in seconds the autoscaler should attempt to scale the web service.

autoscale_target_utilization

An int of the target utilization (in percent out of 100) the autoscaler should attempt to maintain for the web service.

auth_enabled

If TRUE enable key-based authentication for the web service. Defaults to TRUE.

cpu_cores

The number of cpu cores to allocate for the web service. Can be a decimal. Defaults to 0.1.

memory_gb

The amount of memory (in GB) to allocate for the web service. Can be a decimal. Defaults to 0.5.

enable_app_insights

If TRUE enable AppInsights for the web service. Defaults to FALSE.

scoring_timeout_ms

An int of the timeout (in milliseconds) to enforce for scoring calls to the web service.

replica_max_concurrent_requests

An int of the number of maximum concurrent requests per node to allow for the web service.

max_request_wait_time

An int of the maximum amount of time a request will stay in the queue (in milliseconds) before returning a 503 error.

num_replicas

An int of the number of containers to allocate for the web service. If this parameter is not set then the autoscaler is enabled by default.

tags

A named list of key-value tags for the web service, e.g. list("key" = "value"). Will replace existing tags.

properties

A named list of key-value properties to add for the web service, e.g. list("key" = "value").

description

A string of the description to give the web service.

models

A list of Model objects to package into the updated service.

inference_config

An InferenceConfig object.

gpu_cores

An int of the number of gpu cores to allocate for the web service.

period_seconds

An int of how often in seconds to perform the liveness probe. Minimum value is 1.

initial_delay_seconds

An int of the number of seconds after the container has started before liveness probes are initiated.

timeout_seconds

An int of the number of seconds after which the liveness probe times out. Minimum value is 1.

success_threshold

An int of the minimum consecutive successes for the liveness probe to be considered successful after having failed. Minimum value is 1.

failure_threshold

An int of the number of times Kubernetes will try the liveness probe when a Pod starts and the probe fails, before giving up. Minimum value is 1.

namespace

A string of the Kubernetes namespace in which to deploy the web service: up to 63 lowercase alphanumeric ('a'-'z', '0'-'9') and hyphen ('-') characters. The first last characters cannot be hyphens.

token_auth_enabled

If TRUE, enable token-based authentication for the web service. If enabled, users can access the web service by fetching an access token using their Azure Active Directory credentials. Both token_auth_enabled and auth_enabled cannot be set to TRUE.

Value

None