Class AutoscalingMetricSpec (1.33.1)

  AutoscalingMetricSpec 
 ( 
 mapping 
 = 
 None 
 , 
 * 
 , 
 ignore_unknown_fields 
 = 
 False 
 , 
 ** 
 kwargs 
 ) 
 

The metric specification that defines the target resource utilization (CPU utilization, accelerator's duty cycle, and so on) for calculating the desired replica count.

Attributes

Name
Description
metric_name
str
Required. The resource metric name. Supported metrics: - For Online Prediction: - aiplatform.googleapis.com/prediction/online/accelerator/duty_cycle - aiplatform.googleapis.com/prediction/online/cpu/utilization
target
int
The target resource utilization in percentage (1% - 100%) for the given metric; once the real usage deviates from the target by a certain percentage, the machine replicas change. The default value is 60 (representing 60%) if not provided.