MachineSpec

Specification of a single machine.

Fields

machineType string

Immutable. The type of the machine.

See the list of machine types supported for prediction

See the list of machine types supported for custom training .

For DeployedModel this field is optional, and the default value is n1-standard-2 . For BatchPredictionJob or as part of WorkerPoolSpec this field is required.

acceleratorType

enum (  AcceleratorType 
 
)

Immutable. The type of accelerator(s) that may be attached to the machine as per acceleratorCount .

acceleratorCount integer

The number of accelerators to attach to the machine.

tpuTopology string

Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpuTopology: "2x2x1").

reservationAffinity

object (  ReservationAffinity 
 
)

Optional. Immutable. Configuration controlling how this resource pool consumes reservation.

JSON representation
{ "machineType" : string , "acceleratorType" : enum ( `AcceleratorType` ) , "acceleratorCount" : integer , "tpuTopology" : string , "reservationAffinity" : { object ( `ReservationAffinity` ) } }

AcceleratorType

Represents a hardware accelerator type.

Enums
`ACCELERATOR_TYPE_UNSPECIFIED`	Unspecified accelerator type, which means no accelerator.
`NVIDIA_TESLA_K80`	Deprecated: Nvidia Tesla K80 GPU has reached end of support, see https://cloud.google.com/compute/docs/eol/k80-eol . This item is deprecated!
`NVIDIA_TESLA_P100`	Nvidia Tesla P100 GPU.
`NVIDIA_TESLA_V100`	Nvidia Tesla V100 GPU.
`NVIDIA_TESLA_P4`	Nvidia Tesla P4 GPU.
`NVIDIA_TESLA_T4`	Nvidia Tesla T4 GPU.
`NVIDIA_TESLA_A100`	Nvidia Tesla A100 GPU.
`NVIDIA_A100_80GB`	Nvidia A100 80GB GPU.
`NVIDIA_L4`	Nvidia L4 GPU.
`NVIDIA_H100_80GB`	Nvidia H100 80Gb GPU.
`NVIDIA_H100_MEGA_80GB`	Nvidia H100 Mega 80Gb GPU.
`NVIDIA_H200_141GB`	Nvidia H200 141Gb GPU.
`NVIDIA_B200`	Nvidia B200 GPU.
`NVIDIA_GB200`	Nvidia GB200 GPU.
`TPU_V2`	TPU v2.
`TPU_V3`	TPU v3.
`TPU_V4_POD`	TPU v4.
`TPU_V5_LITEPOD`	TPU v5.

ReservationAffinity

A ReservationAffinity can be used to configure a Vertex AI resource (e.g., a DeployedModel) to draw its Compute Engine resources from a Shared Reservation, or exclusively from on-demand capacity.

Fields

reservationAffinityType

enum (  Type 
 
)

Required. Specifies the reservation affinity type.

key string

Optional. Corresponds to the label key of a reservation resource. To target a SPECIFIC_RESERVATION by name, use compute.googleapis.com/reservation-name as the key and specify the name of your reservation as its value.

values[] string

Optional. Corresponds to the label values of a reservation resource. This must be the full resource name of the reservation or reservation block.

JSON representation
{ "reservationAffinityType" : enum ( `Type` ) , "key" : string , "values" : [ string ] }

Type

Identifies a type of reservation affinity.

Enums
`TYPE_UNSPECIFIED`	Default value. This should not be used.
`NO_RESERVATION`	Do not consume from any reserved capacity, only use on-demand.
`ANY_RESERVATION`	Consume any reservation available, falling back to on-demand.
`SPECIFIC_RESERVATION`	Consume from a specific reservation. When chosen, the reservation must be identified via the `key` and `values` fields.

MachineSpec Stay organized with collections Save and categorize content based on your preferences.

AcceleratorType

ReservationAffinity

Type

MachineSpec