Specification of a single machine.
machineType 
 
  string 
 
Immutable. The type of the machine.
See the list of machine types supported for prediction
See the list of machine types supported for custom training .
For  DeployedModel 
 
this field is optional, and the default value is n1-standard-2 
. For  BatchPredictionJob 
 
or as part of  WorkerPoolSpec 
 
this field is required.
acceleratorType 
 
  enum (  AcceleratorType 
 
) 
 
Immutable. The type of accelerator(s) that may be attached to the machine as per  acceleratorCount 
 
.
acceleratorCount 
 
  integer 
 
The number of accelerators to attach to the machine.
tpuTopology 
 
  string 
 
Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpuTopology: "2x2x1").
reservationAffinity 
 
  object (  ReservationAffinity 
 
) 
 
Optional. Immutable. Configuration controlling how this resource pool consumes reservation.
| JSON representation | 
|---|
| { "machineType" : string , "acceleratorType" : enum ( | 
AcceleratorType
Represents a hardware accelerator type.
| Enums | |
|---|---|
| ACCELERATOR_TYPE_UNSPECIFIED | Unspecified accelerator type, which means no accelerator. | 
| NVIDIA_TESLA_K80 | Deprecated: Nvidia Tesla K80 GPU has reached end of support, see https://cloud.google.com/compute/docs/eol/k80-eol . | 
| NVIDIA_TESLA_P100 | Nvidia Tesla P100 GPU. | 
| NVIDIA_TESLA_V100 | Nvidia Tesla V100 GPU. | 
| NVIDIA_TESLA_P4 | Nvidia Tesla P4 GPU. | 
| NVIDIA_TESLA_T4 | Nvidia Tesla T4 GPU. | 
| NVIDIA_TESLA_A100 | Nvidia Tesla A100 GPU. | 
| NVIDIA_A100_80GB | Nvidia A100 80GB GPU. | 
| NVIDIA_L4 | Nvidia L4 GPU. | 
| NVIDIA_H100_80GB | Nvidia H100 80Gb GPU. | 
| NVIDIA_H100_MEGA_80GB | Nvidia H100 Mega 80Gb GPU. | 
| NVIDIA_H200_141GB | Nvidia H200 141Gb GPU. | 
| NVIDIA_B200 | Nvidia B200 GPU. | 
| NVIDIA_GB200 | Nvidia GB200 GPU. | 
| NVIDIA_RTX_PRO_6000 | Nvidia RTX Pro 6000 GPU. | 
| TPU_V2 | TPU v2. | 
| TPU_V3 | TPU v3. | 
| TPU_V4_POD | TPU v4. | 
| TPU_V5_LITEPOD | TPU v5. | 
ReservationAffinity
A ReservationAffinity can be used to configure a Vertex AI resource (e.g., a DeployedModel) to draw its Compute Engine resources from a Shared Reservation, or exclusively from on-demand capacity.
reservationAffinityType 
 
  enum (  Type 
 
) 
 
Required. Specifies the reservation affinity type.
key 
 
  string 
 
Optional. Corresponds to the label key of a reservation resource. To target a SPECIFIC_RESERVATION by name, use compute.googleapis.com/reservation-name 
as the key and specify the name of your reservation as its value.
values[] 
 
  string 
 
Optional. Corresponds to the label values of a reservation resource. This must be the full resource name of the reservation or reservation block.
| JSON representation | 
|---|
|  { 
 "reservationAffinityType" 
 : 
 enum (  | 
Type
Identifies a type of reservation affinity.
| Enums | |
|---|---|
| TYPE_UNSPECIFIED | Default value. This should not be used. | 
| NO_RESERVATION | Do not consume from any reserved capacity, only use on-demand. | 
| ANY_RESERVATION | Consume any reservation available, falling back to on-demand. | 
| SPECIFIC_RESERVATION | Consume from a specific reservation. When chosen, the reservation must be identified via the keyandvaluesfields. | 

