Specification of a single machine.
machineType
string
Immutable. The type of the machine.
See the list of machine types supported for prediction
See the list of machine types supported for custom training .
For DeployedModel
this field is optional, and the default value is n1-standard-2
. For BatchPredictionJob
or as part of WorkerPoolSpec
this field is required.
acceleratorType
enum ( AcceleratorType
)
Immutable. The type of accelerator(s) that may be attached to the machine as per acceleratorCount
.
acceleratorCount
integer
The number of accelerators to attach to the machine.
tpuTopology
string
Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpuTopology: "2x2x1").
reservationAffinity
object ( ReservationAffinity
)
Optional. Immutable. Configuration controlling how this resource pool consumes reservation.
JSON representation |
---|
{ "machineType" : string , "acceleratorType" : enum ( |
AcceleratorType
Represents a hardware accelerator type.
Enums | |
---|---|
ACCELERATOR_TYPE_UNSPECIFIED
|
Unspecified accelerator type, which means no accelerator. |
NVIDIA_TESLA_K80
|
Deprecated: Nvidia Tesla K80 GPU has reached end of support, see https://cloud.google.com/compute/docs/eol/k80-eol . |
NVIDIA_TESLA_P100
|
Nvidia Tesla P100 GPU. |
NVIDIA_TESLA_V100
|
Nvidia Tesla V100 GPU. |
NVIDIA_TESLA_P4
|
Nvidia Tesla P4 GPU. |
NVIDIA_TESLA_T4
|
Nvidia Tesla T4 GPU. |
NVIDIA_TESLA_A100
|
Nvidia Tesla A100 GPU. |
NVIDIA_A100_80GB
|
Nvidia A100 80GB GPU. |
NVIDIA_L4
|
Nvidia L4 GPU. |
NVIDIA_H100_80GB
|
Nvidia H100 80Gb GPU. |
NVIDIA_H100_MEGA_80GB
|
Nvidia H100 Mega 80Gb GPU. |
NVIDIA_H200_141GB
|
Nvidia H200 141Gb GPU. |
NVIDIA_B200
|
Nvidia B200 GPU. |
NVIDIA_GB200
|
Nvidia GB200 GPU. |
TPU_V2
|
TPU v2. |
TPU_V3
|
TPU v3. |
TPU_V4_POD
|
TPU v4. |
TPU_V5_LITEPOD
|
TPU v5. |
ReservationAffinity
A ReservationAffinity can be used to configure a Vertex AI resource (e.g., a DeployedModel) to draw its Compute Engine resources from a Shared Reservation, or exclusively from on-demand capacity.
reservationAffinityType
enum ( Type
)
Required. Specifies the reservation affinity type.
key
string
Optional. Corresponds to the label key of a reservation resource. To target a SPECIFIC_RESERVATION by name, use compute.googleapis.com/reservation-name
as the key and specify the name of your reservation as its value.
values[]
string
Optional. Corresponds to the label values of a reservation resource. This must be the full resource name of the reservation or reservation block.
JSON representation |
---|
{
"reservationAffinityType"
:
enum (
|
Type
Identifies a type of reservation affinity.
Enums | |
---|---|
TYPE_UNSPECIFIED
|
Default value. This should not be used. |
NO_RESERVATION
|
Do not consume from any reserved capacity, only use on-demand. |
ANY_RESERVATION
|
Consume any reservation available, falling back to on-demand. |
SPECIFIC_RESERVATION
|
Consume from a specific reservation. When chosen, the reservation must be identified via the key
and values
fields. |