AI Hypercomputer supports several consumption options to help you get and use compute resources. This document provides an overview of how you can obtain capacity for each consumption option.
How to obtain capacity
The following table describes how you can obtain capacity for each consumption option.
To get future reservation resources, the process is as follows:
- Reserve capacity by contacting your account team.
- Quota is automatically increased before capacity is delivered. No action is required from you.
- When you create a VM or cluster , specify the reservation-bound provisioning model. VMs and clusters are provisioned from your reserved capacity.
To get future reservation resources, the process is as follows:
- Search for available capacity and reserve resources by using the Google Cloud console, gcloud CLI, or Compute Engine API .
- No quota is charged and no action is required from you.
- When you create a VM or cluster , you must specify the reservation-bound provisioning model. VMs and clusters are provisioned from your reserved capacity.
For this consumption option, no reservation is required . To get Flex-start resources, the process is as follows:
- You must request preemptible quota for the GPU machine type that you want to use.
- When you create VMs or clusters by using one of the following options,
specify the flex-start provisioning model:
- Create a standalone VM
- Create MIGs with resize requests
- Create Slurm clusters
- Create GKE clusters:
When your requested capacity becomes available, Compute Engine provisions it. You obtain resources for up to seven days.
For this consumption option, no reservation is required . To get Spot resources, the process is as follows:
- You must request preemptible quota for the GPU machine type that you want to use.
- When you create a VM or cluster , specify the spot provisioning model. Resources are provisioned for you as capacity becomes available, but can be preempted at any time.

