Request TPU capacity in All Capacity mode
Contact your Google Cloud account team to request an All Capacity mode reservation.
Request TPU capacity in All Capacity mode with optional project sharing
Contact your account team with the required intake information. Your account team will create a future reservation in draft state for you to review through the Google Cloud console.
You can request a reservation that can be shared across other Google Cloud projects in your organization. You can also request a reservation that can be shared with another Google Cloud service like Vertex AI. Indicate your reservation share preference as part of the intake information you give your account team:
LOCAL
),
or other projects can use the reservation ( SPECIFIC_PROJECTS
). You can't
change this setting after you submit the request. To share reserved
capacity with other projects in your organization, do the following: - If you haven't already, verify the project where Google creates the request is allowed to create shared reservations .
- Provide the IDs of the projects to share the reserved capacity with. You can specify up to 100 projects in your organization.
The following properties are preset for your All Capacity mode reservation:
| Property | Description |
|---|---|
| Maintenance scheduling type | How Compute Engine schedules host maintenance for the VMs in your
cluster. TPU Cluster Director only supports the GROUPED
option, which
means maintenance is synchronized across all TPU VMs in a sub-block of
your reservation. |
| Reservation operational mode | How Google Cloud manages the VM recovery process after a host error or faulty host report. TPU Cluster Director only supports the `ALL_CAPACITY` option, which means you must self-manage the maintenance and recovery process for a VM. However, you have full access to all machines in your reservation, as well as more granular control over your VM topology and maintenance. |
| Deployment type | Capacity deployment type. TPU Cluster Director only supports `DENSE`. |
Confirm your capacity reservation draft
You must confirm the draft reservation your account team creates for you in order to receive the TPU Cluster Director capacity in your Google Cloud project. Follow the instructions from Review and submit a draft reservation request to confirm the reservation.
Check the status of a capacity request
You can determine the status of your TPU Cluster Director capacity request. If the request is approved, the capacity is allocated at the requested start time for the requested duration. Billing starts at the reservation start time.
For more details, see View future reservations .
To view the details of a future reservation request (a reservation named "exfr1"), run the following command in the Google Cloud CLI:
gcloud compute future-reservations describe exfr1 \ --zone = us-central1-a
The output is similar to the following:
creationTimestamp:
'2025-04-01'
kind:
compute#FutureReservation
name:
'exfr1'
aggregateReservations:
vmFamily:VM_FAMILY_CLOUD_TPU_LITE_DEVICE_CT6L
status:
lockTime:
2024
-03-01T17:00:00Z
procurementStatus:
APPROVED
timeWindow:
endTime:
2025
-06-01T17:00:00Z
startTime:
2024
-05-01T17:00:00Z
...
Expand an existing capacity request
You can receive additional capacity through an existing reservation. When additional capacity is added, it inherits all of the reservation properties such as reservation sharing policies. You can request a CUD with the additional capacity. The new CUD will point to the same reservation. Your account team will complete the reservation expansion process for you.
View reservation details from the Google Cloud console
Once your future reservation is fulfilled, you can find it on the Google Cloud console Reservation page
and view its details. A TPU reservation with a deployment type Dense
indicates that
it's a TPU Cluster Director reservation.

