Deployment options overviewStay organized with collectionsSave and categorize content based on your preferences.
To run artificial intelligence (AI), machine learning (ML), or high performance
computing (HPC) workloads, you can deploy AI-optimized Compute Engine
instances and clusters that use A4X, A4, A3 Ultra, A3 Mega, and A3 High (8 GPUs) machines. For more information
about the features of these machines that enable you to run large-scale AI and ML
clusters, seeCluster management overview.
You can create A4X, A4, A3 Ultra, A3 Mega, and A3 High (8 GPUs) instances directly from
Compute Engine, or through other services that run on Compute Engine
instances like Cluster Toolkit or Google Kubernetes Engine.
For the most appropriate option to create your compute instances or clusters for
your use case, choose one of the following:
Option
Use case
Cluster Director
You want a fully managed service that automates the setup and
configuration of your Slurm clusters. Cluster Director helps you
configure compute, networking, and storage resources for your clusters to
maximize performance and minimize downtimes. To learn more, seeCreate an
AI-optimized cluster based on a template.
Cluster Toolkit
You want to use open-source software that simplifies the process
for you to deploy both Slurm and GKE clusters. Cluster Toolkit is
designed to be highly customizable and extensible. To learn more,
see the following:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2026-04-08 UTC."],[],[]]