GPU pricing

This page describes the pricing information for Compute Engine  GPUs . This page does not cover  disk and images networking sole-tenant nodes pricing  or  VM instance pricing .

Compute Engine charges for usage based on the following price sheet. A bill is sent out at the end of each billing cycle, providing a sum of Google Cloud charges. Prices on this page are listed in U.S. dollars (USD).

For Compute Engine, disk size, machine type memory, and network usage are calculated in JEDEC binary gigabytes (GB), or IEC gibibytes (GiB), where 1 GiB is 2 30  bytes. Similarly, 1 TiB is 2 40  bytes, or 1024 JEDEC GBs.

If you pay in a currency other than USD, the prices listed in your currency on  Cloud Platform SKUs  apply.

You can also find pricing information with the following options:

Overview

You can attach one or more GPUs to your virtual machine (VM) instances to accelerate specific workloads or offload work from your vCPUs. Each GPU adds to the cost of your instance in addition to the cost of the machine type. GPUs are subject to the same  billing policy  as vCPUs and memory.

Eligible GPU devices that are attached to standard VM instances automatically receive any applicable  sustained use discounts (SUDs) , similar to vCPUs. GPUs attached to  Spot VMs  (or  preemptible VMs ), are charged at the Spot prices for GPUs but don't receive SUDs.

GPUs are also eligible for resource-based committed use discounts (CUDs). To receive resource-based CUDs for your GPUs, you must create and attach a reservation for your GPUs when you purchase your resource-based commitment. The attached reservation cannot be modified or deleted for the duration of the commitment. For more information, see  Attach reservations to commitments . However, you can also  reserve capacity  for your GPU resources in a specific zone, even when you don't purchase a commitment. When you don't purchase a commitment for your GPUs, you pay normal on-demand prices; any applicable SUDs apply automatically.

GPU prices are listed by region. GPU devices are available only in specific zones within some regions. Read  GPUs on Compute Engine  to see a complete list of regions and zones where GPU devices are available.

You can also use the  Google Cloud Pricing Calculator  to help determine the total cost of your instances including both the cost of GPUs and machine type configurations.

GPUs that are attached to accelerator-optimized VMs

Each accelerator-optimized machine type has a specific model of NVIDIA GPUs attached.

  • For A3 accelerator-optimized machine types , NVIDIA H100 80GB GPUs are attached. These are available in the following options:
  • A3 High  (a3-highgpu-8g): this machine type has H100 80GB GPUs attached
  • A3 Mega  (a3-megagpu-8g): this machine type has H100 80GB Mega GPUs attached
  • For A2 accelerator-optimized machine types , NVIDIA A100 GPUs are attached. These are available in the following options:
  • A2 Standard  (a2-highgpu-*, a2-megagpu-*): these machine types have A100 40GB GPUs attached
  • A2 Ultra  (a2-ultragpu-*): these machine types have A100 80GB GPUs attached
  • For G2 accelerator-optimized machine types  (g2-standard-*), NVIDIA L4 GPUs are attached.

For GPUs that are attached to accelerator-optimized machine types, the cost of running these machine types includes the GPU cost and is available in the  Accelerator-optimized machine type family  pricing documentation.

Other GPU models

  • Johannesburg (africa-south1)
  • Taiwan (asia-east1)
  • Hong Kong (asia-east2)
  • Tokyo (asia-northeast1)
  • Osaka (asia-northeast2)
  • Seoul (asia-northeast3)
  • Mumbai (asia-south1)
  • Delhi (asia-south2)
  • Singapore (asia-southeast1)
  • Jakarta (asia-southeast2)
  • Sydney (australia-southeast1)
  • Melbourne (australia-southeast2)
  • Warsaw (europe-central2)
  • Madrid (europe-southwest1)
  • Belgium (europe-west1)
  • Berlin (europe-west10)
  • Turin (europe-west12)
  • London (europe-west2)
  • Frankfurt (europe-west3)
  • Netherlands (europe-west4)
  • Zurich (europe-west6)
  • Milan (europe-west8)
  • Paris (europe-west9)
  • Doha (me-central1)
  • Dammam (me-central2)
  • Tel Aviv (me-west1)
  • Montreal (northamerica-northeast1)
  • Sao Paulo (southamerica-east1)
  • Santiago (southamerica-west1)
  • Iowa (us-central1)
  • South Carolina (us-east1)
  • Northern Virginia (us-east4)
  • Columbus (us-east5)
  • Dallas (us-south1)
  • Oregon (us-west1)
  • Los Angeles (us-west2)
  • Salt Lake City (us-west3)
  • Las Vegas (us-west4)
  • Phoenix (us-west8)

Model

GPUs

GPU memory

GPU price (USD) per GPU

1 year commitment price** (USD) per GPU

3 year commitment price** (USD) per GPU

1 GPU

16 GB GDDR6

$0.35 / 1 hour
$0.22 / 1 hour
$0.16 / 1 hour

2 GPUs

32 GB GDDR6

4 GPUs

64 GB GDDR6

1 GPU

8 GB GDDR5

$0.60 / 1 hour
$0.378 / 1 hour
$0.27 / 1 hour

2 GPUs

16 GB GDDR5

4 GPUs

32 GB GDDR5

1 GPU

16 GB HBM2

$2.48 / 1 hour
$1.562 / 1 hour
$1.116 / 1 hour

2 GPUs

32 GB HBM2

4 GPUs

64 GB HBM2

8 GPUs

128 GB HBM2

1 GPU

16 GB HBM2

$1.46 / 1 hour
$0.919 / 1 hour
$0.657 / 1 hour

2 GPUs

32 GB HBM2

4 GPUs

64 GB HBM2

**For committed use discounts pricing on the A2 ultra machine series, connect with your sales account team.

Spot prices are dynamic and can change up to once every 30 days, but provide discounts of 60-91% off of the corresponding on-demand price for most machine types and GPUs. Spot prices also provide smaller discounts for local SSDs and A3 machine types. For more information, see the  Spot VMs  documentation. If you pay in a currency other than USD, the prices listed in your currency on  Cloud Platform SKUs  apply.

NVIDIA RTX virtual workstations (formerly known as NVIDIA GRID)

  • Johannesburg (africa-south1)
  • Taiwan (asia-east1)
  • Hong Kong (asia-east2)
  • Tokyo (asia-northeast1)
  • Osaka (asia-northeast2)
  • Seoul (asia-northeast3)
  • Mumbai (asia-south1)
  • Delhi (asia-south2)
  • Singapore (asia-southeast1)
  • Jakarta (asia-southeast2)
  • Sydney (australia-southeast1)
  • Melbourne (australia-southeast2)
  • Warsaw (europe-central2)
  • Madrid (europe-southwest1)
  • Belgium (europe-west1)
  • Berlin (europe-west10)
  • Turin (europe-west12)
  • London (europe-west2)
  • Frankfurt (europe-west3)
  • Netherlands (europe-west4)
  • Zurich (europe-west6)
  • Milan (europe-west8)
  • Paris (europe-west9)
  • Doha (me-central1)
  • Dammam (me-central2)
  • Tel Aviv (me-west1)
  • Montreal (northamerica-northeast1)
  • Sao Paulo (southamerica-east1)
  • Santiago (southamerica-west1)
  • Iowa (us-central1)
  • South Carolina (us-east1)
  • Northern Virginia (us-east4)
  • Columbus (us-east5)
  • Dallas (us-south1)
  • Oregon (us-west1)
  • Los Angeles (us-west2)
  • Salt Lake City (us-west3)
  • Las Vegas (us-west4)
  • Phoenix (us-west8)

Model

GPUs

GPU memory

GPU price (USD) per GPU

1 year commitment price** (USD) per GPU

3 year commitment price** (USD) per GPU

1 GPU

16 GB GDDR6

$0.55 / 1 hour
$0.42 / 1 hour
$0.36 / 1 hour

2 GPUs

32 GB GDDR6

4 GPUs

64 GB GDDR6

1 GPU

8 GB GDDR5

$0.80 / 1 hour
$0.578 / 1 hour
$0.47 / 1 hour

2 GPUs

16 GB GDDR5

4 GPUs

32 GB GDDR5

1 GPU

16 GB HBM2

$1.66 / 1 hour
$1.119 / 1 hour
$0.857 / 1 hour

2 GPUs

32 GB HBM2

4 GPUs

64 GB HBM2

**For committed use discounts pricing on the A2 ultra machine series, connect with your sales account team.

Spot prices are dynamic and can change up to once every 30 days, but provide discounts of 60-91% off of the corresponding on-demand price for most machine types and GPUs. Spot prices also provide smaller discounts for local SSDs and A3 machine types. For more information, see the  Spot VMs  documentation. If you pay in a currency other than USD, the prices listed in your currency on  Cloud Platform SKUs  apply.

Cloud TPU pricing

Information about billing for Cloud TPU resources is available on the  Cloud TPU pricing page .

What's next

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.
Google Cloud
Design a Mobile Site
View Site in Mobile | Classic
Share by: