Text-to-Speech pricing

Text-to-Speech is priced based on the number of characters sent to the service to be synthesized into audio  each month . You must enable billing to use Text-to-Speech, and will be automatically charged if your usage exceeds the number of free characters allowed per month. For information about how to keep track of your character totals, see  Monitoring API usage . Price is calculated per character.

The total number of characters in the input string are counted for billing purposes, including spaces and newline characters. All  Speech Synthesis Markup Language (SSML)  tags (except the <mark> tag) are also included in the character count.

Pricing table

Gemini-TTS

The latest evolution of our Text-to-Speech technology giving granular control over generated audio using text-based prompts.

Model

Free usage limit

Price after free usage limit is reached

Gemini 2.5 Flash TTS

Not available

Input tokens: $0.50 per 1 million text tokens

(sku: 242A-EA16-C1EC)

Output tokens: $10.00 per 1 million audio tokens*

(sku: 9228-79EF-B162)

Gemini 2.5 Pro TTS

Not available

Input tokens: $1.00 per 1 million text tokens

(sku: 8FF1-7E5B-5BB7)

Output tokens: $20.00 per 1 million audio tokens*

(sku: DCF3-CB17-8262)


* Audio tokens correspond to 25 tokens per second of audio

Latest TTS models

Powered by our cutting-edge LLMs, our latest TTS models deliver an unparalleled level of realism and emotional resonance right out-of-the-box for every use-case.

Model


Free usage limit

Price after free usage limit is reached

Chirp 3: HD voices

(sku:F977-2280-6F1B)

0 to 1 million characters

US$0.00003 per character (US$30 per 1 million characters)

Instant custom voice

(sku:A247-37D7-C094)

Not available

US$0.00006 per character (US$60 per 1 million characters)


Legacy TTS models

Model

Free usage limit

Price after free usage limit is reached

WaveNet  voices

(sku:9D01-5995-B545)

0 to 4 million characters

US$0.000004 per character (US$4per 1 million characters)

Studio  voices

(sku:84AB-48C0-F9C3)

0 to 1 million characters

US$0.00016 per character (US$160 per 1 million characters)

Standard voices

(sku:9D01-5995-B545)

0 to 4 million characters

US$0.000004 per character (US$4 per 1 million characters)

Neural2  voices

(sku:FEBD-04B6-769B)

0 to 1 million characters

US$0.000016 per character (US$16 per 1 million characters)

Polyglot (Preview)  voices

(sku:FEBD-04B6-769B)

0 to 1 million characters

US$0.000016 per character (US$16 per 1 million characters)

Note:  For WaveNet and Standard voices, the number of characters will be equal to or less than the number of bytes represented by the text. This includes alphanumeric characters, punctuation, and white spaces. Some character sets use more than one byte for a character. For example, Japanese (ja-JP) characters in UTF-8 typically require more than one byte each. In this case, you are only charged for one character, not multiple bytes.

Google Cloud pricing

If you use other Google Cloud resources in tandem with the Text-to-Speech, such as Google App Engine instances, then you will also be billed for the use of those services. See Google Cloud's pricing calculator  to determine other costs based on current rates.

What's next

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.




Google Cloud
Design a Mobile Site
View Site in Mobile | Classic
Share by: