Synthesizes speech synchronously: receive results after all text input has been processed.
HTTP request
POST https://texttospeech.googleapis.com/v1/text:synthesize
The URL uses gRPC Transcoding syntax.
Request body
The request body contains data with the following structure:
| JSON representation |
|---|
{ "input" : { object ( |
| Fields | |
|---|---|
input
|
Required. The Synthesizer requires either plain text or SSML as input. |
voice
|
Required. The desired voice of the synthesized audio. |
audioConfig
|
Required. The configuration of the synthesized audio. |
advancedVoiceOptions
|
Advanced voice options. |
Response body
The message returned to the client by the text.synthesize
method.
If successful, the response body contains data with the following structure:
| JSON representation |
|---|
{ "audioContent" : string } |
| Fields | |
|---|---|
audioContent
|
The audio data bytes encoded as specified in the request, including the header for encodings that are wrapped in containers (e.g. MP3, OGG_OPUS). For LINEAR16 audio, we include the WAV header. Note: as with all bytes fields, protobuffers use a pure binary representation, whereas JSON representations use base64. A base64-encoded string. |
Authorization scopes
Requires the following OAuth scope:
-
https://www.googleapis.com/auth/cloud-platform
For more information, see the Authentication Overview .
AdvancedVoiceOptions
Used for advanced voice options.
| JSON representation |
|---|
{ "lowLatencyJourneySynthesis" : boolean } |
| Fields | |
|---|---|
lowLatencyJourneySynthesis
|
Only for Journey voices. If false, the synthesis is context aware and has a higher latency. |

