MultiSpeakerVoiceConfig

Configuration for a multi-speaker text-to-speech setup. Enables the use of up to two distinct voices in a single synthesis request.

JSON representation
 { 
 "speakerVoiceConfigs" 
 : 
 [ 
 { 
 object (  MultispeakerPrebuiltVoice 
 
) 
 } 
 ] 
 } 
Fields
speakerVoiceConfigs[]

object ( MultispeakerPrebuiltVoice )

Required. A list of configurations for the voices of the speakers. Exactly two speaker voice configurations must be provided.

MultispeakerPrebuiltVoice

Configuration for a single speaker in a Gemini TTS multi-speaker setup. Enables dialogue between two speakers.

JSON representation
 { 
 "speakerAlias" 
 : 
 string 
 , 
 "speakerId" 
 : 
 string 
 } 
Fields
speakerAlias

string

Required. The speaker alias of the voice. This is the user-chosen speaker name that is used in the multispeaker text input, such as "Speaker1".

speakerId

string

Required. The speaker ID of the voice. See https://cloud.google.com/text-to-speech/docs/gemini-tts#voice_options for available values.

Design a Mobile Site
View Site in Mobile | Classic
Share by: