Dialogflow voice agents use Speech-to-Text for speech recognition, which is included in Dialogflow pricing . Dialogflow automatically selects a speech recognition model for you, but you can optionally specify the model.
Available models
All available models are listed at Speech-to-Text models . Select a model that is best suited to your domain and supports your agent language and speech features .
If a model is not explicitly specified , then Dialogflow auto-selects a model based on the audio configuration in API requests and agent settings.
The following models typically have the best performance:
- telephony_short (best for telephony Dialogflow)
- telephony (best for Agent Assist) (also good for telephony Dialogflow when advanced timeout-based end of speech sensitivity is enabled)
- phone_call (good for Agent Assist and telephony Dialogflow)
- latest_short (best for non-telephony Dialogflow)
- command_and_search (best for languages where other models are not available)
Specify a model
You can supply the model for an agent, flow, or page with the model selection setting .You can also supply the model when calling the Sessions.detectIntent
or Sessions.streamingDetectIntent
methods;
Select a protocol and version for the Session reference:
Protocol | V3 | V3beta1 |
---|---|---|
REST
|
Session resource | Session resource |
RPC
|
Session interface | Session interface |
C++
|
SessionsClient | Not available |
C#
|
SessionsClient | Not available |
Go
|
SessionsClient | Not available |
Java
|
SessionsClient | SessionsClient |
Node.js
|
SessionsClient | SessionsClient |
PHP
|
Not available | Not available |
Python
|
SessionsClient | SessionsClient |
Ruby
|
Not available | Not available |
ConversationProfile
for Agent Assist
.
Specifying the model in a detect intent or conversation profile API call will
override any model selections applied to the agent, flow, or page, unless you
enable the Override request-level speech
model
setting.