Make an audio transcription request (beta)
Stay organized with collections
Save and categorize content based on your preferences.
Transcribe a local audio file synchronously.
Code sample
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License
, and code samples are licensed under the Apache 2.0 License
. For details, see the Google Developers Site Policies
. Java is a registered trademark of Oracle and/or its affiliates.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],[],[],[],null,["# Make an audio transcription request (beta)\n\nTranscribe a local audio file synchronously.\n\nCode sample\n-----------\n\n### Python\n\n\nTo learn how to install and use the client library for Speech-to-Text, see\n[Speech-to-Text client libraries](/speech-to-text/docs/client-libraries).\n\n\nFor more information, see the\n[Speech-to-Text Python API\nreference documentation](/python/docs/reference/speech/latest).\n\n\nTo authenticate to Speech-to-Text, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n from google.cloud import speech_v1p1beta1 as speech\n\n\n def sample_recognize(storage_uri: str) -\u003e speech.RecognizeResponse:\n \"\"\"\n Performs synchronous speech recognition on an audio file\n\n Args:\n storage_uri URI for audio file in Cloud Storage, e.g. gs://[BUCKET]/[FILE]\n \"\"\"\n\n client = speech.SpeechClient()\n\n # storage_uri = 'gs://cloud-samples-data/speech/brooklyn_bridge.mp3'\n\n # The language of the supplied audio\n language_code = \"en-US\"\n\n # Sample rate in Hertz of the audio data sent\n sample_rate_hertz = 44100\n\n # Encoding of audio data sent. This sample sets this explicitly.\n # This field is optional for FLAC and WAV audio formats.\n encoding = speech.RecognitionConfig.AudioEncoding.MP3\n config = {\n \"language_code\": language_code,\n \"sample_rate_hertz\": sample_rate_hertz,\n \"encoding\": encoding,\n }\n audio = {\"uri\": storage_uri}\n\n response = client.recognize(config=config, audio=audio)\n\n for result in response.results:\n # First alternative is the most probable result\n alternative = result.alternatives[0]\n print(f\"Transcript: {alternative.transcript}\")\n\nWhat's next\n-----------\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=speech)."]]