Enable profanity filtering

This page describes how to use Cloud Speech-to-Text to automatically detect profanity in your audio data and censor them in the transcript.

You can enable the profanity filter by setting profanityFilter=true in the RecognitionFeatures . If enabled, Cloud Speech-to-Text will attempt to detect profane words and return only the first letter followed by asterisks in the transcript (for example, f***). If this field is set to false or not set, Cloud Speech-to-Text will not attempt to filter profanities.

The following sample demonstrates how to enable the profanity filter to recognize audio stored in a Cloud Storage bucket.

Python

To learn how to install and use the client library for Cloud STT, see Cloud STT client libraries . For more information, see the Cloud STT Python API reference documentation .

To authenticate to Cloud STT, set up Application Default Credentials. For more information, see Set up authentication for a local development environment .

  from 
  
 google.cloud 
  
 import 
 speech 
 from 
  
 google.cloud.speech 
  
 import 
 RecognizeResponse 
 def 
  
 sync_recognize_with_profanity_filter_gcs 
 ( 
 audio_uri 
 : 
 str 
 ) 
 - 
> RecognizeResponse 
 : 
  
 """Recognizes speech from an audio file in Cloud Storage and filters out profane language. 
 Args: 
 audio_uri (str): The Cloud Storage URI of the input audio, e.g., gs://[BUCKET]/[FILE] 
 Returns: 
 cloud_speech.RecognizeResponse: The full response object which includes the transcription results. 
 """ 
 # Define the audio source 
 audio 
 = 
 { 
 "uri" 
 : 
 audio_uri 
 } 
 client 
 = 
 speech 
 . 
 SpeechClient 
 () 
 config 
 = 
 speech 
 . 
  RecognitionConfig 
 
 ( 
 encoding 
 = 
 speech 
 . 
 RecognitionConfig 
 . 
 AudioEncoding 
 . 
 FLAC 
 , 
 # Audio format 
 sample_rate_hertz 
 = 
 16000 
 , 
 language_code 
 = 
 "en-US" 
 , 
 # Enable profanity filter 
 profanity_filter 
 = 
 True 
 , 
 ) 
 response 
 = 
 client 
 . 
  recognize 
 
 ( 
 config 
 = 
 config 
 , 
 audio 
 = 
 audio 
 ) 
 for 
 result 
 in 
 response 
 . 
 results 
 : 
 alternative 
 = 
 result 
 . 
 alternatives 
 [ 
 0 
 ] 
 print 
 ( 
 f 
 "Transcript: 
 { 
 alternative 
 . 
 transcript 
 } 
 " 
 ) 
 return 
 response 
 . 
 results 
 
Design a Mobile Site
View Site in Mobile | Classic
Share by: