Transcription service – speech-to-text

In the Universal Translator, we are going to translate spoken words from one language to another. The first step of this translation process is to know which words were spoken. For this, we are going to use the Amazon Transcribe service. Amazon Transcribe uses deep learning based Automatic Speech Recognition (ASR) algorithms to generate text from speech.

Let's use the AWS CLI to understand how the Transcribe service works. Issue the following command to start a transcription:

$ aws transcribe start-transcription-job  --transcription-job-name <jobname>  --language-code en-US  --media-format wav  --media MediaFileUri=https://s3.amazonaws.com/contents.aws.a/<audio file>.wav --output-bucket-name contents.aws.a ...

Get Hands-On Artificial Intelligence on Amazon Web Services now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.