O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Building Cognitive Applications with IBM Watson Services: Volume 6 Speech to Text and Text to Speech

Book Description


The Building Cognitive Applications with IBM Watson Services series is a seven-volume collection that introduces IBM® Watson™ cognitive computing services. The series includes an overview of specific IBM Watson® services with their associated architectures and simple code examples. Each volume describes how you can use and implement these services in your applications through practical use cases.

The series includes the following volumes:

  • , SG24-8387

    Volume 1 Getting Started

  • , SG24-8394

    Volume 2 Conversation

  • , SG24-8393

    Volume 3 Visual Recognition

  • , SG24-8391

    Volume 4 Natural Language Classifier

  • , SG24-8392

    Volume 5 Language Translator

  • , SG24-8388

    Volume 6 Speech to Text and Text to Speech

  • , SG24-8398

    Volume 7 Natural Language Understanding

  • Whether you are a beginner or an experienced developer, this collection provides the information you need to start your research on Watson services. If your goal is to become more familiar with Watson in relation to your current environment, or if you are evaluating cognitive computing, this collection can serve as a powerful learning tool.

    This IBM Redbooks® publication, Volume 6, introduces the Watson Text to Speech (converts written text into natural sounding audio in various languages and voices) and Watson Speech to Text (converts audio voice into written text) services. This book introduces concepts that you need to understand to use these Watson services and provides simple code examples to illustrate the use of the APIs. This book includes examples of applications that demonstrate how to use the Watson Text to Speech and Speech to Text services in practical use cases. You can develop and deploy the sample applications by following along in a step-by-step approach and using provided code snippets. Alternatively, you can download an existing Git project to more quickly deploy the application.

    Table of Contents

    1. Front cover
    2. Notices
      1. Trademarks
    3. Preface
      1. Authors
      2. Now you can become a published author, too!
      3. Comments welcome
      4. Stay connected to IBM Redbooks
    4. Chapter 1. Basics of Watson Speech to Text and Text to Speech services
      1. 1.1 Watson Speech to Text service
        1. 1.1.1 Authentication
        2. 1.1.2 Use case example
        3. 1.1.3 Speech to Text flow
        4. 1.1.4 Code snippets
      2. 1.2 Watson Text to Speech service
        1. 1.2.1 Authentication
        2. 1.2.2 Use case example
        3. 1.2.3 Text to Speech flow
        4. 1.2.4 Code snippets
    5. Chapter 2. Creating Bluemix services
      1. 2.1 Creating the Speech to Text service
      2. 2.2 Creating the Text to Speech service
      3. 2.3 Creating the Language Translator service
      4. 2.4 Creating the Personality Insights service
      5. 2.5 Retrieving the service credentials
    6. Chapter 3. Personality Analyzer
      1. 3.1 Getting started
        1. 3.1.1 Objectives
        2. 3.1.2 Prerequisites
        3. 3.1.3 Expected results
      2. 3.2 Architecture
      3. 3.3 Two ways to deploy the application: Step-by-step and quick deploy
      4. 3.4 Step-by-step implementation
        1. 3.4.1 Downloading the project from Git
        2. 3.4.2 Reviewing the project structure
        3. 3.4.3 Setting up Watson services in Bluemix
        4. 3.4.4 Providing credentials for the Watson services
        5. 3.4.5 Using the Watson Speech to Text (STT) service
        6. 3.4.6 Using the Watson Personality Insights (PI) service
        7. 3.4.7 Providing a unique name for your application
        8. 3.4.8 Deploying the application on Bluemix
        9. 3.4.9 Testing the application
      5. 3.5 Quick deployment of application
      6. 3.6 References
    7. Chapter 4. Real-time transcription
      1. 4.1 Getting started
        1. 4.1.1 Objectives
        2. 4.1.2 Prerequisites
        3. 4.1.3 Expected results
      2. 4.2 Architecture
      3. 4.3 Two ways to deploy the app: Step-by-step and quick deploy
      4. 4.4 Step-by-step implementation
        1. 4.4.1 Preparing the mobile device where the app will run
        2. 4.4.2 Downloading the project from the Git repository
        3. 4.4.3 Importing the project to Android Studio
        4. 4.4.4 Reviewing the project structure and the code provided
        5. 4.4.5 Importing the required Watson libraries
        6. 4.4.6 Using the Speech to Text (STT) service
        7. 4.4.7 Using the Text to Speech (TTS) service
        8. 4.4.8 Setting up the credentials
        9. 4.4.9 Testing the application
      5. 4.5 Quick deployment of application
      6. 4.6 References
    8. Appendix A. Additional material
      1. Locating the Git repositories
    9. Related publications
      1. IBM Redbooks
      2. Online resources
      3. Help from IBM
    10. Back cover