10.8

Exploiting Visual Information in Automatic Speech Processing

Petar S. Aleksic,     Northwestern University

Gerasimos Potamianos,     IBM T.J. Watson Research Center

Aggelos K. Katsaggelos,     Northwestern University

1 Introduction

2 Analysis of Visual Signals

2.1 Face Detection, Mouth, and Lip Tracking

2.2 Visual Features

2.3 Two Visual Feature Extraction Systems

3 Audiovisual Information Fusion

3.1 Speech Classes in Audiovisual Integration

3.2 Classifiers in Speech Applications

3.3 Feature and Classifier Fusion

4 Audiovisual Automatic Speech Recognition

4.1 Bimodal Corpora for Automatic Speech Recognition

4.2 Experimental Results

5 Audiovisual Speech Synthesis

5.1 Coarticulation Modeling

5.2 Facial Animation

5.3 Visual ...

Get Handbook of Image and Video Processing, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.