Chapter 4

Disambiguating Conflicting Classification Results in AVSR

Gonzalo D. Sad; Lucas D. Terissi; Juan C. Gómez    Laboratory for System Dynamics and Signal Processing, Universidad Nacional de Rosario, CIFASIS-CONICET, Rosario, Argentina


A novel scheme for disambiguating conflicting classification results in audio-visual speech recognition (AVSR) applications is proposed in this chapter. The strategy can be implemented with generative and discriminative models. It can be employed with different kinds of input information, viz., audio, visual, or audio-visual information, indistinctly. The proposed training procedure, introduces the concept of complementary models. A complementary model to a particular class j refers to a model ...

Get Intelligent Speech Signal Processing now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.