Almost all noise-robust ASR techniques discussed so far in the book have assumed the use of a single microphone device that captures distorted speech signals. We devote this chapter to the techniques developed with multiple devices. Due to the availability of cheap hardware, an increasing portion of devices features multiple sound capturing channels. This adds the spatial dimension to the otherwise only spectro-temporal processing of single-microphone systems. If the target speech and the interferers are spatially separated, beamforming and other multi-channel processing can greatly improve the target signal-to-noise ratio. It is particularly advantageous in the presence of nonstationary interferers ...
Get Robust Automatic Speech Recognition now with O’Reilly online learning.
O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.