Skip to Content
Parametric Time-Frequency Domain Spatial Audio
book

Parametric Time-Frequency Domain Spatial Audio

by Ville Pulkki, Symeon Delikaris-Manias, Archontis Politis
December 2017
Intermediate to advanced
409 pages
14h 8m
English
Wiley-IEEE Press
Content preview from Parametric Time-Frequency Domain Spatial Audio

12 Microphone-Array-Based Speech Enhancement Using Neural Networks

Pasi Pertilä

Department of Signal Processing, Tampere University of Technology, Finland

12.1 Introduction

As discussed Chapter 10, the noise reduction capacity of beamforming can in practice be rather modest, and the use of post-filtering is often called for to further reduce the noise and interference in the beamformer’s output by using time–frequency (TF) masking. The Wiener filter is theoretically an optimal method (in the mean squared error sense) for noise suppression, but it requires the noise power spectrum (or that of the target signal) to be available during operation. This is problematic in typical real-world scenarios, where only the noisy target signal is observed and no explicit noise (or target) signal is available. A traditional speech enhancement approach is to update the estimates of the noise parameters during silence periods of speech. In environments where the noise statistics do not change significantly until the next update is available, this approach can achieve good noise suppression. Different variants of this technique have been developed in the past (see, for example, Diethorn, 2004). However, relying on a voice activity detection scheme inherently increases the system’s complexity and decreases its robustness. Furthermore, real-world noise is often dynamic, which violates the assumption of noise stationarity. The errors made in the parameter estimates required by the approach ultimately ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Techniques for Noise Robustness in Automatic Speech Recognition

Techniques for Noise Robustness in Automatic Speech Recognition

Rita Singh, Tuomas Virtanen, Bhiksha Raj
Academic Press Library in Signal Processing

Academic Press Library in Signal Processing

Sergios Theodoridis, Rama Chellappa

Publisher Resources

ISBN: 9781119252597Purchase book