Skip to Content
Audio Source Separation and Speech Enhancement
book

Audio Source Separation and Speech Enhancement

by Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot
October 2018
Intermediate to advanced
504 pages
18h 50m
English
Wiley
Content preview from Audio Source Separation and Speech Enhancement

13Independent Component and Vector Analysis

Hiroshi Sawada and Zbyněk Koldovský

The concept of blind source separation (BSS) has been introduced in previous chapters. The term “blind” means that no a priori information is used for separation and that all parameters are estimated from observed signals based on assumed (general) properties of the unknown original signals. For example, in Chapter 8, nonnegative matrix factorization (NMF) was introduced as a blind method which relies only on nonnegativity. This chapter is devoted to methods that rely on signal independence, a condition that is often encountered in real‐world situations where signals originate from different processes that have no mutual connection between each other, e.g. speech and noise.

Efficient mathematical models of the independence come from probability theory. The signals to be separated can be modeled as stochastically independent random processes. Then, objective functions that quantify the independence can be derived based on the model and used to find independent signals. This gives rise to independent component analysis (ICA), a tool popular in BSS.

In principle, ICA assumes instantaneous mixing while audio mixtures are convolutive. This chapter is mainly focused on a solution that is called frequency‐domain ICA (FD‐ICA). The convolutive mixture is transformed by short‐time Fourier transform (STFT) into a set of instantaneous mixtures, one mixture per frequency bin. Each frequency is separated using ICA independently ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Techniques for Noise Robustness in Automatic Speech Recognition

Techniques for Noise Robustness in Automatic Speech Recognition

Rita Singh, Tuomas Virtanen, Bhiksha Raj
Parametric Time-Frequency Domain Spatial Audio

Parametric Time-Frequency Domain Spatial Audio

Ville Pulkki, Symeon Delikaris-Manias, Archontis Politis

Publisher Resources

ISBN: 9781119279891Purchase book