Skip to Content
Spatial Audio Processing: MPEG Surround and Other Applications
book

Spatial Audio Processing: MPEG Surround and Other Applications

by Jeroen Breebaart, Christof Faller
December 2007
Intermediate to advanced
224 pages
7h 18m
English
Wiley-Interscience
Content preview from Spatial Audio Processing: MPEG Surround and Other Applications

4.3 Binaural Cue Coding (BCC)

4.3.1 Time–frequency processing

BCC processes audio signals with a certain time and frequency resolution. The frequency resolution used is largely motivated by the frequency resolution of the auditory system (see Chapter 3). Psychoacoustics suggest that spatial perception is most likely based on a critical band representation of the acoustic input signal [26]. This frequency resolution is considered by using an invertible filterbank with sub-bands with bandwidths equal or proportional to the critical bandwidth of the auditory system [98, 293]. The specific time and frequency resolution used for BCC is discussed later in Section 4.3.3.

4.3.2 Down-mixing to one channel

It is important that the transmitted down-mix signal contains all signal components of the input audio signal. The goal is that each signal component is fully maintained. Simple summation of the audio input channels often results in amplification or attenuation of signal components. In other words, the power of signal components in the ‘simple’ sum is often larger or smaller than the sum of the power of the corresponding signal component of each channel. Therefore, a down-mixing technique is used which equalizes the down-mix signal such that the power of signal components in the down-mix signal is approximately the same as the corresponding power in all input channels.

Figure 4.2 shows the down-mixing scheme. The input audio channels xc(n) (1 ≤ c ≤ C) are decomposed into a number of sub-bands. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Parametric Time-Frequency Domain Spatial Audio

Parametric Time-Frequency Domain Spatial Audio

Ville Pulkki, Symeon Delikaris-Manias, Archontis Politis
Audio Source Separation and Speech Enhancement

Audio Source Separation and Speech Enhancement

Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot

Publisher Resources

ISBN: 9780470723487Purchase book