COMPUTER EXERCISES

Use the speech file ‘Ch2speech.wav’ from the Book Website for all the computer exercises in this chapter.

2.24. Consider the 2-band QMF bank shown in Figure 2.35. In this figure, x(n) denotes speech frames of 256 samples and image denotes the synthesized speech frames.

  1. Design the transfer functions, F0(z) and F1(z), such that aliasing is cancelled. Also calculate the overall delay of the QMF bank.
  2. Select an arbitrary voiced speech frame from Ch2speech.wav. Give time-domain and frequency-domain plots of xd0(n) and xd1(n) for that particular frame. Comment on the frequency-domain plots with regard to low-pass/high-pass band-splitting.

    image

    Figure 2.35. A two-band QMF bank.

    image

    Figure 2.36. Speech synthesis from a select number (subset) of FFT components.

    Table 2.2. Signal-to-noise ratio (SNR) and MOS values.

    image

  3. Repeat step (b) for x1(n) and xd1(n) in order to compare the signals before and after the downsampling stage.
  4. Calculate the SNR between the input speech record, x(n), and the synthesized speech record, . Use the following equation to compute the SNR,

    Listen to the synthesized ...

Get Audio Signal Processing and Coding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.