June 2019
Intermediate to advanced
308 pages
7h 21m
English
Data preprocessing is an essential step for a DL pipeline. The speech commands dataset consists of 1-second .wav files for each short speech command, and these files only need to be converted into a spectrum image. However, the downloaded audio files for the second use case are not uniform in length; hence, they require two-step preprocessing:
The preprocessing of the datasets is discussed in the data collection section. A few issues to be noted during the training image set preparation are as follows:
Read now
Unlock full access