8
Audio Data Augmentation with Spectrogram
In the previous chapter, we visualized the sound using the Waveform graph. An audio spectrogram is another visualizing method for seeing the audio components. The inputs to the Spectrogram are a one-dimensional array of amplitude values and the sampling rate. They are the same inputs as the Waveform graph.
An audio spectrogram is sometimes called a sonograph, sonogram, voiceprint, or voicegram. The Spectrogram is a more detailed representation of sound than the Waveform graph. It shows a correlation between frequency and amplitude (loudness) over time, which helps visualize the frequency content in a signal. Spectrograms make it easier to identify musical elements, detect melodic patterns, recognize ...
Get Data Augmentation with Python now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.