Audio Segmentation
Abstract
This chapter focuses on a vital stage of audio analysis, the audio segmentation stage, which focuses on splitting an uninterrupted audio signal into segments of homogeneous content. The chapter describes two general categories of audio segmentation: those that employ supervised knowledge and those that are unsupervised or semi-supervised. In this presentation context, certain specific segmentation tasks are presented, e.g., silence removal and speaker diarization.
Keywords
Audio segmentation
Fixed-window segmentation
Probability smoothing
Silence removal
Signal change detection
Speaker diarization
Clustering
Unsupervised learning
Semi-supervised learning
Segmentation is a processing stage that is of vital importance ...
Get Introduction to Audio Analysis now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.