6

Audio Segmentation

Abstract

This chapter focuses on a vital stage of audio analysis, the audio segmentation stage, which focuses on splitting an uninterrupted audio signal into segments of homogeneous content. The chapter describes two general categories of audio segmentation: those that employ supervised knowledge and those that are unsupervised or semi-supervised. In this presentation context, certain specific segmentation tasks are presented, e.g., silence removal and speaker diarization.

Keywords

Audio segmentation

Fixed-window segmentation

Probability smoothing

Silence removal

Signal change detection

Speaker diarization

Clustering

Unsupervised learning

Semi-supervised learning

Segmentation is a processing stage that is of vital importance ...

Get Introduction to Audio Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.