Appendix C

Audio Datasets

Abstract

This appendix provides a list of datasets which are available on the Web, that can be used as training and evaluation data for several audio analysis tasks.

Keywords

Audio datasets

Benchmarking

Several datasets and benchmarks that focus on audio analysis tasks are available on the Web. The diversity of the datasets is high with respect to: size, level of annotation, and addressed audio analysis tasks. For example, there are datasets for general audio event classification and segmentation; musical genre classification; speech emotion recognition; speech vs music discrimination; speaker diarization; speaker identification, etc. In addition, these datasets may or may not contain other non-audio media types ...

Get Introduction to Audio Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.