Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review-Reference-Cited by-同舟云学术

Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review

Published:2022-11-18 Issue:22 Volume:11 Page:3795
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Abayomi-Alli Olusola O.^ORCID,Damaševičius Robertas^ORCID,Qazi Atika,Adedoyin-Olowe Mariam,Misra Sanjay^ORCID

Abstract

The aim of this systematic literature review (SLR) is to identify and critically evaluate current research advancements with respect to small data and the use of data augmentation methods to increase the amount of data available for deep learning classifiers for sound (including voice, speech, and related audio signals) classification. Methodology: This SLR was carried out based on the standard SLR guidelines based on PRISMA, and three bibliographic databases were examined, namely, Web of Science, SCOPUS, and IEEE Xplore. Findings. The initial search findings using the variety of keyword combinations in the last five years (2017–2021) resulted in a total of 131 papers. To select relevant articles that are within the scope of this study, we adopted some screening exclusion criteria and snowballing (forward and backward snowballing) which resulted in 56 selected articles. Originality: Shortcomings of previous research studies include the lack of sufficient data, weakly labelled data, unbalanced datasets, noisy datasets, poor representations of sound features, and the lack of effective augmentation approach affecting the overall performance of classifiers, which we discuss in this article. Following the analysis of identified articles, we overview the sound datasets, feature extraction methods, data augmentation techniques, and its applications in different areas in the sound classification research problem. Finally, we conclude with the summary of SLR, answers to research questions, and recommendations for the sound classification task.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/22/3795/pdf

Reference127 articles.

1. Computer vision and artificial intelligence in precision agriculture for grain crops: A systematic review;Comput. Electron. Agric.,2018

2. Machine Learning and Natural Language Processing in Mental Health: Systematic Review;J. Med. Internet Res.,2021

3. Artificial intelligence in healthcare: Review and prediction case studies;Engineering,2020

4. Artificial intelligence for fault diagnosis of rotating machinery: A review;Mech. Syst. Signal Process.,2018

5. Zinemanas, P., Rocamora, M., Miron, M., Font, F., and Serra, X. (2021). An Interpretable Deep Learning Model for Automatic Sound Classification. Electronics, 10.

Cited by 37 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comprehensive survey for generative data augmentation;Neurocomputing;2024-10

2. ClaveNet: Generating Afro-Cuban Drum Patterns through Data Augmentation;Audio Mostly 2024 - Explorations in Sonic Cultures;2024-09-18

3. A Comparative Analysis of the TDCGAN Model for Data Balancing and Intrusion Detection;Signals;2024-09-12

4. AI integration in construction safety: Current state, challenges, and future opportunities in text, vision, and audio based applications;Automation in Construction;2024-08

5. Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning;Bioacoustics;2024-07-02