Deep Learning in Audio Classification

Audio processing technology is happening everywhere in our life. We ask our car to make a call for us while driving, or we let Alexa turn off the light for us when we don’t want to get out of bed before sleep. In all of these audio-based applications and research, it is AI and ML that makes the comp...

Full description

Saved in:
Bibliographic Details
Published inInformation and Software Technologies Vol. 1665; pp. 64 - 77
Main Authors Wang, Yaqin, Wei-Kocsis, Jin, Springer, John A., Matson, Eric T.
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2022
Springer International Publishing
SeriesCommunications in Computer and Information Science
Subjects
Online AccessGet full text
ISBN9783031163012
303116301X
ISSN1865-0929
1865-0937
DOI10.1007/978-3-031-16302-9_5

Cover

More Information
Summary:Audio processing technology is happening everywhere in our life. We ask our car to make a call for us while driving, or we let Alexa turn off the light for us when we don’t want to get out of bed before sleep. In all of these audio-based applications and research, it is AI and ML that makes the computer or the smart phone understand us via our voice [1]. As an important part of artificial intelligence (AI), especially machine learning (ML), which has had great influences in many areas of AI and ML-based research and applications. This paper focuses on deep learning structures and applications for audio classification. We conduct a detailed review of literature in audio-based DL and DRL approaches and applications. We also discuss the limitation and possible future works for audio-based DL approach.
ISBN:9783031163012
303116301X
ISSN:1865-0929
1865-0937
DOI:10.1007/978-3-031-16302-9_5