Deep Learning in Audio Classification

Audio processing technology is happening everywhere in our life. We ask our car to make a call for us while driving, or we let Alexa turn off the light for us when we don’t want to get out of bed before sleep. In all of these audio-based applications and research, it is AI and ML that makes the comp...

Full description

Saved in:

Bibliographic Details
Published in	Information and Software Technologies Vol. 1665; pp. 64 - 77
Main Authors	Wang, Yaqin, Wei-Kocsis, Jin, Springer, John A., Matson, Eric T.
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2022 Springer International Publishing
Series	Communications in Computer and Information Science
Subjects	Audio classification Deep learning Deep reinforcement learning Machine learning
Online Access	Get full text
ISBN	9783031163012 303116301X
ISSN	1865-0929 1865-0937
DOI	10.1007/978-3-031-16302-9_5

Cover

More Information
Summary:	Audio processing technology is happening everywhere in our life. We ask our car to make a call for us while driving, or we let Alexa turn off the light for us when we don’t want to get out of bed before sleep. In all of these audio-based applications and research, it is AI and ML that makes the computer or the smart phone understand us via our voice [1]. As an important part of artificial intelligence (AI), especially machine learning (ML), which has had great influences in many areas of AI and ML-based research and applications. This paper focuses on deep learning structures and applications for audio classification. We conduct a detailed review of literature in audio-based DL and DRL approaches and applications. We also discuss the limitation and possible future works for audio-based DL approach.
ISBN:	9783031163012 303116301X
ISSN:	1865-0929 1865-0937
DOI:	10.1007/978-3-031-16302-9_5