Snore Sound Classification With Mel-Spectrogram and a Fine-Tuned CNN

Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can be due to factors such as relaxed throat muscles, excess tissue, nasal congestion, or structural abnormalities. While snoring is common and va...

Full description

Saved in:
Bibliographic Details
Published inIEEE-EMBS Conference on Biomedical Engineering and Sciences pp. 479 - 482
Main Authors Sharan, Roneel V., Schuller, Bjorn W.
Format Conference Proceeding
LanguageEnglish
Published IEEE 11.12.2024
Subjects
Online AccessGet full text
ISSN2573-3028
DOI10.1109/IECBES61011.2024.10991306

Cover

Abstract Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can be due to factors such as relaxed throat muscles, excess tissue, nasal congestion, or structural abnormalities. While snoring is common and varies in intensity, it can sometimes signal a more serious condition like sleep apnea. Identifying the excitation location of snore sound is important for pinpointing the site of airway obstruction, leading to more targeted and effective treatments tailored to individual anatomical challenges. In this work, we propose a method for detecting the excitation location of snoring by frame-based classification on a dataset of 828 snore sounds from 219 subjects, with expert annotations into four distinct excitation locations. Each segmented snore sound is divided into frames and converted into a Mel-spectrogram, a time-frequency representation that serves as input to a pretrained convolutional neural network designed for audio classification. We fine-tune the network with a modified classification layer with inverse class weights to account for the class imbalance. Our method achieves an improvement of 6.60% in average classification accuracy over the baseline method, demonstrating its effectiveness in distinguishing snoring excitation locations based on acoustic characteristics.
AbstractList Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can be due to factors such as relaxed throat muscles, excess tissue, nasal congestion, or structural abnormalities. While snoring is common and varies in intensity, it can sometimes signal a more serious condition like sleep apnea. Identifying the excitation location of snore sound is important for pinpointing the site of airway obstruction, leading to more targeted and effective treatments tailored to individual anatomical challenges. In this work, we propose a method for detecting the excitation location of snoring by frame-based classification on a dataset of 828 snore sounds from 219 subjects, with expert annotations into four distinct excitation locations. Each segmented snore sound is divided into frames and converted into a Mel-spectrogram, a time-frequency representation that serves as input to a pretrained convolutional neural network designed for audio classification. We fine-tune the network with a modified classification layer with inverse class weights to account for the class imbalance. Our method achieves an improvement of 6.60% in average classification accuracy over the baseline method, demonstrating its effectiveness in distinguishing snoring excitation locations based on acoustic characteristics.
Author Schuller, Bjorn W.
Sharan, Roneel V.
Author_xml – sequence: 1
  givenname: Roneel V.
  surname: Sharan
  fullname: Sharan, Roneel V.
  email: roneel.sharan@essex.ac.uk
  organization: University of Essex,School of Computer Science and Electronic Engineering,Colchester,United Kingdom,CO4 3SQ
– sequence: 2
  givenname: Bjorn W.
  surname: Schuller
  fullname: Schuller, Bjorn W.
  email: bjoern.schuller@imperial.ac.uk
  organization: Imperial College London,GLAM - Group on Language, Audio, & Music,London,United Kingdom,SW7 2AZ
BookMark eNo1j8FOwzAQRA0CiVLyBxzCB6Tsep3YPkJooVIphxRxrJxkA0atUyXpoX9PJOA00mjejOZaXIQ2sBB3CDNEsPfLef44LzIExJkEqWajaZEgOxOR1dZQCmRIgT0XE5lqSgikuRJR338DAKHWaaom4qkIbcdx0R5DHec71_e-8ZUbfBviDz98xa-8S4oDV0PXfnZuH7sx5-KFD5xsjoFHaL2-EZeN2_Uc_elUvC_mm_wlWb09L_OHVeJRmyFhSTXwuG10pdCoypIzXKnSNVmNSjqjU0UGrSYoAZwzNWS6dBlkLEuJNBW3v72embeHzu9dd9r-H6cfLO5NWQ
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/IECBES61011.2024.10991306
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9798350383409
EISSN 2573-3028
EndPage 482
ExternalDocumentID 10991306
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
ID FETCH-LOGICAL-i178t-e23d0e03187c4184c93a8ec4baf6d142a87543819730b00aa8d067ba606e2b213
IEDL.DBID RIE
IngestDate Wed Aug 27 01:53:45 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i178t-e23d0e03187c4184c93a8ec4baf6d142a87543819730b00aa8d067ba606e2b213
PageCount 4
ParticipantIDs ieee_primary_10991306
PublicationCentury 2000
PublicationDate 2024-Dec.-11
PublicationDateYYYYMMDD 2024-12-11
PublicationDate_xml – month: 12
  year: 2024
  text: 2024-Dec.-11
  day: 11
PublicationDecade 2020
PublicationTitle IEEE-EMBS Conference on Biomedical Engineering and Sciences
PublicationTitleAbbrev IECBES
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003177554
Score 1.8949689
Snippet Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can...
SourceID ieee
SourceType Publisher
StartPage 479
SubjectTerms Accuracy
Acoustics
Annotations
Biomedical engineering
Convolutional neural network
Convolutional neural networks
fine-tuning
Mel-spectrogram
Mouth
Muscles
Nose
Sleep apnea
snore sound classification
Time-frequency analysis
Title Snore Sound Classification With Mel-Spectrogram and a Fine-Tuned CNN
URI https://ieeexplore.ieee.org/document/10991306
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB5sD6IXXxXfrOB10ybZ5nG1tlShQWiLvZXZzQaLkookF3-9M0lbHyB4CyFZlp3MfvlmZ74BuIn9wNdZqKVCN5Aq8wKpYzQyJuqRZcifDAf0R0kwnKqHWXe2KlavamGstVXymXX4sjrLT5em5FBZm09xaM8NGtAIo6Au1toEVAgIQ8LGbbhe6Wi27_u92_6Y_g9cJoKectbv_-ikUgHJYA-S9RTq_JEXpyy0Yz5-qTP-e4770Pqq2ROPGzQ6gC2bH8LuN7nBI7gb5zSaGHMnJVF1w-Q8oco04mlRPIuRfZXckL6ok7YE0nMoBjSCnJS0H4tekrRgOuhPekO56qIgF24YFdJ6ftqx7LuhUcTnTOxjZI3SmAWpqzwkxsI6XzH5OvkgYpQSgmkkZmM97bn-MTTzZW5PQPjWYmhVZMKMkN2k2EHyd9S-6mrW5T-FFi_I_K0Wypiv1-Lsj_vnsMN24ewQ172AZvFe2kvC-EJfVbb9BNjVo5I
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB60go-Lr4pvV_C6aZNsXldrS6ttENpib2V3M8GipCLJxV_vbNLWBwjewhKGZWdnv53ZmW8AbiLXd1UaKC6k7XOROj5XkdQ8ItcjTaXZMiagP4j97ljcT7zJoli9rIVBxDL5DC3zWb7lJ3NdmFBZw7zi0Jnrr8OGJ4TwqnKtVUiFoDAgdNyE6wWTZqPXbt22h3RDsI0r6AhrKeFHL5USSjq7EC8nUWWQvFhFriz98Yuf8d-z3IP6V9Uee1zh0T6sYXYAO98IBw_hbpiRNDY0vZRY2Q_TZAqVymFPs_yZDfCVm5b0eZW2xST9J1mHJPBRQScya8VxHcad9qjV5Ys-CnxmB2HO0XGTJhrrDbQgj05HrgxRCyVTP7GFI8lnMUxfEVk7WaGUYUIYpiT5Nugox3aPoJbNMzwG5iLKAEWog5SwXSeyKcnipXKFpwwz_wnUzYJM3yqqjOlyLU7_GL-Cre5o0J_2e_HDGWwbHZlcEds-h1r-XuAFIX6uLks9fwKeC6bf
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE-EMBS+Conference+on+Biomedical+Engineering+and+Sciences&rft.atitle=Snore+Sound+Classification+With+Mel-Spectrogram+and+a+Fine-Tuned+CNN&rft.au=Sharan%2C+Roneel+V.&rft.au=Schuller%2C+Bjorn+W.&rft.date=2024-12-11&rft.pub=IEEE&rft.eissn=2573-3028&rft.spage=479&rft.epage=482&rft_id=info:doi/10.1109%2FIECBES61011.2024.10991306&rft.externalDocID=10991306