Snore Sound Classification With Mel-Spectrogram and a Fine-Tuned CNN
Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can be due to factors such as relaxed throat muscles, excess tissue, nasal congestion, or structural abnormalities. While snoring is common and va...
Saved in:
| Published in | IEEE-EMBS Conference on Biomedical Engineering and Sciences pp. 479 - 482 |
|---|---|
| Main Authors | , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
11.12.2024
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2573-3028 |
| DOI | 10.1109/IECBES61011.2024.10991306 |
Cover
| Abstract | Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can be due to factors such as relaxed throat muscles, excess tissue, nasal congestion, or structural abnormalities. While snoring is common and varies in intensity, it can sometimes signal a more serious condition like sleep apnea. Identifying the excitation location of snore sound is important for pinpointing the site of airway obstruction, leading to more targeted and effective treatments tailored to individual anatomical challenges. In this work, we propose a method for detecting the excitation location of snoring by frame-based classification on a dataset of 828 snore sounds from 219 subjects, with expert annotations into four distinct excitation locations. Each segmented snore sound is divided into frames and converted into a Mel-spectrogram, a time-frequency representation that serves as input to a pretrained convolutional neural network designed for audio classification. We fine-tune the network with a modified classification layer with inverse class weights to account for the class imbalance. Our method achieves an improvement of 6.60% in average classification accuracy over the baseline method, demonstrating its effectiveness in distinguishing snoring excitation locations based on acoustic characteristics. |
|---|---|
| AbstractList | Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can be due to factors such as relaxed throat muscles, excess tissue, nasal congestion, or structural abnormalities. While snoring is common and varies in intensity, it can sometimes signal a more serious condition like sleep apnea. Identifying the excitation location of snore sound is important for pinpointing the site of airway obstruction, leading to more targeted and effective treatments tailored to individual anatomical challenges. In this work, we propose a method for detecting the excitation location of snoring by frame-based classification on a dataset of 828 snore sounds from 219 subjects, with expert annotations into four distinct excitation locations. Each segmented snore sound is divided into frames and converted into a Mel-spectrogram, a time-frequency representation that serves as input to a pretrained convolutional neural network designed for audio classification. We fine-tune the network with a modified classification layer with inverse class weights to account for the class imbalance. Our method achieves an improvement of 6.60% in average classification accuracy over the baseline method, demonstrating its effectiveness in distinguishing snoring excitation locations based on acoustic characteristics. |
| Author | Schuller, Bjorn W. Sharan, Roneel V. |
| Author_xml | – sequence: 1 givenname: Roneel V. surname: Sharan fullname: Sharan, Roneel V. email: roneel.sharan@essex.ac.uk organization: University of Essex,School of Computer Science and Electronic Engineering,Colchester,United Kingdom,CO4 3SQ – sequence: 2 givenname: Bjorn W. surname: Schuller fullname: Schuller, Bjorn W. email: bjoern.schuller@imperial.ac.uk organization: Imperial College London,GLAM - Group on Language, Audio, & Music,London,United Kingdom,SW7 2AZ |
| BookMark | eNo1j8FOwzAQRA0CiVLyBxzCB6Tsep3YPkJooVIphxRxrJxkA0atUyXpoX9PJOA00mjejOZaXIQ2sBB3CDNEsPfLef44LzIExJkEqWajaZEgOxOR1dZQCmRIgT0XE5lqSgikuRJR338DAKHWaaom4qkIbcdx0R5DHec71_e-8ZUbfBviDz98xa-8S4oDV0PXfnZuH7sx5-KFD5xsjoFHaL2-EZeN2_Uc_elUvC_mm_wlWb09L_OHVeJRmyFhSTXwuG10pdCoypIzXKnSNVmNSjqjU0UGrSYoAZwzNWS6dBlkLEuJNBW3v72embeHzu9dd9r-H6cfLO5NWQ |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/IECBES61011.2024.10991306 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISBN | 9798350383409 |
| EISSN | 2573-3028 |
| EndPage | 482 |
| ExternalDocumentID | 10991306 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL |
| ID | FETCH-LOGICAL-i178t-e23d0e03187c4184c93a8ec4baf6d142a87543819730b00aa8d067ba606e2b213 |
| IEDL.DBID | RIE |
| IngestDate | Wed Aug 27 01:53:45 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i178t-e23d0e03187c4184c93a8ec4baf6d142a87543819730b00aa8d067ba606e2b213 |
| PageCount | 4 |
| ParticipantIDs | ieee_primary_10991306 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Dec.-11 |
| PublicationDateYYYYMMDD | 2024-12-11 |
| PublicationDate_xml | – month: 12 year: 2024 text: 2024-Dec.-11 day: 11 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE-EMBS Conference on Biomedical Engineering and Sciences |
| PublicationTitleAbbrev | IECBES |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0003177554 |
| Score | 1.8949689 |
| Snippet | Snoring occurs when airflow through the mouth and nose is partially obstructed during sleep, causing the surrounding tissues to vibrate. This obstruction can... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 479 |
| SubjectTerms | Accuracy Acoustics Annotations Biomedical engineering Convolutional neural network Convolutional neural networks fine-tuning Mel-spectrogram Mouth Muscles Nose Sleep apnea snore sound classification Time-frequency analysis |
| Title | Snore Sound Classification With Mel-Spectrogram and a Fine-Tuned CNN |
| URI | https://ieeexplore.ieee.org/document/10991306 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB5sD6IXXxXfrOB10ybZ5nG1tlShQWiLvZXZzQaLkookF3-9M0lbHyB4CyFZlp3MfvlmZ74BuIn9wNdZqKVCN5Aq8wKpYzQyJuqRZcifDAf0R0kwnKqHWXe2KlavamGstVXymXX4sjrLT5em5FBZm09xaM8NGtAIo6Au1toEVAgIQ8LGbbhe6Wi27_u92_6Y_g9cJoKectbv_-ikUgHJYA-S9RTq_JEXpyy0Yz5-qTP-e4770Pqq2ROPGzQ6gC2bH8LuN7nBI7gb5zSaGHMnJVF1w-Q8oco04mlRPIuRfZXckL6ok7YE0nMoBjSCnJS0H4tekrRgOuhPekO56qIgF24YFdJ6ftqx7LuhUcTnTOxjZI3SmAWpqzwkxsI6XzH5OvkgYpQSgmkkZmM97bn-MTTzZW5PQPjWYmhVZMKMkN2k2EHyd9S-6mrW5T-FFi_I_K0Wypiv1-Lsj_vnsMN24ewQ172AZvFe2kvC-EJfVbb9BNjVo5I |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB60go-Lr4pvV_C6aZNsXldrS6ttENpib2V3M8GipCLJxV_vbNLWBwjewhKGZWdnv53ZmW8AbiLXd1UaKC6k7XOROj5XkdQ8ItcjTaXZMiagP4j97ljcT7zJoli9rIVBxDL5DC3zWb7lJ3NdmFBZw7zi0Jnrr8OGJ4TwqnKtVUiFoDAgdNyE6wWTZqPXbt22h3RDsI0r6AhrKeFHL5USSjq7EC8nUWWQvFhFriz98Yuf8d-z3IP6V9Uee1zh0T6sYXYAO98IBw_hbpiRNDY0vZRY2Q_TZAqVymFPs_yZDfCVm5b0eZW2xST9J1mHJPBRQScya8VxHcad9qjV5Ys-CnxmB2HO0XGTJhrrDbQgj05HrgxRCyVTP7GFI8lnMUxfEVk7WaGUYUIYpiT5Nugox3aPoJbNMzwG5iLKAEWog5SwXSeyKcnipXKFpwwz_wnUzYJM3yqqjOlyLU7_GL-Cre5o0J_2e_HDGWwbHZlcEds-h1r-XuAFIX6uLks9fwKeC6bf |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE-EMBS+Conference+on+Biomedical+Engineering+and+Sciences&rft.atitle=Snore+Sound+Classification+With+Mel-Spectrogram+and+a+Fine-Tuned+CNN&rft.au=Sharan%2C+Roneel+V.&rft.au=Schuller%2C+Bjorn+W.&rft.date=2024-12-11&rft.pub=IEEE&rft.eissn=2573-3028&rft.spage=479&rft.epage=482&rft_id=info:doi/10.1109%2FIECBES61011.2024.10991306&rft.externalDocID=10991306 |