Robust Speech Endpoint Detection Based on Improved Adaptive Band-Partitioning Spectral Entropy
The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech recognition. In this paper, an improved adaptive band-partitioning spectral entropy algorithm was proposed for speech endpoint detection, whi...
        Saved in:
      
    
          | Published in | Bio-Inspired Computational Intelligence and Applications Vol. 4688; pp. 36 - 45 | 
|---|---|
| Main Authors | , , , | 
| Format | Book Chapter | 
| Language | English | 
| Published | 
        Germany
          Springer Berlin / Heidelberg
    
        2007
     Springer Berlin Heidelberg  | 
| Series | Lecture Notes in Computer Science | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 3540747680 9783540747680  | 
| ISSN | 0302-9743 1611-3349  | 
| DOI | 10.1007/978-3-540-74769-7_5 | 
Cover
| Abstract | The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech recognition. In this paper, an improved adaptive band-partitioning spectral entropy algorithm was proposed for speech endpoint detection, which utilized the weighted power spectral subtraction to boost up the signal-to-noise ratio (SNR) as well as keep the robustness. The idea of adaptive band-partitioning spectral entropy is to divide a frame into some sub-bands which the number of it could be selected adaptively, and calculate spectral entropy of them. Although it has good robustness, the accuracy degrades rapidly when the SNR are low. Therefore, the weighted power spectral subtraction is presented for reducing the spectral effects of acoustically added noise in speech. The speech recognition experiment results indicate that the recognition accuracy have improved well in adverse environments. | 
    
|---|---|
| AbstractList | The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech recognition. In this paper, an improved adaptive band-partitioning spectral entropy algorithm was proposed for speech endpoint detection, which utilized the weighted power spectral subtraction to boost up the signal-to-noise ratio (SNR) as well as keep the robustness. The idea of adaptive band-partitioning spectral entropy is to divide a frame into some sub-bands which the number of it could be selected adaptively, and calculate spectral entropy of them. Although it has good robustness, the accuracy degrades rapidly when the SNR are low. Therefore, the weighted power spectral subtraction is presented for reducing the spectral effects of acoustically added noise in speech. The speech recognition experiment results indicate that the recognition accuracy have improved well in adverse environments. | 
    
| Author | Xu, Bolin Zheng, Yu Liu, Huaping Li, Xin  | 
    
| Author_xml | – sequence: 1 givenname: Xin surname: Li fullname: Li, Xin organization: School of Electromechanical Engineering and Automation, Shanghai University, 200072 Shanghai, China – sequence: 2 givenname: Huaping surname: Liu fullname: Liu, Huaping organization: School of Electromechanical Engineering and Automation, Shanghai University, 200072 Shanghai, China – sequence: 3 givenname: Yu surname: Zheng fullname: Zheng, Yu organization: School of Computer Engineering and Science, Shanghai University, Shanghai 200072, China – sequence: 4 givenname: Bolin surname: Xu fullname: Xu, Bolin organization: Department of Electronic and Information Engineering, Nanjing University, 210093 Nanjing, China  | 
    
| BookMark | eNp9UMtOwzAQNFAQbekXcMkPGPx-HEspUKkSiMcVy3EcGihJiN1K_D0OBYkTvng1uzO7MyMwqJvaA3CK0RlGSJ5rqSCFnCEomRQaSsP3wCShNGHfkNwHQywwhpQyfQBGvw2FBmCIKCJQS0aP0pBimjGi0TGYhPCK0qNYMKWG4Pm-yTchZg-t926Vzeuibao6Zpc-eherps4ubPBFlorFe9s121RPC9vGautTqy7gne1i1U9W9Usv42Jn10kodk37eQIOS7sOfvLzj8HT1fxxdgOXt9eL2XQJW6wlh1y6knvECyxdoXPhbU6lxoVkpfACE1oi64TAvGRWK0cYEyIvbaFyjh2hnI4B3umGtkt3-M7kTfMWDEamD9Ok2Aw1KR_znZxJYSYO2XGSrY-ND9H4nuR83Ttwq2TSd8EIiTFB3Ahi-J9F_5MoEpQgYlTaSb8A4KSGNQ | 
    
| ContentType | Book Chapter | 
    
| Copyright | Springer-Verlag Berlin Heidelberg 2007 | 
    
| Copyright_xml | – notice: Springer-Verlag Berlin Heidelberg 2007 | 
    
| DBID | FFUUA | 
    
| DEWEY | 006.3 | 
    
| DOI | 10.1007/978-3-540-74769-7_5 | 
    
| DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only | 
    
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Computer Science | 
    
| EISBN | 9783540747697 3540747699  | 
    
| EISSN | 1611-3349 | 
    
| Editor | Irwin, George W Fei, Minrui Ma, Shiwei  | 
    
| Editor_xml | – sequence: 1 fullname: Ma, Shiwei – sequence: 2 fullname: Irwin, George W – sequence: 3 fullname: Fei, Minrui  | 
    
| EndPage | 45 | 
    
| ExternalDocumentID | EBC6711205_62_55 EBC3063202_8_54  | 
    
| GroupedDBID | 089 0D6 0DA 0E8 2HV 38. A4J AABBV AAHDE AAJYQ AATVQ ABBUY ABBVZ ABCYT ABMNI ACDTA ACDUY ACFGI ADQVG AEDXK AEHEY AEJLV AEKFX AETDV AEZAY AGNDD AHMWK AHNNE ALMA_UNASSIGNED_HOLDINGS ATJMZ AZZ BBABE CZZ FFUUA IEZ IV0 JJU LZA MA. MW~ NUP SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z82 Z83 Z84 Z85 Z87 Z88 -DT -GH -~X 1SB 29L 2HA 5QI 875 AASHB ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RNI RSU SVGTG VI1 ~02  | 
    
| ID | FETCH-LOGICAL-p1975-57cf5e05d17cd9b6eab3791d74f6e6123f0ac6615f4a98c24466bfad8b51c2353 | 
    
| ISBN | 3540747680 9783540747680  | 
    
| ISSN | 0302-9743 | 
    
| IngestDate | Wed Sep 17 03:24:14 EDT 2025 Tue Oct 21 09:40:10 EDT 2025 Tue Oct 21 00:17:42 EDT 2025  | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | false | 
    
| LCCallNum | Q342 -- .I5685 2007eb | 
    
| Language | English | 
    
| LinkModel | OpenURL | 
    
| MergedId | FETCHMERGED-LOGICAL-p1975-57cf5e05d17cd9b6eab3791d74f6e6123f0ac6615f4a98c24466bfad8b51c2353 | 
    
| OCLC | 184944290 1113544862  | 
    
| PQID | EBC3063202_8_54 | 
    
| PageCount | 10 | 
    
| ParticipantIDs | springer_books_10_1007_978_3_540_74769_7_5 proquest_ebookcentralchapters_6711205_62_55 proquest_ebookcentralchapters_3063202_8_54  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2007 | 
    
| PublicationDateYYYYMMDD | 2007-01-01 | 
    
| PublicationDate_xml | – year: 2007 text: 2007  | 
    
| PublicationDecade | 2000 | 
    
| PublicationPlace | Germany | 
    
| PublicationPlace_xml | – name: Germany – name: Berlin, Heidelberg  | 
    
| PublicationSeriesTitle | Lecture Notes in Computer Science | 
    
| PublicationSubtitle | International Conference on Life System Modeling and Simulation, LSMS 2007, Shanghai, China, September 14-17, 2007. Proceedings | 
    
| PublicationTitle | Bio-Inspired Computational Intelligence and Applications | 
    
| PublicationYear | 2007 | 
    
| Publisher | Springer Berlin / Heidelberg Springer Berlin Heidelberg  | 
    
| Publisher_xml | – name: Springer Berlin / Heidelberg – name: Springer Berlin Heidelberg  | 
    
| RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David Tygar, Doug  | 
    
| RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri – sequence: 13 givenname: Doug surname: Tygar fullname: Tygar, Doug – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard  | 
    
| SSID | ssj0000316488 ssj0002792  | 
    
| Score | 1.3525862 | 
    
| Snippet | The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech... | 
    
| SourceID | springer proquest  | 
    
| SourceType | Publisher | 
    
| StartPage | 36 | 
    
| SubjectTerms | Speech Enhancement Speech Recognition Speech Recognition System Speech Signal Voice Activity Detection  | 
    
| Title | Robust Speech Endpoint Detection Based on Improved Adaptive Band-Partitioning Spectral Entropy | 
    
| URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=3063202&ppg=54 http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6711205&ppg=55 http://link.springer.com/10.1007/978-3-540-74769-7_5  | 
    
| Volume | 4688 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBbSDAOGHfbGuhd02GmBB8c2ZfuwQzJkaIuuGLZ2yC4TJFtGc3GCxb70__R_jpQs59GhQHcxHEOwBZKhpI_kR8bei3ysdQZZALg6BokK0Q-KKAlKAFXkGTGEU73z1zNxdJGczGE-GFxvZS21jf5YXP2zruR_tIrPUK9UJXsHzfYvxQd4j_rFK2oYr3ub312Y1YVgF8vguKZAOYG0tjmDB_aOt3k2CRmfbIWp-xQcG8efLzY5OYvWLkStWvn1zCLKxrmDX61_NLfjptTuZ9vivi91u26oo70pLkezulwtF3WDHq0xrh_5FFfMkqITDsnA-0mpVjZ3aYqzDL6RGXuA-AeVgBJ7wIxy6TveA5KqWX867QIfZ8vG5pONfG8K76p2sIx0D8vwWOboFqqvDqrCU5BwLaB89Rd6djwbOWdpnDMXRNEYO0rUzkHHYmupd0SWNxaR_bwR-lYepBIO2AF-fcjuTWYnpz97KA_9okiyDUk9cTK64JWbUldS5KZ8v0fb3O-eCcuRHe99cefcsxeqtzug88fsIVXFcCpXQTk_YQNTP2WPvNx5J_dn7LezAu6sgHsr4L0VcGsFHG-8FXBvBfyGFXBvBbyzgufs4svs_PNR0PXwCFbjPIUA0qICE0I5Tosy18IoHaf5uEyTShii_qlCVeAeEapE5VkRUXqBrlSZaRgXUQzxCzasl7V5ybiJQwWV1rlKdZJkoICo0wifqLQAUxyyD15W0mYadOnNhZPMWuLpOI7CSGYSkkM2un2wSPEkEoIUkQTAV3vZSxq8lp7tG3UmY3xfKK3OJOrs1V0Gv2YPNn-EN2zY_GnNW9zmNvpdZ2Z_AV6eooM | 
    
| linkProvider | Library Specific Holdings | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Bio-Inspired+Computational+Intelligence+and+Applications&rft.au=Li%2C+Xin&rft.au=Liu%2C+Huaping&rft.au=Zheng%2C+Yu&rft.au=Xu%2C+Bolin&rft.atitle=Robust+Speech+Endpoint+Detection+Based+on+Improved+Adaptive+Band-Partitioning+Spectral+Entropy&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2007-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783540747680&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=36&rft.epage=45&rft_id=info:doi/10.1007%2F978-3-540-74769-7_5 | 
    
| thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F3063202-l.jpg http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6711205-l.jpg  |