Robust Speech Endpoint Detection Based on Improved Adaptive Band-Partitioning Spectral Entropy

The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech recognition. In this paper, an improved adaptive band-partitioning spectral entropy algorithm was proposed for speech endpoint detection, whi...

Full description

Saved in:
Bibliographic Details
Published inBio-Inspired Computational Intelligence and Applications Vol. 4688; pp. 36 - 45
Main Authors Li, Xin, Liu, Huaping, Zheng, Yu, Xu, Bolin
Format Book Chapter
LanguageEnglish
Published Germany Springer Berlin / Heidelberg 2007
Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3540747680
9783540747680
ISSN0302-9743
1611-3349
DOI10.1007/978-3-540-74769-7_5

Cover

Abstract The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech recognition. In this paper, an improved adaptive band-partitioning spectral entropy algorithm was proposed for speech endpoint detection, which utilized the weighted power spectral subtraction to boost up the signal-to-noise ratio (SNR) as well as keep the robustness. The idea of adaptive band-partitioning spectral entropy is to divide a frame into some sub-bands which the number of it could be selected adaptively, and calculate spectral entropy of them. Although it has good robustness, the accuracy degrades rapidly when the SNR are low. Therefore, the weighted power spectral subtraction is presented for reducing the spectral effects of acoustically added noise in speech. The speech recognition experiment results indicate that the recognition accuracy have improved well in adverse environments.
AbstractList The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech recognition. In this paper, an improved adaptive band-partitioning spectral entropy algorithm was proposed for speech endpoint detection, which utilized the weighted power spectral subtraction to boost up the signal-to-noise ratio (SNR) as well as keep the robustness. The idea of adaptive band-partitioning spectral entropy is to divide a frame into some sub-bands which the number of it could be selected adaptively, and calculate spectral entropy of them. Although it has good robustness, the accuracy degrades rapidly when the SNR are low. Therefore, the weighted power spectral subtraction is presented for reducing the spectral effects of acoustically added noise in speech. The speech recognition experiment results indicate that the recognition accuracy have improved well in adverse environments.
Author Xu, Bolin
Zheng, Yu
Liu, Huaping
Li, Xin
Author_xml – sequence: 1
  givenname: Xin
  surname: Li
  fullname: Li, Xin
  organization: School of Electromechanical Engineering and Automation, Shanghai University, 200072 Shanghai, China
– sequence: 2
  givenname: Huaping
  surname: Liu
  fullname: Liu, Huaping
  organization: School of Electromechanical Engineering and Automation, Shanghai University, 200072 Shanghai, China
– sequence: 3
  givenname: Yu
  surname: Zheng
  fullname: Zheng, Yu
  organization: School of Computer Engineering and Science, Shanghai University, Shanghai 200072, China
– sequence: 4
  givenname: Bolin
  surname: Xu
  fullname: Xu, Bolin
  organization: Department of Electronic and Information Engineering, Nanjing University, 210093 Nanjing, China
BookMark eNp9UMtOwzAQNFAQbekXcMkPGPx-HEspUKkSiMcVy3EcGihJiN1K_D0OBYkTvng1uzO7MyMwqJvaA3CK0RlGSJ5rqSCFnCEomRQaSsP3wCShNGHfkNwHQywwhpQyfQBGvw2FBmCIKCJQS0aP0pBimjGi0TGYhPCK0qNYMKWG4Pm-yTchZg-t926Vzeuibao6Zpc-eherps4ubPBFlorFe9s121RPC9vGautTqy7gne1i1U9W9Usv42Jn10kodk37eQIOS7sOfvLzj8HT1fxxdgOXt9eL2XQJW6wlh1y6knvECyxdoXPhbU6lxoVkpfACE1oi64TAvGRWK0cYEyIvbaFyjh2hnI4B3umGtkt3-M7kTfMWDEamD9Ok2Aw1KR_znZxJYSYO2XGSrY-ND9H4nuR83Ttwq2TSd8EIiTFB3Ahi-J9F_5MoEpQgYlTaSb8A4KSGNQ
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2007
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2007
DBID FFUUA
DEWEY 006.3
DOI 10.1007/978-3-540-74769-7_5
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783540747697
3540747699
EISSN 1611-3349
Editor Irwin, George W
Fei, Minrui
Ma, Shiwei
Editor_xml – sequence: 1
  fullname: Ma, Shiwei
– sequence: 2
  fullname: Irwin, George W
– sequence: 3
  fullname: Fei, Minrui
EndPage 45
ExternalDocumentID EBC6711205_62_55
EBC3063202_8_54
GroupedDBID 089
0D6
0DA
0E8
2HV
38.
A4J
AABBV
AAHDE
AAJYQ
AATVQ
ABBUY
ABBVZ
ABCYT
ABMNI
ACDTA
ACDUY
ACFGI
ADQVG
AEDXK
AEHEY
AEJLV
AEKFX
AETDV
AEZAY
AGNDD
AHMWK
AHNNE
ALMA_UNASSIGNED_HOLDINGS
ATJMZ
AZZ
BBABE
CZZ
FFUUA
IEZ
IV0
JJU
LZA
MA.
MW~
NUP
SBO
TPJZQ
TSXQS
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z82
Z83
Z84
Z85
Z87
Z88
-DT
-GH
-~X
1SB
29L
2HA
5QI
875
AASHB
ACGFS
ADCXD
AEFIE
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-p1975-57cf5e05d17cd9b6eab3791d74f6e6123f0ac6615f4a98c24466bfad8b51c2353
ISBN 3540747680
9783540747680
ISSN 0302-9743
IngestDate Wed Sep 17 03:24:14 EDT 2025
Tue Oct 21 09:40:10 EDT 2025
Tue Oct 21 00:17:42 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCallNum Q342 -- .I5685 2007eb
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p1975-57cf5e05d17cd9b6eab3791d74f6e6123f0ac6615f4a98c24466bfad8b51c2353
OCLC 184944290
1113544862
PQID EBC3063202_8_54
PageCount 10
ParticipantIDs springer_books_10_1007_978_3_540_74769_7_5
proquest_ebookcentralchapters_6711205_62_55
proquest_ebookcentralchapters_3063202_8_54
PublicationCentury 2000
PublicationDate 2007
PublicationDateYYYYMMDD 2007-01-01
PublicationDate_xml – year: 2007
  text: 2007
PublicationDecade 2000
PublicationPlace Germany
PublicationPlace_xml – name: Germany
– name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle International Conference on Life System Modeling and Simulation, LSMS 2007, Shanghai, China, September 14-17, 2007. Proceedings
PublicationTitle Bio-Inspired Computational Intelligence and Applications
PublicationYear 2007
Publisher Springer Berlin / Heidelberg
Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin / Heidelberg
– name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
SSID ssj0000316488
ssj0002792
Score 1.3525862
Snippet The performance of speech recognition system is often degraded in adverse environments. Accurate Speech endpoint detection is very important for robust speech...
SourceID springer
proquest
SourceType Publisher
StartPage 36
SubjectTerms Speech Enhancement
Speech Recognition
Speech Recognition System
Speech Signal
Voice Activity Detection
Title Robust Speech Endpoint Detection Based on Improved Adaptive Band-Partitioning Spectral Entropy
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=3063202&ppg=54
http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6711205&ppg=55
http://link.springer.com/10.1007/978-3-540-74769-7_5
Volume 4688
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBbSDAOGHfbGuhd02GmBB8c2ZfuwQzJkaIuuGLZ2yC4TJFtGc3GCxb70__R_jpQs59GhQHcxHEOwBZKhpI_kR8bei3ysdQZZALg6BokK0Q-KKAlKAFXkGTGEU73z1zNxdJGczGE-GFxvZS21jf5YXP2zruR_tIrPUK9UJXsHzfYvxQd4j_rFK2oYr3ub312Y1YVgF8vguKZAOYG0tjmDB_aOt3k2CRmfbIWp-xQcG8efLzY5OYvWLkStWvn1zCLKxrmDX61_NLfjptTuZ9vivi91u26oo70pLkezulwtF3WDHq0xrh_5FFfMkqITDsnA-0mpVjZ3aYqzDL6RGXuA-AeVgBJ7wIxy6TveA5KqWX867QIfZ8vG5pONfG8K76p2sIx0D8vwWOboFqqvDqrCU5BwLaB89Rd6djwbOWdpnDMXRNEYO0rUzkHHYmupd0SWNxaR_bwR-lYepBIO2AF-fcjuTWYnpz97KA_9okiyDUk9cTK64JWbUldS5KZ8v0fb3O-eCcuRHe99cefcsxeqtzug88fsIVXFcCpXQTk_YQNTP2WPvNx5J_dn7LezAu6sgHsr4L0VcGsFHG-8FXBvBfyGFXBvBbyzgufs4svs_PNR0PXwCFbjPIUA0qICE0I5Tosy18IoHaf5uEyTShii_qlCVeAeEapE5VkRUXqBrlSZaRgXUQzxCzasl7V5ybiJQwWV1rlKdZJkoICo0wifqLQAUxyyD15W0mYadOnNhZPMWuLpOI7CSGYSkkM2un2wSPEkEoIUkQTAV3vZSxq8lp7tG3UmY3xfKK3OJOrs1V0Gv2YPNn-EN2zY_GnNW9zmNvpdZ2Z_AV6eooM
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Bio-Inspired+Computational+Intelligence+and+Applications&rft.au=Li%2C+Xin&rft.au=Liu%2C+Huaping&rft.au=Zheng%2C+Yu&rft.au=Xu%2C+Bolin&rft.atitle=Robust+Speech+Endpoint+Detection+Based+on+Improved+Adaptive+Band-Partitioning+Spectral+Entropy&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2007-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783540747680&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=36&rft.epage=45&rft_id=info:doi/10.1007%2F978-3-540-74769-7_5
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F3063202-l.jpg
http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6711205-l.jpg