Likelihood-based semi-supervised model selection with applications to speech processing

In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and o...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	White, Christopher M, Khudanpur, Sanjeev P, Wolfe, Patrick J
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 20.11.2009
Subjects	Automatic speech recognition Computer Science - Computation and Language Computer Science - Learning Labeling Labels Likelihood ratio Machine learning Minimax technique Pattern recognition Speech processing Statistical tests Statistics - Applications Statistics - Machine Learning Voice recognition
Online Access	Get full text
ISSN	2331-8422
DOI	10.48550/arxiv.0911.3944

Cover

Abstract	In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and other large-scale practical applications, however, such labeled development data are typically costly and difficult to obtain. This article proposes an alternative semi-supervised framework for likelihood-based model selection that leverages unlabeled data by using trained classifiers representing each model to automatically generate putative labels. The errors that result from this automatic labeling are shown to be amenable to results from robust statistics, which in turn provide for minimax-optimal censored likelihood ratio tests that recover the nonparametric sign test as a limiting case. This approach is then validated experimentally using a state-of-the-art automatic speech recognition system to select between candidate word pronunciations using unlabeled speech data that only potentially contain instances of the words under test. Results provide supporting evidence for the utility of this approach, and suggest that it may also find use in other applications of machine learning.
AbstractList	IEEE Journal of Selected Topics in Signal Processing, vol. 4, pp. 1016-1026, 2010 In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and other large-scale practical applications, however, such labeled development data are typically costly and difficult to obtain. This article proposes an alternative semi-supervised framework for likelihood-based model selection that leverages unlabeled data by using trained classifiers representing each model to automatically generate putative labels. The errors that result from this automatic labeling are shown to be amenable to results from robust statistics, which in turn provide for minimax-optimal censored likelihood ratio tests that recover the nonparametric sign test as a limiting case. This approach is then validated experimentally using a state-of-the-art automatic speech recognition system to select between candidate word pronunciations using unlabeled speech data that only potentially contain instances of the words under test. Results provide supporting evidence for the utility of this approach, and suggest that it may also find use in other applications of machine learning. In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and other large-scale practical applications, however, such labeled development data are typically costly and difficult to obtain. This article proposes an alternative semi-supervised framework for likelihood-based model selection that leverages unlabeled data by using trained classifiers representing each model to automatically generate putative labels. The errors that result from this automatic labeling are shown to be amenable to results from robust statistics, which in turn provide for minimax-optimal censored likelihood ratio tests that recover the nonparametric sign test as a limiting case. This approach is then validated experimentally using a state-of-the-art automatic speech recognition system to select between candidate word pronunciations using unlabeled speech data that only potentially contain instances of the words under test. Results provide supporting evidence for the utility of this approach, and suggest that it may also find use in other applications of machine learning.
Author	Khudanpur, Sanjeev P Wolfe, Patrick J White, Christopher M
Author_xml	– sequence: 1 givenname: Christopher surname: White middlename: M fullname: White, Christopher M – sequence: 2 givenname: Sanjeev surname: Khudanpur middlename: P fullname: Khudanpur, Sanjeev P – sequence: 3 givenname: Patrick surname: Wolfe middlename: J fullname: Wolfe, Patrick J
BackLink	https://doi.org/10.48550/arXiv.0911.3944$$DView paper in arXiv https://doi.org/10.1109/JSTSP.2010.2076050$$DView published paper (Access to full text may be restricted)
BookMark	eNotj0FLw0AUhBdRsNbePUnAc-Lu292ke5SiVSh4KXgML5sXuzXNxmxS9d-bWE_DDMMw3xU7b3xDjN0Inqil1vweu293TLgRIpFGqTM2AylFvFQAl2wRwp5zDmkGWssZe9u4D6rdzvsyLjBQGQU6uDgMLXVHN_mDL6ke05ps73wTfbl-F2Hb1s7iFISo91FoiewuajtvKQTXvF-ziwrrQIt_nbPt0-N29RxvXtcvq4dNjFpAXBmtVVohCZuhBWHAGtQ6s4BlpqzAyiCVhAZlVYiCp2iVsGhTAhAEpZyz29PsH3Tedu6A3U8-wecT_Fi4OxXGa58DhT7f-6Frxks58GXGMykVyF9jQGD_
ContentType	Paper Journal Article
Copyright	2009. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: 2009. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS AKY EPD GOX
DOI	10.48550/arxiv.0911.3944
DatabaseName	ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central Technology Collection (ProQuest) ProQuest One Community College ProQuest Central SciTech Premium Collection ProQuest Engineering Collection Engineering Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection arXiv Computer Science arXiv Statistics arXiv.org
DatabaseTitle	Publicly Available Content Database Engineering Database Technology Collection ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest Central (New) ProQuest One Academic ProQuest One Academic (New) Engineering Collection
DatabaseTitleList	Publicly Available Content Database
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Physics
EISSN	2331-8422
ExternalDocumentID	0911_3944
Genre	Working Paper/Pre-Print
GroupedDBID	8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS AKY EPD GOX
ID	FETCH-LOGICAL-a512-f95546fae1c7ac2192c9a557c2ad74c1af9aedea9a3fb1b06ac41cac6e221e2d3
IEDL.DBID	BENPR
IngestDate	Wed Jul 23 01:26:58 EDT 2025 Mon Jun 30 09:21:03 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a512-f95546fae1c7ac2192c9a557c2ad74c1af9aedea9a3fb1b06ac41cac6e221e2d3
Notes	SourceType-Working Papers-1 ObjectType-Working Paper/Pre-Print-1 content type line 50
OpenAccessLink	https://www.proquest.com/docview/2087073342?pq-origsite=%requestingapplication%&accountid=15518
PQID	2087073342
PQPubID	2050157
ParticipantIDs	arxiv_primary_0911_3944 proquest_journals_2087073342
PublicationCentury	2000
PublicationDate	20091120
PublicationDateYYYYMMDD	2009-11-20
PublicationDate_xml	– month: 11 year: 2009 text: 20091120 day: 20
PublicationDecade	2000
PublicationPlace	Ithaca
PublicationPlace_xml	– name: Ithaca
PublicationTitle	arXiv.org
PublicationYear	2009
Publisher	Cornell University Library, arXiv.org
Publisher_xml	– name: Cornell University Library, arXiv.org
SSID	ssj0002672553
Score	1.4292324
SecondaryResourceType	preprint
Snippet	In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of... IEEE Journal of Selected Topics in Signal Processing, vol. 4, pp. 1016-1026, 2010 In conventional supervised pattern recognition tasks, model selection is...
SourceID	arxiv proquest
SourceType	Open Access Repository Aggregation Database
SubjectTerms	Automatic speech recognition Computer Science - Computation and Language Computer Science - Learning Labeling Labels Likelihood ratio Machine learning Minimax technique Pattern recognition Speech processing Statistical tests Statistics - Applications Statistics - Machine Learning Voice recognition
SummonAdditionalLinks	– databaseName: arXiv.org dbid: GOX link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV09T8MwELXaTiwIxFehgAdWQ-w4TTMiRFUhPpYiukVn5ywioI2aFvHz8TkpQkKMsezlOed39t29Y-wC5GiklXYCQZGoNkoKEiYicpLij1HkQhLNw-Nw8qzvZsmsw843tTCw_Co_G31gU195MpOXVLrZZV3vJ1At79OsCTYGJa52-s8072GGkT8Ha2CL8Q7bbt08ft3syy7r4HyPvdyXb_hekpCwIPYoeI0fpajXFRksfYe-NLwOvWk8YJxeSfnvGDNfLXhdIdpXXjUp_p569tl0fDu9mYi2sYEAz6_CZZQa5gClTcH6I0PZDJIktQqKVFsJLgMsEDKInZEmGoLV0oIdolISVREfsN58MccjxrXV_oIWGYACtHXGWKKXZKSsv2YpTPvsMACSV412RU5Q5QRVnw02EOXtb1vnKvLmm8axVsf_LjxhWyGiIqW3sAHrrZZrPPXEvDJnYXu-AQPpkDQ priority: 102 providerName: Cornell University
Title	Likelihood-based semi-supervised model selection with applications to speech processing
URI	https://www.proquest.com/docview/2087073342 https://arxiv.org/abs/0911.3944
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NS8NAEB1qi-DNb6u15OA1NbvZNOlBBKUfiK1FKvYWJpsNBrWNTSue_O3ubBMVBC-Bzd5md-ft7My8B3CGLAgEF4mtkBOptmKUJPRsJ2GUf3ScxBTRDEftwYO4mXrTCozKXhgqqyx9onHU8VzSG7kO0vXO8l1X8MvszSbVKMqulhIaWEgrxBeGYmwDapyYsapQu-qOxvffry687es7tLvOVxoyr3NcfKTvLQ2brEVNovqSav788c0GcHrbUBtjphY7UFGzXdg0dZoy34PH2_RZvaTERWwTAMVWrl5TO19ldOZpbKRtrNzI22ibW_TQav1OU1vLuZVnSsknK1t3CWj02odJrzu5HtiFNoKNGqLtpEPVZQkqJn2U2utw2UHP8yXH2BeSYdJBFSvsoJtELHLaKAWTKNuKc6Z47B5AdTafqSOwhBQ6xnMixBiFTKJIEkJ5AZc6UuPKr8OhMUiYrekvQjJVSKaqQ6M0UVjs_Dz8Wafj_6dPYMtkZhjTJ7UB1eVipU41wC-jJmwEvX6zWDs96t9N9Xf42f0CGoWq3A
linkProvider	ProQuest
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB5RoqrcoKUtr3YP7dHUu17H8QEh8VIoIUIoVblZ4_VYjYDExAHaH8d_Y2bjtEhIvXG0LfkwszvfvD-AL6g7HWtsGRAaWapNWoqEcRCWWuqPYVj6JprTfrv7w36_iC8W4GE-CyNtlXOb6A11MXaSI-cgnU9WEkXW7FY3gbBGSXV1TqGBDbVCseNXjDWDHSf0555DuHrn-ID1_dWYo8PBfjdoWAYCZLALylT6tEok7RJ0fH-NSzGOE2ewSKzTWKZIBWGKUZnrPGyjs9qha5MxmkwR8W9fQctGNuXYr7V32D87_5vkMe2EXfZoVh71u8O-4eT38G6bUVpvy0wq-8T-zTMo8Ph2tAytM6xosgILNHoLr31bqKvfwc_e8JKuhrL6OBC8K1RN18Ogvq3ExMizZ9JRtWfTYRUryeuqp1VxNR2ruiJyv1Q1G0pgsFyFwUsI6T0sjsYj-gjKOsshZZgjFmhdmedOADHuGMeBoaFkDT54gWTVbNtGJqLKRFRrsDkXUdZctDr7dyzW___5M7zpDk57We-4f7IBS74opDUbiU1YnE5uaYt9i2n-qdGgguyFz8wjp6TmTA
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Likelihood-based+semi-supervised+model+selection+with+applications+to+speech+processing&rft.jtitle=arXiv.org&rft.au=White%2C+Christopher+M&rft.au=Khudanpur%2C+Sanjeev+P&rft.au=Wolfe%2C+Patrick+J&rft.date=2009-11-20&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422&rft_id=info:doi/10.48550%2Farxiv.0911.3944