Likelihood-based semi-supervised model selection with applications to speech processing
In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and o...
Saved in:
| Published in | arXiv.org |
|---|---|
| Main Authors | , , |
| Format | Paper Journal Article |
| Language | English |
| Published |
Ithaca
Cornell University Library, arXiv.org
20.11.2009
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2331-8422 |
| DOI | 10.48550/arxiv.0911.3944 |
Cover
| Abstract | In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and other large-scale practical applications, however, such labeled development data are typically costly and difficult to obtain. This article proposes an alternative semi-supervised framework for likelihood-based model selection that leverages unlabeled data by using trained classifiers representing each model to automatically generate putative labels. The errors that result from this automatic labeling are shown to be amenable to results from robust statistics, which in turn provide for minimax-optimal censored likelihood ratio tests that recover the nonparametric sign test as a limiting case. This approach is then validated experimentally using a state-of-the-art automatic speech recognition system to select between candidate word pronunciations using unlabeled speech data that only potentially contain instances of the words under test. Results provide supporting evidence for the utility of this approach, and suggest that it may also find use in other applications of machine learning. |
|---|---|
| AbstractList | IEEE Journal of Selected Topics in Signal Processing, vol. 4, pp.
1016-1026, 2010 In conventional supervised pattern recognition tasks, model selection is
typically accomplished by minimizing the classification error rate on a set of
so-called development data, subject to ground-truth labeling by human experts
or some other means. In the context of speech processing systems and other
large-scale practical applications, however, such labeled development data are
typically costly and difficult to obtain. This article proposes an alternative
semi-supervised framework for likelihood-based model selection that leverages
unlabeled data by using trained classifiers representing each model to
automatically generate putative labels. The errors that result from this
automatic labeling are shown to be amenable to results from robust statistics,
which in turn provide for minimax-optimal censored likelihood ratio tests that
recover the nonparametric sign test as a limiting case. This approach is then
validated experimentally using a state-of-the-art automatic speech recognition
system to select between candidate word pronunciations using unlabeled speech
data that only potentially contain instances of the words under test. Results
provide supporting evidence for the utility of this approach, and suggest that
it may also find use in other applications of machine learning. In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some other means. In the context of speech processing systems and other large-scale practical applications, however, such labeled development data are typically costly and difficult to obtain. This article proposes an alternative semi-supervised framework for likelihood-based model selection that leverages unlabeled data by using trained classifiers representing each model to automatically generate putative labels. The errors that result from this automatic labeling are shown to be amenable to results from robust statistics, which in turn provide for minimax-optimal censored likelihood ratio tests that recover the nonparametric sign test as a limiting case. This approach is then validated experimentally using a state-of-the-art automatic speech recognition system to select between candidate word pronunciations using unlabeled speech data that only potentially contain instances of the words under test. Results provide supporting evidence for the utility of this approach, and suggest that it may also find use in other applications of machine learning. |
| Author | Khudanpur, Sanjeev P Wolfe, Patrick J White, Christopher M |
| Author_xml | – sequence: 1 givenname: Christopher surname: White middlename: M fullname: White, Christopher M – sequence: 2 givenname: Sanjeev surname: Khudanpur middlename: P fullname: Khudanpur, Sanjeev P – sequence: 3 givenname: Patrick surname: Wolfe middlename: J fullname: Wolfe, Patrick J |
| BackLink | https://doi.org/10.48550/arXiv.0911.3944$$DView paper in arXiv https://doi.org/10.1109/JSTSP.2010.2076050$$DView published paper (Access to full text may be restricted) |
| BookMark | eNotj0FLw0AUhBdRsNbePUnAc-Lu292ke5SiVSh4KXgML5sXuzXNxmxS9d-bWE_DDMMw3xU7b3xDjN0Inqil1vweu293TLgRIpFGqTM2AylFvFQAl2wRwp5zDmkGWssZe9u4D6rdzvsyLjBQGQU6uDgMLXVHN_mDL6ke05ps73wTfbl-F2Hb1s7iFISo91FoiewuajtvKQTXvF-ziwrrQIt_nbPt0-N29RxvXtcvq4dNjFpAXBmtVVohCZuhBWHAGtQ6s4BlpqzAyiCVhAZlVYiCp2iVsGhTAhAEpZyz29PsH3Tedu6A3U8-wecT_Fi4OxXGa58DhT7f-6Frxks58GXGMykVyF9jQGD_ |
| ContentType | Paper Journal Article |
| Copyright | 2009. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| Copyright_xml | – notice: 2009. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS AKY EPD GOX |
| DOI | 10.48550/arxiv.0911.3944 |
| DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central Technology Collection (ProQuest) ProQuest One Community College ProQuest Central SciTech Premium Collection ProQuest Engineering Collection Engineering Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection arXiv Computer Science arXiv Statistics arXiv.org |
| DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest Central (New) ProQuest One Academic ProQuest One Academic (New) Engineering Collection |
| DatabaseTitleList | Publicly Available Content Database |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Physics |
| EISSN | 2331-8422 |
| ExternalDocumentID | 0911_3944 |
| Genre | Working Paper/Pre-Print |
| GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS AKY EPD GOX |
| ID | FETCH-LOGICAL-a512-f95546fae1c7ac2192c9a557c2ad74c1af9aedea9a3fb1b06ac41cac6e221e2d3 |
| IEDL.DBID | BENPR |
| IngestDate | Wed Jul 23 01:26:58 EDT 2025 Mon Jun 30 09:21:03 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a512-f95546fae1c7ac2192c9a557c2ad74c1af9aedea9a3fb1b06ac41cac6e221e2d3 |
| Notes | SourceType-Working Papers-1 ObjectType-Working Paper/Pre-Print-1 content type line 50 |
| OpenAccessLink | https://www.proquest.com/docview/2087073342?pq-origsite=%requestingapplication%&accountid=15518 |
| PQID | 2087073342 |
| PQPubID | 2050157 |
| ParticipantIDs | arxiv_primary_0911_3944 proquest_journals_2087073342 |
| PublicationCentury | 2000 |
| PublicationDate | 20091120 |
| PublicationDateYYYYMMDD | 2009-11-20 |
| PublicationDate_xml | – month: 11 year: 2009 text: 20091120 day: 20 |
| PublicationDecade | 2000 |
| PublicationPlace | Ithaca |
| PublicationPlace_xml | – name: Ithaca |
| PublicationTitle | arXiv.org |
| PublicationYear | 2009 |
| Publisher | Cornell University Library, arXiv.org |
| Publisher_xml | – name: Cornell University Library, arXiv.org |
| SSID | ssj0002672553 |
| Score | 1.4292324 |
| SecondaryResourceType | preprint |
| Snippet | In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of... IEEE Journal of Selected Topics in Signal Processing, vol. 4, pp. 1016-1026, 2010 In conventional supervised pattern recognition tasks, model selection is... |
| SourceID | arxiv proquest |
| SourceType | Open Access Repository Aggregation Database |
| SubjectTerms | Automatic speech recognition Computer Science - Computation and Language Computer Science - Learning Labeling Labels Likelihood ratio Machine learning Minimax technique Pattern recognition Speech processing Statistical tests Statistics - Applications Statistics - Machine Learning Voice recognition |
| SummonAdditionalLinks | – databaseName: arXiv.org dbid: GOX link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV09T8MwELXaTiwIxFehgAdWQ-w4TTMiRFUhPpYiukVn5ywioI2aFvHz8TkpQkKMsezlOed39t29Y-wC5GiklXYCQZGoNkoKEiYicpLij1HkQhLNw-Nw8qzvZsmsw843tTCw_Co_G31gU195MpOXVLrZZV3vJ1At79OsCTYGJa52-s8072GGkT8Ha2CL8Q7bbt08ft3syy7r4HyPvdyXb_hekpCwIPYoeI0fpajXFRksfYe-NLwOvWk8YJxeSfnvGDNfLXhdIdpXXjUp_p569tl0fDu9mYi2sYEAz6_CZZQa5gClTcH6I0PZDJIktQqKVFsJLgMsEDKInZEmGoLV0oIdolISVREfsN58MccjxrXV_oIWGYACtHXGWKKXZKSsv2YpTPvsMACSV412RU5Q5QRVnw02EOXtb1vnKvLmm8axVsf_LjxhWyGiIqW3sAHrrZZrPPXEvDJnYXu-AQPpkDQ priority: 102 providerName: Cornell University |
| Title | Likelihood-based semi-supervised model selection with applications to speech processing |
| URI | https://www.proquest.com/docview/2087073342 https://arxiv.org/abs/0911.3944 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NS8NAEB1qi-DNb6u15OA1NbvZNOlBBKUfiK1FKvYWJpsNBrWNTSue_O3ubBMVBC-Bzd5md-ft7My8B3CGLAgEF4mtkBOptmKUJPRsJ2GUf3ScxBTRDEftwYO4mXrTCozKXhgqqyx9onHU8VzSG7kO0vXO8l1X8MvszSbVKMqulhIaWEgrxBeGYmwDapyYsapQu-qOxvffry687es7tLvOVxoyr3NcfKTvLQ2brEVNovqSav788c0GcHrbUBtjphY7UFGzXdg0dZoy34PH2_RZvaTERWwTAMVWrl5TO19ldOZpbKRtrNzI22ibW_TQav1OU1vLuZVnSsknK1t3CWj02odJrzu5HtiFNoKNGqLtpEPVZQkqJn2U2utw2UHP8yXH2BeSYdJBFSvsoJtELHLaKAWTKNuKc6Z47B5AdTafqSOwhBQ6xnMixBiFTKJIEkJ5AZc6UuPKr8OhMUiYrekvQjJVSKaqQ6M0UVjs_Dz8Wafj_6dPYMtkZhjTJ7UB1eVipU41wC-jJmwEvX6zWDs96t9N9Xf42f0CGoWq3A |
| linkProvider | ProQuest |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB5RoqrcoKUtr3YP7dHUu17H8QEh8VIoIUIoVblZ4_VYjYDExAHaH8d_Y2bjtEhIvXG0LfkwszvfvD-AL6g7HWtsGRAaWapNWoqEcRCWWuqPYVj6JprTfrv7w36_iC8W4GE-CyNtlXOb6A11MXaSI-cgnU9WEkXW7FY3gbBGSXV1TqGBDbVCseNXjDWDHSf0555DuHrn-ID1_dWYo8PBfjdoWAYCZLALylT6tEok7RJ0fH-NSzGOE2ewSKzTWKZIBWGKUZnrPGyjs9qha5MxmkwR8W9fQctGNuXYr7V32D87_5vkMe2EXfZoVh71u8O-4eT38G6bUVpvy0wq-8T-zTMo8Ph2tAytM6xosgILNHoLr31bqKvfwc_e8JKuhrL6OBC8K1RN18Ogvq3ExMizZ9JRtWfTYRUryeuqp1VxNR2ruiJyv1Q1G0pgsFyFwUsI6T0sjsYj-gjKOsshZZgjFmhdmedOADHuGMeBoaFkDT54gWTVbNtGJqLKRFRrsDkXUdZctDr7dyzW___5M7zpDk57We-4f7IBS74opDUbiU1YnE5uaYt9i2n-qdGgguyFz8wjp6TmTA |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Likelihood-based+semi-supervised+model+selection+with+applications+to+speech+processing&rft.jtitle=arXiv.org&rft.au=White%2C+Christopher+M&rft.au=Khudanpur%2C+Sanjeev+P&rft.au=Wolfe%2C+Patrick+J&rft.date=2009-11-20&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422&rft_id=info:doi/10.48550%2Farxiv.0911.3944 |