Mathematical Theory of Machine Learning Models for Estimating Probability Distributions
The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its succes...
        Saved in:
      
    
          | Main Author | |
|---|---|
| Format | Dissertation | 
| Language | English | 
| Published | 
            ProQuest Dissertations & Theses
    
        01.01.2023
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9798379720971 | 
Cover
| Abstract | The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its success is still incomplete. One mystery is the paradox between memorization and generalization: In theory, the model is trained to be exactly the same as the empirical distribution of the finite samples, whereas in practice, the trained model can generate new samples or estimate the likelihood of unseen samples. Meanwhile, the overwhelming diversity of distribution learning models calls for a unified perspective on this subject. This paper aims at solving these problems. First, we provide a mathematical framework from which all the well-known models can be derived. The main factor is whether to use reweighting or transport for the modeling and evaluation of distributions. It leads to different choices of distribution representations and loss types, and their combinations give rise to the diversity of distribution learning models. Beyond a categorization, this perspective greatly facilitates our analysis of training and generalization, such that our proof techniques become applicable to broad categories of models instead of particular instances. Second, we resolve the aforementioned paradox by showing that both generalization and memorization will take place, but over different time scales. On one hand, the models satisfy the property of universal convergence, so their concentration onto the empirical distribution is inevitable in the long term. On the other hand, these models enjoy implicit regularization during training, so that their generalization errors at early-stopping escape from the curse of dimensionality. Third, we obtain comprehensive results on the training behavior of distribution learning models. For models with either the potential representation or fixed generator representation, we establish global convergence to the target distributions. For models with the free generator representation, we show that they all possess a large family of spurious critical points, which sheds light on the training difficulty of these models. Furthermore, we uncover the mechanisms underlying the mode collapse phenomenon that disrupts the training of the generative adversarial networks. | 
    
|---|---|
| AbstractList | The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its success is still incomplete. One mystery is the paradox between memorization and generalization: In theory, the model is trained to be exactly the same as the empirical distribution of the finite samples, whereas in practice, the trained model can generate new samples or estimate the likelihood of unseen samples. Meanwhile, the overwhelming diversity of distribution learning models calls for a unified perspective on this subject. This paper aims at solving these problems. First, we provide a mathematical framework from which all the well-known models can be derived. The main factor is whether to use reweighting or transport for the modeling and evaluation of distributions. It leads to different choices of distribution representations and loss types, and their combinations give rise to the diversity of distribution learning models. Beyond a categorization, this perspective greatly facilitates our analysis of training and generalization, such that our proof techniques become applicable to broad categories of models instead of particular instances. Second, we resolve the aforementioned paradox by showing that both generalization and memorization will take place, but over different time scales. On one hand, the models satisfy the property of universal convergence, so their concentration onto the empirical distribution is inevitable in the long term. On the other hand, these models enjoy implicit regularization during training, so that their generalization errors at early-stopping escape from the curse of dimensionality. Third, we obtain comprehensive results on the training behavior of distribution learning models. For models with either the potential representation or fixed generator representation, we establish global convergence to the target distributions. For models with the free generator representation, we show that they all possess a large family of spurious critical points, which sheds light on the training difficulty of these models. Furthermore, we uncover the mechanisms underlying the mode collapse phenomenon that disrupts the training of the generative adversarial networks. | 
    
| Author | Yang, Hongkang | 
    
| Author_xml | – sequence: 1 givenname: Hongkang surname: Yang fullname: Yang, Hongkang  | 
    
| BookMark | eNotj01LxDAYhAMqqGv_Q8BzIU26m-Qo6_oBLXooeFzeNG-2kZpokj3sv7fiXmbggZlhbslliAEvSKWlVkJqyZmWzTWpcvaGMaaFYC2_IR89lAm_oPgRZjpMGNOJRkd7GCcfkHYIKfhwoH20OGfqYqK7XPxfYqHvKRowfvblRB99LsmbY_Ex5Dty5WDOWJ19RYan3bB9qbu359ftQ1cfNpzXa6FGORqmrbHONVwrubYoBXeKW4GNNMAWqCQC4KIOrADDUSoztsstsSL3_7XfKf4cMZf9ZzymsCzuueKbhrWN5OIXdYFRCQ | 
    
| ContentType | Dissertation | 
    
| Copyright | Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works. | 
    
| Copyright_xml | – notice: Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works. | 
    
| DBID | 050 053 0BH 0JN AAFGM ABUWG ADZZV AFKRA AGAJT AQTIP AZQEC BENPR CBPLH CCPQU DWQXO EU9 G20 M8- PHGZM PHGZT PIMPY PKEHL PQCXX PQEST PQQKQ PQUKI PRINS  | 
    
| DatabaseName | Dissertations & Theses Europe Full Text: Health & Medicine Dissertations & Theses Europe Full Text: Science & Technology ProQuest Dissertations and Theses Professional Dissertations & Theses @ Princeton University ProQuest Central Korea - hybrid linking ProQuest Central (Alumni) ProQuest Central (Alumni) - hybrid linking ProQuest Central UK/Ireland ProQuest Central Essentials - hybrid linking ProQuest Women's & Gender Studies - hybrid linking ProQuest Central Essentials ProQuest Central ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection ProQuest One Community College ProQuest Central ProQuest Dissertations & Theses A&I ProQuest Dissertations & Theses Global ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest Central - hybrid linking ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China  | 
    
| DatabaseTitle | Publicly Available Content Database Dissertations & Theses Europe Full Text: Health & Medicine ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest Dissertations & Theses: Open ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection ProQuest Dissertations and Theses Professional ProQuest Central China ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection ProQuest Dissertations & Theses Global Dissertations & Theses Europe Full Text: Science & Technology ProQuest Central Dissertations & Theses @ Princeton University ProQuest One Academic UKI Edition ProQuest Central Korea ProQuest Central (New) ProQuest One Academic ProQuest Dissertations & Theses A&I ProQuest One Academic (New)  | 
    
| DatabaseTitleList | Publicly Available Content Database | 
    
| Database_xml | – sequence: 1 dbid: BENPR name: ProQuest Central url: http://www.proquest.com/pqcentral?accountid=15518 sourceTypes: Aggregation Database  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Statistics Computer Science  | 
    
| Genre | Dissertation/Thesis | 
    
| GroupedDBID | 050 053 0BH 0JN 8R4 8R5 ABUWG AFKRA AZQEC BENPR CBPLH CCPQU DWQXO EU9 G20 M8- PHGZM PHGZT PIMPY PKEHL PQEST PQQKQ PQUKI PRINS Q2X  | 
    
| ID | FETCH-LOGICAL-g622-538c7cb09dbdff129875de732f82d3e17ba029887eaae87efad3ab2e78bc48373 | 
    
| IEDL.DBID | BENPR | 
    
| ISBN | 9798379720971 | 
    
| IngestDate | Sun Jun 29 12:41:03 EDT 2025 | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | false | 
    
| Language | English | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-g622-538c7cb09dbdff129875de732f82d3e17ba029887eaae87efad3ab2e78bc48373 | 
    
| Notes | SourceType-Dissertations & Theses-1 ObjectType-Dissertation/Thesis-1 content type line 12  | 
    
| OpenAccessLink | https://www.proquest.com/docview/2826104172?pq-origsite=%requestingapplication%&accountid=15518 | 
    
| PQID | 2826104172 | 
    
| PQPubID | 18750 | 
    
| ParticipantIDs | proquest_journals_2826104172 | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 20230101 | 
    
| PublicationDateYYYYMMDD | 2023-01-01 | 
    
| PublicationDate_xml | – month: 01 year: 2023 text: 20230101 day: 01  | 
    
| PublicationDecade | 2020 | 
    
| PublicationYear | 2023 | 
    
| Publisher | ProQuest Dissertations & Theses | 
    
| Publisher_xml | – name: ProQuest Dissertations & Theses | 
    
| SSID | ssib000933042 | 
    
| Score | 1.9755247 | 
    
| Snippet | The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by... | 
    
| SourceID | proquest | 
    
| SourceType | Aggregation Database | 
    
| SubjectTerms | Applied Mathematics Computer science Statistics  | 
    
| Title | Mathematical Theory of Machine Learning Models for Estimating Probability Distributions | 
    
| URI | https://www.proquest.com/docview/2826104172 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NTwIxEJ0gXIgmKmr8QNOD143Sdml7MCYKhJhAiMHIjfSTiwEFPPjv7SxdNSHxsodueunHvL7O9D2Aax4XCuXGZrlECzNhVCaDyDPEFsXythShqPIdtvsv_GmSTyowLN_CYFllGROLQO0WFu_IbyI1iEjPI97ev39k6BqF2dXSQkMnawV3V0iM7UCNojJWFWoP3eHoeZu_7ymhIjdTgqKI0lYgLtCldwC7nT9Z8UOo-HkD9ku_BZK2XwPqeDLcCCsfwevgR3BVv5HNE3uyCGRQVEd6koRTZwTdzt5WJB5OSTf2xR6xdbSMG7kojP0iHRTPTb5Xq2MY97rjx36WXBKyWRuJJJNWWHOrnHEhxBGOBMR5wWiQ1DHfEkajyroUXmsfv0E7pg31QhqLavLsBKrzxdyfAhGOW4eCgcxKLr03yphWUBxzfY7m9Aya5ShN00pfTX_n5fz_3xdQR6v2zfVFE6rr5ae_jIC-Nldplr4BmyKjKQ | 
    
| linkProvider | ProQuest | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NS8MwFH_odlAU1Kn4bQ56LGqaLslhCLrJpm6ITPQ2mibZZWy6KeIf5__me12qguBtlx7appTk5X3n9wM4EigoXJgsShRRmEmjI-VlEpFt0XFSVdLnXb6davNBXD8lT3PwWZyFobbKQifmitqOMsqRn2BogJZeoL09f36JiDWKqqsFhUYaqBVsLYcYCwc7btzHO4Zwk1qrjut9zPlVo3vZjALLQNSvUiAWq0xm5lRbY73HP0QH3joZc6-4jd2ZNCmhlCvp0tTh1ac2Tg13UpmM0Nhj_Ow8lEUsNMZ-5YtG5-7-b7pgWUuNL2vJCbPpj97PjdnVKizVfxXh12DODSuwUtA7sLDbK7BIjugUx3kdHtvf-K7pgE1P9LORZ-28GdOxgNPaZ0SuNpgw9IVZA8fSCLx7N0a9kffhfrA6YfUGmq3JBnRnMV2bUBqOhm4LmLQis4RPGGdKKOeMNubMa0GlRcsTvg17xSz1wsaa9H7EYOf_x4ew0Oy2b3u3rc7NLiwSS_w0c7IHpdfxm9tHX-LVHIQVY9CbsYx8AbID4LM | 
    
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT8JAEJ4gJMZooqLGB-oe9Nio25bdHoiJAgERQgxGb6Tb3eVCQAFj-In-K2fKVk1IvHHpoe02ze7szOw8vg_gIkBB4YFKvFAShZlQkSetCD2yLZEflqWwaZVvp9x4Dh5ew9ccfGW9MFRWmenEVFHrcUIx8is8GqClD9DeXllXFtGt1m_f3j1ikKJMa0anETuaBV1J4cZck0fLzD_xODetNKu49pec12u9-4bnGAe8QZkOZb5MRKKuI620tfi36MxrI3xuJde-uREqJsRyKUwcG7zaWPux4kZIlRAyu4-fXYMC5b5QRxTuap3u03LoYCsSEb4cCU74TUs2IDVs9R3YrP5JyO9CzoyKsJ1RPTC384uwQU7pAtN5D17aP1iv8ZAtuvvZ2LJ2WphpmMNsHTAiWhtOGfrFrIZjaQTe7U5Qh6Q1uXNWJdxeR7k13YfeKqbrAPKj8cgcAhM6SDRhFfqJDKQxKlLqxkYBpRk1D_kRlLJZ6rtNNu3_isTx_4_PYR1lpf_Y7LROYIMI4xdBlBLkZ5MPc4puxUyduQVj0F-xiHwDcL7k4g | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Mathematical+Theory+of+Machine+Learning+Models+for+Estimating+Probability+Distributions&rft.DBID=050%3B053%3B0BH%3B0JN%3BAAFGM%3BABUWG%3BADZZV%3BAFKRA%3BAGAJT%3BAQTIP%3BAZQEC%3BBENPR%3BCBPLH%3BCCPQU%3BDWQXO%3BEU9%3BG20%3BM8-%3BPHGZM%3BPHGZT%3BPIMPY%3BPKEHL%3BPQCXX%3BPQEST%3BPQQKQ%3BPQUKI%3BPRINS&rft.PQPubID=18750&rft.au=Yang%2C+Hongkang&rft.date=2023-01-01&rft.pub=ProQuest+Dissertations+%26+Theses&rft.isbn=9798379720971&rft.externalDBID=HAS_PDF_LINK | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798379720971/lc.gif&client=summon&freeimage=true | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798379720971/mc.gif&client=summon&freeimage=true | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798379720971/sc.gif&client=summon&freeimage=true |