Mathematical Theory of Machine Learning Models for Estimating Probability Distributions

The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its succes...

Full description

Saved in:
Bibliographic Details
Main Author Yang, Hongkang
Format Dissertation
LanguageEnglish
Published ProQuest Dissertations & Theses 01.01.2023
Subjects
Online AccessGet full text
ISBN9798379720971

Cover

Abstract The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its success is still incomplete. One mystery is the paradox between memorization and generalization: In theory, the model is trained to be exactly the same as the empirical distribution of the finite samples, whereas in practice, the trained model can generate new samples or estimate the likelihood of unseen samples. Meanwhile, the overwhelming diversity of distribution learning models calls for a unified perspective on this subject. This paper aims at solving these problems. First, we provide a mathematical framework from which all the well-known models can be derived. The main factor is whether to use reweighting or transport for the modeling and evaluation of distributions. It leads to different choices of distribution representations and loss types, and their combinations give rise to the diversity of distribution learning models. Beyond a categorization, this perspective greatly facilitates our analysis of training and generalization, such that our proof techniques become applicable to broad categories of models instead of particular instances. Second, we resolve the aforementioned paradox by showing that both generalization and memorization will take place, but over different time scales. On one hand, the models satisfy the property of universal convergence, so their concentration onto the empirical distribution is inevitable in the long term. On the other hand, these models enjoy implicit regularization during training, so that their generalization errors at early-stopping escape from the curse of dimensionality. Third, we obtain comprehensive results on the training behavior of distribution learning models. For models with either the potential representation or fixed generator representation, we establish global convergence to the target distributions. For models with the free generator representation, we show that they all possess a large family of spurious critical points, which sheds light on the training difficulty of these models. Furthermore, we uncover the mechanisms underlying the mode collapse phenomenon that disrupts the training of the generative adversarial networks.
AbstractList The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its success is still incomplete. One mystery is the paradox between memorization and generalization: In theory, the model is trained to be exactly the same as the empirical distribution of the finite samples, whereas in practice, the trained model can generate new samples or estimate the likelihood of unseen samples. Meanwhile, the overwhelming diversity of distribution learning models calls for a unified perspective on this subject. This paper aims at solving these problems. First, we provide a mathematical framework from which all the well-known models can be derived. The main factor is whether to use reweighting or transport for the modeling and evaluation of distributions. It leads to different choices of distribution representations and loss types, and their combinations give rise to the diversity of distribution learning models. Beyond a categorization, this perspective greatly facilitates our analysis of training and generalization, such that our proof techniques become applicable to broad categories of models instead of particular instances. Second, we resolve the aforementioned paradox by showing that both generalization and memorization will take place, but over different time scales. On one hand, the models satisfy the property of universal convergence, so their concentration onto the empirical distribution is inevitable in the long term. On the other hand, these models enjoy implicit regularization during training, so that their generalization errors at early-stopping escape from the curse of dimensionality. Third, we obtain comprehensive results on the training behavior of distribution learning models. For models with either the potential representation or fixed generator representation, we establish global convergence to the target distributions. For models with the free generator representation, we show that they all possess a large family of spurious critical points, which sheds light on the training difficulty of these models. Furthermore, we uncover the mechanisms underlying the mode collapse phenomenon that disrupts the training of the generative adversarial networks.
Author Yang, Hongkang
Author_xml – sequence: 1
  givenname: Hongkang
  surname: Yang
  fullname: Yang, Hongkang
BookMark eNotj01LxDAYhAMqqGv_Q8BzIU26m-Qo6_oBLXooeFzeNG-2kZpokj3sv7fiXmbggZlhbslliAEvSKWlVkJqyZmWzTWpcvaGMaaFYC2_IR89lAm_oPgRZjpMGNOJRkd7GCcfkHYIKfhwoH20OGfqYqK7XPxfYqHvKRowfvblRB99LsmbY_Ex5Dty5WDOWJ19RYan3bB9qbu359ftQ1cfNpzXa6FGORqmrbHONVwrubYoBXeKW4GNNMAWqCQC4KIOrADDUSoztsstsSL3_7XfKf4cMZf9ZzymsCzuueKbhrWN5OIXdYFRCQ
ContentType Dissertation
Copyright Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Copyright_xml – notice: Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
DBID 050
053
0BH
0JN
AAFGM
ABUWG
ADZZV
AFKRA
AGAJT
AQTIP
AZQEC
BENPR
CBPLH
CCPQU
DWQXO
EU9
G20
M8-
PHGZM
PHGZT
PIMPY
PKEHL
PQCXX
PQEST
PQQKQ
PQUKI
PRINS
DatabaseName Dissertations & Theses Europe Full Text: Health & Medicine
Dissertations & Theses Europe Full Text: Science & Technology
ProQuest Dissertations and Theses Professional
Dissertations & Theses @ Princeton University
ProQuest Central Korea - hybrid linking
ProQuest Central (Alumni)
ProQuest Central (Alumni) - hybrid linking
ProQuest Central UK/Ireland
ProQuest Central Essentials - hybrid linking
ProQuest Women's & Gender Studies - hybrid linking
ProQuest Central Essentials
ProQuest Central
ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection
ProQuest One Community College
ProQuest Central
ProQuest Dissertations & Theses A&I
ProQuest Dissertations & Theses Global
ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest Central - hybrid linking
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
DatabaseTitle Publicly Available Content Database
Dissertations & Theses Europe Full Text: Health & Medicine
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest Dissertations & Theses: Open
ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection
ProQuest Dissertations and Theses Professional
ProQuest Central China
ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection
ProQuest Dissertations & Theses Global
Dissertations & Theses Europe Full Text: Science & Technology
ProQuest Central
Dissertations & Theses @ Princeton University
ProQuest One Academic UKI Edition
ProQuest Central Korea
ProQuest Central (New)
ProQuest One Academic
ProQuest Dissertations & Theses A&I
ProQuest One Academic (New)
DatabaseTitleList Publicly Available Content Database
Database_xml – sequence: 1
  dbid: BENPR
  name: ProQuest Central
  url: http://www.proquest.com/pqcentral?accountid=15518
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Statistics
Computer Science
Genre Dissertation/Thesis
GroupedDBID 050
053
0BH
0JN
8R4
8R5
ABUWG
AFKRA
AZQEC
BENPR
CBPLH
CCPQU
DWQXO
EU9
G20
M8-
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
Q2X
ID FETCH-LOGICAL-g622-538c7cb09dbdff129875de732f82d3e17ba029887eaae87efad3ab2e78bc48373
IEDL.DBID BENPR
ISBN 9798379720971
IngestDate Sun Jun 29 12:41:03 EDT 2025
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-g622-538c7cb09dbdff129875de732f82d3e17ba029887eaae87efad3ab2e78bc48373
Notes SourceType-Dissertations & Theses-1
ObjectType-Dissertation/Thesis-1
content type line 12
OpenAccessLink https://www.proquest.com/docview/2826104172?pq-origsite=%requestingapplication%&accountid=15518
PQID 2826104172
PQPubID 18750
ParticipantIDs proquest_journals_2826104172
PublicationCentury 2000
PublicationDate 20230101
PublicationDateYYYYMMDD 2023-01-01
PublicationDate_xml – month: 01
  year: 2023
  text: 20230101
  day: 01
PublicationDecade 2020
PublicationYear 2023
Publisher ProQuest Dissertations & Theses
Publisher_xml – name: ProQuest Dissertations & Theses
SSID ssib000933042
Score 1.9755247
Snippet The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by...
SourceID proquest
SourceType Aggregation Database
SubjectTerms Applied Mathematics
Computer science
Statistics
Title Mathematical Theory of Machine Learning Models for Estimating Probability Distributions
URI https://www.proquest.com/docview/2826104172
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NTwIxEJ0gXIgmKmr8QNOD143Sdml7MCYKhJhAiMHIjfSTiwEFPPjv7SxdNSHxsodueunHvL7O9D2Aax4XCuXGZrlECzNhVCaDyDPEFsXythShqPIdtvsv_GmSTyowLN_CYFllGROLQO0WFu_IbyI1iEjPI97ev39k6BqF2dXSQkMnawV3V0iM7UCNojJWFWoP3eHoeZu_7ymhIjdTgqKI0lYgLtCldwC7nT9Z8UOo-HkD9ku_BZK2XwPqeDLcCCsfwevgR3BVv5HNE3uyCGRQVEd6koRTZwTdzt5WJB5OSTf2xR6xdbSMG7kojP0iHRTPTb5Xq2MY97rjx36WXBKyWRuJJJNWWHOrnHEhxBGOBMR5wWiQ1DHfEkajyroUXmsfv0E7pg31QhqLavLsBKrzxdyfAhGOW4eCgcxKLr03yphWUBxzfY7m9Aya5ShN00pfTX_n5fz_3xdQR6v2zfVFE6rr5ae_jIC-Nldplr4BmyKjKQ
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NS8MwFH_odlAU1Kn4bQ56LGqaLslhCLrJpm6ITPQ2mibZZWy6KeIf5__me12qguBtlx7appTk5X3n9wM4EigoXJgsShRRmEmjI-VlEpFt0XFSVdLnXb6davNBXD8lT3PwWZyFobbKQifmitqOMsqRn2BogJZeoL09f36JiDWKqqsFhUYaqBVsLYcYCwc7btzHO4Zwk1qrjut9zPlVo3vZjALLQNSvUiAWq0xm5lRbY73HP0QH3joZc6-4jd2ZNCmhlCvp0tTh1ac2Tg13UpmM0Nhj_Ow8lEUsNMZ-5YtG5-7-b7pgWUuNL2vJCbPpj97PjdnVKizVfxXh12DODSuwUtA7sLDbK7BIjugUx3kdHtvf-K7pgE1P9LORZ-28GdOxgNPaZ0SuNpgw9IVZA8fSCLx7N0a9kffhfrA6YfUGmq3JBnRnMV2bUBqOhm4LmLQis4RPGGdKKOeMNubMa0GlRcsTvg17xSz1wsaa9H7EYOf_x4ew0Oy2b3u3rc7NLiwSS_w0c7IHpdfxm9tHX-LVHIQVY9CbsYx8AbID4LM
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT8JAEJ4gJMZooqLGB-oe9Nio25bdHoiJAgERQgxGb6Tb3eVCQAFj-In-K2fKVk1IvHHpoe02ze7szOw8vg_gIkBB4YFKvFAShZlQkSetCD2yLZEflqWwaZVvp9x4Dh5ew9ccfGW9MFRWmenEVFHrcUIx8is8GqClD9DeXllXFtGt1m_f3j1ikKJMa0anETuaBV1J4cZck0fLzD_xODetNKu49pec12u9-4bnGAe8QZkOZb5MRKKuI620tfi36MxrI3xuJde-uREqJsRyKUwcG7zaWPux4kZIlRAyu4-fXYMC5b5QRxTuap3u03LoYCsSEb4cCU74TUs2IDVs9R3YrP5JyO9CzoyKsJ1RPTC384uwQU7pAtN5D17aP1iv8ZAtuvvZ2LJ2WphpmMNsHTAiWhtOGfrFrIZjaQTe7U5Qh6Q1uXNWJdxeR7k13YfeKqbrAPKj8cgcAhM6SDRhFfqJDKQxKlLqxkYBpRk1D_kRlLJZ6rtNNu3_isTx_4_PYR1lpf_Y7LROYIMI4xdBlBLkZ5MPc4puxUyduQVj0F-xiHwDcL7k4g
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Mathematical+Theory+of+Machine+Learning+Models+for+Estimating+Probability+Distributions&rft.DBID=050%3B053%3B0BH%3B0JN%3BAAFGM%3BABUWG%3BADZZV%3BAFKRA%3BAGAJT%3BAQTIP%3BAZQEC%3BBENPR%3BCBPLH%3BCCPQU%3BDWQXO%3BEU9%3BG20%3BM8-%3BPHGZM%3BPHGZT%3BPIMPY%3BPKEHL%3BPQCXX%3BPQEST%3BPQQKQ%3BPQUKI%3BPRINS&rft.PQPubID=18750&rft.au=Yang%2C+Hongkang&rft.date=2023-01-01&rft.pub=ProQuest+Dissertations+%26+Theses&rft.isbn=9798379720971&rft.externalDBID=HAS_PDF_LINK
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798379720971/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798379720971/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798379720971/sc.gif&client=summon&freeimage=true