Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families

Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, th...

Full description

Saved in:
Bibliographic Details
Main Authors Rajendran, Goutham, Kivva, Bohdan, Gao, Ming, Aragam, Bryon
Format Journal Article
LanguageEnglish
Published 10.10.2021
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2110.04719

Cover

Abstract Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy score-based algorithm for learning DAGs. Unlike edge-greedy algorithms such as the popular GES and hill-climbing algorithms, our approach is vertex-greedy and requires at most a polynomial number of score evaluations. We then show how recent polynomial-time algorithms for learning DAG models are a special case of this algorithm, thereby illustrating how these order-based algorithms can be rigourously interpreted as score-based algorithms. This observation suggests new score functions and optimality conditions based on the duality between Bregman divergences and exponential families, which we explore in detail. Explicit sample and computational complexity bounds are derived. Finally, we provide extensive experiments suggesting that this algorithm indeed optimizes the score in a variety of settings.
AbstractList Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy score-based algorithm for learning DAGs. Unlike edge-greedy algorithms such as the popular GES and hill-climbing algorithms, our approach is vertex-greedy and requires at most a polynomial number of score evaluations. We then show how recent polynomial-time algorithms for learning DAG models are a special case of this algorithm, thereby illustrating how these order-based algorithms can be rigourously interpreted as score-based algorithms. This observation suggests new score functions and optimality conditions based on the duality between Bregman divergences and exponential families, which we explore in detail. Explicit sample and computational complexity bounds are derived. Finally, we provide extensive experiments suggesting that this algorithm indeed optimizes the score in a variety of settings.
Author Gao, Ming
Rajendran, Goutham
Aragam, Bryon
Kivva, Bohdan
Author_xml – sequence: 1
  givenname: Goutham
  surname: Rajendran
  fullname: Rajendran, Goutham
– sequence: 2
  givenname: Bohdan
  surname: Kivva
  fullname: Kivva, Bohdan
– sequence: 3
  givenname: Ming
  surname: Gao
  fullname: Gao, Ming
– sequence: 4
  givenname: Bryon
  surname: Aragam
  fullname: Aragam, Bryon
BackLink https://doi.org/10.48550/arXiv.2110.04719$$DView paper in arXiv
BookMark eNqFjrkOwkAMRLeAgusDqPAHcCRAxFGCOHroIwucYGnXGzkbRP6egOipRhq90byuaYkXMmYYR9PlOkmiGeqLn9N53BTRchVvOia_BK1uoVICS6jCkgMLFN7W4h2jhcCOtnBSonsNaHOvHB6uHMNOKXcoDZ55dRjYyxhQ7kCvovmV8Fln6NgylX3TztCWNPhlz4yOh-v-PPkqpYWyQ63Tj1r6VVv8J95ld0eh
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
EPD
GOX
DOI 10.48550/arxiv.2110.04719
DatabaseName arXiv Computer Science
arXiv Statistics
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2110_04719
GroupedDBID AKY
EPD
GOX
ID FETCH-arxiv_primary_2110_047193
IEDL.DBID GOX
IngestDate Wed Jul 23 02:03:02 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2110_047193
OpenAccessLink https://arxiv.org/abs/2110.04719
ParticipantIDs arxiv_primary_2110_04719
PublicationCentury 2000
PublicationDate 2021-10-10
PublicationDateYYYYMMDD 2021-10-10
PublicationDate_xml – month: 10
  year: 2021
  text: 2021-10-10
  day: 10
PublicationDecade 2020
PublicationYear 2021
Score 3.551119
SecondaryResourceType preprint
Snippet Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Artificial Intelligence
Computer Science - Learning
Statistics - Machine Learning
Title Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families
URI https://arxiv.org/abs/2110.04719
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdZ1NT8JAEIYnwMmLkajB7zl4BF1kS1luaiTERD2oSW_Nth0qCbRNUQP_3pltjV64trPNdJvtzNPOvAtwGZlR7JPmlRYzm2iV9HuWwzBTysha8slTTkj76Xk4fdePgRc0AH97YWy5nn9X-sDR6lro5Erx-9M0ocmJgjTzvgTVz0knxVXb_9lxjukO_QsSkz3YrbM7vK0eRxsalO1D-uo0Wr9KwnqPhhTnGRb5YiM9wWwvG7yPUUpgkg3aRZozsX8sV128Kyld2gxreVOZxC4y-yOtizyTQh8e7b5RMPAewMXk4e1-2nOuhUWlIxGK16HzenAILaZ96gDGQ2s45qq-9Yz2_VmkZoxgVjNmmMQM6Ag6265yvP3UCezcSDGGlGKoU2jxfdMZR9PP6NxN6Q9caXp8
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Structure+learning+in+polynomial+time%3A+Greedy+algorithms%2C+Bregman+information%2C+and+exponential+families&rft.au=Rajendran%2C+Goutham&rft.au=Kivva%2C+Bohdan&rft.au=Gao%2C+Ming&rft.au=Aragam%2C+Bryon&rft.date=2021-10-10&rft_id=info:doi/10.48550%2Farxiv.2110.04719&rft.externalDocID=2110_04719