Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families

Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, th...

Full description

Saved in:

Bibliographic Details
Main Authors	Rajendran, Goutham, Kivva, Bohdan, Gao, Ming, Aragam, Bryon
Format	Journal Article
Language	English
Published	10.10.2021
Subjects	Computer Science - Artificial Intelligence Computer Science - Learning Statistics - Machine Learning
Online Access	Get full text
DOI	10.48550/arxiv.2110.04719

Cover

Abstract	Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy score-based algorithm for learning DAGs. Unlike edge-greedy algorithms such as the popular GES and hill-climbing algorithms, our approach is vertex-greedy and requires at most a polynomial number of score evaluations. We then show how recent polynomial-time algorithms for learning DAG models are a special case of this algorithm, thereby illustrating how these order-based algorithms can be rigourously interpreted as score-based algorithms. This observation suggests new score functions and optimality conditions based on the duality between Bregman divergences and exponential families, which we explore in detail. Explicit sample and computational complexity bounds are derived. Finally, we provide extensive experiments suggesting that this algorithm indeed optimizes the score in a variety of settings.
AbstractList	Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy score-based algorithm for learning DAGs. Unlike edge-greedy algorithms such as the popular GES and hill-climbing algorithms, our approach is vertex-greedy and requires at most a polynomial number of score evaluations. We then show how recent polynomial-time algorithms for learning DAG models are a special case of this algorithm, thereby illustrating how these order-based algorithms can be rigourously interpreted as score-based algorithms. This observation suggests new score functions and optimality conditions based on the duality between Bregman divergences and exponential families, which we explore in detail. Explicit sample and computational complexity bounds are derived. Finally, we provide extensive experiments suggesting that this algorithm indeed optimizes the score in a variety of settings.
Author	Gao, Ming Rajendran, Goutham Aragam, Bryon Kivva, Bohdan
Author_xml	– sequence: 1 givenname: Goutham surname: Rajendran fullname: Rajendran, Goutham – sequence: 2 givenname: Bohdan surname: Kivva fullname: Kivva, Bohdan – sequence: 3 givenname: Ming surname: Gao fullname: Gao, Ming – sequence: 4 givenname: Bryon surname: Aragam fullname: Aragam, Bryon
BackLink	https://doi.org/10.48550/arXiv.2110.04719$$DView paper in arXiv
BookMark	eNqFjrkOwkAMRLeAgusDqPAHcCRAxFGCOHroIwucYGnXGzkbRP6egOipRhq90byuaYkXMmYYR9PlOkmiGeqLn9N53BTRchVvOia_BK1uoVICS6jCkgMLFN7W4h2jhcCOtnBSonsNaHOvHB6uHMNOKXcoDZ55dRjYyxhQ7kCvovmV8Fln6NgylX3TztCWNPhlz4yOh-v-PPkqpYWyQ63Tj1r6VVv8J95ld0eh
ContentType	Journal Article
Copyright	http://creativecommons.org/licenses/by/4.0
Copyright_xml	– notice: http://creativecommons.org/licenses/by/4.0
DBID	AKY EPD GOX
DOI	10.48550/arxiv.2110.04719
DatabaseName	arXiv Computer Science arXiv Statistics arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2110_04719
GroupedDBID	AKY EPD GOX
ID	FETCH-arxiv_primary_2110_047193
IEDL.DBID	GOX
IngestDate	Wed Jul 23 02:03:02 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_2110_047193
OpenAccessLink	https://arxiv.org/abs/2110.04719
ParticipantIDs	arxiv_primary_2110_04719
PublicationCentury	2000
PublicationDate	2021-10-10
PublicationDateYYYYMMDD	2021-10-10
PublicationDate_xml	– month: 10 year: 2021 text: 2021-10-10 day: 10
PublicationDecade	2020
PublicationYear	2021
Score	3.551119
SecondaryResourceType	preprint
Snippet	Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Artificial Intelligence Computer Science - Learning Statistics - Machine Learning
Title	Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families
URI	https://arxiv.org/abs/2110.04719
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdZ1NT8JAEIYnwMmLkajB7zl4BF1kS1luaiTERD2oSW_Nth0qCbRNUQP_3pltjV64trPNdJvtzNPOvAtwGZlR7JPmlRYzm2iV9HuWwzBTysha8slTTkj76Xk4fdePgRc0AH97YWy5nn9X-sDR6lro5Erx-9M0ocmJgjTzvgTVz0knxVXb_9lxjukO_QsSkz3YrbM7vK0eRxsalO1D-uo0Wr9KwnqPhhTnGRb5YiM9wWwvG7yPUUpgkg3aRZozsX8sV128Kyld2gxreVOZxC4y-yOtizyTQh8e7b5RMPAewMXk4e1-2nOuhUWlIxGK16HzenAILaZ96gDGQ2s45qq-9Yz2_VmkZoxgVjNmmMQM6Ag6265yvP3UCezcSDGGlGKoU2jxfdMZR9PP6NxN6Q9caXp8
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Structure+learning+in+polynomial+time%3A+Greedy+algorithms%2C+Bregman+information%2C+and+exponential+families&rft.au=Rajendran%2C+Goutham&rft.au=Kivva%2C+Bohdan&rft.au=Gao%2C+Ming&rft.au=Aragam%2C+Bryon&rft.date=2021-10-10&rft_id=info:doi/10.48550%2Farxiv.2110.04719&rft.externalDocID=2110_04719