Structural Learning of Bayesian Networks Via Constrained Hill Climbing Algorithms: Adjusting Trade-off between Efficiency and Accuracy
Learning Bayesian networks is known to be an NP‐hard problem, and this, combined with the growing interest in learning models from high‐dimensional domains, leads to the necessity of finding more efficient learning algorithms. Recent papers have proposed constrained approaches of successfully and wi...
Saved in:
| Published in | International journal of intelligent systems Vol. 30; no. 3; pp. 292 - 325 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Blackwell Publishing Ltd
01.03.2015
John Wiley & Sons, Inc |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0884-8173 1098-111X |
| DOI | 10.1002/int.21701 |
Cover
| Summary: | Learning Bayesian networks is known to be an NP‐hard problem, and this, combined with the growing interest in learning models from high‐dimensional domains, leads to the necessity of finding more efficient learning algorithms. Recent papers have proposed constrained approaches of successfully and widely used local search algorithms, such as Hill Climbing. One of these algorithms families, called constrained Hill Climbing (CHC), greatly improves upon the efficiency of the original approach, obtaining models with slightly lower quality but maintaining their theoretical properties. In this paper, we propose three different modifications to the most scalable version of these algorithms, fast constrained Hill Climbing, to improve the quality of its output by relaxing the constraints imposed to include some diversification in the search process. The aim of these new approaches is to adjust the trade‐off between efficiency and accuracy of the algorithm, as they do not modify its complexity and only imply a few more search iterations. We perform an intensive experimental evaluation of the modifications proposed with an extensive comparison between the original algorithms and the new modifications covering several scenarios with quite large data sets. Available code and data for further use of the algorithms presented in this paper and experiment replication can be available at http://simd.albacete.org/supplements/FastCHC.html. |
|---|---|
| Bibliography: | istex:EAEC7FB6003FB0A604A9520C59FF0D106AFCA75C ark:/67375/WNG-D0Q3Z1SJ-Z FEDER funds and the Spanish Government (MICINN) - No. TIN2010-20900-C04-03; No. TIN2013-46638-C3-3-P ArticleID:INT21701 This paper is an extension of a previous conference paper. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| ISSN: | 0884-8173 1098-111X |
| DOI: | 10.1002/int.21701 |