Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses

The problem of closed frequent itemset discovery is a fundamental problem of data mining, having applications in numerous domains. It is thus very important to have efficient parallel algorithms to solve this problem, capable of efficiently harnessing the power of multicore processors that exists in...

Full description

Saved in:
Bibliographic Details
Published in2010 International Conference on High Performance Computing and Simulation pp. 521 - 528
Main Authors Negrevergne, B, Termier, A, Méhaut, J, Uno, T
Format Conference Proceeding
LanguageEnglish
Japanese
Published IEEE 01.06.2010
Subjects
Online AccessGet full text
ISBN9781424468270
1424468272
DOI10.1109/HPCS.2010.5547082

Cover

More Information
Summary:The problem of closed frequent itemset discovery is a fundamental problem of data mining, having applications in numerous domains. It is thus very important to have efficient parallel algorithms to solve this problem, capable of efficiently harnessing the power of multicore processors that exists in our computers (notebooks as well as desktops). In this paper we present PLCM QS , a parallel algorithm based on the LCM algorithm, recognized as the most efficient algorithm for sequential discovery of closed frequent itemsets. We also present a simple yet powerfull parallelism interface based on the concept of Tuple Space, which allows an efficient dynamic sharing of the work. Thanks to a detailed experimental study, we show that PLCM QS is efficient on both on sparse and dense databases.
ISBN:9781424468270
1424468272
DOI:10.1109/HPCS.2010.5547082