An Algorithm to Mine Normalized Weighted Sequential Patterns Using a Prefix-projected Database

Sequential pattern mining is an important subject in data mining with broad applications in many different areas. However, previous sequential mining algorithms mostly aimed to calculate the number of occurrences (the support) without regard to the degree of importance of different data items. In th...

Full description

Saved in:
Bibliographic Details
Published inSerdica : journal of computing Vol. 9; no. 2; pp. 105 - 122
Main Authors Demetrovics, János, Thi, Vu Duc, Duong, Tran Huy
Format Journal Article
LanguageEnglish
Published 18.04.2016
Online AccessGet full text
ISSN1312-6555
1314-7897
1314-7897
DOI10.55630/sjc.2015.9.105-122

Cover

More Information
Summary:Sequential pattern mining is an important subject in data mining with broad applications in many different areas. However, previous sequential mining algorithms mostly aimed to calculate the number of occurrences (the support) without regard to the degree of importance of different data items. In this paper, we propose to explore the search space of subsequences with normalized weights. We are not only interested in the numberof occurrences of the sequences (supports of sequences), but also concernedabout importance of sequences (weights). When generating subsequencecandidates we use both the support and the weight of the candidates whilemaintaining the downward closure property of these patterns which allowsto accelerate the process of candidate generation.
ISSN:1312-6555
1314-7897
1314-7897
DOI:10.55630/sjc.2015.9.105-122