AN ONLINE FREQUENCY RATE BASED ALGORITHM FOR MINING FREQUENT SEQUENCES IN EVOLVING DATA STREAMS
Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-...
Saved in:
| Published in | Challenges In Information Technology Management pp. 56 - 62 |
|---|---|
| Main Authors | , |
| Format | Book Chapter |
| Language | English |
| Published |
WORLD SCIENTIFIC
01.05.2008
|
| Subjects | |
| Online Access | Get full text |
| ISBN | 9789812819062 981281907X 9789812819079 9812819061 9789814470674 9814470678 |
| DOI | 10.1142/9789812819079_0009 |
Cover
| Summary: | Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-pass algorithm called OFSD (Online Frequent Sequence Discovery), to mine the set of all frequent sequences in a data stream whose frequency rates satisfy a minimum user defined frequency rate (fu). The algorithm significantly reduces the number of elements in the candidate set (a set of candidate sequences that should be kept for further exploration) that efficiently increases its performance in comparison with other general solutions. The simulation results show the effects of fu variation and the application defined threshold (CM) on the frequent phrase detection process. |
|---|---|
| ISBN: | 9789812819062 981281907X 9789812819079 9812819061 9789814470674 9814470678 |
| DOI: | 10.1142/9789812819079_0009 |