AN ONLINE FREQUENCY RATE BASED ALGORITHM FOR MINING FREQUENT SEQUENCES IN EVOLVING DATA STREAMS
Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-...
Saved in:
| Published in | Challenges In Information Technology Management pp. 56 - 62 |
|---|---|
| Main Authors | , |
| Format | Book Chapter |
| Language | English |
| Published |
WORLD SCIENTIFIC
01.05.2008
|
| Subjects | |
| Online Access | Get full text |
| ISBN | 9789812819062 981281907X 9789812819079 9812819061 9789814470674 9814470678 |
| DOI | 10.1142/9789812819079_0009 |
Cover
| Abstract | Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-pass algorithm called OFSD (Online Frequent Sequence Discovery), to mine the set of all frequent sequences in a data stream whose frequency rates satisfy a minimum user defined frequency rate (fu). The algorithm significantly reduces the number of elements in the candidate set (a set of candidate sequences that should be kept for further exploration) that efficiently increases its performance in comparison with other general solutions. The simulation results show the effects of fu variation and the application defined threshold (CM) on the frequent phrase detection process. |
|---|---|
| AbstractList | Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-pass algorithm called OFSD (Online Frequent Sequence Discovery), to mine the set of all frequent sequences in a data stream whose frequency rates satisfy a minimum user defined frequency rate (fu). The algorithm significantly reduces the number of elements in the candidate set (a set of candidate sequences that should be kept for further exploration) that efficiently increases its performance in comparison with other general solutions. The simulation results show the effects of fu variation and the application defined threshold (CM) on the frequent phrase detection process. |
| Author | GHORBANI, ALI A. BAROUNI-EBRAHIMI, M. |
| Author_xml | – sequence: 1 givenname: M. surname: BAROUNI-EBRAHIMI fullname: BAROUNI-EBRAHIMI, M. organization: Faculty of Computer Science, University of New Brunswick Fredericton, NB – sequence: 2 givenname: ALI A. surname: GHORBANI fullname: GHORBANI, ALI A. organization: Faculty of Computer Science, University of New Brunswick Fredericton, NB |
| BookMark | eNqdkE9LwzAYxiMq6Ga_gKd8gWmSpvlz8BC3rCt0Kbbd0FNY01SqY4V14Ne3YyqCN08PvM_v9x6eEbjYdTsPwC1GdxhTci-5kAITgSXi0iKE5BkY_Vyez0Hwi2DkCgR9_zZgiEiMI3oNrDIwM2liNJzn-mmlzfQF5qrU8FEVegZVGmd5Ui6WcJ7lcJmYxMTfZAmLk6ELmBio11m6PtYzVSpYlLlWy-IGXDabbe-DrxyD1VyX08UkzeJkqtLJKxGimTAuIlcJ2bAQUeSY99g54isaMRZJWRHPBeUbSVxNq5pILwjnuPZ8oMKI0XAM-OnvR7ff1r1r_e7QNq2zVde99xYje9zL_t1rMB_-Z9pq3_om_ATlpGm0 |
| ContentType | Book Chapter |
| Copyright | World Scientific Publishing Co. Pte. Ltd. |
| Copyright_xml | – notice: World Scientific Publishing Co. Pte. Ltd. |
| DOI | 10.1142/9789812819079_0009 |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISBN | 981281907X 9789812819079 9789814470674 9814470678 |
| Editor | Cheung, Ronnie Chan, Man-Chung Liu, James N K |
| Editor_xml | – sequence: 1 givenname: Man-Chung surname: Chan fullname: Chan, Man-Chung organization: Hong Kong Polytechnic University – sequence: 2 givenname: Ronnie surname: Cheung fullname: Cheung, Ronnie organization: Hong Kong Polytechnic University – sequence: 3 givenname: James N K surname: Liu fullname: Liu, James N K organization: Hong Kong Polytechnic University |
| EndPage | 62 |
| ExternalDocumentID | 10.1142/9789812819079_0009 |
| GroupedDBID | -VX 089 20A 38. 9WS A4J AABBV AAFQY AATMT ABARN ABCYV ABGJO ABIAV ABMRC ABQPQ ACBYE ACRAN ACZWY ADVEM AERYV AFOJC AFTHB AIXPE AJFER AKHYG ALMA_UNASSIGNED_HOLDINGS ALUEM AMYDA AZZ BBABE CZZ DUGUG EBSCA ECOWB GEOUK J-X JJU MYL PD4 PQQKQ PVBBV WMAQA XI1 YSPEL |
| ID | FETCH-LOGICAL-g288f-6785cb89f63040c6ee1cc2eb4566599b2e7847a92cd4bd29e82771de7cc235643 |
| ISBN | 9789812819062 981281907X 9789812819079 9812819061 9789814470674 9814470678 |
| IngestDate | Sat Mar 08 06:12:39 EST 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | OpenURL |
| MeetingName | Proceedings of the International Conference |
| MergedId | FETCHMERGED-LOGICAL-g288f-6785cb89f63040c6ee1cc2eb4566599b2e7847a92cd4bd29e82771de7cc235643 |
| PageCount | 7 |
| ParticipantIDs | worldscientific_books_10_1142_9789812819079_0009 worldscientific_books_10_1142_9789812819079_0009_brief |
| PublicationCentury | 2000 |
| PublicationDate | 20080500 |
| PublicationDateYYYYMMDD | 2008-05-01 |
| PublicationDate_xml | – month: 05 year: 2008 text: 20080500 |
| PublicationDecade | 2000 |
| PublicationTitle | Challenges In Information Technology Management |
| PublicationYear | 2008 |
| Publisher | WORLD SCIENTIFIC |
| Publisher_xml | – name: WORLD SCIENTIFIC |
| SSID | ssj0000291154 |
| Score | 1.3324859 |
| Snippet | Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to... |
| SourceID | worldscientific |
| SourceType | Enrichment Source Publisher |
| StartPage | 56 |
| SubjectTerms | Part A: Development of Enabling Technologies |
| Title | AN ONLINE FREQUENCY RATE BASED ALGORITHM FOR MINING FREQUENT SEQUENCES IN EVOLVING DATA STREAMS |
| URI | https://www.worldscientific.com/doi/10.1142/9789812819079_0009 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBbW7rIOGPZEuxd02M1wliiyJR2VRlk8JDbmOkV2CiJHHnbJgDa79NeXsvxqUgzILoYlS0psfiIpiqQQ-sJNQJiB-R2u6dCnOjS-TZvlG13YbF46MGtr75jH4XRBvy-DZes6VEaX7HQvv3s0ruR_qAp1QFcbJXsEZZtBoQLugb5wBQrDdU_5fWhmdXkF6mNQbmGSe1VYUUnN1lz-iHfLSKYJLPp8NUrlFDhXaRHtNW440yQdybislbPIk70upmTsJfEsipU3SdWPhYovf3qpzBSs9q_UGDp8S9Iom849WFp68yi2lrCqZeZduR7Kxix76jqZXdvHY5lJUEtTJSvreG2B4K2_n-NzSTobO5tYFk2iOhWlW6EKXm7V9d2JMRWXDMKOvHXM-JCTU-KcN9oR7Ea5aOVWvVe_J84aJ8N_jXKCThgHtvgUhL-aN0a5PhE2PZFLzFR3CcnDMhNNmVIG0t4eHdY0HnQKbOkKrhmvY7Yo-Xr4j87QizJHrouDtW5iHT0ne4me29gXbINS4B1foSdm-xqddXJWvkErGWMHAtyAAFsQ4BIEuAEBBhBgB4K6ZYYbEOAoxjUIsAUBrkDwFi0mKruc-tUpHf4vwnnhw4sFueaiCIcgEPLQmEGeE6NBMw8DITQxDDSgtSD5huoNEYYTxgYbw6DVMACF-B063f7ZmnOEDQ0FZXanNx_QXJB1kdttXl4IzjQLigvU3_tGKzv9blcuup6sDj_rBQqP7bLSN79N8f743_qAnrXT4yM63d38NZ9AZd3pzxXQ7gFoTm8H |
| linkProvider | ProQuest Ebooks |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Challenges+In+Information+Technology+Management&rft.au=BAROUNI-EBRAHIMI%2C+M.&rft.au=GHORBANI%2C+ALI+A.&rft.atitle=AN+ONLINE+FREQUENCY+RATE+BASED+ALGORITHM+FOR+MINING+FREQUENT+SEQUENCES+IN+EVOLVING+DATA+STREAMS&rft.date=2008-05-01&rft.pub=WORLD+SCIENTIFIC&rft.isbn=9789812819079&rft.spage=56&rft.epage=62&rft_id=info:doi/10.1142%2F9789812819079_0009&rft.externalDBID=n%2Fa&rft.externalDocID=10.1142%2F9789812819079_0009 |
| thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fwww.worldscientific.com%2Faction%2FshowCoverImage%3Fdoi%3D10.1142%2F9789812819079_0009 |