AN ONLINE FREQUENCY RATE BASED ALGORITHM FOR MINING FREQUENT SEQUENCES IN EVOLVING DATA STREAMS

Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-...

Full description

Saved in:
Bibliographic Details
Published inChallenges In Information Technology Management pp. 56 - 62
Main Authors BAROUNI-EBRAHIMI, M., GHORBANI, ALI A.
Format Book Chapter
LanguageEnglish
Published WORLD SCIENTIFIC 01.05.2008
Subjects
Online AccessGet full text
ISBN9789812819062
981281907X
9789812819079
9812819061
9789814470674
9814470678
DOI10.1142/9789812819079_0009

Cover

Abstract Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-pass algorithm called OFSD (Online Frequent Sequence Discovery), to mine the set of all frequent sequences in a data stream whose frequency rates satisfy a minimum user defined frequency rate (fu). The algorithm significantly reduces the number of elements in the candidate set (a set of candidate sequences that should be kept for further exploration) that efficiently increases its performance in comparison with other general solutions. The simulation results show the effects of fu variation and the application defined threshold (CM) on the frequent phrase detection process.
AbstractList Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to data streams. A data steam is an unbounded, continuously generated sequence of data transactions. In this paper, we propose an online single-pass algorithm called OFSD (Online Frequent Sequence Discovery), to mine the set of all frequent sequences in a data stream whose frequency rates satisfy a minimum user defined frequency rate (fu). The algorithm significantly reduces the number of elements in the candidate set (a set of candidate sequences that should be kept for further exploration) that efficiently increases its performance in comparison with other general solutions. The simulation results show the effects of fu variation and the application defined threshold (CM) on the frequent phrase detection process.
Author GHORBANI, ALI A.
BAROUNI-EBRAHIMI, M.
Author_xml – sequence: 1
  givenname: M.
  surname: BAROUNI-EBRAHIMI
  fullname: BAROUNI-EBRAHIMI, M.
  organization: Faculty of Computer Science, University of New Brunswick Fredericton, NB
– sequence: 2
  givenname: ALI A.
  surname: GHORBANI
  fullname: GHORBANI, ALI A.
  organization: Faculty of Computer Science, University of New Brunswick Fredericton, NB
BookMark eNqdkE9LwzAYxiMq6Ga_gKd8gWmSpvlz8BC3rCt0Kbbd0FNY01SqY4V14Ne3YyqCN08PvM_v9x6eEbjYdTsPwC1GdxhTci-5kAITgSXi0iKE5BkY_Vyez0Hwi2DkCgR9_zZgiEiMI3oNrDIwM2liNJzn-mmlzfQF5qrU8FEVegZVGmd5Ui6WcJ7lcJmYxMTfZAmLk6ELmBio11m6PtYzVSpYlLlWy-IGXDabbe-DrxyD1VyX08UkzeJkqtLJKxGimTAuIlcJ2bAQUeSY99g54isaMRZJWRHPBeUbSVxNq5pILwjnuPZ8oMKI0XAM-OnvR7ff1r1r_e7QNq2zVde99xYje9zL_t1rMB_-Z9pq3_om_ATlpGm0
ContentType Book Chapter
Copyright World Scientific Publishing Co. Pte. Ltd.
Copyright_xml – notice: World Scientific Publishing Co. Pte. Ltd.
DOI 10.1142/9789812819079_0009
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 981281907X
9789812819079
9789814470674
9814470678
Editor Cheung, Ronnie
Chan, Man-Chung
Liu, James N K
Editor_xml – sequence: 1
  givenname: Man-Chung
  surname: Chan
  fullname: Chan, Man-Chung
  organization: Hong Kong Polytechnic University
– sequence: 2
  givenname: Ronnie
  surname: Cheung
  fullname: Cheung, Ronnie
  organization: Hong Kong Polytechnic University
– sequence: 3
  givenname: James N K
  surname: Liu
  fullname: Liu, James N K
  organization: Hong Kong Polytechnic University
EndPage 62
ExternalDocumentID 10.1142/9789812819079_0009
GroupedDBID -VX
089
20A
38.
9WS
A4J
AABBV
AAFQY
AATMT
ABARN
ABCYV
ABGJO
ABIAV
ABMRC
ABQPQ
ACBYE
ACRAN
ACZWY
ADVEM
AERYV
AFOJC
AFTHB
AIXPE
AJFER
AKHYG
ALMA_UNASSIGNED_HOLDINGS
ALUEM
AMYDA
AZZ
BBABE
CZZ
DUGUG
EBSCA
ECOWB
GEOUK
J-X
JJU
MYL
PD4
PQQKQ
PVBBV
WMAQA
XI1
YSPEL
ID FETCH-LOGICAL-g288f-6785cb89f63040c6ee1cc2eb4566599b2e7847a92cd4bd29e82771de7cc235643
ISBN 9789812819062
981281907X
9789812819079
9812819061
9789814470674
9814470678
IngestDate Sat Mar 08 06:12:39 EST 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel OpenURL
MeetingName Proceedings of the International Conference
MergedId FETCHMERGED-LOGICAL-g288f-6785cb89f63040c6ee1cc2eb4566599b2e7847a92cd4bd29e82771de7cc235643
PageCount 7
ParticipantIDs worldscientific_books_10_1142_9789812819079_0009
worldscientific_books_10_1142_9789812819079_0009_brief
PublicationCentury 2000
PublicationDate 20080500
PublicationDateYYYYMMDD 2008-05-01
PublicationDate_xml – month: 05
  year: 2008
  text: 20080500
PublicationDecade 2000
PublicationTitle Challenges In Information Technology Management
PublicationYear 2008
Publisher WORLD SCIENTIFIC
Publisher_xml – name: WORLD SCIENTIFIC
SSID ssj0000291154
Score 1.3324859
Snippet Mining sequential patterns for discovering frequent sequences has been widely studied as a data mining problem. A challenging research is to extend its use to...
SourceID worldscientific
SourceType Enrichment Source
Publisher
StartPage 56
SubjectTerms Part A: Development of Enabling Technologies
Title AN ONLINE FREQUENCY RATE BASED ALGORITHM FOR MINING FREQUENT SEQUENCES IN EVOLVING DATA STREAMS
URI https://www.worldscientific.com/doi/10.1142/9789812819079_0009
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBbW7rIOGPZEuxd02M1wliiyJR2VRlk8JDbmOkV2CiJHHnbJgDa79NeXsvxqUgzILoYlS0psfiIpiqQQ-sJNQJiB-R2u6dCnOjS-TZvlG13YbF46MGtr75jH4XRBvy-DZes6VEaX7HQvv3s0ruR_qAp1QFcbJXsEZZtBoQLugb5wBQrDdU_5fWhmdXkF6mNQbmGSe1VYUUnN1lz-iHfLSKYJLPp8NUrlFDhXaRHtNW440yQdybislbPIk70upmTsJfEsipU3SdWPhYovf3qpzBSs9q_UGDp8S9Iom849WFp68yi2lrCqZeZduR7Kxix76jqZXdvHY5lJUEtTJSvreG2B4K2_n-NzSTobO5tYFk2iOhWlW6EKXm7V9d2JMRWXDMKOvHXM-JCTU-KcN9oR7Ea5aOVWvVe_J84aJ8N_jXKCThgHtvgUhL-aN0a5PhE2PZFLzFR3CcnDMhNNmVIG0t4eHdY0HnQKbOkKrhmvY7Yo-Xr4j87QizJHrouDtW5iHT0ne4me29gXbINS4B1foSdm-xqddXJWvkErGWMHAtyAAFsQ4BIEuAEBBhBgB4K6ZYYbEOAoxjUIsAUBrkDwFi0mKruc-tUpHf4vwnnhw4sFueaiCIcgEPLQmEGeE6NBMw8DITQxDDSgtSD5huoNEYYTxgYbw6DVMACF-B063f7ZmnOEDQ0FZXanNx_QXJB1kdttXl4IzjQLigvU3_tGKzv9blcuup6sDj_rBQqP7bLSN79N8f743_qAnrXT4yM63d38NZ9AZd3pzxXQ7gFoTm8H
linkProvider ProQuest Ebooks
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Challenges+In+Information+Technology+Management&rft.au=BAROUNI-EBRAHIMI%2C+M.&rft.au=GHORBANI%2C+ALI+A.&rft.atitle=AN+ONLINE+FREQUENCY+RATE+BASED+ALGORITHM+FOR+MINING+FREQUENT+SEQUENCES+IN+EVOLVING+DATA+STREAMS&rft.date=2008-05-01&rft.pub=WORLD+SCIENTIFIC&rft.isbn=9789812819079&rft.spage=56&rft.epage=62&rft_id=info:doi/10.1142%2F9789812819079_0009&rft.externalDBID=n%2Fa&rft.externalDocID=10.1142%2F9789812819079_0009
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fwww.worldscientific.com%2Faction%2FshowCoverImage%3Fdoi%3D10.1142%2F9789812819079_0009