Research and Realization of Text Mining Algorithm on Web

It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the data mining. Now people mainly use information retrieval (IR) or the search engine to look up Web information. But IR focuses on searching for...

Full description

Saved in:
Bibliographic Details
Published inCIS Workshops 2007 : 2007 International Conference on Computational Intelligence and Security Workshops : proceedings : 15-19 December, 2007, Harbin, Heilongjiang, China pp. 413 - 416
Main Authors Shiqun Yin, Yuhui Qiu, Jike Ge
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2007
Subjects
Online AccessGet full text
ISBN9780769530734
0769530737
DOI10.1109/CISW.2007.4425522

Cover

Abstract It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the data mining. Now people mainly use information retrieval (IR) or the search engine to look up Web information. But IR focuses on searching for information that is explicitly present but not latent knowledge in some document, the search engine can hardly according to different need of different customers and provide individual service, and it is very difficult to mine data further. However, text mining on Web aims to resolve this problem. This paper discusses an Algorithm of how to follow the appointed website or Web page according to the user's request by using the text mining technique, how to extract and express text characteristic, how to classify the data information with feedback judgement combined with the Web page text contents for later use. We present experiments on different data set that demonstrate more effectiveness of our algorithm than traditional algorithm. The process of Web text mining, information extraction method, mining algorithm and realization technique are discussed in details.
AbstractList It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the data mining. Now people mainly use information retrieval (IR) or the search engine to look up Web information. But IR focuses on searching for information that is explicitly present but not latent knowledge in some document, the search engine can hardly according to different need of different customers and provide individual service, and it is very difficult to mine data further. However, text mining on Web aims to resolve this problem. This paper discusses an Algorithm of how to follow the appointed website or Web page according to the user's request by using the text mining technique, how to extract and express text characteristic, how to classify the data information with feedback judgement combined with the Web page text contents for later use. We present experiments on different data set that demonstrate more effectiveness of our algorithm than traditional algorithm. The process of Web text mining, information extraction method, mining algorithm and realization technique are discussed in details.
Author Shiqun Yin
Jike Ge
Yuhui Qiu
Author_xml – sequence: 1
  surname: Shiqun Yin
  fullname: Shiqun Yin
  organization: Southwest Univ., Chongqing
– sequence: 2
  surname: Yuhui Qiu
  fullname: Yuhui Qiu
  organization: Southwest Univ., Chongqing
– sequence: 3
  surname: Jike Ge
  fullname: Jike Ge
  organization: Southwest Univ., Chongqing
BookMark eNotj81Kw0AURgdUUGseQNzMCyTeOz-Z3GUJ_hQqQg10WSbJTTuSTiTJQn16C_bbnMWBA9-tuIxDZCHuETJEoMdy9bHNFIDLjFHWKnUhEnIFuJysBqfNtUim6RNO06QJ1I0oNjyxH5uD9LGVG_Z9-PVzGKIcOlnx9yzfQgxxL5f9fhjDfDjKk9tyfSeuOt9PnJy5ENXzU1W-puv3l1W5XKeBYE6N1T4vyJmuYW1bwtYBWuWRkQ1AYw2iwtzmPgdvsW6x0wV1RIVD1TS1XoiH_2xg5t3XGI5-_Nmd7-k_uBJFIQ
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CISW.2007.4425522
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EndPage 416
ExternalDocumentID 4425522
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AARBI
AAWTH
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
OCL
RIE
RIL
ID FETCH-LOGICAL-i90t-453a68974fce35d91d70152a1e1e400c541121656a60a51bd1f389f998712ccb3
IEDL.DBID RIE
ISBN 9780769530734
0769530737
IngestDate Wed Aug 27 01:42:02 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-453a68974fce35d91d70152a1e1e400c541121656a60a51bd1f389f998712ccb3
PageCount 4
ParticipantIDs ieee_primary_4425522
PublicationCentury 2000
PublicationDate 2007-Dec.
PublicationDateYYYYMMDD 2007-12-01
PublicationDate_xml – month: 12
  year: 2007
  text: 2007-Dec.
PublicationDecade 2000
PublicationTitle CIS Workshops 2007 : 2007 International Conference on Computational Intelligence and Security Workshops : proceedings : 15-19 December, 2007, Harbin, Heilongjiang, China
PublicationTitleAbbrev CISW
PublicationYear 2007
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000393902
Score 1.4164169
Snippet It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the...
SourceID ieee
SourceType Publisher
StartPage 413
SubjectTerms Computational intelligence
Computer security
Data mining
Feedback
Information retrieval
Search engines
Text categorization
Text mining
Text recognition
Web pages
Title Research and Realization of Text Mining Algorithm on Web
URI https://ieeexplore.ieee.org/document/4425522
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgKtIhveWAkbZzY-RhRRVWQihAUtVsVOxeogARV6cKv5-I4RSAGtiSWIuvs5N47390DuAgTz6-kJImmCo8Iihs5SiB3MCT_lsUyS42KwvQumDyJ24VctOByWwuDiCb5DAfVpTnLTwu9qUJlQ0EbjPBCG9phFNS1Wtt4SlVjGruWmcey2rqhbbDT3At7qsndeDi6eZzXDQztS3-oqxjnMu7CtJlWnVPyOtiUaqA_f3Vs_O-8d6H_XcbH7rcOag9amO9Dt9FxYPaz7kHUpN-xJE_ZA0FHW5zJiozN6O_NpkZGgl29PRfrVfnyzmhsjqoPs_H1bDRxrKKCs4rd0hHST4KIGESm0ZdpzNOQ0ICXcORIy6SlIPRVteNJAjeRXKU8IzyTESMLuae18g-gkxc5HgLzXNSJzlDHyIUiK5N5OXlDRYhOSQyPoFfZYflR98xYWhMc__34BHZMzNSkiZxCp1xv8IycfanOzSp_AYJEoj8
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT4NAEJ3UetBT1db47R48SsvCbilH09hULY1RTHtr2GXQRi2moRd_vcMCNRoP3oBNyGZ2Yd6bnZkHcOFFjptLSRJNFQ4RFLtnKYHcQo_8W-LLJDYqCsG4O3wSt1M5rcHluhYGEU3yGbbzS3OWH6d6lYfKOoI2GOGFDdiUQghZVGutIyp5lalvl9zcl_nm9coWO9W9KM81ue13-jePk6KFYfnaH_oqxr0MGhBUEyuySl7bq0y19eevno3_nfkOtL4L-dj92kXtQg0Xe9ColBxY-WE3oVcl4LFoEbMHAo9leSZLExbS_5sFRkiCXb09p8t59vLOaGyCqgXh4DrsD61SU8Ga-3ZmCelG3R5xiESjK2Ofxx7hASfiyJEWSktB-CtvyBN17UhyFfOEEE1CnMzjjtbK3Yf6Il3gATDHRh3pBLWPXCiyMpmXkz9UhOmURO8QmrkdZh9F14xZaYKjvx-fw9YwDEaz0c347hi2TQTVJI2cQD1brvCUXH-mzsyKfwE5cKWM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=CIS+Workshops+2007+%3A+2007+International+Conference+on+Computational+Intelligence+and+Security+Workshops+%3A+proceedings+%3A+15-19+December%2C+2007%2C+Harbin%2C+Heilongjiang%2C+China&rft.atitle=Research+and+Realization+of+Text+Mining+Algorithm+on+Web&rft.au=Shiqun+Yin&rft.au=Yuhui+Qiu&rft.au=Jike+Ge&rft.date=2007-12-01&rft.pub=IEEE&rft.isbn=9780769530734&rft.spage=413&rft.epage=416&rft_id=info:doi/10.1109%2FCISW.2007.4425522&rft.externalDocID=4425522
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769530734/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769530734/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769530734/sc.gif&client=summon&freeimage=true