Research and Realization of Text Mining Algorithm on Web
It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the data mining. Now people mainly use information retrieval (IR) or the search engine to look up Web information. But IR focuses on searching for...
Saved in:
| Published in | CIS Workshops 2007 : 2007 International Conference on Computational Intelligence and Security Workshops : proceedings : 15-19 December, 2007, Harbin, Heilongjiang, China pp. 413 - 416 |
|---|---|
| Main Authors | , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
01.12.2007
|
| Subjects | |
| Online Access | Get full text |
| ISBN | 9780769530734 0769530737 |
| DOI | 10.1109/CISW.2007.4425522 |
Cover
| Abstract | It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the data mining. Now people mainly use information retrieval (IR) or the search engine to look up Web information. But IR focuses on searching for information that is explicitly present but not latent knowledge in some document, the search engine can hardly according to different need of different customers and provide individual service, and it is very difficult to mine data further. However, text mining on Web aims to resolve this problem. This paper discusses an Algorithm of how to follow the appointed website or Web page according to the user's request by using the text mining technique, how to extract and express text characteristic, how to classify the data information with feedback judgement combined with the Web page text contents for later use. We present experiments on different data set that demonstrate more effectiveness of our algorithm than traditional algorithm. The process of Web text mining, information extraction method, mining algorithm and realization technique are discussed in details. |
|---|---|
| AbstractList | It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the data mining. Now people mainly use information retrieval (IR) or the search engine to look up Web information. But IR focuses on searching for information that is explicitly present but not latent knowledge in some document, the search engine can hardly according to different need of different customers and provide individual service, and it is very difficult to mine data further. However, text mining on Web aims to resolve this problem. This paper discusses an Algorithm of how to follow the appointed website or Web page according to the user's request by using the text mining technique, how to extract and express text characteristic, how to classify the data information with feedback judgement combined with the Web page text contents for later use. We present experiments on different data set that demonstrate more effectiveness of our algorithm than traditional algorithm. The process of Web text mining, information extraction method, mining algorithm and realization technique are discussed in details. |
| Author | Shiqun Yin Jike Ge Yuhui Qiu |
| Author_xml | – sequence: 1 surname: Shiqun Yin fullname: Shiqun Yin organization: Southwest Univ., Chongqing – sequence: 2 surname: Yuhui Qiu fullname: Yuhui Qiu organization: Southwest Univ., Chongqing – sequence: 3 surname: Jike Ge fullname: Jike Ge organization: Southwest Univ., Chongqing |
| BookMark | eNotj81Kw0AURgdUUGseQNzMCyTeOz-Z3GUJ_hQqQg10WSbJTTuSTiTJQn16C_bbnMWBA9-tuIxDZCHuETJEoMdy9bHNFIDLjFHWKnUhEnIFuJysBqfNtUim6RNO06QJ1I0oNjyxH5uD9LGVG_Z9-PVzGKIcOlnx9yzfQgxxL5f9fhjDfDjKk9tyfSeuOt9PnJy5ENXzU1W-puv3l1W5XKeBYE6N1T4vyJmuYW1bwtYBWuWRkQ1AYw2iwtzmPgdvsW6x0wV1RIVD1TS1XoiH_2xg5t3XGI5-_Nmd7-k_uBJFIQ |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/CISW.2007.4425522 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EndPage | 416 |
| ExternalDocumentID | 4425522 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AARBI AAWTH ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK OCL RIE RIL |
| ID | FETCH-LOGICAL-i90t-453a68974fce35d91d70152a1e1e400c541121656a60a51bd1f389f998712ccb3 |
| IEDL.DBID | RIE |
| ISBN | 9780769530734 0769530737 |
| IngestDate | Wed Aug 27 01:42:02 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i90t-453a68974fce35d91d70152a1e1e400c541121656a60a51bd1f389f998712ccb3 |
| PageCount | 4 |
| ParticipantIDs | ieee_primary_4425522 |
| PublicationCentury | 2000 |
| PublicationDate | 2007-Dec. |
| PublicationDateYYYYMMDD | 2007-12-01 |
| PublicationDate_xml | – month: 12 year: 2007 text: 2007-Dec. |
| PublicationDecade | 2000 |
| PublicationTitle | CIS Workshops 2007 : 2007 International Conference on Computational Intelligence and Security Workshops : proceedings : 15-19 December, 2007, Harbin, Heilongjiang, China |
| PublicationTitleAbbrev | CISW |
| PublicationYear | 2007 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0000393902 |
| Score | 1.4164169 |
| Snippet | It is recognized that text information on Web is growing at an astounding pace. Research and application of text mining on Web is an important branch in the... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 413 |
| SubjectTerms | Computational intelligence Computer security Data mining Feedback Information retrieval Search engines Text categorization Text mining Text recognition Web pages |
| Title | Research and Realization of Text Mining Algorithm on Web |
| URI | https://ieeexplore.ieee.org/document/4425522 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgKtIhveWAkbZzY-RhRRVWQihAUtVsVOxeogARV6cKv5-I4RSAGtiSWIuvs5N47390DuAgTz6-kJImmCo8Iihs5SiB3MCT_lsUyS42KwvQumDyJ24VctOByWwuDiCb5DAfVpTnLTwu9qUJlQ0EbjPBCG9phFNS1Wtt4SlVjGruWmcey2rqhbbDT3At7qsndeDi6eZzXDQztS3-oqxjnMu7CtJlWnVPyOtiUaqA_f3Vs_O-8d6H_XcbH7rcOag9amO9Dt9FxYPaz7kHUpN-xJE_ZA0FHW5zJiozN6O_NpkZGgl29PRfrVfnyzmhsjqoPs_H1bDRxrKKCs4rd0hHST4KIGESm0ZdpzNOQ0ICXcORIy6SlIPRVteNJAjeRXKU8IzyTESMLuae18g-gkxc5HgLzXNSJzlDHyIUiK5N5OXlDRYhOSQyPoFfZYflR98xYWhMc__34BHZMzNSkiZxCp1xv8IycfanOzSp_AYJEoj8 |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT4NAEJ3UetBT1db47R48SsvCbilH09hULY1RTHtr2GXQRi2moRd_vcMCNRoP3oBNyGZ2Yd6bnZkHcOFFjptLSRJNFQ4RFLtnKYHcQo_8W-LLJDYqCsG4O3wSt1M5rcHluhYGEU3yGbbzS3OWH6d6lYfKOoI2GOGFDdiUQghZVGutIyp5lalvl9zcl_nm9coWO9W9KM81ue13-jePk6KFYfnaH_oqxr0MGhBUEyuySl7bq0y19eevno3_nfkOtL4L-dj92kXtQg0Xe9ColBxY-WE3oVcl4LFoEbMHAo9leSZLExbS_5sFRkiCXb09p8t59vLOaGyCqgXh4DrsD61SU8Ga-3ZmCelG3R5xiESjK2Ofxx7hASfiyJEWSktB-CtvyBN17UhyFfOEEE1CnMzjjtbK3Yf6Il3gATDHRh3pBLWPXCiyMpmXkz9UhOmURO8QmrkdZh9F14xZaYKjvx-fw9YwDEaz0c347hi2TQTVJI2cQD1brvCUXH-mzsyKfwE5cKWM |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=CIS+Workshops+2007+%3A+2007+International+Conference+on+Computational+Intelligence+and+Security+Workshops+%3A+proceedings+%3A+15-19+December%2C+2007%2C+Harbin%2C+Heilongjiang%2C+China&rft.atitle=Research+and+Realization+of+Text+Mining+Algorithm+on+Web&rft.au=Shiqun+Yin&rft.au=Yuhui+Qiu&rft.au=Jike+Ge&rft.date=2007-12-01&rft.pub=IEEE&rft.isbn=9780769530734&rft.spage=413&rft.epage=416&rft_id=info:doi/10.1109%2FCISW.2007.4425522&rft.externalDocID=4425522 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769530734/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769530734/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769530734/sc.gif&client=summon&freeimage=true |