A framework for the selective dissemination of XML documents based on inferred user profiles
As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key cap...
        Saved in:
      
    
          | Published in | 2003 19th International Conference on Data Engineering pp. 531 - 542 | 
|---|---|
| Main Authors | , , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        2003
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9780780376656 078037665X  | 
| DOI | 10.1109/ICDE.2003.1260819 | 
Cover
| Abstract | As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key capability of emerging information dissemination systems is therefore the effective filtering of a continuous stream of XML data items according to user preferences. Here we propose a model for information dissemination that integrates profile inference with data dissemination and takes advantage of the structured content in XML documents. Starting from the assumption that explicitly stating one's information interests is an inconvenient and error-prone process, we aim to automatically construct user profiles. We do this by clustering items previously deemed valuable by the user according to a novel similarity measure that takes advantage of the semantic content of XML. Furthermore, we index the profiles from all users into a multilevel index structure whose nodes naturally will be a close match to subject areas present in the document collection. Such an approach is both intuitive and efficient since the indexing structure is not primarily affected by an increasing number of users. To support our claims we experimentally validate our method and report on its effectiveness and efficiency. | 
    
|---|---|
| AbstractList | As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key capability of emerging information dissemination systems is therefore the effective filtering of a continuous stream of XML data items according to user preferences. Here we propose a model for information dissemination that integrates profile inference with data dissemination and takes advantage of the structured content in XML documents. Starting from the assumption that explicitly stating one's information interests is an inconvenient and error-prone process, we aim to automatically construct user profiles. We do this by clustering items previously deemed valuable by the user according to a novel similarity measure that takes advantage of the semantic content of XML. Furthermore, we index the profiles from all users into a multilevel index structure whose nodes naturally will be a close match to subject areas present in the document collection. Such an approach is both intuitive and efficient since the indexing structure is not primarily affected by an increasing number of users. To support our claims we experimentally validate our method and report on its effectiveness and efficiency. | 
    
| Author | Sriram Padmanabhan Mihaila, G. Stanoi, I.  | 
    
| Author_xml | – sequence: 1 givenname: I. surname: Stanoi fullname: Stanoi, I. organization: IBM Thomas J. Watson Res. Center, NY, USA – sequence: 2 givenname: G. surname: Mihaila fullname: Mihaila, G. organization: IBM Thomas J. Watson Res. Center, NY, USA – sequence: 3 surname: Sriram Padmanabhan fullname: Sriram Padmanabhan organization: IBM Thomas J. Watson Res. Center, NY, USA  | 
    
| BookMark | eNotUNFKxDAQDKignv0A8SU_0Jo0bZo8HvXUg4ovCj4IR5psMNomR9JT_HsD3jIwu8wwDHuJTn3wgNA1JRWlRN5u-7tNVRPCKlpzIqg8QYXsBMlgHectP0dFSp8kT9MyKZoL9L7GNqoZfkL8wjZEvHwATjCBXtw3YONSgtl5tbjgcbD47WnAJujDDH5JeFQJDM6K8xZizPshQcT7GKybIF2hM6umBMWRV-j1fvPSP5bD88O2Xw-lox1bSsNrGC1rayYNoyz374wQkijIF9VSZ4MiHNgoWtFyqjSnTSNFzaEdZaPZCt385zoA2O2jm1X83R1_wP4AJgRTZg | 
    
| ContentType | Conference Proceeding | 
    
| DBID | 6IE 6IH CBEJK RIE RIO  | 
    
| DOI | 10.1109/ICDE.2003.1260819 | 
    
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore digital library IEEE Proceedings Order Plans (POP) 1998-present  | 
    
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| EndPage | 542 | 
    
| ExternalDocumentID | 1260819 | 
    
| GroupedDBID | 6IE 6IH 6IK 6IL AAJGR AAVQY AAWTH ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IERZE OCL RIB RIC RIE RIL RIO  | 
    
| ID | FETCH-LOGICAL-i173t-d62ebf35239d3130037d8890ae3131c9cd62a06e3b858561ac61449826e5b94c3 | 
    
| IEDL.DBID | RIE | 
    
| ISBN | 9780780376656 078037665X  | 
    
| IngestDate | Tue Aug 26 17:53:07 EDT 2025 | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | false | 
    
| Language | English | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-i173t-d62ebf35239d3130037d8890ae3131c9cd62a06e3b858561ac61449826e5b94c3 | 
    
| PageCount | 12 | 
    
| ParticipantIDs | ieee_primary_1260819 | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 20030000 | 
    
| PublicationDateYYYYMMDD | 2003-01-01 | 
    
| PublicationDate_xml | – year: 2003 text: 20030000  | 
    
| PublicationDecade | 2000 | 
    
| PublicationTitle | 2003 19th International Conference on Data Engineering | 
    
| PublicationTitleAbbrev | ICDE | 
    
| PublicationYear | 2003 | 
    
| Publisher | IEEE | 
    
| Publisher_xml | – name: IEEE | 
    
| SSID | ssj0000453984 | 
    
| Score | 1.3426446 | 
    
| Snippet | As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective... | 
    
| SourceID | ieee | 
    
| SourceType | Publisher | 
    
| StartPage | 531 | 
    
| SubjectTerms | Bandwidth Deductive databases Indexing Information filtering Information filters Internet Monitoring Pressing Traffic control XML  | 
    
| Title | A framework for the selective dissemination of XML documents based on inferred user profiles | 
    
| URI | https://ieeexplore.ieee.org/document/1260819 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKJyZALeItD4wkjWPHjxGVVgVRxEClDkhVnHMkhNSiJl349ZzdpAjEwGYnkWPZsu-7u-_uCLmWwBiAUpEFkaKCYvFIpQ6ijOUKEgBpMh-cPH2Sk5l4mGfzDrnZxcI45wL5zMW-GXz5sCo23lQ2YAi-tc_xuae03MZq7ewpCE240SJo5jrBYyOzNsFO25eNV5MlZnA_vBuFbKBxM-iP6ipBuIwPyLSd1pZT8h5vahsXn78yNv533oek_x3GR593AuqIdNyyR15vadkSsigiVooIkFahGg5efDT45z09xm8YXZV0Pn2k7S8q6oUeUHzjSVzrNba9lYM2lb-rPpmNRy_DSdSUWIjemOJ1BDJ1tkQQxg1w79niCrQ2Se6wxwpT4Ad5Ih233n8oWV54BdKgTuIya0TBj0l3uVq6E0KZAF0guDI5GKFSmXMrNHOqLDXKA8VOSc-vzOJjm0Vj0SzK2d-Pz8l-oM0FY8cF6dbrjbtE8V_bq7DvX_Mdq1I | 
    
| linkProvider | IEEE | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFG8IHvSkBozf9uDRjXXt-nE0CAFlxAMkHEzIunaJMQEC4-Jf72vZMBoP3tqt2Zqu3fu9937vPYTuuSHEGCECbVgMCoqGIxVbEyQkEyYyhqvEBSenYz6YsudZMmugh30sjLXWk89s6Jrel2-W-daZyjoEwLd0OT4PEsZYsovW2ltUAJxQJZnXzWUEB4cndYqdus8rvyaJVGfYfer5fKBh9dgf9VW8eOkfo7Se2I5V8hFuSx3mn79yNv535ieo_R3Ih1_3IuoUNeyihd4ecVFTsjBgVgwYEG98PRz49WHvoXcEGffJ8LLAs3SE61dssBN7BsMdR-Nar6Ht7By4qv29aaNpvzfpDoKqyELwTgQtA8NjqwuAYVQZ6nxbVBgpVZRZ6JFc5TAgi7il2nkQOclyp0Iq0EpsohXL6RlqLpYLe44wYUbmAK9UZhQTMc-oZpJYURQSJIIgF6jlVma-2uXRmFeLcvn35Tt0OJiko_loOH65QkeeROdNH9eoWa639gbAQKlv_R74AiDArp8 | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2003+19th+International+Conference+on+Data+Engineering&rft.atitle=A+framework+for+the+selective+dissemination+of+XML+documents+based+on+inferred+user+profiles&rft.au=Stanoi%2C+I.&rft.au=Mihaila%2C+G.&rft.au=Sriram+Padmanabhan&rft.date=2003-01-01&rft.pub=IEEE&rft.isbn=9780780376656&rft.spage=531&rft.epage=542&rft_id=info:doi/10.1109%2FICDE.2003.1260819&rft.externalDocID=1260819 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/lc.gif&client=summon&freeimage=true | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/mc.gif&client=summon&freeimage=true | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/sc.gif&client=summon&freeimage=true |