A framework for the selective dissemination of XML documents based on inferred user profiles

As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key cap...

Full description

Saved in:
Bibliographic Details
Published in2003 19th International Conference on Data Engineering pp. 531 - 542
Main Authors Stanoi, I., Mihaila, G., Sriram Padmanabhan
Format Conference Proceeding
LanguageEnglish
Published IEEE 2003
Subjects
Online AccessGet full text
ISBN9780780376656
078037665X
DOI10.1109/ICDE.2003.1260819

Cover

Abstract As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key capability of emerging information dissemination systems is therefore the effective filtering of a continuous stream of XML data items according to user preferences. Here we propose a model for information dissemination that integrates profile inference with data dissemination and takes advantage of the structured content in XML documents. Starting from the assumption that explicitly stating one's information interests is an inconvenient and error-prone process, we aim to automatically construct user profiles. We do this by clustering items previously deemed valuable by the user according to a novel similarity measure that takes advantage of the semantic content of XML. Furthermore, we index the profiles from all users into a multilevel index structure whose nodes naturally will be a close match to subject areas present in the document collection. Such an approach is both intuitive and efficient since the indexing structure is not primarily affected by an increasing number of users. To support our claims we experimentally validate our method and report on its effectiveness and efficiency.
AbstractList As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key capability of emerging information dissemination systems is therefore the effective filtering of a continuous stream of XML data items according to user preferences. Here we propose a model for information dissemination that integrates profile inference with data dissemination and takes advantage of the structured content in XML documents. Starting from the assumption that explicitly stating one's information interests is an inconvenient and error-prone process, we aim to automatically construct user profiles. We do this by clustering items previously deemed valuable by the user according to a novel similarity measure that takes advantage of the semantic content of XML. Furthermore, we index the profiles from all users into a multilevel index structure whose nodes naturally will be a close match to subject areas present in the document collection. Such an approach is both intuitive and efficient since the indexing structure is not primarily affected by an increasing number of users. To support our claims we experimentally validate our method and report on its effectiveness and efficiency.
Author Sriram Padmanabhan
Mihaila, G.
Stanoi, I.
Author_xml – sequence: 1
  givenname: I.
  surname: Stanoi
  fullname: Stanoi, I.
  organization: IBM Thomas J. Watson Res. Center, NY, USA
– sequence: 2
  givenname: G.
  surname: Mihaila
  fullname: Mihaila, G.
  organization: IBM Thomas J. Watson Res. Center, NY, USA
– sequence: 3
  surname: Sriram Padmanabhan
  fullname: Sriram Padmanabhan
  organization: IBM Thomas J. Watson Res. Center, NY, USA
BookMark eNotUNFKxDAQDKignv0A8SU_0Jo0bZo8HvXUg4ovCj4IR5psMNomR9JT_HsD3jIwu8wwDHuJTn3wgNA1JRWlRN5u-7tNVRPCKlpzIqg8QYXsBMlgHectP0dFSp8kT9MyKZoL9L7GNqoZfkL8wjZEvHwATjCBXtw3YONSgtl5tbjgcbD47WnAJujDDH5JeFQJDM6K8xZizPshQcT7GKybIF2hM6umBMWRV-j1fvPSP5bD88O2Xw-lox1bSsNrGC1rayYNoyz374wQkijIF9VSZ4MiHNgoWtFyqjSnTSNFzaEdZaPZCt385zoA2O2jm1X83R1_wP4AJgRTZg
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/ICDE.2003.1260819
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore digital library
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EndPage 542
ExternalDocumentID 1260819
GroupedDBID 6IE
6IH
6IK
6IL
AAJGR
AAVQY
AAWTH
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IERZE
OCL
RIB
RIC
RIE
RIL
RIO
ID FETCH-LOGICAL-i173t-d62ebf35239d3130037d8890ae3131c9cd62a06e3b858561ac61449826e5b94c3
IEDL.DBID RIE
ISBN 9780780376656
078037665X
IngestDate Tue Aug 26 17:53:07 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i173t-d62ebf35239d3130037d8890ae3131c9cd62a06e3b858561ac61449826e5b94c3
PageCount 12
ParticipantIDs ieee_primary_1260819
PublicationCentury 2000
PublicationDate 20030000
PublicationDateYYYYMMDD 2003-01-01
PublicationDate_xml – year: 2003
  text: 20030000
PublicationDecade 2000
PublicationTitle 2003 19th International Conference on Data Engineering
PublicationTitleAbbrev ICDE
PublicationYear 2003
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000453984
Score 1.3426446
Snippet As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective...
SourceID ieee
SourceType Publisher
StartPage 531
SubjectTerms Bandwidth
Deductive databases
Indexing
Information filtering
Information filters
Internet
Monitoring
Pressing
Traffic control
XML
Title A framework for the selective dissemination of XML documents based on inferred user profiles
URI https://ieeexplore.ieee.org/document/1260819
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKJyZALeItD4wkjWPHjxGVVgVRxEClDkhVnHMkhNSiJl349ZzdpAjEwGYnkWPZsu-7u-_uCLmWwBiAUpEFkaKCYvFIpQ6ijOUKEgBpMh-cPH2Sk5l4mGfzDrnZxcI45wL5zMW-GXz5sCo23lQ2YAi-tc_xuae03MZq7ewpCE240SJo5jrBYyOzNsFO25eNV5MlZnA_vBuFbKBxM-iP6ipBuIwPyLSd1pZT8h5vahsXn78yNv533oek_x3GR593AuqIdNyyR15vadkSsigiVooIkFahGg5efDT45z09xm8YXZV0Pn2k7S8q6oUeUHzjSVzrNba9lYM2lb-rPpmNRy_DSdSUWIjemOJ1BDJ1tkQQxg1w79niCrQ2Se6wxwpT4Ad5Ih233n8oWV54BdKgTuIya0TBj0l3uVq6E0KZAF0guDI5GKFSmXMrNHOqLDXKA8VOSc-vzOJjm0Vj0SzK2d-Pz8l-oM0FY8cF6dbrjbtE8V_bq7DvX_Mdq1I
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFG8IHvSkBozf9uDRjXXt-nE0CAFlxAMkHEzIunaJMQEC4-Jf72vZMBoP3tqt2Zqu3fu9937vPYTuuSHEGCECbVgMCoqGIxVbEyQkEyYyhqvEBSenYz6YsudZMmugh30sjLXWk89s6Jrel2-W-daZyjoEwLd0OT4PEsZYsovW2ltUAJxQJZnXzWUEB4cndYqdus8rvyaJVGfYfer5fKBh9dgf9VW8eOkfo7Se2I5V8hFuSx3mn79yNv535ieo_R3Ih1_3IuoUNeyihd4ecVFTsjBgVgwYEG98PRz49WHvoXcEGffJ8LLAs3SE61dssBN7BsMdR-Nar6Ht7By4qv29aaNpvzfpDoKqyELwTgQtA8NjqwuAYVQZ6nxbVBgpVZRZ6JFc5TAgi7il2nkQOclyp0Iq0EpsohXL6RlqLpYLe44wYUbmAK9UZhQTMc-oZpJYURQSJIIgF6jlVma-2uXRmFeLcvn35Tt0OJiko_loOH65QkeeROdNH9eoWa639gbAQKlv_R74AiDArp8
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2003+19th+International+Conference+on+Data+Engineering&rft.atitle=A+framework+for+the+selective+dissemination+of+XML+documents+based+on+inferred+user+profiles&rft.au=Stanoi%2C+I.&rft.au=Mihaila%2C+G.&rft.au=Sriram+Padmanabhan&rft.date=2003-01-01&rft.pub=IEEE&rft.isbn=9780780376656&rft.spage=531&rft.epage=542&rft_id=info:doi/10.1109%2FICDE.2003.1260819&rft.externalDocID=1260819
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/sc.gif&client=summon&freeimage=true