Ranked join indices

A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query p...

Full description

Saved in:
Bibliographic Details
Published in2003 19th International Conference on Data Engineering pp. 277 - 288
Main Authors Tsaparas, P., Palpanas, T., Kotidis, Y., Koudas, N., Divesh Srivastava
Format Conference Proceeding
LanguageEnglish
Published IEEE 2003
Subjects
Online AccessGet full text
ISBN9780780376656
078037665X
DOI10.1109/ICDE.2003.1260799

Cover

Abstract A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query processing purposes through join operations. In query processing, it is desirable to incorporate user preferences towards specific attributes or their values. A way to incorporate such preferences is by utilizing scoring functions that combine user preferences and attribute values and return a numerical score for each tuple in the join result. Then, a target query, which we refer to as top-k join query, seeks to identify the k tuples in the join result with the highest scores. We propose a novel technique, which we refer to as ranked join index, to efficiently answer top-k join queries for arbitrary, user specified, preferences and a large class of scoring functions. Our rank join index requires small space (compared to the entire join result) and provides guarantees for its performance. Moreover, our proposal provides a graceful tradeoff between its space requirements and worst case search performance. We supplement our analytical results with a thorough experimental evaluation using a variety of real and synthetic data sets, demonstrating that, in comparison to other viable approaches, our technique offers significant performance benefits.
AbstractList A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query processing purposes through join operations. In query processing, it is desirable to incorporate user preferences towards specific attributes or their values. A way to incorporate such preferences is by utilizing scoring functions that combine user preferences and attribute values and return a numerical score for each tuple in the join result. Then, a target query, which we refer to as top-k join query, seeks to identify the k tuples in the join result with the highest scores. We propose a novel technique, which we refer to as ranked join index, to efficiently answer top-k join queries for arbitrary, user specified, preferences and a large class of scoring functions. Our rank join index requires small space (compared to the entire join result) and provides guarantees for its performance. Moreover, our proposal provides a graceful tradeoff between its space requirements and worst case search performance. We supplement our analytical results with a thorough experimental evaluation using a variety of real and synthetic data sets, demonstrating that, in comparison to other viable approaches, our technique offers significant performance benefits.
Author Tsaparas, P.
Koudas, N.
Palpanas, T.
Kotidis, Y.
Divesh Srivastava
Author_xml – sequence: 1
  givenname: P.
  surname: Tsaparas
  fullname: Tsaparas, P.
  organization: Toronto Univ., Ont., Canada
– sequence: 2
  givenname: T.
  surname: Palpanas
  fullname: Palpanas, T.
  organization: Toronto Univ., Ont., Canada
– sequence: 3
  givenname: Y.
  surname: Kotidis
  fullname: Kotidis, Y.
– sequence: 4
  givenname: N.
  surname: Koudas
  fullname: Koudas, N.
– sequence: 5
  surname: Divesh Srivastava
  fullname: Divesh Srivastava
BookMark eNotjkFLAzEQRgMqVOueehIv_QO7nWSymZ2jrNUWCoWi55JNJpCqqXS9-O8t2I8H7_b47tR1ORZR6kFDozXwYt0_LxsDgI02Doj5SlVMHZxBcq51E1WN4wHOsy1yZ2_VbOfLh8T54ZjLPJeYg4z36ib5z1Gqi6fq_WX51q_qzfZ13T9t6qwJf2pDLVg7GOrIG5KgXXLMHClFq30A8YMFCCkiphiM7jikgJDc0AoiA07V4383i8j--5S__Ol3f7mOfy8hORc
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/ICDE.2003.1260799
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EndPage 288
ExternalDocumentID 1260799
GroupedDBID 6IE
6IH
6IK
6IL
AAJGR
AAVQY
AAWTH
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IERZE
OCL
RIB
RIC
RIE
RIL
RIO
ID FETCH-LOGICAL-i173t-275044b2787a27ec16f6999d7fd41ac0eab400cfd33fdc2189cfc30f6b5e33903
IEDL.DBID RIE
ISBN 9780780376656
078037665X
IngestDate Tue Aug 26 17:53:07 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i173t-275044b2787a27ec16f6999d7fd41ac0eab400cfd33fdc2189cfc30f6b5e33903
PageCount 12
ParticipantIDs ieee_primary_1260799
PublicationCentury 2000
PublicationDate 20030000
PublicationDateYYYYMMDD 2003-01-01
PublicationDate_xml – year: 2003
  text: 20030000
PublicationDecade 2000
PublicationTitle 2003 19th International Conference on Data Engineering
PublicationTitleAbbrev ICDE
PublicationYear 2003
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000453984
Score 1.7842149
Snippet A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result...
SourceID ieee
SourceType Publisher
StartPage 277
SubjectTerms Airports
Availability
Data engineering
Delay effects
Performance analysis
Proposals
Quality of service
Query processing
Title Ranked join indices
URI https://ieeexplore.ieee.org/document/1260799
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8MgGCbbTp782Izf6cGj7aBQKOe5ZZrMGOOS3ZYCL8lc0hnTXvz1vmXdjMaDN8qBAM3L8_B-PBByy4y00goWy0LbWMjM4jkodYxYnHPuM2mLxjUwe5LTuXhcZIsOudvXwgBASD6DpGmGWL7b2LpxlQ0Zkm-ldZd0VS63tVp7fwpSExxehJt5TtFsZLYT2Nl9yzaqyagePozux0ENNGkH_fG6SgCXySGZ7aa1zSlZJ3VlEvv5S7Hxv_M-IoPvMr7oeQ9Qx6QDZZ-cvBTlGlz0tlmVUROwxoNiQOaT8etoGrcvI8QrpngVB1F2YVK0tiJVYJn0EpmeU94JVlgKhUHbtN7hdjuLKK6tt5x6aTLgXFN-SnrlpoQzEgFoYEAdV8YJL1JDpeUKaZFKkXzn7pz0mwUt37fiF8t2LRd_d1-Sg5DtFnwUV6RXfdRwjahdmZvwu74AOr-QZg
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH9BPOjJDzB-u4NHN9a16-gZIaBAjIGEG1nbtwRJhjHj4l_vWxkYjQdvXQ9bP_L6--19_Apwz7Q00gjmy1QZX8jY0DkolU9Y3OY8i6VJS9fAaCz7U_E0i2c1eNjVwiCiSz7DoGy6WL5dmXXpKmsxIt-JUnuwHwsh4k211s6jQuSEPiDcv3k7JMOR8VZiZ_ssq7gmC1Vr0HnsOj3QoHrtj_tVHLz0jmC0Hdgmq2QZrAsdmM9fmo3_HfkxNL8L-byXHUSdQA3zBpy-pvkSrfe2WuReGbKmo6IJ01530un71d0I_oIlvPCdLLvQEdlbGiVomMwkcT2bZFaw1ISYarJOk1lacGsIx5XJDA8zqWPkXIX8DOr5Ksdz8BAVMgwtT7QVmYh0KA1PiBglEdHvtr2ARjmh-ftG_mJezeXy7-47OOhPRsP5cDB-voJDl_vmPBbXUC8-1nhDGF7oW7d1Xwelk7M
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2003+19th+International+Conference+on+Data+Engineering&rft.atitle=Ranked+join+indices&rft.au=Tsaparas%2C+P.&rft.au=Palpanas%2C+T.&rft.au=Kotidis%2C+Y.&rft.au=Koudas%2C+N.&rft.date=2003-01-01&rft.pub=IEEE&rft.isbn=9780780376656&rft.spage=277&rft.epage=288&rft_id=info:doi/10.1109%2FICDE.2003.1260799&rft.externalDocID=1260799
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/sc.gif&client=summon&freeimage=true