Ranked join indices
A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query p...
        Saved in:
      
    
          | Published in | 2003 19th International Conference on Data Engineering pp. 277 - 288 | 
|---|---|
| Main Authors | , , , , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        2003
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9780780376656 078037665X  | 
| DOI | 10.1109/ICDE.2003.1260799 | 
Cover
| Abstract | A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query processing purposes through join operations. In query processing, it is desirable to incorporate user preferences towards specific attributes or their values. A way to incorporate such preferences is by utilizing scoring functions that combine user preferences and attribute values and return a numerical score for each tuple in the join result. Then, a target query, which we refer to as top-k join query, seeks to identify the k tuples in the join result with the highest scores. We propose a novel technique, which we refer to as ranked join index, to efficiently answer top-k join queries for arbitrary, user specified, preferences and a large class of scoring functions. Our rank join index requires small space (compared to the entire join result) and provides guarantees for its performance. Moreover, our proposal provides a graceful tradeoff between its space requirements and worst case search performance. We supplement our analytical results with a thorough experimental evaluation using a variety of real and synthetic data sets, demonstrating that, in comparison to other viable approaches, our technique offers significant performance benefits. | 
    
|---|---|
| AbstractList | A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query processing purposes through join operations. In query processing, it is desirable to incorporate user preferences towards specific attributes or their values. A way to incorporate such preferences is by utilizing scoring functions that combine user preferences and attribute values and return a numerical score for each tuple in the join result. Then, a target query, which we refer to as top-k join query, seeks to identify the k tuples in the join result with the highest scores. We propose a novel technique, which we refer to as ranked join index, to efficiently answer top-k join queries for arbitrary, user specified, preferences and a large class of scoring functions. Our rank join index requires small space (compared to the entire join result) and provides guarantees for its performance. Moreover, our proposal provides a graceful tradeoff between its space requirements and worst case search performance. We supplement our analytical results with a thorough experimental evaluation using a variety of real and synthetic data sets, demonstrating that, in comparison to other viable approaches, our technique offers significant performance benefits. | 
    
| Author | Tsaparas, P. Koudas, N. Palpanas, T. Kotidis, Y. Divesh Srivastava  | 
    
| Author_xml | – sequence: 1 givenname: P. surname: Tsaparas fullname: Tsaparas, P. organization: Toronto Univ., Ont., Canada – sequence: 2 givenname: T. surname: Palpanas fullname: Palpanas, T. organization: Toronto Univ., Ont., Canada – sequence: 3 givenname: Y. surname: Kotidis fullname: Kotidis, Y. – sequence: 4 givenname: N. surname: Koudas fullname: Koudas, N. – sequence: 5 surname: Divesh Srivastava fullname: Divesh Srivastava  | 
    
| BookMark | eNotjkFLAzEQRgMqVOueehIv_QO7nWSymZ2jrNUWCoWi55JNJpCqqXS9-O8t2I8H7_b47tR1ORZR6kFDozXwYt0_LxsDgI02Doj5SlVMHZxBcq51E1WN4wHOsy1yZ2_VbOfLh8T54ZjLPJeYg4z36ib5z1Gqi6fq_WX51q_qzfZ13T9t6qwJf2pDLVg7GOrIG5KgXXLMHClFq30A8YMFCCkiphiM7jikgJDc0AoiA07V4383i8j--5S__Ol3f7mOfy8hORc | 
    
| ContentType | Conference Proceeding | 
    
| DBID | 6IE 6IH CBEJK RIE RIO  | 
    
| DOI | 10.1109/ICDE.2003.1260799 | 
    
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP) 1998-present  | 
    
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| EndPage | 288 | 
    
| ExternalDocumentID | 1260799 | 
    
| GroupedDBID | 6IE 6IH 6IK 6IL AAJGR AAVQY AAWTH ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IERZE OCL RIB RIC RIE RIL RIO  | 
    
| ID | FETCH-LOGICAL-i173t-275044b2787a27ec16f6999d7fd41ac0eab400cfd33fdc2189cfc30f6b5e33903 | 
    
| IEDL.DBID | RIE | 
    
| ISBN | 9780780376656 078037665X  | 
    
| IngestDate | Tue Aug 26 17:53:07 EDT 2025 | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | true | 
    
| Language | English | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-i173t-275044b2787a27ec16f6999d7fd41ac0eab400cfd33fdc2189cfc30f6b5e33903 | 
    
| PageCount | 12 | 
    
| ParticipantIDs | ieee_primary_1260799 | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 20030000 | 
    
| PublicationDateYYYYMMDD | 2003-01-01 | 
    
| PublicationDate_xml | – year: 2003 text: 20030000  | 
    
| PublicationDecade | 2000 | 
    
| PublicationTitle | 2003 19th International Conference on Data Engineering | 
    
| PublicationTitleAbbrev | ICDE | 
    
| PublicationYear | 2003 | 
    
| Publisher | IEEE | 
    
| Publisher_xml | – name: IEEE | 
    
| SSID | ssj0000453984 | 
    
| Score | 1.7842149 | 
    
| Snippet | A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result... | 
    
| SourceID | ieee | 
    
| SourceType | Publisher | 
    
| StartPage | 277 | 
    
| SubjectTerms | Airports Availability Data engineering Delay effects Performance analysis Proposals Quality of service Query processing  | 
    
| Title | Ranked join indices | 
    
| URI | https://ieeexplore.ieee.org/document/1260799 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8MgGCbbTp782Izf6cGj7aBQKOe5ZZrMGOOS3ZYCL8lc0hnTXvz1vmXdjMaDN8qBAM3L8_B-PBByy4y00goWy0LbWMjM4jkodYxYnHPuM2mLxjUwe5LTuXhcZIsOudvXwgBASD6DpGmGWL7b2LpxlQ0Zkm-ldZd0VS63tVp7fwpSExxehJt5TtFsZLYT2Nl9yzaqyagePozux0ENNGkH_fG6SgCXySGZ7aa1zSlZJ3VlEvv5S7Hxv_M-IoPvMr7oeQ9Qx6QDZZ-cvBTlGlz0tlmVUROwxoNiQOaT8etoGrcvI8QrpngVB1F2YVK0tiJVYJn0EpmeU94JVlgKhUHbtN7hdjuLKK6tt5x6aTLgXFN-SnrlpoQzEgFoYEAdV8YJL1JDpeUKaZFKkXzn7pz0mwUt37fiF8t2LRd_d1-Sg5DtFnwUV6RXfdRwjahdmZvwu74AOr-QZg | 
    
| linkProvider | IEEE | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH9BPOjJDzB-u4NHN9a16-gZIaBAjIGEG1nbtwRJhjHj4l_vWxkYjQdvXQ9bP_L6--19_Apwz7Q00gjmy1QZX8jY0DkolU9Y3OY8i6VJS9fAaCz7U_E0i2c1eNjVwiCiSz7DoGy6WL5dmXXpKmsxIt-JUnuwHwsh4k211s6jQuSEPiDcv3k7JMOR8VZiZ_ssq7gmC1Vr0HnsOj3QoHrtj_tVHLz0jmC0Hdgmq2QZrAsdmM9fmo3_HfkxNL8L-byXHUSdQA3zBpy-pvkSrfe2WuReGbKmo6IJ01530un71d0I_oIlvPCdLLvQEdlbGiVomMwkcT2bZFaw1ISYarJOk1lacGsIx5XJDA8zqWPkXIX8DOr5Ksdz8BAVMgwtT7QVmYh0KA1PiBglEdHvtr2ARjmh-ftG_mJezeXy7-47OOhPRsP5cDB-voJDl_vmPBbXUC8-1nhDGF7oW7d1Xwelk7M | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2003+19th+International+Conference+on+Data+Engineering&rft.atitle=Ranked+join+indices&rft.au=Tsaparas%2C+P.&rft.au=Palpanas%2C+T.&rft.au=Kotidis%2C+Y.&rft.au=Koudas%2C+N.&rft.date=2003-01-01&rft.pub=IEEE&rft.isbn=9780780376656&rft.spage=277&rft.epage=288&rft_id=info:doi/10.1109%2FICDE.2003.1260799&rft.externalDocID=1260799 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/lc.gif&client=summon&freeimage=true | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/mc.gif&client=summon&freeimage=true | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780780376656/sc.gif&client=summon&freeimage=true |