Query Expansion of Zero-Hit Subject Searches: Using a Thesaurus in Conjunction with NLP Techniques

The focus of our study is zero-hit queries in keyword subject searches and the effort of increasing recall in these cases by reformulating and, then, expanding the initial queries using an external source of knowledge, namely a thesaurus. To this end, the objectives of this study are twofold. First,...

Full description

Saved in:
Bibliographic Details
Published inLecture notes in computer science pp. 433 - 438
Main Authors Kapidakis, Sarantos, Mastora, Anna, Peponakis, Manolis
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2012
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783642332890
3642332897
ISSN0302-9743
1611-3349
1611-3349
DOI10.1007/978-3-642-33290-6_48

Cover

Abstract The focus of our study is zero-hit queries in keyword subject searches and the effort of increasing recall in these cases by reformulating and, then, expanding the initial queries using an external source of knowledge, namely a thesaurus. To this end, the objectives of this study are twofold. First, we perform the mapping of query terms to the thesaurus terms. Second, we use the matched terms to expand the user’s initial query by taking advantage of the thesaurus relations and implementing natural language processing (NLP) techniques. We report on the overall procedure and elaborate on key points and considerations of each step of the process.
AbstractList The focus of our study is zero-hit queries in keyword subject searches and the effort of increasing recall in these cases by reformulating and, then, expanding the initial queries using an external source of knowledge, namely a thesaurus. To this end, the objectives of this study are twofold. First, we perform the mapping of query terms to the thesaurus terms. Second, we use the matched terms to expand the user’s initial query by taking advantage of the thesaurus relations and implementing natural language processing (NLP) techniques. We report on the overall procedure and elaborate on key points and considerations of each step of the process.
Author Kapidakis, Sarantos
Peponakis, Manolis
Mastora, Anna
Author_xml – sequence: 1
  givenname: Sarantos
  surname: Kapidakis
  fullname: Kapidakis, Sarantos
  email: sarantos@ionio.gr
  organization: Laboratory on Digital Libraries & Electronic Publishing, Archives & Library Science Department, Ionian University, Corfu, Greece
– sequence: 2
  givenname: Anna
  surname: Mastora
  fullname: Mastora, Anna
  organization: Laboratory on Digital Libraries & Electronic Publishing, Archives & Library Science Department, Ionian University, Corfu, Greece
– sequence: 3
  givenname: Manolis
  surname: Peponakis
  fullname: Peponakis, Manolis
  organization: National Hellenic Research Foundation / National Documentation Centre, Athens, Greece
BookMark eNp9kMtOwzAQRc1LokD_gIV_wDCOHTtmh6rykCoeomzYWI47aVOKU-JGpX-PW1iwYjZXc-feWZwTchiagIScc7jgAPrS6IIJpmTGhMgMMGVlsUf6yRbJ3Hlqn_S44jwlpDn4eysMHJIeCMiY0VIck36Mc0ijlVage6R87rDd0OHX0oVYN4E2FX3DtmF39Yq-dOUcfVJ0rZ9hvKKvsQ5T6ug4ba5ru0jrQAdNmHfBr7b1db2a0YfREx2jn4X6s8N4Ro4qt4jY_9VT8nozHA_u2Ojx9n5wPWKR56ZgBahsAkYqnEheoc5L8JKbTCCUufJOSy9L7XGCzqPOeCEnKjeVLitXAORKnJL8528Xlm6zdouFXbb1h2s3loPdsrQJjBU2obE7bnbLMvWyn15M8TDF1pZN8x7_L30DdD1zoA
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2012
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2012
DBID ABOKW
UNPAY
DOI 10.1007/978-3-642-33290-6_48
DatabaseName Unpaywall for CDI: Monographs and Miscellaneous Content
Unpaywall
DatabaseTitleList
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
Computer Science
EISBN 9783642332906
3642332900
EISSN 1611-3349
Editor Loizides, Fernando
Zaphiris, Panayiotis
Rasmussen, Edie
Buchanan, George
Editor_xml – sequence: 1
  givenname: Panayiotis
  surname: Zaphiris
  fullname: Zaphiris, Panayiotis
  email: panayiotis.zaphiris@cut.ac.cy
– sequence: 2
  givenname: George
  surname: Buchanan
  fullname: Buchanan, George
  email: george.buchanan.1@city.ac.uk
– sequence: 3
  givenname: Edie
  surname: Rasmussen
  fullname: Rasmussen, Edie
  email: edie.rasmussen@ubc.ca
– sequence: 4
  givenname: Fernando
  surname: Loizides
  fullname: Loizides, Fernando
  email: fernando.loizides@gmail.com
EndPage 438
ExternalDocumentID oai:lekythos.library.ucy.ac.cy:10797/13690
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RNI
RSU
SVGTG
VI1
~02
ABOKW
UNPAY
ID FETCH-LOGICAL-s1598-8062d0946ed41fe75b0c41923e0b56ca74c4b7cedeace72184d659f7bfa800563
IEDL.DBID UNPAY
ISBN 9783642332890
3642332897
ISSN 0302-9743
1611-3349
IngestDate Sun Oct 26 03:59:35 EDT 2025
Wed Sep 17 02:39:47 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
License other-oa
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-s1598-8062d0946ed41fe75b0c41923e0b56ca74c4b7cedeace72184d659f7bfa800563
OpenAccessLink https://proxy.k.utb.cz/login?url=http://hdl.handle.net/10797/13690
PageCount 6
ParticipantIDs unpaywall_primary_10_1007_978_3_642_33290_6_48
springer_books_10_1007_978_3_642_33290_6_48
PublicationCentury 2000
PublicationDate 2012
PublicationDateYYYYMMDD 2012-01-01
PublicationDate_xml – year: 2012
  text: 2012
PublicationDecade 2010
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle Second International Conference, TPDL 2012, Paphos, Cyprus, September 23-27, 2012. Proceedings
PublicationTitle Lecture notes in computer science
PublicationYear 2012
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0000767607
ssj0002792
Score 1.4053333
Snippet The focus of our study is zero-hit queries in keyword subject searches and the effort of increasing recall in these cases by reformulating and, then, expanding...
SourceID unpaywall
springer
SourceType Open Access Repository
Publisher
StartPage 433
SubjectTerms NLP techniques
Query expansion
Thesaurus
Zero-hit queries
Title Query Expansion of Zero-Hit Subject Searches: Using a Thesaurus in Conjunction with NLP Techniques
URI http://link.springer.com/10.1007/978-3-642-33290-6_48
http://hdl.handle.net/10797/13690
UnpaywallVersion submittedVersion
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Na8JAEB2sHkp7sJ_U0soeeiuxmmw2pjcRRUTEghbbS9hNNmArieSD1v76zprESksLPYeFsG93Zt7OzBuAG_RinCKZ1bjftDSqc67xtmtrwvbQ43jMt7Mq3zEbzOhwbs5LUMwn_CYvgNzEtu5aBlK4PagwE8PtMlRm40nnKcsO6JqdF9EzJcdnUHunPS7L_GOAjR90G1mSo6b8bBOeB7CfBiu-fuPL5Y5P6Ve_OnOyUpLXRpqIhvvxU6jx9989gkPVrEBUFwFu0DGUZHAC1WJUA8lv7imIh1RGa9J7x8uv3sdI6JNnGYXaYJEQNB7qNYZkpccyviebQgLCCR6imKdRGpNFQLph8IJeUCFJ1PMtGY8mZFpowMZnMOv3pt2Blo9X0GKMYdrom5juIbtj0qMtX1qmaLoqJ2zIpjCZyy3qUmG50kPbLC1FBT1EwLeEz9tKQdQ4h3IQBvICiCmYznXq8qakuApDlpZwqUF9D80XM80a3Ba77ijyEDuFWjJi5BgOYuRsMHIURjVobIFxVpnoxp8LLv-74ArKSZTKa4wnElGHSqc3HD3W85P1CYQMxMc
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3fS8MwEA5zexB9mD9xopIH3ySza9N09U3GxhAZEzaYvoQkTUEd7egPdP71XtZ2DkXB5xIo-ZK7-3J33yF0CV5MUCCzRISWR6gtBBFd5RPpB-BxAhb6RZXviA2n9G7mzmqomk_4TV4AuInvXXccoHBbqMFcCLfrqDEdjW8fi-yATfyyiJ4ZOT6H-hvtcUXmHwJs-GD7wJK4mfKzTnjuoO08Wojlm5jPN3zKoPnVmVOUkry280y21cdPocbff3cP7ZpmBWy6CGCD9lFNRweoWY1qwOXNPUTyIdfJEvff4fKb9zEch_hJJzEZPmcYjId5jcFF6bFOb_CqkAALDIcoFXmSp_g5wr04egEvaJDE5vkWj-7HeFJpwKZHaDroT3pDUo5XICnEMF3wTcwOgN0xHdBOqD1XWsrkhB1tSZcp4VFFpad0ALZZe4YKBoBA6MlQdI2CqHOM6lEc6ROEXclsYVMlLE1hFYQsHamoQ8MAzBdz3Ra6qnadG_KQ8kotGTDiDgeM-AojbjBqofYaGL4oRDf-XHD63wVnqJ4luT6HeCKTF-WJ-gRlx8My
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Theory+and+Practice+of+Digital+Libraries&rft.au=Kapidakis%2C+Sarantos&rft.au=Mastora%2C+Anna&rft.au=Peponakis%2C+Manolis&rft.atitle=Query+Expansion+of+Zero-Hit+Subject+Searches%3A+Using+a+Thesaurus+in+Conjunction+with+NLP+Techniques&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2012-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642332890&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=433&rft.epage=438&rft_id=info:doi/10.1007%2F978-3-642-33290-6_48
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon