Topic based language models for ad hoc information retrieval

We propose a topic based approach to language modelling for ad-hoc information retrieval (IR). Many smoothed estimators used for the multinomial query model in IR rely upon the estimated background collection probabilities. In this paper, we propose a topic based language modelling approach, that us...

Full description

Saved in:
Bibliographic Details
Published in2004 IEEE International Joint Conference on Neural Networks Vol. 4; pp. 3281 - 3286 vol.4
Main Authors Azzopardi, L., Girolami, M., van Rijsbergen, C.J.
Format Conference Proceeding
LanguageEnglish
Published Piscataway NJ IEEE 2004
Subjects
Online AccessGet full text
ISBN0780383591
9780780383593
ISSN1098-7576
DOI10.1109/IJCNN.2004.1381205

Cover

Abstract We propose a topic based approach to language modelling for ad-hoc information retrieval (IR). Many smoothed estimators used for the multinomial query model in IR rely upon the estimated background collection probabilities. In this paper, we propose a topic based language modelling approach, that uses a more informative prior based on the topical content of a document. In our experiments, the proposed model provides comparable IR performance to the standard models, but when combined in a two stage language model, it outperforms all other estimated models.
AbstractList We propose a topic based approach to language modelling for ad-hoc information retrieval (IR). Many smoothed estimators used for the multinomial query model in IR rely upon the estimated background collection probabilities. In this paper, we propose a topic based language modelling approach, that uses a more informative prior based on the topical content of a document. In our experiments, the proposed model provides comparable IR performance to the standard models, but when combined in a two stage language model, it outperforms all other estimated models.
Author van Rijsbergen, C.J.
Azzopardi, L.
Girolami, M.
Author_xml – sequence: 1
  givenname: L.
  surname: Azzopardi
  fullname: Azzopardi, L.
  organization: Sch. of ICT, Paisley Univ., UK
– sequence: 2
  givenname: M.
  surname: Girolami
  fullname: Girolami, M.
– sequence: 3
  givenname: C.J.
  surname: van Rijsbergen
  fullname: van Rijsbergen, C.J.
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17623984$$DView record in Pascal Francis
BookMark eNpFkE1LxDAQhgOu4O66f0AvuXhsTTppk4AXKX6sLOul9yVNJ2ukXyRV8N9bqOBcHob3YRjeDVn1Q4-E3HCWcs70_f6tPB7TjDGRclA8Y_kF2TCpGCjINV-R9WypROayuCK7GD_ZPCIXHMSaPFTD6C2tTcSGtqY_f5kz0m5osI3UDYGahn4Mlvp-Xjoz-aGnAafg8du01-TSmTbi7o9bUj0_VeVrcnh_2ZePh8TPz0yJtVmjGlFIQK0wt7polGAcnFK11ZkD5qQUVktQqACFEbVGlFAXbiaDLblbzo4mWtO6YHrr42kMvjPh58RlkYFWYvZuF88j4n-8dAK_7-1Wkw
ContentType Conference Proceeding
Copyright 2006 INIST-CNRS
Copyright_xml – notice: 2006 INIST-CNRS
DBID 6IE
6IH
CBEJK
RIE
RIO
IQODW
DOI 10.1109/IJCNN.2004.1381205
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
Pascal-Francis
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Anatomy & Physiology
Computer Science
Applied Sciences
EndPage 3286 vol.4
ExternalDocumentID 17623984
1381205
Genre orig-research
GroupedDBID 29I
6IE
6IH
CBEJK
RIE
RIO
6IK
6IL
AAJGR
AAVQY
AAWTH
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
IQODW
OCL
RIL
ID FETCH-LOGICAL-i205t-cc2d8d4673e98e5c96d84013f88bc92f30f774c9738e83e4a4b9ee73b6f9ee03
IEDL.DBID RIE
ISBN 0780383591
9780780383593
ISSN 1098-7576
IngestDate Wed Apr 02 07:24:55 EDT 2025
Tue Aug 26 18:27:04 EDT 2025
IsPeerReviewed false
IsScholarly false
Keywords Information retrieval
Modelling language
Neural network
Probabilistic approach
Database query
Modeling
Language English
License CC BY 4.0
LinkModel DirectLink
MeetingName 2004 International Joint Conference on Neural Networks (proceedings)
MergedId FETCHMERGED-LOGICAL-i205t-cc2d8d4673e98e5c96d84013f88bc92f30f774c9738e83e4a4b9ee73b6f9ee03
ParticipantIDs ieee_primary_1381205
pascalfrancis_primary_17623984
PublicationCentury 2000
PublicationDate 20040000
2004
PublicationDateYYYYMMDD 2004-01-01
PublicationDate_xml – year: 2004
  text: 20040000
PublicationDecade 2000
PublicationPlace Piscataway NJ
PublicationPlace_xml – name: Piscataway NJ
PublicationTitle 2004 IEEE International Joint Conference on Neural Networks
PublicationTitleAbbrev IJCNN
PublicationYear 2004
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000454134
ssj0060749
ssj0020275
Score 1.4300265
Snippet We propose a topic based approach to language modelling for ad-hoc information retrieval (IR). Many smoothed estimators used for the multinomial query model in...
SourceID pascalfrancis
ieee
SourceType Index Database
Publisher
StartPage 3281
SubjectTerms Applied sciences
Artificial intelligence
Computer science; control theory; systems
Exact sciences and technology
Hidden Markov models
Information retrieval
Information systems. Data bases
Mathematical model
Memory organisation. Data processing
Optical computing
Smoothing methods
Software
Vocabulary
Title Topic based language models for ad hoc information retrieval
URI https://ieeexplore.ieee.org/document/1381205
Volume 4
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFG-QkycUMOIH6cF4cmNdt9EmXgyRIAnEAybcSD8jMQ4C46B_vW3XgRoPnrZlWba-vbyvvvf7AXBDURLzGOmApmkcJArJgEtJAk0U1sYBS4rscPJkmo1ekvE8ndfA3X4WRinlms9UaE_dXr5ciZ0tlfWQcS-xBSw96pOsnNXa11MslByyUGc-2Yq-oe9mxk9St-1pwTNNhO3ydxKZ7CylyMPwVNe4Gq2JaO9pPJhOXRIZ-nd7EhbbQsm2Roq6pL_45pOGDTCpVlO2oryFu4KH4vMX0ON_l3sC2ofpP_i892unoKbyJmg95CY9f_-At9A1jbpqfBM0KlYI6I1EC9zPVuulgNY9SliVQ6Fj3NlCEyJDJuHrSkAP2WoVA24cr5dR-jaYDR9ng1HgORqCpfm0IhAilkQaa4sVJSoVNJPEpmyaEC5orHGkTYApaB8TRbBKWMKpUn3MM22OET4D9XyVq3MAkWBRJnmWMB4nmnNGWMoQ4kyhVDOBOqBlRbRYlygcCy-dDuj--BOH-8bMY0qSi7-fuwTHZROOraZcgXqx2alrE18UvOsU6wst-ccL
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV05T8MwFH5CMMDE0SLKUTwgJlLi2E5tiQVVVOVoxVAktsqnqBAtou0Avx7bScohBqYkiqLEL0_v8nvfB3AiMM1Uhl0iGMsSarFJlDE8cdwS5x2wETgMJ_cHee-B3jyyxxU4W87CWGtj85lthdO4l2-mehFKZefYu5csAJauMUopK6a1lhWVACaHA9hZmW6l3_B3c-8pRdz4DPCZPsaOGTxPfX7GBC6BeKprUg3XpOL8-qYzGMQ0slW-vaRhCU2Ucubl6AoCjG9eqbsJ_Wo9RTPKc2sxVy398Qvq8b8L3oL61_wful96tm1YsZMdqF1OfIL-8o5OUWwbjfX4HdiseCFQaSZqcDGcvo41Cg7SoKogiiLnzgz5IBlJg56mGpWgrUE10Ftk9vJqX4dh92rY6SUlS0My9p82T7TODDfe3hIruGVa5IaHpM1xrrTIHEmdDzG1aBNuObFUUiWsbROVO39MyS6sTqYTuwcIa5nmRuVUqow6pSSXTGKspMXMSY0bUAsiGr0WOByjUjoNaP74E1_3vaEngtP9v587hvXesH83urse3B7ARtGSE2orh7A6f1vYIx9tzFUzKtknzWHKWA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2004+IEEE+International+Joint+Conference+on+Neural+Networks+%28IEEE+Cat.+No.04CH37541%29&rft.atitle=Topic+based+language+models+for+ad+hoc+information+retrieval&rft.au=Azzopardi%2C+L.&rft.au=Girolami%2C+M.&rft.au=van+Rijsbergen%2C+C.J.&rft.date=2004-01-01&rft.pub=IEEE&rft.isbn=9780780383593&rft.issn=1098-7576&rft.volume=4&rft.spage=3281&rft.epage=3286+vol.4&rft_id=info:doi/10.1109%2FIJCNN.2004.1381205&rft.externalDocID=1381205
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1098-7576&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1098-7576&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1098-7576&client=summon