A Multiple-Instance Learning Approach to Sentence Selection for Question Ranking

In example-based retrieval a system is queried with a document aiming to retrieve other similar or relevant documents. We address an instance of this problem: question retrieval in community Question Answering (cQA) forums. In this scenario, both the document collection and the queries are relativel...

Full description

Saved in:

Bibliographic Details
Published in	Advances in Information Retrieval Vol. 10193; pp. 437 - 449
Main Authors	Romeo, Salvatore, Da San Martino, Giovanni, Barrón-Cedeño, Alberto, Moschitti, Alessandro
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2017 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Mean Average Precision Multiple Instance Learn Parse Tree Sentence Pair Statistical Machine Translation
Online Access	Get full text
ISBN	3319566075 9783319566078
ISSN	0302-9743 1611-3349 1611-3349
DOI	10.1007/978-3-319-56608-5_34

Cover

More Information
Summary:	In example-based retrieval a system is queried with a document aiming to retrieve other similar or relevant documents. We address an instance of this problem: question retrieval in community Question Answering (cQA) forums. In this scenario, both the document collection and the queries are relatively short multi-sentence documents subject to noise and redundancy, which makes it harder for learning-to-rank algorithms to build upon the proper text representation. In order to only exploit the relevant fragments of the query and collection documents, we treat them as a sequence of sentences, in a multiple-instance learning fashion. By automatically pre-selecting the best sentences for our tree-kernel-based learning model, we improve over using full text performance on the dataset of the 2016 SemEval cQA challenge in terms of accuracy and speed, reaching the state of the art.
ISBN:	3319566075 9783319566078
ISSN:	0302-9743 1611-3349 1611-3349
DOI:	10.1007/978-3-319-56608-5_34