Combined Syntactic and Semantic Kernels for Text Classification
The exploitation of syntactic structures and semantic background knowledge has always been an appealing subject in the context of text retrieval and information management. The usefulness of this kind of information has been shown most prominently in highly specialized tasks, such as classification...
Saved in:
| Published in | Advances in Information Retrieval Vol. 4425; pp. 307 - 318 |
|---|---|
| Main Authors | , , |
| Format | Book Chapter |
| Language | English |
| Published |
Germany
Springer Berlin / Heidelberg
2007
Springer Berlin Heidelberg |
| Series | Lecture Notes in Computer Science |
| Subjects | |
| Online Access | Get full text |
| ISBN | 9783540714941 3540714944 |
| ISSN | 0302-9743 1611-3349 |
| DOI | 10.1007/978-3-540-71496-5_29 |
Cover
| Summary: | The exploitation of syntactic structures and semantic background knowledge has always been an appealing subject in the context of text retrieval and information management. The usefulness of this kind of information has been shown most prominently in highly specialized tasks, such as classification in Question Answering (QA) scenarios. So far, however, additional syntactic or semantic information has been used only individually. In this paper, we propose a principled approach for jointly exploiting both types of information. We propose a new type of kernel, the Semantic Syntactic Tree Kernel (SSTK), which incorporates linguistic structures, e.g. syntactic dependencies, and semantic background knowledge, e.g. term similarity based on WordNet, to automatically learn question categories in QA. We show the power of this approach in a series of experiments with a well known Question Classification dataset. |
|---|---|
| ISBN: | 9783540714941 3540714944 |
| ISSN: | 0302-9743 1611-3349 |
| DOI: | 10.1007/978-3-540-71496-5_29 |