Boosting for Text Classification with Semantic Features
Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic level than single words for text classification purposes. In this paper we propose such an enhancement of the classical document...
Saved in:
Published in | Advances in Web Mining and Web Usage Analysis pp. 149 - 166 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Berlin, Heidelberg
Springer Berlin Heidelberg
2006
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3540471278 9783540471271 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/11899402_10 |
Cover
Summary: | Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic level than single words for text classification purposes. In this paper we propose such an enhancement of the classical document representation through concepts extracted from background knowledge. Boosting, a successful machine learning technique is used for classification. Comparative experimental evaluations in three different settings support our approach through consistent improvement of the results. An analysis of the results shows that this improvement is due to two separate effects. |
---|---|
ISBN: | 3540471278 9783540471271 |
ISSN: | 0302-9743 1611-3349 |
DOI: | 10.1007/11899402_10 |