Boosting for Text Classification with Semantic Features

Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic level than single words for text classification purposes. In this paper we propose such an enhancement of the classical document...

Full description

Saved in:

Bibliographic Details
Published in	Advances in Web Mining and Web Usage Analysis pp. 149 - 166
Main Authors	Bloehdorn, Stephan, Hotho, Andreas
Format	Book Chapter
Language	English
Published	Berlin, Heidelberg Springer Berlin Heidelberg 2006
Series	Lecture Notes in Computer Science
Subjects	Feature Representation Lexical Entry Noun Phrase Semantic Feature Weak Learner
Online Access	Get full text
ISBN	3540471278 9783540471271
ISSN	0302-9743 1611-3349
DOI	10.1007/11899402_10

Cover

More Information
Summary:	Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic level than single words for text classification purposes. In this paper we propose such an enhancement of the classical document representation through concepts extracted from background knowledge. Boosting, a successful machine learning technique is used for classification. Comparative experimental evaluations in three different settings support our approach through consistent improvement of the results. An analysis of the results shows that this improvement is due to two separate effects.
ISBN:	3540471278 9783540471271
ISSN:	0302-9743 1611-3349
DOI:	10.1007/11899402_10