IGTree: Using Trees for Compression and Classification in Lazy Learning Algorithms

We describe the IGTree learning algorithm, which compresses an instance base into a tree structure. The concept of information gain is used as a heuristic function for performing this compression. IGTree produces trees that, compared to other lazy learning approaches, reduce storage requirements and...

Full description

Saved in:
Bibliographic Details
Published inThe Artificial intelligence review Vol. 11; no. 1-5; pp. 407 - 423
Main Authors Daelemans, Walter, Van Den Bosch, Antal, Weijters, Ton
Format Journal Article
LanguageEnglish
Published Dordrecht Springer Nature B.V 01.02.1997
Subjects
Online AccessGet full text
ISSN0269-2821
1573-7462
DOI10.1023/A:1006506017891

Cover

More Information
Summary:We describe the IGTree learning algorithm, which compresses an instance base into a tree structure. The concept of information gain is used as a heuristic function for performing this compression. IGTree produces trees that, compared to other lazy learning approaches, reduce storage requirements and the time required to compute classifications. Furthermore, we obtained similar or better generalization accuracy with IGTree when trained on two complex linguistic tasks, viz. letter-phoneme transliteration and part-of-speech-tagging, when compared to alternative lazy learning and decision tree approaches (viz., IB1, information-gain-weighted IB1, and C4.5). A third experiment, with the task of word hyphenation, demonstrates that when the mutual differences in information gain of features is too small, IGTree as well as information-gain-weighted IB1 perform worse than IB1. These results indicate that IGTree is a useful algorithm for problems characterized by the availability of a large number of training instances described by symbolic features with sufficiently differing information gain values.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:0269-2821
1573-7462
DOI:10.1023/A:1006506017891