Decision tree classification algorithm for non-equilibrium data set based on random forests
In order to overcome the problems of poor accuracy and high complexity of current classification algorithm for non-equilibrium data set, this paper proposes a decision tree classification algorithm for non-equilibrium data set based on random forest. Wavelet packet decomposition is used to denoise n...
Saved in:
Published in | Journal of intelligent & fuzzy systems Vol. 39; no. 2; pp. 1639 - 1648 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
London, England
SAGE Publications
01.01.2020
Sage Publications Ltd |
Subjects | |
Online Access | Get full text |
ISSN | 1064-1246 1875-8967 |
DOI | 10.3233/JIFS-179937 |
Cover
Summary: | In order to overcome the problems of poor accuracy and high complexity of current classification algorithm for non-equilibrium data set, this paper proposes a decision tree classification algorithm for non-equilibrium data set based on random forest. Wavelet packet decomposition is used to denoise non-equilibrium data, and SNM algorithm and RFID are combined to remove redundant data from data sets. Based on the results of data processing, the non-equilibrium data sets are classified by random forest method. According to Bootstrap resampling method with certain constraints, the majority and minority samples of each sample subset are sampled, CART is used to train the data set, and a decision tree is constructed. Obtain the final classification results by voting on the CART decision tree classification. Experimental results show that the proposed algorithm has the characteristics of high classification accuracy and low complexity, and it is a feasible classification algorithm for non-equilibrium data set. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 1064-1246 1875-8967 |
DOI: | 10.3233/JIFS-179937 |