Solution Methods for Classification Problems with Categorical Attributes

The article considers various methods for classification of a set of objects into two classes when all the attributes are categorical (nominal or factor attributes), i.e., describe the membership of an object in a category. Some methods are a simple generalization of classical methods (Bayesian algo...

Full description

Saved in:
Bibliographic Details
Published inComputational mathematics and modeling Vol. 26; no. 3; pp. 408 - 428
Main Author D’yakonov, A. G.
Format Journal Article
LanguageEnglish
Published New York Springer US 01.07.2015
Subjects
Online AccessGet full text
ISSN1046-283X
1573-837X
DOI10.1007/s10598-015-9281-2

Cover

More Information
Summary:The article considers various methods for classification of a set of objects into two classes when all the attributes are categorical (nominal or factor attributes), i.e., describe the membership of an object in a category. Some methods are a simple generalization of classical methods (Bayesian algorithms, singular decomposition methods), others are fundamentally novel. An efficient technique is proposed for encoding categorical attributes by real numbers, which makes it possible to apply classical machine-learning methods (e.g., the random forest). A generalization of the k nearest neighbors (kNN) algorithm and Zhuravlev’s estimate calculation algorithm (AEC) achieve best performance on real-life data. All methods have been tested on an applied problem involving construction of a recommender system for a security service.
ISSN:1046-283X
1573-837X
DOI:10.1007/s10598-015-9281-2