Multigranulation Relative Entropy-Based Mixed Attribute Outlier Detection in Neighborhood Systems

Outlier detection is widely used in many fields, such as intrusion detection, credit card fraud detection, medical diagnosis, and so on. Existing outlier detection algorithms are mostly designed for dealing with numeric or categorical attributes. However, data usually exist in the form of mixed attr...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on systems, man, and cybernetics. Systems Vol. 52; no. 8; pp. 5175 - 5187
Main Authors Yuan, Zhong, Chen, Hongmei, Li, Tianrui, Zhang, Xianyong, Sang, Binbin
Format Journal Article
LanguageEnglish
Published New York IEEE 01.08.2022
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN2168-2216
2168-2232
DOI10.1109/TSMC.2021.3119119

Cover

More Information
Summary:Outlier detection is widely used in many fields, such as intrusion detection, credit card fraud detection, medical diagnosis, and so on. Existing outlier detection algorithms are mostly designed for dealing with numeric or categorical attributes. However, data usually exist in the form of mixed attributes in real-world applications. In this article, we propose a novel mixed attribute outlier detection method based on multigranulation relative entropy by employing the neighborhood rough set. First, the neighborhood system is constructed by optimizing the mixed distance metric and the radius of the statistical value. Second, the neighborhood entropy is introduced as an uncertainty measure of data. Furthermore, the three kinds of multigranulation relative entropy-based matrices are defined by three kinds of attribute sequences, and the multigranulation relative entropy-based outlier factor is integrated to indicate the outlier degree of every object. Based on the proposed outlier detection model, the corresponding algorithm is designed. Finally, the proposed algorithm is compared with other nine algorithms through experiments on public data. The experimental results show that the proposed technique is adaptive and effective.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2168-2216
2168-2232
DOI:10.1109/TSMC.2021.3119119