Pre-Cutoff Value Calculation Method for Accelerating Metric Space Outlier Detection

Outlier detection is an important data mining technique. In this article, the triangle inequality of distances is leveraged to design a pre-cutoff value (PCV) algorithm that calculates the outlier degree pre-threshold without additional distance computations. This algorithm is suitable for accelerat...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of grid and high performance computing Vol. 16; no. 1; pp. 1 - 17
Main Authors Xu, Honglong, Liang, Zhonghao, Huang, Kaide, Huang, Guoshun, He, Yan
Format Journal Article
LanguageEnglish
Published Hershey IGI Global 01.01.2024
Subjects
Online AccessGet full text
ISSN1938-0259
1938-0267
1938-0267
DOI10.4018/IJGHPC.334125

Cover

More Information
Summary:Outlier detection is an important data mining technique. In this article, the triangle inequality of distances is leveraged to design a pre-cutoff value (PCV) algorithm that calculates the outlier degree pre-threshold without additional distance computations. This algorithm is suitable for accelerating various metric space outlier detection algorithms. Experimental results on multiple real datasets demonstrate that the PCV algorithm reduces the runtime and number of distance computations for the iORCA algorithm by 14.59% and 15.73%, respectively. Even compared to the new high-performance algorithm ADPOD, the PCV algorithm achieves 1.41% and 0.45% reductions. Notably, the non-outlier exclusion for the first data block in the dataset is significantly improved, with an exclusion rate of up to 36.5%, leading to a 23.54% reduction in detection time for that data block. While demonstrating excellent results, the PCV algorithm maintains the data type generality of metric space algorithms.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1938-0259
1938-0267
1938-0267
DOI:10.4018/IJGHPC.334125