A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data

One of the most common tasks nowadays in big data environments is the need to classify large amounts of data. There are numerous classification models designed to perform best in different environments and datasets, each with its advantages and disadvantages. However, when dealing with big data, the...

Full description

Saved in:

Bibliographic Details
Published in	Computation Vol. 8; no. 3; p. 80
Main Authors	Kalyvas, Christos, Maragoudakis, Manolis
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.09.2020
Subjects	Algorithms Big Data Classification Data mining Datasets decision boundary Genetic algorithms Internet of Things Methods Neural networks Optimization Performance degradation Principal components analysis Queries skyline Support vector machines
Online Access	Get full text
ISSN	2079-3197 2079-3197
DOI	10.3390/computation8030080

Cover

More Information
Summary:	One of the most common tasks nowadays in big data environments is the need to classify large amounts of data. There are numerous classification models designed to perform best in different environments and datasets, each with its advantages and disadvantages. However, when dealing with big data, their performance is significantly degraded because they are not designed—or even capable—of handling very large datasets. The current approach is based on a novel proposal of exploiting the dynamics of skyline queries to efficiently identify the decision boundary and classify big data. A comparison against the popular k-nearest neighbor (k-NN), support vector machines (SVM) and naïve Bayes classification algorithms shows that the proposed method is faster than the k-NN and the SVM. The novelty of this method is based on the fact that only a small number of computations are needed in order to make a prediction, while its full potential is revealed in very large datasets.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2079-3197 2079-3197
DOI:	10.3390/computation8030080