An Innovative Model for Extracting OLAP Cubes from NOSQL Database Based on Scalable Naïve Bayes Classifier

Due to unstructured and large amounts of data, relational databases are no longer suitable for data management. As a result, new databases known as NOSQL have been introduced. The issue is that such a database is difficult to analyze. Online analytical processing (OLAP) is the foundational technolog...

Full description

Saved in:

Bibliographic Details
Published in	Mathematical problems in engineering Vol. 2022; pp. 1 - 11
Main Authors	Davardoost, Farnaz, Babazadeh Sangar, Amin, Majidzadeh, Kambiz
Format	Journal Article
Language	English
Published	New York Hindawi 11.04.2022 John Wiley & Sons, Inc
Subjects	Algorithms Big Data Classifiers Cubes Data analysis Data management Documents Information management Intelligence (information) Methods Online analytical processing Relational data bases Social networks
Online Access	Get full text
ISSN	1024-123X 1026-7077 1563-5147 1563-5147
DOI	10.1155/2022/2860735

Cover

More Information
Summary:	Due to unstructured and large amounts of data, relational databases are no longer suitable for data management. As a result, new databases known as NOSQL have been introduced. The issue is that such a database is difficult to analyze. Online analytical processing (OLAP) is the foundational technology for data analysis in business intelligence. Because these technologies were designed primarily for relational database systems, performing OLAP in NOSQL is difficult. We present a model for extracting OLAP cubes from a document-oriented NOSQL database in this article. A scalable Naïve Bayes classifier method was used for this purpose. The proposed solution is divided into three stages of preparation, Naïve Bayes, and NBMR. Our proposed algorithm, NBMR, is based on the Naïve Bayes classifier (NBC) and the MapReduce (MR) programming model. Each NOSQL database document with nearly the same attribute will belong to the same class, and as a result, OLAP cubes can be used to perform data analysis. Because the proposed model allows for distributed and parallel Naïve Bayes Classifier computing, it is appropriate and suitable for large-scale data sets. Our proposed model is a proper and efficient approach when considering the speed and reduced the number of required comparisons.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1024-123X 1026-7077 1563-5147 1563-5147
DOI:	10.1155/2022/2860735