Parallelization of Top Algorithm Through a New Hybrid Recommendation System for Big Data in Spark Cloud Computing Framework

In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval has received increasing attention from both the industry and academia. This query handling allows users to re...

Full description

Saved in:
Bibliographic Details
Published inIEEE systems journal Vol. 15; no. 4; pp. 4876 - 4886
Main Authors El Handri, Kaoutar, Idrissi, Abdellah
Format Journal Article
LanguageEnglish
Published IEEE 01.12.2021
Subjects
Online AccessGet full text
ISSN1932-8184
1937-9234
DOI10.1109/JSYST.2020.3019368

Cover

Abstract In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval has received increasing attention from both the industry and academia. This query handling allows users to retrieve the most useful data objects in a set of choices. This problem is compounded by the use of <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> in cases of multiple dimensions and extensive data analytics. In this article, we provide a novel parallel algorithm in a distributed recommender system based on the Apache Spark platform. The purpose of this approach was to implement the multicriteria decision aiding support and dominating query approach run by using matrix factorization and singular value decomposition (SVD)-based model as a sophisticated machine learning technique. Simultaneously, applying the resilient distributed datasets paradigm in cloud computing, which presents a favorable environment for big data management. Extensive experimental results in terms of accuracy, and scalability indicated the new algorithm's advantage compared to other <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> algorithms. Accordingly, our recommender system based on the conceived algorithm achieved high precision (62%-82%, depending on the data) to verify the profoundly positive effect of the use of the Spark framework and the SVD-based model while applying the commonly used evaluation metrics in the recommendation systems.
AbstractList In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval has received increasing attention from both the industry and academia. This query handling allows users to retrieve the most useful data objects in a set of choices. This problem is compounded by the use of <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> in cases of multiple dimensions and extensive data analytics. In this article, we provide a novel parallel algorithm in a distributed recommender system based on the Apache Spark platform. The purpose of this approach was to implement the multicriteria decision aiding support and dominating query approach run by using matrix factorization and singular value decomposition (SVD)-based model as a sophisticated machine learning technique. Simultaneously, applying the resilient distributed datasets paradigm in cloud computing, which presents a favorable environment for big data management. Extensive experimental results in terms of accuracy, and scalability indicated the new algorithm's advantage compared to other <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> algorithms. Accordingly, our recommender system based on the conceived algorithm achieved high precision (62%-82%, depending on the data) to verify the profoundly positive effect of the use of the Spark framework and the SVD-based model while applying the commonly used evaluation metrics in the recommendation systems.
Author Idrissi, Abdellah
El Handri, Kaoutar
Author_xml – sequence: 1
  givenname: Kaoutar
  orcidid: 0000-0001-6732-2627
  surname: El Handri
  fullname: El Handri, Kaoutar
  email: kaoutar.elhandri@um5s.net.ma
  organization: Intelligent Processing Systems Team, Computer Science Laboratory, Department of Computer Science, Faculty of Sciences, Mohammed V University, Rabat, Morocco
– sequence: 2
  givenname: Abdellah
  orcidid: 0000-0001-5696-3007
  surname: Idrissi
  fullname: Idrissi, Abdellah
  email: idriab@gmail.com
  organization: Intelligent Processing Systems Team, Computer Science Laboratory, Department of Computer Science, Faculty of Sciences, Mohammed V University, Rabat, Morocco
BookMark eNp9kMtOwzAQRS1UJNrCD8DGP5DiR5LayxJoC6oA0bBgFU0TOzVN4shJVRV-nr7EggWrOxrp3NGcHupUtlIIXVMyoJTI26f5xzweMMLIgBMqeSjOUHeXQ08y7ncOM_MEFf4F6jXNJyGBCIayi75fwUFRqMJ8QWtsha3Gsa3xqMitM-2yxPHS2XW-xICf1QZPtwtnMvymUluWqsqO0HzbtKrE2jp8Z3J8Dy1gs1vX4FY4Kuw6w5Et63VrqhyPHZRqY93qEp1rKBp1dco-eh8_xNHUm71MHqPRzEspZdITSgAJwxR8LekizbTSwmfZIhU6lDqQWpFMABMSGOfCD0IpmfS1rxkXQhPgfcSOvamzTeOUTmpnSnDbhJJkry856Ev2-pKTvh0k_kCpaQ_ftg5M8T96c0SNUur3lqQyCIjkP6Qmgfw
CODEN ISJEB2
CitedBy_id crossref_primary_10_1016_j_engappai_2023_106602
crossref_primary_10_1155_2022_5811341
crossref_primary_10_3390_electronics13071386
crossref_primary_10_1007_s12652_022_03911_z
crossref_primary_10_1155_2021_7213246
crossref_primary_10_2174_2666255813999201109201006
crossref_primary_10_23939_ujit2022_02_068
Cites_doi 10.1137/S0895480102412856
10.1023/A:1006544522159
10.1201/9781351236423-24
10.1145/1189769.1189772
10.1145/3326128
10.14778/2824032.2824080
10.1109/CompComm.2018.8781032
10.1145/1142473.1142530
10.1109/ICIT.2015.7125310
10.1145/568518.568519
10.1007/s00158-017-1793-2
10.1007/s00158-003-0368-6
10.1504/IJITST.2020.104579
10.1109/ACCESS.2019.2891544
10.1145/1391729.1391730
10.1109/TKDE.2005.99
10.1109/ICDAR.1999.791887
10.1145/375551.375567
10.1145/3010089.3010125
10.1109/BigData.2016.7840649
10.5539/nct.v2n1p62
10.1504/IJWGS.2014.058759
10.1109/ICEEE.2014.6978334
10.1109/I-SPAN.2012.9
10.1007/s13369-020-04568-6
10.1109/BigDataService.2017.28
10.1109/MC.2009.263
10.1016/j.procs.2016.04.214
10.1016/S0022-0000(03)00026-6
10.1007/978-0-387-85820-3_8
10.1109/TKDE.2018.2790387
10.1007/978-981-13-1747-7_17
10.1145/2043932.2044001
10.1016/j.procs.2015.04.237
ContentType Journal Article
DBID 97E
RIA
RIE
AAYXX
CITATION
DOI 10.1109/JSYST.2020.3019368
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Xplore (NTUSG)
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 1937-9234
EndPage 4886
ExternalDocumentID 10_1109_JSYST_2020_3019368
9195509
Genre orig-research
GroupedDBID 0R~
29I
4.4
5GY
5VS
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACIWK
AENEX
AETIX
AGQYO
AGSQL
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
HZ~
IFIPE
IPLJI
JAVBF
LAI
M43
O9-
OCL
RIA
RIE
RNS
AAYXX
CITATION
ID FETCH-LOGICAL-c1129-8e8a066ca4f91bcdfef842dbc8f69f59fe0d8a289a233845699294f4f2388f0a3
IEDL.DBID RIE
ISSN 1932-8184
IngestDate Thu Apr 24 23:06:02 EDT 2025
Wed Oct 01 02:25:51 EDT 2025
Wed Aug 27 02:33:11 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1129-8e8a066ca4f91bcdfef842dbc8f69f59fe0d8a289a233845699294f4f2388f0a3
ORCID 0000-0001-6732-2627
0000-0001-5696-3007
PageCount 11
ParticipantIDs crossref_primary_10_1109_JSYST_2020_3019368
crossref_citationtrail_10_1109_JSYST_2020_3019368
ieee_primary_9195509
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2021-Dec.
2021-12-00
PublicationDateYYYYMMDD 2021-12-01
PublicationDate_xml – month: 12
  year: 2021
  text: 2021-Dec.
PublicationDecade 2020
PublicationTitle IEEE systems journal
PublicationTitleAbbrev JSYST
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
References ref13
ref12
rousset (ref40) 2019; e 35
ref15
agrebi (ref32) 2018
ref14
fifa (ref48) 0
witwit (ref5) 2017
ayvazyan (ref29) 2012
ref11
ref10
ref17
ref16
ref19
el handri (ref9) 0
ref50
ref46
ref45
ref47
ref42
ref41
idrissi (ref37) 2014; 60
claypool (ref35) 1999
ref49
funk (ref43) 2006
ref8
ref7
ref4
ref3
badr (ref1) 2013
ref34
ref36
ref31
ref30
ref33
paterek (ref28) 0; 2007
ref2
anastasiu (ref44) 2016; 240
ref38
el handri (ref39) 0
ref23
ref26
ref25
chen (ref6) 2006
ref20
ref22
ref21
ref27
shoro (ref24) 2015; 15
im (ref18) 0
References_xml – ident: ref13
  doi: 10.1137/S0895480102412856
– ident: ref36
  doi: 10.1023/A:1006544522159
– ident: ref41
  doi: 10.1201/9781351236423-24
– year: 2017
  ident: ref5
  article-title: Non-weighted aggregate evaluation function of multi-objective optimization for knock engine modeling
– ident: ref11
  doi: 10.1145/1189769.1189772
– ident: ref47
  doi: 10.1145/3326128
– volume: e 35
  year: 2019
  ident: ref40
  publication-title: Extraction et Gestion Des Connaissances (Series RNTI)
– ident: ref49
  doi: 10.14778/2824032.2824080
– ident: ref17
  doi: 10.1109/CompComm.2018.8781032
– ident: ref50
  doi: 10.1145/1142473.1142530
– ident: ref25
  doi: 10.1109/ICIT.2015.7125310
– ident: ref10
  doi: 10.1145/568518.568519
– volume: 60
  year: 2014
  ident: ref37
  article-title: Skyline in cloud computing
  publication-title: J Theor Appl Inf Technol
– ident: ref42
  doi: 10.1007/s00158-017-1793-2
– volume: 240
  year: 2016
  ident: ref44
  article-title: Big data and recommender systems
  publication-title: Novtica J Spanish Comput Scientist Assoc
– year: 2013
  ident: ref1
  article-title: Traitement de requêtes top-k multicritères et application à la recherche par le contenu dans les bases de données multimédia
– year: 2006
  ident: ref43
  article-title: Netflix update: Try this at home
– ident: ref31
  doi: 10.1007/s00158-003-0368-6
– ident: ref7
  doi: 10.1504/IJITST.2020.104579
– ident: ref30
  doi: 10.1109/ACCESS.2019.2891544
– ident: ref15
  doi: 10.1145/1391729.1391730
– ident: ref3
  doi: 10.1109/TKDE.2005.99
– year: 0
  ident: ref48
  article-title: Predict fifa 2018 man of the match
– ident: ref45
  doi: 10.1109/ICDAR.1999.791887
– volume: 2007
  start-page: 5
  year: 0
  ident: ref28
  article-title: Improving regularized singular value decomposition for collaborative filtering
  publication-title: Proc KDD Cup Workshop
– ident: ref12
  doi: 10.1145/375551.375567
– ident: ref38
  doi: 10.1145/3010089.3010125
– year: 2012
  ident: ref29
  article-title: Etude de champs de température séparables avec une double décomposition en valeurs singulières: Quelques applications à la caractérisation des propriétés thermophysiques des matérieux et au contrôle non destructif
– ident: ref21
  doi: 10.1109/BigData.2016.7840649
– ident: ref22
  doi: 10.5539/nct.v2n1p62
– year: 1999
  ident: ref35
  article-title: Combing content-based and collaborative filters in an online newspaper
  publication-title: Proc ACM SIGIR Workshop Recommender Syst Implementation Eval
– ident: ref16
  doi: 10.1504/IJWGS.2014.058759
– ident: ref46
  doi: 10.1109/ICEEE.2014.6978334
– ident: ref23
  doi: 10.1109/I-SPAN.2012.9
– ident: ref26
  doi: 10.1007/s13369-020-04568-6
– year: 2006
  ident: ref6
  article-title: Multiple criteria decision analysis: Classification problems and solutions
– ident: ref19
  doi: 10.1109/BigDataService.2017.28
– year: 0
  ident: ref9
  article-title: Efficient $Top_{k_{WS}}$ algorithm for synthetics and real datasets
  publication-title: Artific Intellig Int J
– year: 2018
  ident: ref32
  article-title: Méthodes d'aide à la décision multi-attribut et multi-acteur pour résoudre le problème de sélection dans un environnement certain/incertain: Cas de la localisation des centres de distribution
– ident: ref8
  doi: 10.1109/MC.2009.263
– ident: ref4
  doi: 10.1016/j.procs.2016.04.214
– ident: ref14
  doi: 10.1016/S0022-0000(03)00026-6
– start-page: 89
  year: 0
  ident: ref18
  article-title: Parallel top-k query processing using MapReduce
  publication-title: Proc Int Conf Database Syst Adv Appl
– volume: 15
  year: 2015
  ident: ref24
  article-title: Big data analysis: Apache spark perspective
  publication-title: Global J Comput Sci Technol
– ident: ref2
  doi: 10.1007/978-0-387-85820-3_8
– ident: ref33
  doi: 10.1109/TKDE.2018.2790387
– ident: ref34
  doi: 10.1007/978-981-13-1747-7_17
– ident: ref20
  doi: 10.1145/2043932.2044001
– start-page: 359
  year: 0
  ident: ref39
  article-title: Étude comparative de topk basée sur l'algorithme de fagin en utilisant des métriques de corrélation dans la qualité de service de cloud computing
  publication-title: Proc EGC
– ident: ref27
  doi: 10.1016/j.procs.2015.04.237
SSID ssj0058579
Score 2.2578824
Snippet In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval...
SourceID crossref
ieee
SourceType Enrichment Source
Index Database
Publisher
StartPage 4876
SubjectTerms <inline-formula xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> <tex-math notation="LaTeX"> Top_{k}</tex-math> </inline-formula>
Big Data
Collaboration
Collaborative filtering (CF)
Filtering
Funk singular value decomposition (SVD)
Machine learning
Machine learning algorithms
multiple criteria decision aiding (MCDA)
Parallel algorithms
Query processing
Recommender systems
Skyline
Spark
Title Parallelization of Top Algorithm Through a New Hybrid Recommendation System for Big Data in Spark Cloud Computing Framework
URI https://ieeexplore.ieee.org/document/9195509
Volume 15
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  customDbUrl:
  eissn: 1937-9234
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0058579
  issn: 1932-8184
  databaseCode: RIE
  dateStart: 20070101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7Vkx58VbG-mIM33brdZtvk6KsUoSK0gp6WJJvR0seWsj2of97JPkoREW8hJCQwIfm-3ZnvY-xchzZsow49guLG41KjJ2KXTKMbwkptQ2McUew9trrP_OElfKmwy2UtjLU2Sz6zddfM_uXHiVm4T2VXsiEJUMs1ttYWrbxWq7x1CfVmunoOj3j0CPGyQMaXVw_91_6AqGBADJUQTdPJqq48QiuuKtmj0tlmvXI7eS7JqL5Idd18_lBq_O9-d9hWgS7hOj8Ou6xip3tsc0VzsMq-ntTc-aeMiwJMSBAGyQyux2_JfJi-T2CQW_eAAroBofvharrA0dQJLZdbMEEudA6EeOFm-AZ3KlUwpO6Zmo_gdpwsYsj9ImhN6JQJYPvsuXM_uO16hQODZxwO84QVijCJURxlQ5sYLQoexNoIbEkMJVo_Foo4mwqI6hIWk4S2OHIkICDQV80Dtj5NpvaQAQbYbvJQB02NPFBIDWFkbDTGwoY-1lijDElkCnly55IxjjKa4ssoC2PkwhgVYayxi-WcWS7O8efoqgvRcmQRnaPfu4_ZRuDSV7LMlRO2ns4X9pTwR6rPsoP3DT6o2TE
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NTxsxEB1Remg50BZaAW1hDtzohs3GBvtIaaNACUJikehpZXs9NCJko2hzgP75jvcjiqqq4mZZtmxpLPu93Zn3APat9PKYrIwYirtIaEuRykMyje0qr62XzgWiOLw8GtyI81t5uwJfFrUw3vsq-cx3QrP6l58Xbh4-lR3qrmZArV_ASymEkHW1VnvvMu6tlPUCIon4GRJtiUysD8-vf16nTAYT5qiMaXpBWHXpGVryVamelf4bGLYbqrNJ7jvz0nbc019ajc_d8VtYb_AlntQH4h2s-MkGrC2pDm7C7yszCw4q46YEEwvCtJjiyfiumI3KXw-Y1uY9aJDvQBw8hqouDET1gZerTZiwljpHxrz4dXSH30xpcMTdUzO7x9NxMc-xdozgNbHfpoC9h5v-9_R0EDUeDJELSCxSXhlGJc4I0l3rcvKkRJJbp-hIk9Tk41wZZm0mYbLLaEwz3hIkiKGAotj0PsDqpJj4LUBK6LgnpE16lkRiiBvK6dxZypWXMW1Dtw1J5hqB8uCTMc4qohLrrApjFsKYNWHchoPFnGktz_Hf0ZshRIuRTXR2_t29B68G6fAiuzi7_PERXichmaXKY_kEq-Vs7j8zGintbnUI_wDFM9x-
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Parallelization+of+%24Top_%7Bk%7D%24+Algorithm+Through+a+New+Hybrid+Recommendation+System+for+Big+Data+in+Spark+Cloud+Computing+Framework&rft.jtitle=IEEE+systems+journal&rft.au=El+Handri%2C+Kaoutar&rft.au=Idrissi%2C+Abdellah&rft.date=2021-12-01&rft.issn=1932-8184&rft.eissn=1937-9234&rft.volume=15&rft.issue=4&rft.spage=4876&rft.epage=4886&rft_id=info:doi/10.1109%2FJSYST.2020.3019368&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_JSYST_2020_3019368
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1932-8184&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1932-8184&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1932-8184&client=summon