Parallelization of Top Algorithm Through a New Hybrid Recommendation System for Big Data in Spark Cloud Computing Framework
In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval has received increasing attention from both the industry and academia. This query handling allows users to re...
        Saved in:
      
    
          | Published in | IEEE systems journal Vol. 15; no. 4; pp. 4876 - 4886 | 
|---|---|
| Main Authors | , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
            IEEE
    
        01.12.2021
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1932-8184 1937-9234  | 
| DOI | 10.1109/JSYST.2020.3019368 | 
Cover
| Abstract | In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval has received increasing attention from both the industry and academia. This query handling allows users to retrieve the most useful data objects in a set of choices. This problem is compounded by the use of <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> in cases of multiple dimensions and extensive data analytics. In this article, we provide a novel parallel algorithm in a distributed recommender system based on the Apache Spark platform. The purpose of this approach was to implement the multicriteria decision aiding support and dominating query approach run by using matrix factorization and singular value decomposition (SVD)-based model as a sophisticated machine learning technique. Simultaneously, applying the resilient distributed datasets paradigm in cloud computing, which presents a favorable environment for big data management. Extensive experimental results in terms of accuracy, and scalability indicated the new algorithm's advantage compared to other <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> algorithms. Accordingly, our recommender system based on the conceived algorithm achieved high precision (62%-82%, depending on the data) to verify the profoundly positive effect of the use of the Spark framework and the SVD-based model while applying the commonly used evaluation metrics in the recommendation systems. | 
    
|---|---|
| AbstractList | In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval has received increasing attention from both the industry and academia. This query handling allows users to retrieve the most useful data objects in a set of choices. This problem is compounded by the use of <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> in cases of multiple dimensions and extensive data analytics. In this article, we provide a novel parallel algorithm in a distributed recommender system based on the Apache Spark platform. The purpose of this approach was to implement the multicriteria decision aiding support and dominating query approach run by using matrix factorization and singular value decomposition (SVD)-based model as a sophisticated machine learning technique. Simultaneously, applying the resilient distributed datasets paradigm in cloud computing, which presents a favorable environment for big data management. Extensive experimental results in terms of accuracy, and scalability indicated the new algorithm's advantage compared to other <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> algorithms. Accordingly, our recommender system based on the conceived algorithm achieved high precision (62%-82%, depending on the data) to verify the profoundly positive effect of the use of the Spark framework and the SVD-based model while applying the commonly used evaluation metrics in the recommendation systems. | 
    
| Author | Idrissi, Abdellah El Handri, Kaoutar  | 
    
| Author_xml | – sequence: 1 givenname: Kaoutar orcidid: 0000-0001-6732-2627 surname: El Handri fullname: El Handri, Kaoutar email: kaoutar.elhandri@um5s.net.ma organization: Intelligent Processing Systems Team, Computer Science Laboratory, Department of Computer Science, Faculty of Sciences, Mohammed V University, Rabat, Morocco – sequence: 2 givenname: Abdellah orcidid: 0000-0001-5696-3007 surname: Idrissi fullname: Idrissi, Abdellah email: idriab@gmail.com organization: Intelligent Processing Systems Team, Computer Science Laboratory, Department of Computer Science, Faculty of Sciences, Mohammed V University, Rabat, Morocco  | 
    
| BookMark | eNp9kMtOwzAQRS1UJNrCD8DGP5DiR5LayxJoC6oA0bBgFU0TOzVN4shJVRV-nr7EggWrOxrp3NGcHupUtlIIXVMyoJTI26f5xzweMMLIgBMqeSjOUHeXQ08y7ncOM_MEFf4F6jXNJyGBCIayi75fwUFRqMJ8QWtsha3Gsa3xqMitM-2yxPHS2XW-xICf1QZPtwtnMvymUluWqsqO0HzbtKrE2jp8Z3J8Dy1gs1vX4FY4Kuw6w5Et63VrqhyPHZRqY93qEp1rKBp1dco-eh8_xNHUm71MHqPRzEspZdITSgAJwxR8LekizbTSwmfZIhU6lDqQWpFMABMSGOfCD0IpmfS1rxkXQhPgfcSOvamzTeOUTmpnSnDbhJJkry856Ev2-pKTvh0k_kCpaQ_ftg5M8T96c0SNUur3lqQyCIjkP6Qmgfw | 
    
| CODEN | ISJEB2 | 
    
| CitedBy_id | crossref_primary_10_1016_j_engappai_2023_106602 crossref_primary_10_1155_2022_5811341 crossref_primary_10_3390_electronics13071386 crossref_primary_10_1007_s12652_022_03911_z crossref_primary_10_1155_2021_7213246 crossref_primary_10_2174_2666255813999201109201006 crossref_primary_10_23939_ujit2022_02_068  | 
    
| Cites_doi | 10.1137/S0895480102412856 10.1023/A:1006544522159 10.1201/9781351236423-24 10.1145/1189769.1189772 10.1145/3326128 10.14778/2824032.2824080 10.1109/CompComm.2018.8781032 10.1145/1142473.1142530 10.1109/ICIT.2015.7125310 10.1145/568518.568519 10.1007/s00158-017-1793-2 10.1007/s00158-003-0368-6 10.1504/IJITST.2020.104579 10.1109/ACCESS.2019.2891544 10.1145/1391729.1391730 10.1109/TKDE.2005.99 10.1109/ICDAR.1999.791887 10.1145/375551.375567 10.1145/3010089.3010125 10.1109/BigData.2016.7840649 10.5539/nct.v2n1p62 10.1504/IJWGS.2014.058759 10.1109/ICEEE.2014.6978334 10.1109/I-SPAN.2012.9 10.1007/s13369-020-04568-6 10.1109/BigDataService.2017.28 10.1109/MC.2009.263 10.1016/j.procs.2016.04.214 10.1016/S0022-0000(03)00026-6 10.1007/978-0-387-85820-3_8 10.1109/TKDE.2018.2790387 10.1007/978-981-13-1747-7_17 10.1145/2043932.2044001 10.1016/j.procs.2015.04.237  | 
    
| ContentType | Journal Article | 
    
| DBID | 97E RIA RIE AAYXX CITATION  | 
    
| DOI | 10.1109/JSYST.2020.3019368 | 
    
| DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Xplore (NTUSG) CrossRef  | 
    
| DatabaseTitle | CrossRef | 
    
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Engineering | 
    
| EISSN | 1937-9234 | 
    
| EndPage | 4886 | 
    
| ExternalDocumentID | 10_1109_JSYST_2020_3019368 9195509  | 
    
| Genre | orig-research | 
    
| GroupedDBID | 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACIWK AENEX AETIX AGQYO AGSQL AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL RIA RIE RNS AAYXX CITATION  | 
    
| ID | FETCH-LOGICAL-c1129-8e8a066ca4f91bcdfef842dbc8f69f59fe0d8a289a233845699294f4f2388f0a3 | 
    
| IEDL.DBID | RIE | 
    
| ISSN | 1932-8184 | 
    
| IngestDate | Thu Apr 24 23:06:02 EDT 2025 Wed Oct 01 02:25:51 EDT 2025 Wed Aug 27 02:33:11 EDT 2025  | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 4 | 
    
| Language | English | 
    
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037  | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c1129-8e8a066ca4f91bcdfef842dbc8f69f59fe0d8a289a233845699294f4f2388f0a3 | 
    
| ORCID | 0000-0001-6732-2627 0000-0001-5696-3007  | 
    
| PageCount | 11 | 
    
| ParticipantIDs | crossref_primary_10_1109_JSYST_2020_3019368 crossref_citationtrail_10_1109_JSYST_2020_3019368 ieee_primary_9195509  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2021-Dec. 2021-12-00  | 
    
| PublicationDateYYYYMMDD | 2021-12-01 | 
    
| PublicationDate_xml | – month: 12 year: 2021 text: 2021-Dec.  | 
    
| PublicationDecade | 2020 | 
    
| PublicationTitle | IEEE systems journal | 
    
| PublicationTitleAbbrev | JSYST | 
    
| PublicationYear | 2021 | 
    
| Publisher | IEEE | 
    
| Publisher_xml | – name: IEEE | 
    
| References | ref13 ref12 rousset (ref40) 2019; e 35 ref15 agrebi (ref32) 2018 ref14 fifa (ref48) 0 witwit (ref5) 2017 ayvazyan (ref29) 2012 ref11 ref10 ref17 ref16 ref19 el handri (ref9) 0 ref50 ref46 ref45 ref47 ref42 ref41 idrissi (ref37) 2014; 60 claypool (ref35) 1999 ref49 funk (ref43) 2006 ref8 ref7 ref4 ref3 badr (ref1) 2013 ref34 ref36 ref31 ref30 ref33 paterek (ref28) 0; 2007 ref2 anastasiu (ref44) 2016; 240 ref38 el handri (ref39) 0 ref23 ref26 ref25 chen (ref6) 2006 ref20 ref22 ref21 ref27 shoro (ref24) 2015; 15 im (ref18) 0  | 
    
| References_xml | – ident: ref13 doi: 10.1137/S0895480102412856 – ident: ref36 doi: 10.1023/A:1006544522159 – ident: ref41 doi: 10.1201/9781351236423-24 – year: 2017 ident: ref5 article-title: Non-weighted aggregate evaluation function of multi-objective optimization for knock engine modeling – ident: ref11 doi: 10.1145/1189769.1189772 – ident: ref47 doi: 10.1145/3326128 – volume: e 35 year: 2019 ident: ref40 publication-title: Extraction et Gestion Des Connaissances (Series RNTI) – ident: ref49 doi: 10.14778/2824032.2824080 – ident: ref17 doi: 10.1109/CompComm.2018.8781032 – ident: ref50 doi: 10.1145/1142473.1142530 – ident: ref25 doi: 10.1109/ICIT.2015.7125310 – ident: ref10 doi: 10.1145/568518.568519 – volume: 60 year: 2014 ident: ref37 article-title: Skyline in cloud computing publication-title: J Theor Appl Inf Technol – ident: ref42 doi: 10.1007/s00158-017-1793-2 – volume: 240 year: 2016 ident: ref44 article-title: Big data and recommender systems publication-title: Novtica J Spanish Comput Scientist Assoc – year: 2013 ident: ref1 article-title: Traitement de requêtes top-k multicritères et application à la recherche par le contenu dans les bases de données multimédia – year: 2006 ident: ref43 article-title: Netflix update: Try this at home – ident: ref31 doi: 10.1007/s00158-003-0368-6 – ident: ref7 doi: 10.1504/IJITST.2020.104579 – ident: ref30 doi: 10.1109/ACCESS.2019.2891544 – ident: ref15 doi: 10.1145/1391729.1391730 – ident: ref3 doi: 10.1109/TKDE.2005.99 – year: 0 ident: ref48 article-title: Predict fifa 2018 man of the match – ident: ref45 doi: 10.1109/ICDAR.1999.791887 – volume: 2007 start-page: 5 year: 0 ident: ref28 article-title: Improving regularized singular value decomposition for collaborative filtering publication-title: Proc KDD Cup Workshop – ident: ref12 doi: 10.1145/375551.375567 – ident: ref38 doi: 10.1145/3010089.3010125 – year: 2012 ident: ref29 article-title: Etude de champs de température séparables avec une double décomposition en valeurs singulières: Quelques applications à la caractérisation des propriétés thermophysiques des matérieux et au contrôle non destructif – ident: ref21 doi: 10.1109/BigData.2016.7840649 – ident: ref22 doi: 10.5539/nct.v2n1p62 – year: 1999 ident: ref35 article-title: Combing content-based and collaborative filters in an online newspaper publication-title: Proc ACM SIGIR Workshop Recommender Syst Implementation Eval – ident: ref16 doi: 10.1504/IJWGS.2014.058759 – ident: ref46 doi: 10.1109/ICEEE.2014.6978334 – ident: ref23 doi: 10.1109/I-SPAN.2012.9 – ident: ref26 doi: 10.1007/s13369-020-04568-6 – year: 2006 ident: ref6 article-title: Multiple criteria decision analysis: Classification problems and solutions – ident: ref19 doi: 10.1109/BigDataService.2017.28 – year: 0 ident: ref9 article-title: Efficient $Top_{k_{WS}}$ algorithm for synthetics and real datasets publication-title: Artific Intellig Int J – year: 2018 ident: ref32 article-title: Méthodes d'aide à la décision multi-attribut et multi-acteur pour résoudre le problème de sélection dans un environnement certain/incertain: Cas de la localisation des centres de distribution – ident: ref8 doi: 10.1109/MC.2009.263 – ident: ref4 doi: 10.1016/j.procs.2016.04.214 – ident: ref14 doi: 10.1016/S0022-0000(03)00026-6 – start-page: 89 year: 0 ident: ref18 article-title: Parallel top-k query processing using MapReduce publication-title: Proc Int Conf Database Syst Adv Appl – volume: 15 year: 2015 ident: ref24 article-title: Big data analysis: Apache spark perspective publication-title: Global J Comput Sci Technol – ident: ref2 doi: 10.1007/978-0-387-85820-3_8 – ident: ref33 doi: 10.1109/TKDE.2018.2790387 – ident: ref34 doi: 10.1007/978-981-13-1747-7_17 – ident: ref20 doi: 10.1145/2043932.2044001 – start-page: 359 year: 0 ident: ref39 article-title: Étude comparative de topk basée sur l'algorithme de fagin en utilisant des métriques de corrélation dans la qualité de service de cloud computing publication-title: Proc EGC – ident: ref27 doi: 10.1016/j.procs.2015.04.237  | 
    
| SSID | ssj0058579 | 
    
| Score | 2.2578824 | 
    
| Snippet | In the era of big data, parallel <inline-formula><tex-math notation="LaTeX">Top_{k}</tex-math></inline-formula> query processing under information retrieval... | 
    
| SourceID | crossref ieee  | 
    
| SourceType | Enrichment Source Index Database Publisher  | 
    
| StartPage | 4876 | 
    
| SubjectTerms | <inline-formula xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> <tex-math notation="LaTeX"> Top_{k}</tex-math> </inline-formula> Big Data Collaboration Collaborative filtering (CF) Filtering Funk singular value decomposition (SVD) Machine learning Machine learning algorithms multiple criteria decision aiding (MCDA) Parallel algorithms Query processing Recommender systems Skyline Spark  | 
    
| Title | Parallelization of Top Algorithm Through a New Hybrid Recommendation System for Big Data in Spark Cloud Computing Framework | 
    
| URI | https://ieeexplore.ieee.org/document/9195509 | 
    
| Volume | 15 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 1937-9234 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0058579 issn: 1932-8184 databaseCode: RIE dateStart: 20070101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7Vkx58VbG-mIM33brdZtvk6KsUoSK0gp6WJJvR0seWsj2of97JPkoREW8hJCQwIfm-3ZnvY-xchzZsow49guLG41KjJ2KXTKMbwkptQ2McUew9trrP_OElfKmwy2UtjLU2Sz6zddfM_uXHiVm4T2VXsiEJUMs1ttYWrbxWq7x1CfVmunoOj3j0CPGyQMaXVw_91_6AqGBADJUQTdPJqq48QiuuKtmj0tlmvXI7eS7JqL5Idd18_lBq_O9-d9hWgS7hOj8Ou6xip3tsc0VzsMq-ntTc-aeMiwJMSBAGyQyux2_JfJi-T2CQW_eAAroBofvharrA0dQJLZdbMEEudA6EeOFm-AZ3KlUwpO6Zmo_gdpwsYsj9ImhN6JQJYPvsuXM_uO16hQODZxwO84QVijCJURxlQ5sYLQoexNoIbEkMJVo_Foo4mwqI6hIWk4S2OHIkICDQV80Dtj5NpvaQAQbYbvJQB02NPFBIDWFkbDTGwoY-1lijDElkCnly55IxjjKa4ssoC2PkwhgVYayxi-WcWS7O8efoqgvRcmQRnaPfu4_ZRuDSV7LMlRO2ns4X9pTwR6rPsoP3DT6o2TE | 
    
| linkProvider | IEEE | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NTxsxEB1Remg50BZaAW1hDtzohs3GBvtIaaNACUJikehpZXs9NCJko2hzgP75jvcjiqqq4mZZtmxpLPu93Zn3APat9PKYrIwYirtIaEuRykMyje0qr62XzgWiOLw8GtyI81t5uwJfFrUw3vsq-cx3QrP6l58Xbh4-lR3qrmZArV_ASymEkHW1VnvvMu6tlPUCIon4GRJtiUysD8-vf16nTAYT5qiMaXpBWHXpGVryVamelf4bGLYbqrNJ7jvz0nbc019ajc_d8VtYb_AlntQH4h2s-MkGrC2pDm7C7yszCw4q46YEEwvCtJjiyfiumI3KXw-Y1uY9aJDvQBw8hqouDET1gZerTZiwljpHxrz4dXSH30xpcMTdUzO7x9NxMc-xdozgNbHfpoC9h5v-9_R0EDUeDJELSCxSXhlGJc4I0l3rcvKkRJJbp-hIk9Tk41wZZm0mYbLLaEwz3hIkiKGAotj0PsDqpJj4LUBK6LgnpE16lkRiiBvK6dxZypWXMW1Dtw1J5hqB8uCTMc4qohLrrApjFsKYNWHchoPFnGktz_Hf0ZshRIuRTXR2_t29B68G6fAiuzi7_PERXichmaXKY_kEq-Vs7j8zGintbnUI_wDFM9x- | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Parallelization+of+%24Top_%7Bk%7D%24+Algorithm+Through+a+New+Hybrid+Recommendation+System+for+Big+Data+in+Spark+Cloud+Computing+Framework&rft.jtitle=IEEE+systems+journal&rft.au=El+Handri%2C+Kaoutar&rft.au=Idrissi%2C+Abdellah&rft.date=2021-12-01&rft.issn=1932-8184&rft.eissn=1937-9234&rft.volume=15&rft.issue=4&rft.spage=4876&rft.epage=4886&rft_id=info:doi/10.1109%2FJSYST.2020.3019368&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_JSYST_2020_3019368 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1932-8184&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1932-8184&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1932-8184&client=summon |