Accuracy Comparison of Data Mining Algorithms Used in the Diagnosis of Breast Cancer: A Scoping Review Study

Introduction: Breast cancer recognized as one of the widespread types of invasive cancer. Early diagnosis of breast cancer is crucial in treating it. The concept of data mining refers to the process of discovering and identifying information in large datasets. Data mining uses a set of techniques an...

Full description

Saved in:
Bibliographic Details
Published inApplied medical informatics Vol. 41; no. 4; pp. 129 - 139
Main Authors Khara, Rouhallah, Piri, Zakieh, Dehghan, Mohamad, Arab-Zozani, Morteza
Format Journal Article
LanguageEnglish
Published Cluj-Napoca SRIMA Publishing House 01.12.2019
Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca
Subjects
Online AccessGet full text
ISSN1224-5593
2067-7855

Cover

Abstract Introduction: Breast cancer recognized as one of the widespread types of invasive cancer. Early diagnosis of breast cancer is crucial in treating it. The concept of data mining refers to the process of discovering and identifying information in large datasets. Data mining uses a set of techniques and algorithms that can be used for the early detection of breast cancer. The present study gives comparisons between the performances of various algorithms used to diagnose breast cancer. Method: This scoping review was led by the framework of the JBI methodology rules. The search was conducted in some relevant electronic databases, and PICO based extracted data were analyzed with Excel software. Results: The most commonly used algorithms were SVM (8 cases), j48 and Naive Bayes (7 cases), and MLP (6 cases), and 34 cases were only used in one study. The accuracy rate obtained with FSRAIRS2 (100%) is the highest among the other reported algorithms by other researchers. Moreover, Canopy was the less accurate algorithm (accuracy= 65%). Conclusion: Any use of a data mining and knowledge discovery method on a data set requires some discussion on the accuracy of the extracted model on some test data. In this study, we have investigated 48 common algorithms on one of the most crucial areas in medicine. Using algorithms that have high accuracy, automated, and semi-automated tools can be designed and used by professionals for the timely detection of breast cancer.
AbstractList Introduction: Breast cancer recognized as one of the widespread types of invasive cancer. Early diagnosis of breast cancer is crucial in treating it. The concept of data mining refers to the process of discovering and identifying information in large datasets. Data mining uses a set of techniques and algorithms that can be used for the early detection of breast cancer. The present study gives comparisons between the performances of various algorithms used to diagnose breast cancer. Method: This scoping review was led by the framework of the JBI methodology rules. The search was conducted in some relevant electronic databases, and PICO based extracted data were analyzed with Excel software. Results: The most commonly used algorithms were SVM (8 cases), j48 and Naive Bayes (7 cases), and MLP (6 cases), and 34 cases were only used in one study. The accuracy rate obtained with FSRAIRS2 (100%) is the highest among the other reported algorithms by other researchers. Moreover, Canopy was the less accurate algorithm (accuracy= 65%). Conclusion: Any use of a data mining and knowledge discovery method on a data set requires some discussion on the accuracy of the extracted model on some test data. In this study, we have investigated 48 common algorithms on one of the most crucial areas in medicine. Using algorithms that have high accuracy, automated, and semi-automated tools can be designed and used by professionals for the timely detection of breast cancer.
Author Dehghan, Mohamad
Arab-Zozani, Morteza
Piri, Zakieh
Khara, Rouhallah
Author_xml – sequence: 1
  givenname: Rouhallah
  surname: Khara
  fullname: Khara, Rouhallah
– sequence: 2
  givenname: Zakieh
  surname: Piri
  fullname: Piri, Zakieh
– sequence: 3
  givenname: Mohamad
  surname: Dehghan
  fullname: Dehghan, Mohamad
– sequence: 4
  givenname: Morteza
  surname: Arab-Zozani
  fullname: Arab-Zozani, Morteza
BookMark eNotkN1LwzAUxYtMcM79DwGfC2k-msa32vkxmAhuPpc0H11Gl9SkVfbf2znPy4XLub97OLfJzHmnr5I5gjlLWUHpLJlnCJGUUo5vkmWMBzgJc44QnyddKeUYhDyByh97EWz0DngDVmIQ4M0661pQdq0PdtgfI_iMWgHrwLDXYGVF63y08ex_DFrEAVTCSR0eQAm20vfn4w_9bfUP2A6jOt0l10Z0US__5yLZPT_tqtd08_6yrspNqjhhU9CiwSzPNDYZaTgyWhmFCZeN0hluYIZ4kxsJc4KxYYZTihQsMqkbmTGMIF4k6wtWeXGo-2CPIpxqL2z9t_ChrUUYrOx0rQQsoJm-MMoJJnJqxShBmGGy4RnFE-v-wuqD_xp1HOqDH4Ob0tcIY4IYpIThX9vcb5M
ContentType Journal Article
Copyright 2019. This work is published under https://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2019. This work is published under https://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID 3V.
7SC
7X7
7XB
88I
8AL
8FD
8FE
8FG
8FI
8FJ
8FK
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
BYOGL
CCPQU
DWQXO
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
L~C
L~D
M0N
M0S
M2P
P5Z
P62
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
Q9U
DOA
DatabaseName ProQuest Central (Corporate)
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Science Database (Alumni Edition)
Computing Database (Alumni Edition)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Hospital Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
ProQuest Technology Collection
East Europe, Central Europe Database (ProQuest)
ProQuest One
ProQuest Central
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
ProQuest Health & Medical Collection
Science Database
Advanced Technologies & Aerospace Collection
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central Basic
DOAJ Directory of Open Access Journals
DatabaseTitle Publicly Available Content Database
Computer Science Database
ProQuest Central Student
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest Health & Medical Complete (Alumni)
Computer and Information Systems Abstracts
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Health & Medical Research Collection
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
ProQuest Central (New)
Advanced Technologies Database with Aerospace
Advanced Technologies & Aerospace Collection
ProQuest Computing
ProQuest Science Journals (Alumni Edition)
ProQuest Central Basic
ProQuest Science Journals
ProQuest Computing (Alumni Edition)
ProQuest One Academic Eastern Edition
East Europe, Central Europe Database
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
ProQuest SciTech Collection
ProQuest Hospital Collection (Alumni)
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
DatabaseTitleList
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISSN 2067-7855
EndPage 139
ExternalDocumentID oai_doaj_org_article_da080f4b9759434c992fda47f7cb9153
GroupedDBID 23M
3V.
53G
5VS
7SC
7X7
7XB
88I
8AL
8FD
8FE
8FG
8FI
8FJ
8FK
8R4
8R5
ABDBF
ABUWG
ACGOD
ACIHN
ACUHS
ADBBV
AEAQA
AFKRA
AHMBA
ALMA_UNASSIGNED_HOLDINGS
ARAPS
AZQEC
BAWUL
BCNDV
BENPR
BGLVJ
BPHCQ
BVXVI
BYOGL
CCPQU
C~G
DIK
DWQXO
DYU
EOJEC
ESX
FYUFA
GNUQQ
GROUPED_DOAJ
HCIFZ
HMCUK
JQ2
K6V
K7-
K9.
KQ8
L7M
L~C
L~D
M0N
M2P
OBODZ
OK1
P62
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PROAC
Q2X
Q9U
RNS
TUS
UKHRP
ID FETCH-LOGICAL-d947-558b3761e3f14b92fedfd349cbde13b0129b6fc06433f7f9552d081cebc173203
IEDL.DBID DOA
ISSN 1224-5593
IngestDate Fri Oct 03 12:45:47 EDT 2025
Tue Oct 07 05:35:52 EDT 2025
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Issue 4
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-d947-558b3761e3f14b92fedfd349cbde13b0129b6fc06433f7f9552d081cebc173203
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Literature Review-2
ObjectType-Feature-3
OpenAccessLink https://doaj.org/article/da080f4b9759434c992fda47f7cb9153
PQID 2334270547
PQPubID 54733
PageCount 11
ParticipantIDs doaj_primary_oai_doaj_org_article_da080f4b9759434c992fda47f7cb9153
proquest_journals_2334270547
PublicationCentury 2000
PublicationDate 2019-12-01
PublicationDateYYYYMMDD 2019-12-01
PublicationDate_xml – month: 12
  year: 2019
  text: 2019-12-01
  day: 01
PublicationDecade 2010
PublicationPlace Cluj-Napoca
PublicationPlace_xml – name: Cluj-Napoca
PublicationTitle Applied medical informatics
PublicationYear 2019
Publisher SRIMA Publishing House
Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca
Publisher_xml – name: SRIMA Publishing House
– name: Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca
SSID ssj0000399229
Score 2.0900292
Snippet Introduction: Breast cancer recognized as one of the widespread types of invasive cancer. Early diagnosis of breast cancer is crucial in treating it. The...
SourceID doaj
proquest
SourceType Open Website
Aggregation Database
StartPage 129
SubjectTerms Accuracy
Algorithms
Automation
Breast Cancer
Clustering
Data Mining
Diagnosis
Mammography
Medical diagnosis
Model accuracy
Model testing
Mortality
Software
Tumors
Womens health
SummonAdditionalLinks – databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1Na8JAEF2sQuml9JPa2rKHXkOTbOImhVLUKlJQSqvgLeynFWy0Gg_--84k0R4KvSZL2MxsJm935r0h5B7JlshwdNwoFk5gReRI33CHW9e3EgJi0yA5eTBs9sfB6yScVMhwx4XBsspdTMwDtV4oPCN_8BkLfA4Agz8vvx3sGoXZ1V0LDVG2VtBPucTYAan5qIxVJbV2d_j2vj91cXMdVsTEmFFyAE6zUqz_TyzOfzC9E3JcIkPaKlx5SiomPSOHgzL3fU7mLaU2K6G2tLPvHUgXlr6ITNBB3ueBtuZTmHP2-bWm47XRdJZSAHj0painm61xfBvL0DPaQXevHmmLfqicNEWLNAHFysLtBRn1uqNO3yl7JTgajA0vEkkIFZ5h1gtk7FujrWZBrKQ2HpN42oSsHsQfzHIbh6GvAQwoI5XHme-yS1JNF6m5IhT2F1LCxrWJyuiRNTKEp1gUsreRhZF10kZ7JctCDSNBfer8wmI1TcrlnmgBSNTCTHiIAnQKrG61CLjlSsYQZOuksbN2Un406-TXxdf_374hR4Bb4qKqpEGq2WpjbgEbZPKudPgPdVu57Q
  priority: 102
  providerName: ProQuest
Title Accuracy Comparison of Data Mining Algorithms Used in the Diagnosis of Breast Cancer: A Scoping Review Study
URI https://www.proquest.com/docview/2334270547
https://doaj.org/article/da080f4b9759434c992fda47f7cb9153
Volume 41
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVCAB
  databaseName: Nutrition and Food Sciences Database
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: DYU
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://www.cabidigitallibrary.org/product/zd
  providerName: CAB International
– providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: KQ8
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: DOA
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVEBS
  databaseName: EBSCOhost Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: ABDBF
  dateStart: 20090701
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: DIK
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVPQU
  databaseName: East Europe, Central Europe Database (ProQuest)
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: BYOGL
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/eastcentraleurope
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Health & Medical Collection
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 2067-7855
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0000399229
  issn: 1224-5593
  databaseCode: 8FG
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8NAEF60gngRn1itZQ9eg-1u0s16S18WoUW0hXoK2ZcWaipNeui_dyaJInjw4mUPy5KBmezsN7sz3xByg8WWWOHotUKZeL5LQk8xKzzhWswpcIgdi8XJ40lnNPMf5sH8R6svzAkr6YFLxd2aBDCN85UUAVKZaSmZM4kvnNBKwnZF7wtyfgRThQ8u-FZlRcT_y88Wh8fwiBxWqI9GpbRjsmPTE7I_rt61T8ky0nqzTvSW9r77AtKVo_0kT-i46OFAo-XrCiL5t_eMzjJr6CKlAN5ov8yVW2S4vosp5jntoSnXdzSiz7ooiKLlEwDFrMHtGZkOB9PeyKv6IHgGFOkFQajADbQtd23QA3PWOMN9qZWxba7wJgkrdhBbcCecDAJm4KDXVum24KzFz0ktXaX2glCIHZSCoLSDrOehsyqArzgkqXehg5V10kV9xR8l00WM3NPFBFgkriwS_2WROml8aTuuNkQWM859JgAfisv_kHFFDgC5yDKvpEFq-XpjrwEd5KpJdsVcwBgO75tkrzuYPD41i58Dx5fZJ0g7vm8
linkProvider Directory of Open Access Journals
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT-MwEB7xkIALggW0LK85LMeI1k7qZCWESgsqj3KhSL1F8YtFgpZtglb9UfxHZpIUDkjcuCaWbY3n8dnzAvjNyZac4Rg04iQLQp_FgRZOBco3hNekEFuOk5P7N63eXXg5jIZz8DrLheGwyplOLBW1HRt-Iz8SUoZCEcBQJ8__Au4axd7VWQuNii2u3PQ_Xdny44sune-hEOdng04vqLsKBJa2FURRrEmomk76ZqgT4Z31VoaJ0dY1peZ3Gc5_YUstvfJJFAlLZtM4bZpKioakaedhMZSkSkh81FC9P-k0yiKvDLjZXUUrsTe77ATwSdGX1ut8DVZr2Intik_WYc6NfsBSv3asb8Bj25iXSWam2HlvTIhjj92syLBfNpHA9uM9EaT4-5TjXe4sPoyQ0CN2q2C9h5zHn3KMe4Ed5qXJH2zjrSkzsrDyQSCHLU43YfAdJNuChdF45H4C0uVFa7oVt7jseuydjmgWz1Xyfexp5DacMr3S56rURsrFr8sP48l9WstSajOCuZ52oiKubmeI6t5mofLK6IQ0-Dbszqid1hKZpx_88-vr3wew3Bv0r9Pri5urHVghgJRU4Su7sFBMXtwegZBC75dHj5B-M6u9AWrL7xw
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3fT9swED4BkxAv0xibBmPbPWyPEa2d1MmkCZWWCsaKkAZS36L4xxUkaFkTNPVP23_HXZKyh0l74zWxbOt8Pn_2fXcH8FmCLSXCMeqkWRHFVKSRVcFEhjqKLBvEXpDg5PF57-Qq_j5JJmvwZxULI7TKlU2sDbWfO3kjP1Bax8owwDAH1NIiLoajw_tfkVSQEk_rqpxGoyJnYfmbr2_lt9Mhr_UXpUbHl4OTqK0wEHmeYpQkqeUN1g2aurHNFAVPXseZsz50tZU3GomFkVNbk6EsSZTnI9QF67pGq47mbtfhhdE6EzahmZin551OnfBVwLe4rngk8WzXVQH-Mfr1STZ6BS9bCIr9Rme2YS3MXsPmuHWy78Bt37mHReGWOHgqUohzwmFRFTiuC0pg_3bKAqmu70q8KoPHmxkyksRhQ9y7KaX9kfDdKxyIXi2-Yh9_ujo6Cxt_BAqFcfkGLp9DZG9hYzafhXeAfJGxlm_IPUnBnlKwCfdCkjGfUuKWu3Ak8srvm7QbuSTCrj_MF9O83Ve5LxjyEs_EJJLpzrHUyRexIeNsxtZ8F_ZX0s7b3Vnmf3Vp7_-_P8EmK1n-4_T87D1sMVbKGibLPmxUi4fwgfFIZT_WK4-QP7OmPQJOMfNf
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Accuracy+Comparison+of+Data+Mining+Algorithms+Used+in+the+Diagnosis+of+Breast+Cancer%3A+A+Scoping+Review+Study&rft.jtitle=Applied+medical+informatics&rft.au=Rouhallah+KHARA&rft.au=Zakieh+PIRI&rft.au=Mohamad+DEHGHAN&rft.au=Morteza+ARAB-ZOZANI&rft.date=2019-12-01&rft.pub=Iuliu+Hatieganu+University+of+Medicine+and+Pharmacy%2C+Cluj-Napoca&rft.eissn=2067-7855&rft.volume=41&rft.issue=4&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_da080f4b9759434c992fda47f7cb9153
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1224-5593&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1224-5593&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1224-5593&client=summon