Estimation of risk factors associated with colorectal cancer: an application of knowledge discovery in databases

Colorectal cancer is one of the first reasons for death due to cancer in the world.The goal of this study is to predict important risk factors of colorectal cancer (CRC)by knowledge discovery in databases (KDD) methods. This study comprised aretrospective CRC data of patients who had been diagnosed...

Full description

Saved in:
Bibliographic Details
Published inKuwait journal of science Vol. 43; no. 2
Main Authors Feyza Firat, Ahmet Kadir Arslan, Cemil Colak, Hakan Harputluoglu
Format Journal Article
LanguageEnglish
Published Elsevier 01.05.2016
Subjects
Online AccessGet full text
ISSN2307-4108
2307-4116

Cover

Abstract Colorectal cancer is one of the first reasons for death due to cancer in the world.The goal of this study is to predict important risk factors of colorectal cancer (CRC)by knowledge discovery in databases (KDD) methods. This study comprised aretrospective CRC data of patients who had been diagnosed with colorectal cancer. Theselected records between 1 January 2010 and 1 March 2014 were collected randomlyfrom Turgut Ozal Medical Centre databases. The study included 160 individuals: 80patients admitted to Department of Oncology and diagnosed with CRC, and 80 controlsubjects with non-CRC categorization. The groups were matched for age and gender.We mined retrospective CRC data from large integrated health systems with electronichealth records. Specific demographical and clinical variables including calcium,hemoglobin, white blood cells, platelets, potassium, sodium, glucose, creatinine andtotal bilirubin were used in multilayer perceptron (MLP) artificial neural networks(ANN) modeling. In this study, patient and control groups consist of 160 individuals.In each group, 45 of these (56.3%) are male, and 35 (43.7%) are women. Mean ageof CRC patients and control groups is 58.6±13.0. While the accuracy was 71.31%in training dataset (n=122), the accuracy was 81.82% in testing dataset. Area undercurve (AUC) values of training and testing datasets were 0.73 and 0.81, respectively.The suggested MLP ANN model identified significant factors of calcium, creatinine,potassium, platelets, sodium, hemoglobin and total bilirubin. Taken together, thesuggested MLP ANN model might be used for the estimation of risk factors associatedwith CRC as an application of medical KDD.
AbstractList Colorectal cancer is one of the first reasons for death due to cancer in the world.The goal of this study is to predict important risk factors of colorectal cancer (CRC)by knowledge discovery in databases (KDD) methods. This study comprised aretrospective CRC data of patients who had been diagnosed with colorectal cancer. Theselected records between 1 January 2010 and 1 March 2014 were collected randomlyfrom Turgut Ozal Medical Centre databases. The study included 160 individuals: 80patients admitted to Department of Oncology and diagnosed with CRC, and 80 controlsubjects with non-CRC categorization. The groups were matched for age and gender.We mined retrospective CRC data from large integrated health systems with electronichealth records. Specific demographical and clinical variables including calcium,hemoglobin, white blood cells, platelets, potassium, sodium, glucose, creatinine andtotal bilirubin were used in multilayer perceptron (MLP) artificial neural networks(ANN) modeling. In this study, patient and control groups consist of 160 individuals.In each group, 45 of these (56.3%) are male, and 35 (43.7%) are women. Mean ageof CRC patients and control groups is 58.6±13.0. While the accuracy was 71.31%in training dataset (n=122), the accuracy was 81.82% in testing dataset. Area undercurve (AUC) values of training and testing datasets were 0.73 and 0.81, respectively.The suggested MLP ANN model identified significant factors of calcium, creatinine,potassium, platelets, sodium, hemoglobin and total bilirubin. Taken together, thesuggested MLP ANN model might be used for the estimation of risk factors associatedwith CRC as an application of medical KDD.
Author Ahmet Kadir Arslan
Cemil Colak
Hakan Harputluoglu
Feyza Firat
Author_xml – sequence: 1
  fullname: Feyza Firat
– sequence: 2
  fullname: Ahmet Kadir Arslan
  organization: Inonu University, Department of Biostatistics and Medical Informatics
– sequence: 3
  fullname: Cemil Colak
– sequence: 4
  fullname: Hakan Harputluoglu
BookMark eNo9j81KAzEUhQepYK19h7zAQP4mmbiTUrVQcKPr4U5yU9OOkyEJlr69g0oXh3M4iw---2oxxhFvqiUXVNeSMbW4btreVeucj5RSJiRnjV5W0zaX8AUlxJFET1LIJ-LBlpgygZyjDVDQkXMon8TGISa0BQZiYbSYHgmMBKZpCPZKOI3xPKA7IHEh2_iN6ULCSBwU6CFjfqhuPQwZ1_-9qj6et--b13r_9rLbPO1rxzkrtWzb3kmGfd8Io6lWvjWNMspwoz32FKVwc7jF1ipwFJ1UbFby6K2h1opVtfvjugjHbkqzZLp0EUL3e8R06CCVYAfsqNMWvFdaKC51A6Z3PeUOWiWZUY0QPz3haBY
ContentType Journal Article
DBID DOA
DatabaseName DOAJ Directory of Open Access Journals
DatabaseTitleList
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Sciences (General)
EISSN 2307-4116
ExternalDocumentID oai_doaj_org_article_0d7caff67362475a9bdb02da86419653
GroupedDBID ABDBF
ACUHS
AENEX
AFWDF
ALMA_UNASSIGNED_HOLDINGS
EOJEC
GROUPED_DOAJ
OBODZ
ID FETCH-LOGICAL-d221t-488bd41ebb5397076f8956969297feb0e43de432ce8c6ad0ed461215fefc90cc3
IEDL.DBID DOA
ISSN 2307-4108
IngestDate Fri Oct 03 12:51:39 EDT 2025
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-d221t-488bd41ebb5397076f8956969297feb0e43de432ce8c6ad0ed461215fefc90cc3
OpenAccessLink https://doaj.org/article/0d7caff67362475a9bdb02da86419653
ParticipantIDs doaj_primary_oai_doaj_org_article_0d7caff67362475a9bdb02da86419653
PublicationCentury 2000
PublicationDate 2016-05-01
PublicationDateYYYYMMDD 2016-05-01
PublicationDate_xml – month: 05
  year: 2016
  text: 2016-05-01
  day: 01
PublicationDecade 2010
PublicationTitle Kuwait journal of science
PublicationYear 2016
Publisher Elsevier
Publisher_xml – name: Elsevier
SSID ssj0001342157
Score 2.025627
Snippet Colorectal cancer is one of the first reasons for death due to cancer in the world.The goal of this study is to predict important risk factors of colorectal...
SourceID doaj
SourceType Open Website
SubjectTerms Artificial neural networks
colorectal cancer
knowledge discovery in databases
risk factors
Title Estimation of risk factors associated with colorectal cancer: an application of knowledge discovery in databases
URI https://doaj.org/article/0d7caff67362475a9bdb02da86419653
Volume 43
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2307-4116
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001342157
  issn: 2307-4108
  databaseCode: DOA
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVEBS
  databaseName: Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 2307-4116
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001342157
  issn: 2307-4108
  databaseCode: ABDBF
  dateStart: 20140701
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELZQWVgQ5SGelQcGGCIcx4-YrYVWFRJMVOoW-XGWWNKqLUP_Pb4kQtlYGLJkuEh3udx3l8_fEXLPZAkapM-0l6lBMdJlNhQxM1bYBOZkyQIO9N8_1Hwh3pZy2Vv1hZywVh64ddwTC9rbGJF-xIWW1rjgGA-2VALF8BqdT1aaXjPVTFcKkWqZbjbLMZ2JHNdH9kT5m-oxOyHHHeyj4_ZxQ3IA9SkZdom1pQ-d-vPjGVlPU9K15wnpKlKkftNuKQ61nS8hUJyfUlScxi9Wsuwxeptnamva-yeNFn6nZhRP4CJjc0-_aorUUCxh23OymE0_X-ZZtxYhC5znuyylnAsiB-dkAhNMq1imJseoBHR0BMdAFCFd3EPplQ0MgkCZMBkhesO8Ly7IoF7VcEmo4JzZUhowIAXXyuZWgi6kFqCUF_aKTNBn1bpVvqhQi7q5kSJUdRGq_orQ9X8YuSFHCaqolmp4Swa7zTfcJTiwcyNyOJ68Tmaj5g34AQ9Mtyk
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Estimation+of+risk+factors+associated+with+colorectal+cancer%3A+an+application+of+knowledge+discovery+in+databases&rft.jtitle=Kuwait+journal+of+science&rft.au=Feyza+Firat&rft.au=Ahmet+Kadir+Arslan&rft.au=Cemil+Colak&rft.au=Hakan+Harputluoglu&rft.date=2016-05-01&rft.pub=Elsevier&rft.issn=2307-4108&rft.eissn=2307-4116&rft.volume=43&rft.issue=2&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_0d7caff67362475a9bdb02da86419653
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2307-4108&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2307-4108&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2307-4108&client=summon