Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction

In today's world diabetes has become one of the most life threatening and at the same time most common diseases not only in India but around the world. Diabetes is seen in all age groups these days and they are attributed to lifestyle, genetic, stress and age factor. Whatever be the reasons for...

Full description

Saved in:
Bibliographic Details
Published inInternational Conference on Advanced Computing and Communication Systems (Online) Vol. 1; pp. 141 - 146
Main Authors Sivaranjani, S, Ananya, S, Aravinth, J, Karthika, R
Format Conference Proceeding
LanguageEnglish
Published IEEE 19.03.2021
Subjects
Online AccessGet full text
ISBN9781665405201
1665405201
ISSN2575-7288
DOI10.1109/ICACCS51430.2021.9441935

Cover

Abstract In today's world diabetes has become one of the most life threatening and at the same time most common diseases not only in India but around the world. Diabetes is seen in all age groups these days and they are attributed to lifestyle, genetic, stress and age factor. Whatever be the reasons for diabetics, the outcome could be severe if left unnoticed. Currently various methods are being used to predict diabetes and diabetic inflicted diseases. In the proposed work, we have used the Machine Learning algorithms Support Vector Machine (SVM) & Random Forest (RF) that would help to identify the potential chances of getting affected by Diabetes Related Diseases. After pre-processing the data, features which influences the prediction are selected by implementing step forward and backward feature selection. The Principle Component Analysis (PCA) dimensionality reduction method is analyzed after the selection of specific features and the accuracy of the prediction is 83% implementing Random Forest (RF) which is significant in comparison with Support Vector Machine (SVM) with accuracy of 81.4%.
AbstractList In today's world diabetes has become one of the most life threatening and at the same time most common diseases not only in India but around the world. Diabetes is seen in all age groups these days and they are attributed to lifestyle, genetic, stress and age factor. Whatever be the reasons for diabetics, the outcome could be severe if left unnoticed. Currently various methods are being used to predict diabetes and diabetic inflicted diseases. In the proposed work, we have used the Machine Learning algorithms Support Vector Machine (SVM) & Random Forest (RF) that would help to identify the potential chances of getting affected by Diabetes Related Diseases. After pre-processing the data, features which influences the prediction are selected by implementing step forward and backward feature selection. The Principle Component Analysis (PCA) dimensionality reduction method is analyzed after the selection of specific features and the accuracy of the prediction is 83% implementing Random Forest (RF) which is significant in comparison with Support Vector Machine (SVM) with accuracy of 81.4%.
Author Sivaranjani, S
Aravinth, J
Karthika, R
Ananya, S
Author_xml – sequence: 1
  givenname: S
  surname: Sivaranjani
  fullname: Sivaranjani, S
  email: siva01.11.1999@gmail.com
  organization: Amrita School of Engineering,Department of Electronics and Communication Engineering,Coimbatore
– sequence: 2
  givenname: S
  surname: Ananya
  fullname: Ananya, S
  email: ananyasankar7@gmail.com
  organization: Amrita School of Engineering,Department of Electronics and Communication Engineering,Coimbatore
– sequence: 3
  givenname: J
  surname: Aravinth
  fullname: Aravinth, J
  email: j_aravinth@cb.amrita.edu
  organization: Amrita School of Engineering,Department of Electronics and Communication Engineering,Coimbatore
– sequence: 4
  givenname: R
  surname: Karthika
  fullname: Karthika, R
  email: r_karthika@cb.amrita.edu
  organization: Amrita School of Engineering,Department of Electronics and Communication Engineering,Coimbatore
BookMark eNpVUEtOwzAUND-JUnoCNr5Aynv-xPayChSQikAU1pWbvLRGqYviVKi3J9BuWI1mRjPSzBU7j9tIjHGEMSK426diUhRzjUrCWIDAsVMKndQnbOSMxTzXCrRAe8oGQhudGWHt2T8P8JKNUvoEAInWWWsHLNwFv6SOEn9tqQplF7aR71KIK_7sy3WIxGfk2_grTJrVtg3depP4dw98Sr7btcTn1NAh6GPF78KGYuqZb0K3529U7f7Ma3ZR-ybR6IhD9jG9fy8es9nLQ79tlgVE22XK6Rw8oC6tVIrq2ghhc0dmCV4ioUepjFe10aqEshQeTU-rymohvQYnh-zm0BuIaPHVho1v94vjWfIHmfFfCA
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICACCS51430.2021.9441935
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore Digital Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore Digital Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781665405218
166540521X
EISSN 2575-7288
EndPage 146
ExternalDocumentID 9441935
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i118t-49560a015c8344eff722869e7b0a31e1a1347a4f754c0cc2a177a4dd8523a5093
IEDL.DBID RIE
ISBN 9781665405201
1665405201
IngestDate Wed Aug 27 02:27:32 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i118t-49560a015c8344eff722869e7b0a31e1a1347a4f754c0cc2a177a4dd8523a5093
PageCount 6
ParticipantIDs ieee_primary_9441935
PublicationCentury 2000
PublicationDate 2021-March-19
PublicationDateYYYYMMDD 2021-03-19
PublicationDate_xml – month: 03
  year: 2021
  text: 2021-March-19
  day: 19
PublicationDecade 2020
PublicationTitle International Conference on Advanced Computing and Communication Systems (Online)
PublicationTitleAbbrev ICACCS
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003189888
Score 2.1767929
Snippet In today's world diabetes has become one of the most life threatening and at the same time most common diseases not only in India but around the world....
SourceID ieee
SourceType Publisher
StartPage 141
SubjectTerms classifier
Diabetes
Dimensionality reduction
Feature extraction
feature selection
Machine learning algorithms
PIMA dataset
pre-processing
Radio frequency
Random Forest (RF)
Random forests
Support Vector Machine (SVM)
Support vector machines
Title Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction
URI https://ieeexplore.ieee.org/document/9441935
Volume 1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwFH7ZdvI0dTP-Tg8ehUFhKz0u6DJNZoxzyW5LKWUuKjMbXPzrfQ8YRuPBC6H8aJq28L7Xfu97AFfoIydCRr4VcDz4gVKWQlRuCU9HnAuDFpOCkycPg_HMv5_35w24rmNhjDEF-czYdFrs5cdrndNSWU-i7ZZevwlNnGZlrFa9noJzU6I3R7FblFCX6B1uJelUl3dMHkf27sJhGE4JLTjoJHLXrur-kWSlsDGjNkx2rSupJa92nkW2_vwl3Pjf5u9D9zuajz3WduoAGiY9hPYunQOrvu4OrCp2zBafpu0bGjJGvPglmxSUS8MqNdYlG74t15tV9vK-ZbSSywhJ5hvDpkVaHXpRpTG7odQBpewHgn32RDKxdLMLs9Htczi2qkwM1godkMwqvCiFyEFTWg6TJILzYCCNiBzlucZVFJCq_ET0fe1ozZUrsBjHAbq5CiGJdwStdJ2aY2AR_iA8Eq33-8aPRKJ0jPZR6iSQLh946gQ61HOLj1JsY1F12unfl89gj0aPSGGuPIdWtsnNBaKELLospscXA9G31Q
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFG8QD3pCBeO3PXh0Y590PZIpAWXECCTcSNd1SNRhYLv41_veNmY0Hrws6z6are323mt_7_cj5AZi5Jjx0NE8CzaOJ4QmwCvXmC1Dy2IKLCYmJwejTn_qPMzcWY3cVrkwSqkcfKZ03M3X8qOVzHCqrM3BdnPb3SG7LkQVXpGtVc2owOjkEM9h9hZK6iLAwyxJnaryFstj8PbA7_r-GP0FA8JEy9TL2n_IrORWptcgwfb5CnDJq56loS4_f1E3_vcFDkjrO5-PPlWW6pDUVHJEGltBB1p-302yLPExG7gaF3Cw0ygi4xc0yEGXipZ8rAvafVus1sv05X1DcS6Xoi-ZrRUd58I6eKNIInqH4gEF8Qe4-_QZiWLxZItMe_cTv6-VWgzaEkKQVMvjKAG-g0RhDhXHzLK8DlcsNIRtKlNgSqpwYuY60pDSEiaDYhR5EOgKcErsY1JPVok6ITSEX4SNtPWOq5yQxUJGYCG5jD1uWh1bnJImttz8o6DbmJeNdvb34Wuy158Ew_lwMHo8J_vYkwgRM_kFqafrTF2Cz5CGV_lQ-QLiPrso
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Conference+on+Advanced+Computing+and+Communication+Systems+%28Online%29&rft.atitle=Diabetes+Prediction+using+Machine+Learning+Algorithms+with+Feature+Selection+and+Dimensionality+Reduction&rft.au=Sivaranjani%2C+S&rft.au=Ananya%2C+S&rft.au=Aravinth%2C+J&rft.au=Karthika%2C+R&rft.date=2021-03-19&rft.pub=IEEE&rft.isbn=9781665405201&rft.eissn=2575-7288&rft.volume=1&rft.spage=141&rft.epage=146&rft_id=info:doi/10.1109%2FICACCS51430.2021.9441935&rft.externalDocID=9441935
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781665405201/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781665405201/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781665405201/sc.gif&client=summon&freeimage=true