Ant Colony-Based Sequential Hybrid Sampling Method for Handling the Imbalanced Data Problem in Drug-Target Interaction Prediction

Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational techniques, particularly machine learning, are increasingly used due to cost and time efficiencies. However, DTI prediction faces challenges, notab...

Full description

Saved in:
Bibliographic Details
Published in2024 7th International Conference of Computer and Informatics Engineering (IC2IE) pp. 1 - 7
Main Authors Harahap, Rizky Nurhaliza, Hendrawan, Rahmat, Kurniawan, Isman
Format Conference Proceeding
LanguageEnglish
Published IEEE 12.09.2024
Subjects
Online AccessGet full text
DOI10.1109/IC2IE63342.2024.10748152

Cover

Abstract Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational techniques, particularly machine learning, are increasingly used due to cost and time efficiencies. However, DTI prediction faces challenges, notably class imbalance. This study investigates the application of oversampling followed by Ant Colony Optimization (ACO) based undersampling, to address class imbalance problem in DTI prediction using Sequential Hybrid Sampling (SHS) approach. Several oversampling methods such as SMOTE, Random Oversampling (ROS), and ADASYN were used in the experiment to get the best results. The evaluation results show that by using F1-score as the fitness function, it is found that using ACO as undersampling based with SHS approach can improve the performance of the classifier. Gradient boosting is used as the classification method and F1-score, G-Mean, and Balanced Accuracy are used as evaluation metrics. The result show that implementation SHS with SMOTE and ACO were achieved the highest F1-score of 43.24%, Geometric Mean (G-Mean) of 50.75%, and Balanced Accuracy Score (BAS) of 62.41%. These results highlight that the implementation of SHS with ACO can improve the performance of the classifier in DTI prediction compared to the classic oversampling technique. By advancing the understanding and methodologies for handling class imbalance in DTI prediction, this study contributes to the broader goal of enhancing drug discovery and development processes.
AbstractList Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational techniques, particularly machine learning, are increasingly used due to cost and time efficiencies. However, DTI prediction faces challenges, notably class imbalance. This study investigates the application of oversampling followed by Ant Colony Optimization (ACO) based undersampling, to address class imbalance problem in DTI prediction using Sequential Hybrid Sampling (SHS) approach. Several oversampling methods such as SMOTE, Random Oversampling (ROS), and ADASYN were used in the experiment to get the best results. The evaluation results show that by using F1-score as the fitness function, it is found that using ACO as undersampling based with SHS approach can improve the performance of the classifier. Gradient boosting is used as the classification method and F1-score, G-Mean, and Balanced Accuracy are used as evaluation metrics. The result show that implementation SHS with SMOTE and ACO were achieved the highest F1-score of 43.24%, Geometric Mean (G-Mean) of 50.75%, and Balanced Accuracy Score (BAS) of 62.41%. These results highlight that the implementation of SHS with ACO can improve the performance of the classifier in DTI prediction compared to the classic oversampling technique. By advancing the understanding and methodologies for handling class imbalance in DTI prediction, this study contributes to the broader goal of enhancing drug discovery and development processes.
Author Kurniawan, Isman
Harahap, Rizky Nurhaliza
Hendrawan, Rahmat
Author_xml – sequence: 1
  givenname: Rizky Nurhaliza
  surname: Harahap
  fullname: Harahap, Rizky Nurhaliza
  email: lizahrp@student.telkomuniversity.ac.id
  organization: Telkom University,School of Computing,Bandung,Indonesia
– sequence: 2
  givenname: Rahmat
  surname: Hendrawan
  fullname: Hendrawan, Rahmat
  email: rahmathendrawann@student.telkomuniversity.ac.id
  organization: Telkom University,School of Computing,Bandung,Indonesia
– sequence: 3
  givenname: Isman
  surname: Kurniawan
  fullname: Kurniawan, Isman
  email: ismankrn@telkomuniversity.ac.id
  organization: Telkom University,School of Computing,Bandung,Indonesia
BookMark eNo1UMtOwzAQNBIcoPQPOPgHUvyIY-dY0kIjFYFE79Um3rSWEru47qFH_pxQ4DSj0cxod-7ItQ8eCaGczThn5WNdiXpZSJmLmWAin3Gmc8OVuCLTUpdGSq6YUpLdkq-5T7QKffDn7AmOaOkHfp7QJwc9XZ2b6EYFhkPv_I6-YtoHS7sQ6Qq8vWhpj7QeGujBt2N6AQnoewxNjwN1ni7iaZdtIO4w0donjNAmF_xoQesu9J7cdNAfcfqHE7J5Xm6qVbZ-e6mr-TpzJU_ZeD7q0uZdyXVjjAIBjW2ZLKzSgBoMYwWzTP-8xYWVbWEMQ1NYEF2juJUT8vBb6xBxe4hugHje_g8jvwEwbF7A
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/IC2IE63342.2024.10748152
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798331505530
EndPage 7
ExternalDocumentID 10748152
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i91t-481e79d4f917b885a2abdc036d57ae7a80060d07055312d3c6880e86da2fb51d3
IEDL.DBID RIE
IngestDate Wed Nov 20 06:17:43 EST 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i91t-481e79d4f917b885a2abdc036d57ae7a80060d07055312d3c6880e86da2fb51d3
PageCount 7
ParticipantIDs ieee_primary_10748152
PublicationCentury 2000
PublicationDate 2024-Sept.-12
PublicationDateYYYYMMDD 2024-09-12
PublicationDate_xml – month: 09
  year: 2024
  text: 2024-Sept.-12
  day: 12
PublicationDecade 2020
PublicationTitle 2024 7th International Conference of Computer and Informatics Engineering (IC2IE)
PublicationTitleAbbrev IC2IE
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8854983
Snippet Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Accuracy
ACO
Ant colony optimization
Classification algorithms
Diffusion tensor imaging
Drug discovery
Drug target interaction
Drugs
Measurement
Optimization
oversampling
Robustness
Sampling methods
SHS
SMOTE
undersampling
Title Ant Colony-Based Sequential Hybrid Sampling Method for Handling the Imbalanced Data Problem in Drug-Target Interaction Prediction
URI https://ieeexplore.ieee.org/document/10748152
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA62J08qVnyTg9fd7nuzR-2DVmgRWqG3ksdsKeoqZfdQb_5zZ7JdRUHwNoQsCQmzM0m-bz7GbpQnU0974GQgiJJjEnQptIxMSaPS97Ug7vBkmoweo_tFvNiR1S0XBgAs-AxcMu1bvnnVFV2VdQk8KDDgtFgrFUlN1mrQOV7WHfeC8SAJw4gIVkHkNt1_CKfYuDE8YNNmxBou8uRWpXL1-69ijP-e0iHrfFP0-MNX8Dlie1Acs4_bouQ9_J0VW-cOw5PhM4uURi9-5qMtkbP4TBKGvFjxidWO5pi08hGVWqA2zAb5-EUR3FHj131ZShqFJGf4uuD9TbVy5hY7zu1VYs2KwC703ENmh82Hg3lv5Ow0Fpx15pcOzh3SzEQ5ntqUELEMpDIao5qJUwmpFFSvxXhUcif0AxPqBP0dRGJkkKvYN-EJaxevBZwyDkESaOEpLzdZFGU5HuQkZhuYL4BIAcQZ69DyLd_qKhrLZuXO_2i_YPu0i46Va7hk7XJTwRUmAKW6thv_CSDAsZc
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELagDDABoog3HliT5uEkzgh9KIG2QmqQulVO7FQVkKIqGcrGP-fOaUAgIbGdrFi2nFzubH_ffYTcpJYIrMxSRqg4UnKkDy4FlhQBalTadsaROzwa-9ETu5960w1ZXXNhlFIafKZMNPVdvlxmFR6VdRA8yCHgbJMdjzHm1XStBp9jhZ2468R933UZUqwcZjYdfkin6Mgx2CfjZswaMPJsVmVqZu-_yjH-e1IHpP1N0qOPX-HnkGyp4oh83BYl7cIPrVgbdxCgJJ1orDT48QuN1kjPohOBKPJiTkdaPZpC2kojLLaAbZAP0vg1RcBjBr17ohQ4CorO0EVBe6tqbiQaPU71YWLNi4BH8MIHzTZJBv2kGxkblQVjEdqlAXNXQShZDvu2lHNPOCKVGcQ16QVCBYJjxRZpYdEd13akm_ng8Yr7Ujh56tnSPSatYlmoE0KV4zsZt1IrlyFjYQ5bOQH5BmQMigdK8VPSxuWbvdV1NGbNyp390X5NdqNkNJwN4_HDOdnDN2po8YYL0ipXlbqEdKBMr_RH8AmYh7Tk
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+7th+International+Conference+of+Computer+and+Informatics+Engineering+%28IC2IE%29&rft.atitle=Ant+Colony-Based+Sequential+Hybrid+Sampling+Method+for+Handling+the+Imbalanced+Data+Problem+in+Drug-Target+Interaction+Prediction&rft.au=Harahap%2C+Rizky+Nurhaliza&rft.au=Hendrawan%2C+Rahmat&rft.au=Kurniawan%2C+Isman&rft.date=2024-09-12&rft.pub=IEEE&rft.spage=1&rft.epage=7&rft_id=info:doi/10.1109%2FIC2IE63342.2024.10748152&rft.externalDocID=10748152