Ant Colony-Based Sequential Hybrid Sampling Method for Handling the Imbalanced Data Problem in Drug-Target Interaction Prediction
Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational techniques, particularly machine learning, are increasingly used due to cost and time efficiencies. However, DTI prediction faces challenges, notab...
Saved in:
| Published in | 2024 7th International Conference of Computer and Informatics Engineering (IC2IE) pp. 1 - 7 |
|---|---|
| Main Authors | , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
12.09.2024
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/IC2IE63342.2024.10748152 |
Cover
| Abstract | Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational techniques, particularly machine learning, are increasingly used due to cost and time efficiencies. However, DTI prediction faces challenges, notably class imbalance. This study investigates the application of oversampling followed by Ant Colony Optimization (ACO) based undersampling, to address class imbalance problem in DTI prediction using Sequential Hybrid Sampling (SHS) approach. Several oversampling methods such as SMOTE, Random Oversampling (ROS), and ADASYN were used in the experiment to get the best results. The evaluation results show that by using F1-score as the fitness function, it is found that using ACO as undersampling based with SHS approach can improve the performance of the classifier. Gradient boosting is used as the classification method and F1-score, G-Mean, and Balanced Accuracy are used as evaluation metrics. The result show that implementation SHS with SMOTE and ACO were achieved the highest F1-score of 43.24%, Geometric Mean (G-Mean) of 50.75%, and Balanced Accuracy Score (BAS) of 62.41%. These results highlight that the implementation of SHS with ACO can improve the performance of the classifier in DTI prediction compared to the classic oversampling technique. By advancing the understanding and methodologies for handling class imbalance in DTI prediction, this study contributes to the broader goal of enhancing drug discovery and development processes. |
|---|---|
| AbstractList | Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational techniques, particularly machine learning, are increasingly used due to cost and time efficiencies. However, DTI prediction faces challenges, notably class imbalance. This study investigates the application of oversampling followed by Ant Colony Optimization (ACO) based undersampling, to address class imbalance problem in DTI prediction using Sequential Hybrid Sampling (SHS) approach. Several oversampling methods such as SMOTE, Random Oversampling (ROS), and ADASYN were used in the experiment to get the best results. The evaluation results show that by using F1-score as the fitness function, it is found that using ACO as undersampling based with SHS approach can improve the performance of the classifier. Gradient boosting is used as the classification method and F1-score, G-Mean, and Balanced Accuracy are used as evaluation metrics. The result show that implementation SHS with SMOTE and ACO were achieved the highest F1-score of 43.24%, Geometric Mean (G-Mean) of 50.75%, and Balanced Accuracy Score (BAS) of 62.41%. These results highlight that the implementation of SHS with ACO can improve the performance of the classifier in DTI prediction compared to the classic oversampling technique. By advancing the understanding and methodologies for handling class imbalance in DTI prediction, this study contributes to the broader goal of enhancing drug discovery and development processes. |
| Author | Kurniawan, Isman Harahap, Rizky Nurhaliza Hendrawan, Rahmat |
| Author_xml | – sequence: 1 givenname: Rizky Nurhaliza surname: Harahap fullname: Harahap, Rizky Nurhaliza email: lizahrp@student.telkomuniversity.ac.id organization: Telkom University,School of Computing,Bandung,Indonesia – sequence: 2 givenname: Rahmat surname: Hendrawan fullname: Hendrawan, Rahmat email: rahmathendrawann@student.telkomuniversity.ac.id organization: Telkom University,School of Computing,Bandung,Indonesia – sequence: 3 givenname: Isman surname: Kurniawan fullname: Kurniawan, Isman email: ismankrn@telkomuniversity.ac.id organization: Telkom University,School of Computing,Bandung,Indonesia |
| BookMark | eNo1UMtOwzAQNBIcoPQPOPgHUvyIY-dY0kIjFYFE79Um3rSWEru47qFH_pxQ4DSj0cxod-7ItQ8eCaGczThn5WNdiXpZSJmLmWAin3Gmc8OVuCLTUpdGSq6YUpLdkq-5T7QKffDn7AmOaOkHfp7QJwc9XZ2b6EYFhkPv_I6-YtoHS7sQ6Qq8vWhpj7QeGujBt2N6AQnoewxNjwN1ni7iaZdtIO4w0donjNAmF_xoQesu9J7cdNAfcfqHE7J5Xm6qVbZ-e6mr-TpzJU_ZeD7q0uZdyXVjjAIBjW2ZLKzSgBoMYwWzTP-8xYWVbWEMQ1NYEF2juJUT8vBb6xBxe4hugHje_g8jvwEwbF7A |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/IC2IE63342.2024.10748152 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798331505530 |
| EndPage | 7 |
| ExternalDocumentID | 10748152 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i91t-481e79d4f917b885a2abdc036d57ae7a80060d07055312d3c6880e86da2fb51d3 |
| IEDL.DBID | RIE |
| IngestDate | Wed Nov 20 06:17:43 EST 2024 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i91t-481e79d4f917b885a2abdc036d57ae7a80060d07055312d3c6880e86da2fb51d3 |
| PageCount | 7 |
| ParticipantIDs | ieee_primary_10748152 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Sept.-12 |
| PublicationDateYYYYMMDD | 2024-09-12 |
| PublicationDate_xml | – month: 09 year: 2024 text: 2024-Sept.-12 day: 12 |
| PublicationDecade | 2020 |
| PublicationTitle | 2024 7th International Conference of Computer and Informatics Engineering (IC2IE) |
| PublicationTitleAbbrev | IC2IE |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.8854983 |
| Snippet | Drug Target Interaction (DTI) is crucial in pharmaceutical research, aiding drug discovery, repositioning, and side effect identification. Computational... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Accuracy ACO Ant colony optimization Classification algorithms Diffusion tensor imaging Drug discovery Drug target interaction Drugs Measurement Optimization oversampling Robustness Sampling methods SHS SMOTE undersampling |
| Title | Ant Colony-Based Sequential Hybrid Sampling Method for Handling the Imbalanced Data Problem in Drug-Target Interaction Prediction |
| URI | https://ieeexplore.ieee.org/document/10748152 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA62J08qVnyTg9fd7nuzR-2DVmgRWqG3ksdsKeoqZfdQb_5zZ7JdRUHwNoQsCQmzM0m-bz7GbpQnU0974GQgiJJjEnQptIxMSaPS97Ug7vBkmoweo_tFvNiR1S0XBgAs-AxcMu1bvnnVFV2VdQk8KDDgtFgrFUlN1mrQOV7WHfeC8SAJw4gIVkHkNt1_CKfYuDE8YNNmxBou8uRWpXL1-69ijP-e0iHrfFP0-MNX8Dlie1Acs4_bouQ9_J0VW-cOw5PhM4uURi9-5qMtkbP4TBKGvFjxidWO5pi08hGVWqA2zAb5-EUR3FHj131ZShqFJGf4uuD9TbVy5hY7zu1VYs2KwC703ENmh82Hg3lv5Ow0Fpx15pcOzh3SzEQ5ntqUELEMpDIao5qJUwmpFFSvxXhUcif0AxPqBP0dRGJkkKvYN-EJaxevBZwyDkESaOEpLzdZFGU5HuQkZhuYL4BIAcQZ69DyLd_qKhrLZuXO_2i_YPu0i46Va7hk7XJTwRUmAKW6thv_CSDAsZc |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELagDDABoog3HliT5uEkzgh9KIG2QmqQulVO7FQVkKIqGcrGP-fOaUAgIbGdrFi2nFzubH_ffYTcpJYIrMxSRqg4UnKkDy4FlhQBalTadsaROzwa-9ETu5960w1ZXXNhlFIafKZMNPVdvlxmFR6VdRA8yCHgbJMdjzHm1XStBp9jhZ2468R933UZUqwcZjYdfkin6Mgx2CfjZswaMPJsVmVqZu-_yjH-e1IHpP1N0qOPX-HnkGyp4oh83BYl7cIPrVgbdxCgJJ1orDT48QuN1kjPohOBKPJiTkdaPZpC2kojLLaAbZAP0vg1RcBjBr17ohQ4CorO0EVBe6tqbiQaPU71YWLNi4BH8MIHzTZJBv2kGxkblQVjEdqlAXNXQShZDvu2lHNPOCKVGcQ16QVCBYJjxRZpYdEd13akm_ng8Yr7Ujh56tnSPSatYlmoE0KV4zsZt1IrlyFjYQ5bOQH5BmQMigdK8VPSxuWbvdV1NGbNyp390X5NdqNkNJwN4_HDOdnDN2po8YYL0ipXlbqEdKBMr_RH8AmYh7Tk |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+7th+International+Conference+of+Computer+and+Informatics+Engineering+%28IC2IE%29&rft.atitle=Ant+Colony-Based+Sequential+Hybrid+Sampling+Method+for+Handling+the+Imbalanced+Data+Problem+in+Drug-Target+Interaction+Prediction&rft.au=Harahap%2C+Rizky+Nurhaliza&rft.au=Hendrawan%2C+Rahmat&rft.au=Kurniawan%2C+Isman&rft.date=2024-09-12&rft.pub=IEEE&rft.spage=1&rft.epage=7&rft_id=info:doi/10.1109%2FIC2IE63342.2024.10748152&rft.externalDocID=10748152 |