A class skew-insensitive ACO-based decision tree algorithm for imbalanced data sets

Ant-tree-miner (ATM) has an advantage over the conventional decision tree algorithm in terms of feature selection. However, real world applications commonly involved imbalanced class problem where the classes have different importance. This condition impeded the entropy-based heuristic of existing A...

Full description

Saved in:
Bibliographic Details
Published inIndonesian Journal of Electrical Engineering and Computer Science Vol. 21; no. 1; p. 412
Main Authors Bin Mohd Razali, Muhamad Hasbullah, Bin Saian, Rizauddin, Bee Wah, Yap, Ku-Mahamud, Ku Ruhana
Format Journal Article
LanguageEnglish
Published 01.01.2021
Online AccessGet full text
ISSN2502-4752
2502-4760
2502-4760
DOI10.11591/ijeecs.v21.i1.pp412-419

Cover

Abstract Ant-tree-miner (ATM) has an advantage over the conventional decision tree algorithm in terms of feature selection. However, real world applications commonly involved imbalanced class problem where the classes have different importance. This condition impeded the entropy-based heuristic of existing ATM algorithm to develop effective decision boundaries due to its biasness towards the dominant class. Consequently, the induced decision trees are dominated by the majority class which lack in predictive ability on the rare class. This study proposed an enhanced algorithm called hellinger-ant-tree-miner (HATM) which is inspired by ant colony optimization (ACO) metaheuristic for imbalanced learning using decision tree classification algorithm. The proposed algorithm was compared to the existing algorithm, ATM in nine (9) publicly available imbalanced data sets. Simulation study reveals the superiority of HATM when the sample size increases with skewed class (Imbalanced Ratio < 50%). Experimental results demonstrate the performance of the existing algorithm measured by BACC has been improved due to the class skew-insensitiveness of hellinger distance. The statistical significance test shows that HATM has higher mean BACC score than ATM.
AbstractList Ant-tree-miner (ATM) has an advantage over the conventional decision tree algorithm in terms of feature selection. However, real world applications commonly involved imbalanced class problem where the classes have different importance. This condition impeded the entropy-based heuristic of existing ATM algorithm to develop effective decision boundaries due to its biasness towards the dominant class. Consequently, the induced decision trees are dominated by the majority class which lack in predictive ability on the rare class. This study proposed an enhanced algorithm called hellinger-ant-tree-miner (HATM) which is inspired by ant colony optimization (ACO) metaheuristic for imbalanced learning using decision tree classification algorithm. The proposed algorithm was compared to the existing algorithm, ATM in nine (9) publicly available imbalanced data sets. Simulation study reveals the superiority of HATM when the sample size increases with skewed class (Imbalanced Ratio < 50%). Experimental results demonstrate the performance of the existing algorithm measured by BACC has been improved due to the class skew-insensitiveness of hellinger distance. The statistical significance test shows that HATM has higher mean BACC score than ATM.
Author Bin Saian, Rizauddin
Bee Wah, Yap
Ku-Mahamud, Ku Ruhana
Bin Mohd Razali, Muhamad Hasbullah
Author_xml – sequence: 1
  givenname: Muhamad Hasbullah
  surname: Bin Mohd Razali
  fullname: Bin Mohd Razali, Muhamad Hasbullah
– sequence: 2
  givenname: Rizauddin
  surname: Bin Saian
  fullname: Bin Saian, Rizauddin
– sequence: 3
  givenname: Yap
  surname: Bee Wah
  fullname: Bee Wah, Yap
– sequence: 4
  givenname: Ku Ruhana
  surname: Ku-Mahamud
  fullname: Ku-Mahamud, Ku Ruhana
BookMark eNqVkM1KAzEUhYMoWGvfIS8wNcn8ZiOU4h8UulDX4U7mjkanM0NubOnbG1tx4UZc3QP3fGfxXbDTfuiRMS7FXMpcyyv3hmhpvlVy7uR8HDOpkkzqEzZRuYixLMTpT87VOZsRuVqkQur4SyfsccFtB0Sc3nGXuJ6wJxfcFvliuU5qIGx4g9aRG3oePCKH7mXwLrxueDt47jY1dNDbrxoE4ISBLtlZCx3h7PtO2fPtzdPyPlmt7x6Wi1Vilcp0ktatza2oMO7bMhVFjG2moESdClHINtMCSpHFT4kqNlEVyqqqAV1B2-TplOnj7kc_wn4HXWdG7zbg90YKcxBkjoJMFGScNAdBJgqKbHVkrR-IPLb_Qa9_odYFCFFQ8OC6vwc-ASlhhzs
CitedBy_id crossref_primary_10_1155_2022_3127487
ContentType Journal Article
DBID AAYXX
CITATION
ADTOC
UNPAY
DOI 10.11591/ijeecs.v21.i1.pp412-419
DatabaseName CrossRef
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
EISSN 2502-4760
ExternalDocumentID 10.11591/ijeecs.v21.i1.pp412-419
10_11591_ijeecs_v21_i1_pp412_419
GroupedDBID AAYXX
ALMA_UNASSIGNED_HOLDINGS
CITATION
ADTOC
ARCSS
UNPAY
ID FETCH-LOGICAL-c2249-3bfc5c08edecc730608ef42a7e930061f490a7043067e25c0e262c28da98afd53
IEDL.DBID UNPAY
ISSN 2502-4752
2502-4760
IngestDate Tue Aug 19 20:47:01 EDT 2025
Thu Apr 24 22:56:22 EDT 2025
Tue Jul 01 02:46:33 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Issue 1
Language English
License http://creativecommons.org/licenses/by-nc/4.0
cc-by-nc
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c2249-3bfc5c08edecc730608ef42a7e930061f490a7043067e25c0e262c28da98afd53
OpenAccessLink https://proxy.k.utb.cz/login?url=https://ijeecs.iaescore.com/index.php/IJEECS/article/download/22253/14558
ParticipantIDs unpaywall_primary_10_11591_ijeecs_v21_i1_pp412_419
crossref_primary_10_11591_ijeecs_v21_i1_pp412_419
crossref_citationtrail_10_11591_ijeecs_v21_i1_pp412_419
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2021-01-01
PublicationDateYYYYMMDD 2021-01-01
PublicationDate_xml – month: 01
  year: 2021
  text: 2021-01-01
  day: 01
PublicationDecade 2020
PublicationTitle Indonesian Journal of Electrical Engineering and Computer Science
PublicationYear 2021
SSID ssib030194763
ssib044739472
ssib052605909
Score 1.808112
Snippet Ant-tree-miner (ATM) has an advantage over the conventional decision tree algorithm in terms of feature selection. However, real world applications commonly...
SourceID unpaywall
crossref
SourceType Open Access Repository
Enrichment Source
Index Database
StartPage 412
Title A class skew-insensitive ACO-based decision tree algorithm for imbalanced data sets
URI https://ijeecs.iaescore.com/index.php/IJEECS/article/download/22253/14558
UnpaywallVersion publishedVersion
Volume 21
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2502-4760
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib044739472
  issn: 2502-4752
  databaseCode: M~E
  dateStart: 20160101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELagDEw8BAgQVB5Y3STOw_FYVa0AiYcElWCKbOcCgb5EUhAM_HbOSYqACSE2D3d53Dn3nePzd4QcWcqSGIRgPPINQ8TXTCEKMx2DxOkgZVadrzg7j46HwelNeNNQClVnYR4ATNHJFRQ1jSPG6oo30JJFOCen_X7vymmM6qSWT36qUscuWnzHcm7Hy2QlCjEtb5GV4fll99Y2lwvxqw9E1X2nGUfuoqonlN7ips_c6-QYuWaBx-3u6DeoWp1PZur1RY1GX_BnsE4eFk9el508dual7pi3H6SO__JqG2StyVJptxbbJEsw2SJXXWpsrk2LR3hhuf0pXVSVR7Tbu2AWDlOaNi17qN3spmp0N33Ky_sxxdSY5mNt6yiNFVOlogWUxTYZDvrXvWPW9GRgBsFeMl9nJjRuDHg9g9EhwmEWcCVA-jYdygLpKmGJxCIBHCWBR9zwOFUyVlka-jukNZlOYJfQLEBg1AGAp9Mgco30U19pNzOuLySA2CNiYf7ENITltm_GKKkWLui4pDZpgo5Lci-pHJeg4_aI96k5q0k7fqHDPz38a6X9vygdkFb5NIdDTGZK3SbLZ-_9djNZPwCILvcm
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LT8MwDI5gO3DiIUCAAOXANWubPtIcp2loTOIhwaRxqpLUhcLYJtqB4NfjtB0CTghxy8Huw079OY3zmZATS1kSgxCMR75hiPiaKURhpmOQOB2kzKrzFecX0WAUDMfhuKEUqs7CPACYopMrKGoaR4zVFW-gJYtwzob9fu_aaYzqpJZPfqZSxy5afMdybserpB2FmJa3SHt0cdW9tc3lQvzqA1F132nGkbus6gmlt7zpC_c6OUaueeBxuzv6DarWFtO5entVk8kX_DndIA_LJ6_LTh47i1J3zPsPUsd_ebVNst5kqbRbi22RFZhuk-suNTbXpsUjvLLc_pQuqsoj2u1dMguHKU2blj3UbnZTNbmbPefl_RPF1JjmT9rWURorpkpFCyiLHTI67d_0BqzpycAMgr1kvs5MaNwY8HoGo0OEwyzgSoD0bTqUBdJVwhKJRQI4SgKPuOFxqmSssjT0d0lrOpvCHqFZgMCoAwBPp0HkGumnvtJuZlxfSACxT8TS_IlpCMtt34xJUi1c0HFJbdIEHZfkXlI5LkHH7RPvU3Nek3b8Qod_evjXSgd_UTokrfJ5AUeYzJT6uJmmH8HG9fU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+class+skew-insensitive+ACO-based+decision+tree+algorithm+for+imbalanced+data+sets&rft.jtitle=Indonesian+Journal+of+Electrical+Engineering+and+Computer+Science&rft.au=Bin+Mohd+Razali%2C+Muhamad+Hasbullah&rft.au=Bin+Saian%2C+Rizauddin&rft.au=Bee+Wah%2C+Yap&rft.au=Ku-Mahamud%2C+Ku+Ruhana&rft.date=2021-01-01&rft.issn=2502-4752&rft.eissn=2502-4760&rft.volume=21&rft.issue=1&rft.spage=412&rft_id=info:doi/10.11591%2Fijeecs.v21.i1.pp412-419&rft.externalDBID=n%2Fa&rft.externalDocID=10_11591_ijeecs_v21_i1_pp412_419
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2502-4752&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2502-4752&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2502-4752&client=summon