Assessing the Limitations of Relief-Based Algorithms in Detecting Higher-Order Interactions

The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions. Consequently, selecting key features that influence epistatic interactions is crucial for effective downstream analyses. Recognizing this challenge, this...

Full description

Saved in:
Bibliographic Details
Published inResearch square
Main Authors Freda, Philip J, Ye, Suyu, Zhang, Robert, Moore, Jason H, Urbanowicz, Ryan J
Format Journal Article
LanguageEnglish
Published United States 02.09.2024
Subjects
Online AccessGet full text
ISSN2693-5015
DOI10.21203/rs.3.rs-4870116/v1

Cover

Abstract The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions. Consequently, selecting key features that influence epistatic interactions is crucial for effective downstream analyses. Recognizing this challenge, this study investigates the efficiency of Relief-Based Algorithms (RBAs) in detecting higher-order epistatic interactions, which may be critical for understanding the genetic architecture of complex traits. RBAs are uniquely non-exhaustive, eliminating the need to construct features for every possible interaction and thus improving computational tractability. Motivated by previous research indicating that some RBAs rank predictive features involved in higher-order epistasis as highly negative, we explore the utility of absolute value ranking of RBA feature weights as an alternative method to capture complex interactions. We evaluate ReliefF, MultiSURF, and MultiSURFstar on simulated genetic datasets that model various patterns of genotype-phenotype associations, including 2-way to 5-way genetic interactions, and compare their performance to two control methods: a random shuffle and mutual information. Our findings indicate that while RBAs effectively identify lower-order (2 to 3-way) interactions, their capability to detect higher-order interactions is significantly limited, primarily by large feature count but also by signal noise. Specifically, we observe that RBAs are successful in detecting fully penetrant 4-way XOR interactions using an absolute value ranking approach, but this is restricted to datasets with a minimal number of total features. These results highlight the inherent limitations of current RBAs and underscore the need for enhanced detection capabilities for the investigation of epistasis, particularly in datasets with large feature counts and complex higher-order interactions.
AbstractList The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions. Consequently, selecting key features that influence epistatic interactions is crucial for effective downstream analyses. Recognizing this challenge, this study investigates the efficiency of Relief-Based Algorithms (RBAs) in detecting higher-order epistatic interactions, which may be critical for understanding the genetic architecture of complex traits. RBAs are uniquely non-exhaustive, eliminating the need to construct features for every possible interaction and thus improving computational tractability. Motivated by previous research indicating that some RBAs rank predictive features involved in higher-order epistasis as highly negative, we explore the utility of absolute value ranking of RBA feature weights as an alternative method to capture complex interactions. We evaluate ReliefF, MultiSURF, and MultiSURFstar on simulated genetic datasets that model various patterns of genotype-phenotype associations, including 2-way to 5-way genetic interactions, and compare their performance to two control methods: a random shuffle and mutual information. Our findings indicate that while RBAs effectively identify lower-order (2 to 3-way) interactions, their capability to detect higher-order interactions is significantly limited, primarily by large feature count but also by signal noise. Specifically, we observe that RBAs are successful in detecting fully penetrant 4-way XOR interactions using an absolute value ranking approach, but this is restricted to datasets with a minimal number of total features. These results highlight the inherent limitations of current RBAs and underscore the need for enhanced detection capabilities for the investigation of epistasis, particularly in datasets with large feature counts and complex higher-order interactions.
Author Freda, Philip J
Ye, Suyu
Moore, Jason H
Urbanowicz, Ryan J
Zhang, Robert
Author_xml – sequence: 1
  givenname: Philip J
  surname: Freda
  fullname: Freda, Philip J
  organization: Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
– sequence: 2
  givenname: Suyu
  surname: Ye
  fullname: Ye, Suyu
  organization: Whiting School of Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD, 21218, USA
– sequence: 3
  givenname: Robert
  surname: Zhang
  fullname: Zhang, Robert
  organization: University of Pennsylvania, Philadelphia, PA, 19104, USA
– sequence: 4
  givenname: Jason H
  surname: Moore
  fullname: Moore, Jason H
  organization: Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
– sequence: 5
  givenname: Ryan J
  surname: Urbanowicz
  fullname: Urbanowicz, Ryan J
  organization: Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39281873$$D View this record in MEDLINE/PubMed
BookMark eNo9kF1rwjAUhsPYmM75CwYjf6DakzRpe-nch4IgjO1qFyVNTzTQpiWpDv_9nLrdnPfi8Ly8PHfk2rUOCXmAeMKAxXzqw4RPfIiSLI0B5HQPV2TIZM4jEYMYkHEItoxFyniaA9ySAc9ZBlnKh-RrFgIe325D-y3SlW1sr3rbukBbQ9-xtmiiJxWworN603rbb5tAraPP2KPuf7mF3WzRR2tfoadL16NX-tRwT26MqgOOLzkin68vH_NFtFq_LeezVdTBcW6UCiWMzAXTKTLU0phY5DrXHCouJWqlWaKRZyqTJpE8wxS0LHmpdM6VAcVHJDn37lynDt-qrovO20b5QwFxcVJU-FCczkVRsYcj9njGul3ZYPXP_MnhP6E_aRY
ContentType Journal Article
DBID NPM
UNPAY
DOI 10.21203/rs.3.rs-4870116/v1
DatabaseName PubMed
Unpaywall
DatabaseTitle PubMed
DatabaseTitleList PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
EISSN 2693-5015
ExternalDocumentID 10.21203/rs.3.rs-4870116/v1
39281873
Genre Journal Article
Preprint
GrantInformation_xml – fundername: NIAID NIH HHS
  grantid: R01 AI173095
– fundername: NIA NIH HHS
  grantid: R01 AG066833
– fundername: NLM NIH HHS
  grantid: R01 LM010098
GroupedDBID NPM
UNPAY
ID FETCH-LOGICAL-p1011-75a5f6952c7e2ec6ff059c9c31d366ecac24ce38a86f4638e71c6b3bac93af1a3
IEDL.DBID UNPAY
IngestDate Sun Oct 26 04:15:57 EDT 2025
Tue Aug 05 11:42:43 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Keywords RBA
Relief-based algorithm
univariate
epistasis
ReliefF
feature selection
heterogeneity
high-order interactions
Language English
License cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-p1011-75a5f6952c7e2ec6ff059c9c31d366ecac24ce38a86f4638e71c6b3bac93af1a3
OpenAccessLink https://proxy.k.utb.cz/login?url=https://doi.org/10.21203/rs.3.rs-4870116/v1
PMID 39281873
ParticipantIDs unpaywall_primary_10_21203_rs_3_rs_4870116_v1
pubmed_primary_39281873
PublicationCentury 2000
PublicationDate 2024-Sep-02
PublicationDateYYYYMMDD 2024-09-02
PublicationDate_xml – month: 09
  year: 2024
  text: 2024-Sep-02
  day: 02
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Research square
PublicationTitleAlternate Res Sq
PublicationYear 2024
References 39354639 - BioData Min. 2024 Oct 1;17(1):37. doi: 10.1186/s13040-024-00390-0
References_xml – reference: 39354639 - BioData Min. 2024 Oct 1;17(1):37. doi: 10.1186/s13040-024-00390-0
SSID ssib057237911
Score 1.8845876
SecondaryResourceType preprint
Snippet The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions....
SourceID unpaywall
pubmed
SourceType Open Access Repository
Index Database
Title Assessing the Limitations of Relief-Based Algorithms in Detecting Higher-Order Interactions
URI https://www.ncbi.nlm.nih.gov/pubmed/39281873
https://doi.org/10.21203/rs.3.rs-4870116/v1
UnpaywallVersion acceptedVersion
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8MwGA66HTz5gV8THTl4bbc2adIc58cYwuYODiYeSpImOpzdaDtFf71JU8YQBL3k0hTKm6Tv8348TwC4JIowjkVq2ypMgCJi5fHQXhmmGKYa84hTS04ejshggu-m0bTW2bZcmI36vfmpdlEnL3zk54VngLWtGXTeTajTJJEB3g3QnIzGvUenJfTb7A3PsrPKlvzzg8_nGy6kv-e42UWlPGg7R179VSl8-fVDl_GPX7cPdmsoCXtu7Q_AlsoOwZOr4hqHBA20gxV_ySXl4EJD24CstHdlXFcKe_PnRT4rX94KOMvgjbLlBPuea_3w7q0oJ6wyho78UByBSf_24Xrg1RcoeMvApj5pxCNNWBRKqkIlidYGTEkmUZAiQpTkMsRSoZjHRGNzEBUNJBFIcMkQ1wFHx6CRLTJ1CqBBEUGsI6EVFzhl3ZgTwXSEWBxQFWDaAifOzMnSqWQkBngZMEBRC3hru68fmtCjMmCSF0k11AY0MfvZP-efg0aZr9SFwQalaIPt0XjYrnfGN9oCuxc
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LS8MwHA6yHTz5wNdEJQev7dbm1RznYwzB6cHBxENJ0kSHsxttp-hfb9KUMQRBL7k0hfJL0t_3e3xfADinmnKBZebaKmyAIhMdiNhdGaY5ZgYLIpgjJ9-O6HCMbyZk0uhsOy7MWv3e_lR7qFuUIQqLMrDA2tUMuu821GlTYoF3C7THo_v-o9cS-m32mmfZXOYL8fkhZrM1FzLY9tzsslYedJ0jr-GykqH6-qHL-Mev2wFbDZSEfb_2u2BD53vgyVdxrUOCFtrBmr_kk3JwbqBrQNYmuLCuK4P92fO8mFYvbyWc5vBKu3KCe8-3fgR3TpQT1hlDT34o98F4cP1wOQyaCxSCReRSn4wIYignsWI61ooaY8GU4gpFGaJUK6FirDRKREINtgdRs0hRiaRQHAkTCXQAWvk810cAWhQRJYZIo4XEGe8lgkpuCOJJxHSEWQccejOnC6-SkVrgZcEAQx0QrOy-emhDj9qAaVGm9dAY0Mbsx_-cfwJaVbHUpxYbVPKs2RPffK26Cw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Assessing+the+Limitations+of+Relief-Based+Algorithms+in+Detecting+Higher-Order+Interactions&rft_id=info:doi/10.21203%2Frs.3.rs-4870116%2Fv1&rft.externalDocID=10.21203%2Frs.3.rs-4870116%2Fv1