Assessing the Limitations of Relief-Based Algorithms in Detecting Higher-Order Interactions
The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions. Consequently, selecting key features that influence epistatic interactions is crucial for effective downstream analyses. Recognizing this challenge, this...
Saved in:
| Published in | Research square |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
United States
02.09.2024
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2693-5015 |
| DOI | 10.21203/rs.3.rs-4870116/v1 |
Cover
| Abstract | The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions. Consequently, selecting key features that influence epistatic interactions is crucial for effective downstream analyses. Recognizing this challenge, this study investigates the efficiency of Relief-Based Algorithms (RBAs) in detecting higher-order epistatic interactions, which may be critical for understanding the genetic architecture of complex traits. RBAs are uniquely non-exhaustive, eliminating the need to construct features for every possible interaction and thus improving computational tractability. Motivated by previous research indicating that some RBAs rank predictive features involved in higher-order epistasis as highly negative, we explore the utility of absolute value ranking of RBA feature weights as an alternative method to capture complex interactions. We evaluate ReliefF, MultiSURF, and MultiSURFstar on simulated genetic datasets that model various patterns of genotype-phenotype associations, including 2-way to 5-way genetic interactions, and compare their performance to two control methods: a random shuffle and mutual information.
Our findings indicate that while RBAs effectively identify lower-order (2 to 3-way) interactions, their capability to detect higher-order interactions is significantly limited, primarily by large feature count but also by signal noise. Specifically, we observe that RBAs are successful in detecting fully penetrant 4-way XOR interactions using an absolute value ranking approach, but this is restricted to datasets with a minimal number of total features.
These results highlight the inherent limitations of current RBAs and underscore the need for enhanced detection capabilities for the investigation of epistasis, particularly in datasets with large feature counts and complex higher-order interactions. |
|---|---|
| AbstractList | The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions. Consequently, selecting key features that influence epistatic interactions is crucial for effective downstream analyses. Recognizing this challenge, this study investigates the efficiency of Relief-Based Algorithms (RBAs) in detecting higher-order epistatic interactions, which may be critical for understanding the genetic architecture of complex traits. RBAs are uniquely non-exhaustive, eliminating the need to construct features for every possible interaction and thus improving computational tractability. Motivated by previous research indicating that some RBAs rank predictive features involved in higher-order epistasis as highly negative, we explore the utility of absolute value ranking of RBA feature weights as an alternative method to capture complex interactions. We evaluate ReliefF, MultiSURF, and MultiSURFstar on simulated genetic datasets that model various patterns of genotype-phenotype associations, including 2-way to 5-way genetic interactions, and compare their performance to two control methods: a random shuffle and mutual information.
Our findings indicate that while RBAs effectively identify lower-order (2 to 3-way) interactions, their capability to detect higher-order interactions is significantly limited, primarily by large feature count but also by signal noise. Specifically, we observe that RBAs are successful in detecting fully penetrant 4-way XOR interactions using an absolute value ranking approach, but this is restricted to datasets with a minimal number of total features.
These results highlight the inherent limitations of current RBAs and underscore the need for enhanced detection capabilities for the investigation of epistasis, particularly in datasets with large feature counts and complex higher-order interactions. |
| Author | Freda, Philip J Ye, Suyu Moore, Jason H Urbanowicz, Ryan J Zhang, Robert |
| Author_xml | – sequence: 1 givenname: Philip J surname: Freda fullname: Freda, Philip J organization: Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA – sequence: 2 givenname: Suyu surname: Ye fullname: Ye, Suyu organization: Whiting School of Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD, 21218, USA – sequence: 3 givenname: Robert surname: Zhang fullname: Zhang, Robert organization: University of Pennsylvania, Philadelphia, PA, 19104, USA – sequence: 4 givenname: Jason H surname: Moore fullname: Moore, Jason H organization: Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA – sequence: 5 givenname: Ryan J surname: Urbanowicz fullname: Urbanowicz, Ryan J organization: Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/39281873$$D View this record in MEDLINE/PubMed |
| BookMark | eNo9kF1rwjAUhsPYmM75CwYjf6DakzRpe-nch4IgjO1qFyVNTzTQpiWpDv_9nLrdnPfi8Ly8PHfk2rUOCXmAeMKAxXzqw4RPfIiSLI0B5HQPV2TIZM4jEYMYkHEItoxFyniaA9ySAc9ZBlnKh-RrFgIe325D-y3SlW1sr3rbukBbQ9-xtmiiJxWworN603rbb5tAraPP2KPuf7mF3WzRR2tfoadL16NX-tRwT26MqgOOLzkin68vH_NFtFq_LeezVdTBcW6UCiWMzAXTKTLU0phY5DrXHCouJWqlWaKRZyqTJpE8wxS0LHmpdM6VAcVHJDn37lynDt-qrovO20b5QwFxcVJU-FCczkVRsYcj9njGul3ZYPXP_MnhP6E_aRY |
| ContentType | Journal Article |
| DBID | NPM UNPAY |
| DOI | 10.21203/rs.3.rs-4870116/v1 |
| DatabaseName | PubMed Unpaywall |
| DatabaseTitle | PubMed |
| DatabaseTitleList | PubMed |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| EISSN | 2693-5015 |
| ExternalDocumentID | 10.21203/rs.3.rs-4870116/v1 39281873 |
| Genre | Journal Article Preprint |
| GrantInformation_xml | – fundername: NIAID NIH HHS grantid: R01 AI173095 – fundername: NIA NIH HHS grantid: R01 AG066833 – fundername: NLM NIH HHS grantid: R01 LM010098 |
| GroupedDBID | NPM UNPAY |
| ID | FETCH-LOGICAL-p1011-75a5f6952c7e2ec6ff059c9c31d366ecac24ce38a86f4638e71c6b3bac93af1a3 |
| IEDL.DBID | UNPAY |
| IngestDate | Sun Oct 26 04:15:57 EDT 2025 Tue Aug 05 11:42:43 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Keywords | RBA Relief-based algorithm univariate epistasis ReliefF feature selection heterogeneity high-order interactions |
| Language | English |
| License | cc-by |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-p1011-75a5f6952c7e2ec6ff059c9c31d366ecac24ce38a86f4638e71c6b3bac93af1a3 |
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://doi.org/10.21203/rs.3.rs-4870116/v1 |
| PMID | 39281873 |
| ParticipantIDs | unpaywall_primary_10_21203_rs_3_rs_4870116_v1 pubmed_primary_39281873 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Sep-02 |
| PublicationDateYYYYMMDD | 2024-09-02 |
| PublicationDate_xml | – month: 09 year: 2024 text: 2024-Sep-02 day: 02 |
| PublicationDecade | 2020 |
| PublicationPlace | United States |
| PublicationPlace_xml | – name: United States |
| PublicationTitle | Research square |
| PublicationTitleAlternate | Res Sq |
| PublicationYear | 2024 |
| References | 39354639 - BioData Min. 2024 Oct 1;17(1):37. doi: 10.1186/s13040-024-00390-0 |
| References_xml | – reference: 39354639 - BioData Min. 2024 Oct 1;17(1):37. doi: 10.1186/s13040-024-00390-0 |
| SSID | ssib057237911 |
| Score | 1.8845876 |
| SecondaryResourceType | preprint |
| Snippet | The investigation of epistasis becomes increasingly complex as more loci are considered due to the exponential expansion of possible interactions.... |
| SourceID | unpaywall pubmed |
| SourceType | Open Access Repository Index Database |
| Title | Assessing the Limitations of Relief-Based Algorithms in Detecting Higher-Order Interactions |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/39281873 https://doi.org/10.21203/rs.3.rs-4870116/v1 |
| UnpaywallVersion | acceptedVersion |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8MwGA66HTz5gV8THTl4bbc2adIc58cYwuYODiYeSpImOpzdaDtFf71JU8YQBL3k0hTKm6Tv8348TwC4JIowjkVq2ypMgCJi5fHQXhmmGKYa84hTS04ejshggu-m0bTW2bZcmI36vfmpdlEnL3zk54VngLWtGXTeTajTJJEB3g3QnIzGvUenJfTb7A3PsrPKlvzzg8_nGy6kv-e42UWlPGg7R179VSl8-fVDl_GPX7cPdmsoCXtu7Q_AlsoOwZOr4hqHBA20gxV_ySXl4EJD24CstHdlXFcKe_PnRT4rX94KOMvgjbLlBPuea_3w7q0oJ6wyho78UByBSf_24Xrg1RcoeMvApj5pxCNNWBRKqkIlidYGTEkmUZAiQpTkMsRSoZjHRGNzEBUNJBFIcMkQ1wFHx6CRLTJ1CqBBEUGsI6EVFzhl3ZgTwXSEWBxQFWDaAifOzMnSqWQkBngZMEBRC3hru68fmtCjMmCSF0k11AY0MfvZP-efg0aZr9SFwQalaIPt0XjYrnfGN9oCuxc |
| linkProvider | Unpaywall |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LS8MwHA6yHTz5wNdEJQev7dbm1RznYwzB6cHBxENJ0kSHsxttp-hfb9KUMQRBL7k0hfJL0t_3e3xfADinmnKBZebaKmyAIhMdiNhdGaY5ZgYLIpgjJ9-O6HCMbyZk0uhsOy7MWv3e_lR7qFuUIQqLMrDA2tUMuu821GlTYoF3C7THo_v-o9cS-m32mmfZXOYL8fkhZrM1FzLY9tzsslYedJ0jr-GykqH6-qHL-Mev2wFbDZSEfb_2u2BD53vgyVdxrUOCFtrBmr_kk3JwbqBrQNYmuLCuK4P92fO8mFYvbyWc5vBKu3KCe8-3fgR3TpQT1hlDT34o98F4cP1wOQyaCxSCReRSn4wIYignsWI61ooaY8GU4gpFGaJUK6FirDRKREINtgdRs0hRiaRQHAkTCXQAWvk810cAWhQRJYZIo4XEGe8lgkpuCOJJxHSEWQccejOnC6-SkVrgZcEAQx0QrOy-emhDj9qAaVGm9dAY0Mbsx_-cfwJaVbHUpxYbVPKs2RPffK26Cw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Assessing+the+Limitations+of+Relief-Based+Algorithms+in+Detecting+Higher-Order+Interactions&rft_id=info:doi/10.21203%2Frs.3.rs-4870116%2Fv1&rft.externalDocID=10.21203%2Frs.3.rs-4870116%2Fv1 |