CIMLA: Interpretable AI for inference of differential causal networks

The discovery of causal relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model estimates a caus...

Full description

Saved in:
Bibliographic Details
Published inArXiv.org
Main Authors Dibaeinia, Payam, Sinha, Saurabh
Format Journal Article
LanguageEnglish
Published United States Cornell University 25.04.2023
Online AccessGet full text
ISSN2331-8422
2331-8422

Cover

More Information
Summary:The discovery of causal relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model estimates a causal quantity reflecting the influence of one variable on another, under certain assumptions. We leverage this insight to implement a new tool, CIMLA, for discovering condition-dependent changes in causal relationships. We then use CIMLA to identify differences in gene regulatory networks between biological conditions, a problem that has received great attention in recent years. Using extensive benchmarking on simulated data sets, we show that CIMLA is more robust to confounding variables and is more accurate than leading methods. Finally, we employ CIMLA to analyze a previously published single-cell RNA-seq data set collected from subjects with and without Alzheimer's disease (AD), discovering several potential regulators of AD.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Working Paper/Pre-Print-1
ObjectType-Feature-3
content type line 23
ISSN:2331-8422
2331-8422