CIMLA: Interpretable AI for inference of differential causal networks

The discovery of causal relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model estimates a caus...

Full description

Saved in:

Bibliographic Details
Published in	ArXiv.org
Main Authors	Dibaeinia, Payam, Sinha, Saurabh
Format	Journal Article
Language	English
Published	United States Cornell University 25.04.2023
Online Access	Get full text
ISSN	2331-8422 2331-8422

Cover

More Information
Summary:	The discovery of causal relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model estimates a causal quantity reflecting the influence of one variable on another, under certain assumptions. We leverage this insight to implement a new tool, CIMLA, for discovering condition-dependent changes in causal relationships. We then use CIMLA to identify differences in gene regulatory networks between biological conditions, a problem that has received great attention in recent years. Using extensive benchmarking on simulated data sets, we show that CIMLA is more robust to confounding variables and is more accurate than leading methods. Finally, we employ CIMLA to analyze a previously published single-cell RNA-seq data set collected from subjects with and without Alzheimer's disease (AD), discovering several potential regulators of AD.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Working Paper/Pre-Print-1 ObjectType-Feature-3 content type line 23
ISSN:	2331-8422 2331-8422