A two-sample tree-based test for hierarchically organized genomic signals

Abstract This article addresses a common type of data encountered in genomic studies, where a signal along a linear chromosome exhibits a hierarchical organization. We propose a novel framework to assess the significance of dissimilarities between two sets of genomic matrices obtained from distinct...

Full description

Saved in:
Bibliographic Details
Published inJournal of the Royal Statistical Society Series C: Applied Statistics Vol. 73; no. 3; pp. 774 - 795
Main Authors Neuvial, Pierre, Randriamihamison, Nathanaël, Chavent, Marie, Foissac, Sylvain, Vialaneix, Nathalie
Format Journal Article
LanguageEnglish
Published UK Oxford University Press 01.06.2024
Wiley
Subjects
Online AccessGet full text
ISSN0035-9254
1467-9876
DOI10.1093/jrsssc/qlae011

Cover

More Information
Summary:Abstract This article addresses a common type of data encountered in genomic studies, where a signal along a linear chromosome exhibits a hierarchical organization. We propose a novel framework to assess the significance of dissimilarities between two sets of genomic matrices obtained from distinct biological conditions. Our approach relies on a data representation based on trees. It utilizes tree distances and an aggregation procedure for tests performed at the level of leaf pairs. Numerical experiments demonstrate its statistical validity and its superior accuracy and power compared to alternatives. The method’s effectiveness is illustrated using real-world data from GWAS and Hi-C data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0035-9254
1467-9876
DOI:10.1093/jrsssc/qlae011