A two-sample tree-based test for hierarchically organized genomic signals
Abstract This article addresses a common type of data encountered in genomic studies, where a signal along a linear chromosome exhibits a hierarchical organization. We propose a novel framework to assess the significance of dissimilarities between two sets of genomic matrices obtained from distinct...
        Saved in:
      
    
          | Published in | Journal of the Royal Statistical Society Series C: Applied Statistics Vol. 73; no. 3; pp. 774 - 795 | 
|---|---|
| Main Authors | , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        UK
          Oxford University Press
    
        01.06.2024
     Wiley  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 0035-9254 1467-9876  | 
| DOI | 10.1093/jrsssc/qlae011 | 
Cover
| Summary: | Abstract
This article addresses a common type of data encountered in genomic studies, where a signal along a linear chromosome exhibits a hierarchical organization. We propose a novel framework to assess the significance of dissimilarities between two sets of genomic matrices obtained from distinct biological conditions. Our approach relies on a data representation based on trees. It utilizes tree distances and an aggregation procedure for tests performed at the level of leaf pairs. Numerical experiments demonstrate its statistical validity and its superior accuracy and power compared to alternatives. The method’s effectiveness is illustrated using real-world data from GWAS and Hi-C data. | 
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14  | 
| ISSN: | 0035-9254 1467-9876  | 
| DOI: | 10.1093/jrsssc/qlae011 |