Recursive organizer (ROR): an analytic framework for sequence-based association analysis
The advent of next-generation sequencing technologies affords the ability to sequence thousands of subjects cost-effectively, and is revolutionizing the landscape of genetic research. With the evolving genotyping/sequencing technologies, it is not unrealistic to expect that we will soon obtain a pai...
Saved in:
| Published in | Human genetics Vol. 132; no. 7; pp. 745 - 759 |
|---|---|
| Main Authors | , |
| Format | Journal Article |
| Language | English |
| Published |
Berlin/Heidelberg
Springer-Verlag
01.07.2013
Springer Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0340-6717 1432-1203 1432-1203 |
| DOI | 10.1007/s00439-013-1285-4 |
Cover
| Summary: | The advent of next-generation sequencing technologies affords the ability to sequence thousands of subjects cost-effectively, and is revolutionizing the landscape of genetic research. With the evolving genotyping/sequencing technologies, it is not unrealistic to expect that we will soon obtain a pair of diploidic fully phased genome sequences from each subject in the near future. Here, in light of this potential, we propose an analytic framework called, recursive organizer (ROR), which recursively groups sequence variants based upon sequence similarities and their empirical disease associations, into fewer and potentially more interpretable super sequence variants (SSV). As an illustration, we applied ROR to assess an association between
HLA
-
DRB1
and type 1 diabetes (T1D), discovering SSVs of
HLA
-
DRB1
with sequence data from the Wellcome Trust Case Control Consortium. Specifically, ROR reduces 36 observed unique
HLA
-
DRB1
sequences into 8 SSVs that empirically associate with T1D, a fourfold reduction of sequence complexity. Using
HLA
-
DRB1
data from Type 1 Diabetes Genetics Consortium as cases and data from Fred Hutchinson Cancer Research Center as controls, we are able to validate associations of these SSVs with T1D. Further, SSVs consist of nine nucleotides, and each associates with its corresponding amino acids. Detailed examination of these selected amino acids reveals their potential functional roles in protein structures and possible implication to the mechanism of T1D. |
|---|---|
| Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0340-6717 1432-1203 1432-1203 |
| DOI: | 10.1007/s00439-013-1285-4 |