CpG island clusters and pro-epigenetic selection for CpGs in protein-coding exons of HOX and other transcription factors

CpG dinucleotides contribute to epigenetic mechanisms by being the only site for DNA methylation in mammalian somatic cells. They are also mutation hotspots and ∼5-fold depleted genome-wide. We report here a study focused on CpG sites in the coding regions of Hox and other transcription factor genes...

Full description

Saved in:
Bibliographic Details
Published inProceedings of the National Academy of Sciences - PNAS Vol. 107; no. 35; pp. 15485 - 15490
Main Authors Branciamore, Sergio, Chen, Zhao-Xia, Riggs, Arthur D., Rodin, Sergei N.
Format Journal Article
LanguageEnglish
Published United States National Academy of Sciences 31.08.2010
National Acad Sciences
Subjects
Online AccessGet full text
ISSN0027-8424
1091-6490
1091-6490
DOI10.1073/pnas.1010506107

Cover

More Information
Summary:CpG dinucleotides contribute to epigenetic mechanisms by being the only site for DNA methylation in mammalian somatic cells. They are also mutation hotspots and ∼5-fold depleted genome-wide. We report here a study focused on CpG sites in the coding regions of Hox and other transcription factor genes, comparing methylated genomes of Homo sapiens, Mus musculus, and Danio rerio with nonmethylated genomes of Drosophila melanogaster and Caenorhabditis elegans. We analyzed 4-fold degenerate, synonymous codons with the potential for CpG. That is, we studied "silent" changes that do not affect protein products but could damage epigenetic marking. We find that DNA-binding transcription factors and other developmentally relevant genes show, only in methylated genomes, a bimodal distribution of CpG usage. Several genetic code-based tests indicate, again for methylated genomes only, that the frequency of silent CpGs in Hox genes is much greater than expectation. Also informative are NCG-GNN and NCC-GNN codon doublets, for which an unusually high rate of G to C and C to G transversions was observed at the third (silent) position of the first codon. Together these results are interpreted as evidence for strong "pro-epigenetic" selection acting to preserve CpG sites in coding regions of many genes controlling development. We also report that DNA-binding transcription factors and developmentally important genes are dramatically overrepresented in or near clusters of three or more CpG islands, suggesting a possible relationship between evolutionary preservation of CpG dinucleotides in both coding regions and CpG islands.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
Author contributions: S.B., Z.-X.C., A.D.R., and S.N.R. designed research; S.B. and Z.-X.C. performed research; S.B., Z.-X.C., A.D.R., and S.N.R. analyzed data; and S.B., Z.-X.C., A.D.R., and S.N.R. wrote the paper.
Contributed by Arthur D. Riggs, July 27, 2010 (sent for review May 26, 2010)
ISSN:0027-8424
1091-6490
1091-6490
DOI:10.1073/pnas.1010506107