GC skew is a conserved property of unmethylated CpG island promoters across vertebrates

Stella R. Hartono, Ian F Korf, Frederic Chedin

Research output: Contribution to journalArticle

25 Citations (Scopus)

Abstract

GC skew is a measure of the strand asymmetry in the distribution of guanines and cytosines. GC skew favors R-loops, a type of three stranded nucleic acid structures that form upon annealing of an RNA strand to one strand of DNA, creating a persistent RNA:DNA hybrid. Previous studies show that GC skew is prevalent at thousands of human CpG island (CGI) promoters and transcription termination regions, which correspond to hotspots of R-loop formation. Here, we investigated the conservation of GC skew patterns in 60 sequenced chordates genomes. We report that GC skew is a conserved sequence characteristic of the CGI promoter class in vertebrates. Furthermore, we reveal that promoter GC skew peaks at the exon 1/ intron1 junction and that it is highly correlated with gene age and CGI promoter strength. Our data also show that GC skew is predictive of unmethylated CGI promoters in a range of vertebrate species and that it imparts significant DNA hypomethylation for promoters with intermediate CpG densities. Finally, we observed that terminal GC skew is conserved for a subset of vertebrate genes that tend to be located significantly closer to their downstream neighbors, consistent with a role for R-loop formation in transcription termination.

Original languageEnglish (US)
Pages (from-to)9729-9741
Number of pages13
JournalNucleic Acids Research
Volume43
Issue number20
DOIs
StatePublished - Jul 29 2015

Fingerprint

CpG Islands
Vertebrates
DNA
RNA
Chordata
Conserved Sequence
Cytosine
Guanine
Nucleic Acids
Genes
Exons
Genome

ASJC Scopus subject areas

  • Genetics

Cite this

GC skew is a conserved property of unmethylated CpG island promoters across vertebrates. / Hartono, Stella R.; Korf, Ian F; Chedin, Frederic.

In: Nucleic Acids Research, Vol. 43, No. 20, 29.07.2015, p. 9729-9741.

Research output: Contribution to journalArticle

@article{d7adb4d2b2844abe86fae739e2c33bee,
title = "GC skew is a conserved property of unmethylated CpG island promoters across vertebrates",
abstract = "GC skew is a measure of the strand asymmetry in the distribution of guanines and cytosines. GC skew favors R-loops, a type of three stranded nucleic acid structures that form upon annealing of an RNA strand to one strand of DNA, creating a persistent RNA:DNA hybrid. Previous studies show that GC skew is prevalent at thousands of human CpG island (CGI) promoters and transcription termination regions, which correspond to hotspots of R-loop formation. Here, we investigated the conservation of GC skew patterns in 60 sequenced chordates genomes. We report that GC skew is a conserved sequence characteristic of the CGI promoter class in vertebrates. Furthermore, we reveal that promoter GC skew peaks at the exon 1/ intron1 junction and that it is highly correlated with gene age and CGI promoter strength. Our data also show that GC skew is predictive of unmethylated CGI promoters in a range of vertebrate species and that it imparts significant DNA hypomethylation for promoters with intermediate CpG densities. Finally, we observed that terminal GC skew is conserved for a subset of vertebrate genes that tend to be located significantly closer to their downstream neighbors, consistent with a role for R-loop formation in transcription termination.",
author = "Hartono, {Stella R.} and Korf, {Ian F} and Frederic Chedin",
year = "2015",
month = "7",
day = "29",
doi = "10.1093/nar/gkv811",
language = "English (US)",
volume = "43",
pages = "9729--9741",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "20",

}

TY - JOUR

T1 - GC skew is a conserved property of unmethylated CpG island promoters across vertebrates

AU - Hartono, Stella R.

AU - Korf, Ian F

AU - Chedin, Frederic

PY - 2015/7/29

Y1 - 2015/7/29

N2 - GC skew is a measure of the strand asymmetry in the distribution of guanines and cytosines. GC skew favors R-loops, a type of three stranded nucleic acid structures that form upon annealing of an RNA strand to one strand of DNA, creating a persistent RNA:DNA hybrid. Previous studies show that GC skew is prevalent at thousands of human CpG island (CGI) promoters and transcription termination regions, which correspond to hotspots of R-loop formation. Here, we investigated the conservation of GC skew patterns in 60 sequenced chordates genomes. We report that GC skew is a conserved sequence characteristic of the CGI promoter class in vertebrates. Furthermore, we reveal that promoter GC skew peaks at the exon 1/ intron1 junction and that it is highly correlated with gene age and CGI promoter strength. Our data also show that GC skew is predictive of unmethylated CGI promoters in a range of vertebrate species and that it imparts significant DNA hypomethylation for promoters with intermediate CpG densities. Finally, we observed that terminal GC skew is conserved for a subset of vertebrate genes that tend to be located significantly closer to their downstream neighbors, consistent with a role for R-loop formation in transcription termination.

AB - GC skew is a measure of the strand asymmetry in the distribution of guanines and cytosines. GC skew favors R-loops, a type of three stranded nucleic acid structures that form upon annealing of an RNA strand to one strand of DNA, creating a persistent RNA:DNA hybrid. Previous studies show that GC skew is prevalent at thousands of human CpG island (CGI) promoters and transcription termination regions, which correspond to hotspots of R-loop formation. Here, we investigated the conservation of GC skew patterns in 60 sequenced chordates genomes. We report that GC skew is a conserved sequence characteristic of the CGI promoter class in vertebrates. Furthermore, we reveal that promoter GC skew peaks at the exon 1/ intron1 junction and that it is highly correlated with gene age and CGI promoter strength. Our data also show that GC skew is predictive of unmethylated CGI promoters in a range of vertebrate species and that it imparts significant DNA hypomethylation for promoters with intermediate CpG densities. Finally, we observed that terminal GC skew is conserved for a subset of vertebrate genes that tend to be located significantly closer to their downstream neighbors, consistent with a role for R-loop formation in transcription termination.

UR - http://www.scopus.com/inward/record.url?scp=84950143504&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84950143504&partnerID=8YFLogxK

U2 - 10.1093/nar/gkv811

DO - 10.1093/nar/gkv811

M3 - Article

C2 - 26253743

AN - SCOPUS:84950143504

VL - 43

SP - 9729

EP - 9741

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - 20

ER -