Literature DB >> 32800095

Promoter CpG Density Predicts Downstream Gene Loss-of-Function Intolerance.

Leandros Boukas1, Hans T Bjornsson2, Kasper D Hansen3.   

Abstract

The aggregation and joint analysis of large numbers of exome sequences has recently made it possible to derive estimates of intolerance to loss-of-function (LoF) variation for human genes. Here, we demonstrate strong and widespread coupling between genic LoF intolerance and promoter CpG density across the human genome. Genes downstream of the most CpG-rich promoters (top 10% CpG density) have a 67.2% probability of being highly LoF intolerant, using the LOEUF metric from gnomAD. This is in contrast to 7.4% of genes downstream of the most CpG-poor (bottom 10% CpG density) promoters. Combining promoter CpG density with exonic and promoter conservation explains 33.4% of the variation in LOEUF, and the contribution of CpG density exceeds the individual contributions of exonic and promoter conservation. We leverage this to train a simple and easily interpretable predictive model that outperforms other existing predictors and allows us to classify 1,760 genes-which are currently unascertained in gnomAD-as highly LoF intolerant or not. These predictions have the potential to aid in the interpretation of novel variants in the clinical setting. Moreover, our results reveal that high CpG density is not merely a generic feature of human promoters but is preferentially encountered at the promoters of the most selectively constrained genes, calling into question the prevailing view that CpG islands are not subject to selection.
Copyright © 2020. Published by Elsevier Inc.

Entities:  

Keywords:  CpG density; CpG islands; GC content; dosage sensitivity; epigenetics; gnomAD; haploinsufficiency; loss-of-function; promoters; selection

Mesh:

Substances:

Year:  2020        PMID: 32800095      PMCID: PMC7477270          DOI: 10.1016/j.ajhg.2020.07.014

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  59 in total

1.  Primate CpG islands are maintained by heterogeneous evolutionary regimes involving minimal selection.

Authors:  Netta Mendelson Cohen; Ephraim Kenigsberg; Amos Tanay
Journal:  Cell       Date:  2011-05-27       Impact factor: 41.582

2.  Human disease genes.

Authors:  G Jimenez-Sanchez; B Childs; D Valle
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

3.  The human noncoding genome defined by genetic diversity.

Authors:  Julia di Iulio; Istvan Bartha; Emily H M Wong; Hung-Chun Yu; Victor Lavrenko; Dongchan Yang; Inkyung Jung; Michael A Hicks; Naisha Shah; Ewen F Kirkness; Martin M Fabani; William H Biggs; Bing Ren; J Craig Venter; Amalio Telenti
Journal:  Nat Genet       Date:  2018-02-26       Impact factor: 38.330

4.  Enhancer Domains Predict Gene Pathogenicity and Inform Gene Discovery in Complex Disease.

Authors:  Xinchen Wang; David B Goldstein
Journal:  Am J Hum Genet       Date:  2020-02-06       Impact factor: 11.025

5.  Predicting mRNA Abundance Directly from Genomic Sequence Using Deep Convolutional Neural Networks.

Authors:  Vikram Agarwal; Jay Shendure
Journal:  Cell Rep       Date:  2020-05-19       Impact factor: 9.423

6.  Shadow enhancers foster robustness of Drosophila gastrulation.

Authors:  Michael W Perry; Alistair N Boettiger; Jacques P Bothma; Michael Levine
Journal:  Curr Biol       Date:  2010-09-14       Impact factor: 10.834

7.  Characterising and predicting haploinsufficiency in the human genome.

Authors:  Ni Huang; Insuk Lee; Edward M Marcotte; Matthew E Hurles
Journal:  PLoS Genet       Date:  2010-10-14       Impact factor: 5.917

8.  Genome-scale DNA methylation maps of pluripotent and differentiated cells.

Authors:  Alexander Meissner; Tarjei S Mikkelsen; Hongcang Gu; Marius Wernig; Jacob Hanna; Andrey Sivachenko; Xiaolan Zhang; Bradley E Bernstein; Chad Nusbaum; David B Jaffe; Andreas Gnirke; Rudolf Jaenisch; Eric S Lander
Journal:  Nature       Date:  2008-07-06       Impact factor: 49.962

9.  Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans.

Authors: 
Journal:  Science       Date:  2015-05-07       Impact factor: 47.728

10.  CpG islands influence chromatin structure via the CpG-binding protein Cfp1.

Authors:  John P Thomson; Peter J Skene; Jim Selfridge; Thomas Clouaire; Jacky Guy; Shaun Webb; Alastair R W Kerr; Aimée Deaton; Rob Andrews; Keith D James; Daniel J Turner; Robert Illingworth; Adrian Bird
Journal:  Nature       Date:  2010-04-15       Impact factor: 49.962

View more
  1 in total

1.  Prioritizing non-coding regions based on human genomic constraint and sequence context with deep learning.

Authors:  Dimitrios Vitsios; Ryan S Dhindsa; Lawrence Middleton; Ayal B Gussow; Slavé Petrovski
Journal:  Nat Commun       Date:  2021-03-08       Impact factor: 14.919

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.