| Literature DB >> 33057201 |
Alexander G Bick1,2,3,4, Joshua S Weinstock5, Satish K Nandakumar2,6, Charles P Fulco2,7, Erik L Bao2,6,8, Seyedeh M Zekavat2,9, Mindy D Szeto10,11, Xiaotian Liao2,6, Matthew J Leventhal2, Joseph Nasser2, Kyle Chang12, Cecelia Laurie13, Bala Bharathi Burugula14, Christopher J Gibson15, Amy E Lin16, Margaret A Taub17, Francois Aguet2, Kristin Ardlie2, Braxton D Mitchell18,19, Kathleen C Barnes10,20, Arden Moscati21, Myriam Fornage22,23, Susan Redline3,24,25, Bruce M Psaty26,27,28,29, Edwin K Silverman3,30, Scott T Weiss3,30, Nicholette D Palmer31, Ramachandran S Vasan32, Esteban G Burchard33,34, Sharon L R Kardia35, Jiang He36,37, Robert C Kaplan38,39, Nicholas L Smith27,29,40, Donna K Arnett41, David A Schwartz42, Adolfo Correa43, Mariza de Andrade44, Xiuqing Guo45, Barbara A Konkle46,47, Brian Custer48,49, Juan M Peralta50, Hongsheng Gui51, Deborah A Meyers52, Stephen T McGarvey53, Ida Yii-Der Chen54, M Benjamin Shoemaker55, Patricia A Peyser35, Jai G Broome13, Stephanie M Gogarten13, Fei Fei Wang13, Quenna Wong13, May E Montasser18, Michelle Daya10, Eimear E Kenny56, Kari E North57, Lenore J Launer58, Brian E Cade24,59, Joshua C Bis26, Michael H Cho3,30, Jessica Lasky-Su3,30, Donald W Bowden31, L Adrienne Cupples60, Angel C Y Mak33, Lewis C Becker61, Jennifer A Smith35,62, Tanika N Kelly36,37, Stella Aslibekyan63, Susan R Heckbert27,29, Hemant K Tiwari64, Ivana V Yang42, John A Heit65, Steven A Lubitz2,3,66, Jill M Johnsen46,47, Joanne E Curran50, Sally E Wenzel67, Daniel E Weeks68, Dabeeru C Rao69, Dawood Darbar70, Jee-Young Moon38, Russell P Tracy71, Erin J Buth13, Nicholas Rafaels20, Ruth J F Loos21,72, Peter Durda71, Yongmei Liu73, Lifang Hou74, Jiwon Lee24, Priyadarshini Kachroo3,30, Barry I Freedman75, Daniel Levy76,77, Lawrence F Bielak35, James E Hixson78, James S Floyd26,27,47, Eric A Whitsel79,80, Patrick T Ellinor2,3,66, Marguerite R Irvin63, Tasha E Fingerlin81, Laura M Raffield82, Sebastian M Armasu44, Marsha M Wheeler83, Ester C Sabino84, John Blangero50, L Keoki Williams51, Bruce D Levy3,85, Wayne Huey-Herng Sheu86, Dan M Roden87,88,89, Eric Boerwinkle89,90, JoAnn E Manson3,91,92, Rasika A Mathias61, Pinkal Desai93, Kent D Taylor94,95, Andrew D Johnson76,77, Paul L Auer96, Charles Kooperberg97, Cathy C Laurie13, Thomas W Blackwell5, Albert V Smith5, Hongyu Zhao98,99, Ethan Lange10, Leslie Lange10, Stephen S Rich100, Jerome I Rotter94,95, James G Wilson101,102, Paul Scheet12, Jacob O Kitzman14,103, Eric S Lander2,7,104, Jesse M Engreitz2,105, Benjamin L Ebert2,3,15,106, Alexander P Reiner27,97, Siddhartha Jaiswal107, Gonçalo Abecasis5,108, Vijay G Sankaran2,3,6, Sekar Kathiresan109,110,111,112, Pradeep Natarajan113,114,115.
Abstract
Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2-4 and coronary heart disease5-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33057201 PMCID: PMC7944936 DOI: 10.1038/s41586-020-2819-2
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 69.504
Fig. 1|Identifying CHIP in TOPMed Genomes.
CHIP was identified in 97,631 whole genome sequenced peripheral blood samples through the curation of somatic driver mutations. Counts for 8 most common driver genes plotted. inset, CHIP prevalence increased with age. Center line represents general additive model spline, 95% confidence interval is shaded (n=82,807 individuals; two-sided t-test: p<10−300).
Extended Data Fig. 1|Characterizing TOPMed CHIP.
a, There was marked heterogeneity of CHIP clone size as measured by variant allele fraction by CHIP driver gene. Violin plot spanning minimum and maximum values calculated on full dataset (Supplementary Table 3). Sample size for each element in violin plot displayed in Fig. 1, b, 90% of individuals with CHIP had only one CHIP driver mutation identified c, CHIP prevalence with age was highly concordant across sequenced cohorts. CHIP prevalence was estimated from a logistic mixed model with spline-transformed age, sex, and cohort included as predictors. The cohort was included as a random intercept. Sample size for each cohort listed in Supplementary Table 1. d, CHIP prevalence with age in this study (blue triangles, N=82,807) was highly consistent with previously observed CHIP prevalence (dots represent mean point prevalence with shaded area represents 95% confidence interval; NGenovese=12,380; NJaiswal = 17,182; NXie = 2,728).
Extended Data Fig. 3|CHIP associates with Blood, Lipid, and Inflammatory traits.
a, CHIP consistently associated with increased Red Cell Distribution Width (RDW). JAK2, SF3B1 and SRSF2 showed driver gene specific effects on blood traits (see Supplementary Table S5) b, CHIP status was not consistently associated with lipid traits, other than JAK2 CHIP which was associated with decreased total cholesterol and a trend towards decreased LDL (see Supplementary Table S6) c, CHIP status is associated with inflammatory markers, however notable heterogeneity existed across CHIP mutations (see Supplementary Table S7). Associations utilized a two-sided t-test from a multivariate general linear model including age, smoking, race and gender and study center and were not adjusted for multiple comparisons. Sample sizes and exact p-values for each phenotype are listed in Supplementary Tables 5–7.
Extended Data Fig. 5|CHIP Single variant association regional association plots.
a, TERT locus b, TRIM59/KPNA4 locus c, TET2 locus. Two-sided association testing performed using SAIGE (N=65,405 individuals, see methods)
Fig. 3|African ancestry specific TET2 locus risk variant disrupts hematopoietic stem cell TET2 enhancer decreasing TET2 expression and increasing self-renewal.
a, the TET2 locus with fine-mapped risk variants, Activity-by-Contact (ABC) hematopoietic stem and progenitor cell (HSPC) enhancers, DNase-Seq CD34+ HSPC and RefSeq genes. ABC model predicts that rs79901204 disrupts a TET2 enhancer resulting in decreased TET2 expression (see methods). b, expanded view of TET2 enhancer element. c, rs79901204 disrupts a GATA motif/E-Box motif. d, rs79901204 is associated with decreased TET2 expression in human peripheral blood RNA-seq (NA/A=230, NA/T=16, NT/T=1, two-sided linear mixed model p=0.012). TPM, transcripts per million. Boxplot displays median, 25th and 75th percentiles, mean (diamond symbol) and outlier observations (black dots) e, luciferase assay in CD34+ primary cells demonstrates four-fold attenuation of enhancer activity by the rs79901204 T risk allele relative to the A reference allele (N=3, two-sided t-test p=0.007). f, deleting the TET2 enhancer (ENH) in CD34+ primary cells results in decreased TET2 expression relative to deletion of control locus AAVS1 (N=3, two-sided t-test, p=0.04). g, Human HSPCs were electroporated with Cas9 targeting a coding region of TET2 and AAVS1 (a control locus) and plated for primary and secondary colony-forming assays. h, two TET2 guides had differential editing efficiency. i, TET2 coding disruption leads to expanded secondary colony formation compared to AAVS1 controls (N=3, two-sided t-test p=0.01, p=0.002 for g1 and g2 respectively, with greater expansion identified in the TET2 guide with greater editing efficiency (two-sided t-test p=0.04). Mean and standard deviation of number of each colony type plotted. CFU-M, colony forming unit-macrophage; CFU-GM, granulocyte macrophage; CFU-GEMM, granulocyte erythrocyte macrophage megakaryocyte; CFU-G, granulocyte; BFU-E, burst forming unit-erythroid. In e, f, h, points represent independent replicates, mean values and error bars represent standard error are plotted.
Extended Data Fig. 6|CHIP transcriptome-wide association study (TWAS) results across 48 tissues identified 7 significant loci.
UTMOST algorithm applied to CHIP genome wide association study results from n=65,405 individuals (see methods). Genomic coordinates listed on x-axis. P-value from generalized Berk-Jones test on Y axis. Multiple hypothesis corrected threshold, p<2.9 × 10−6 displayed as dotted red line.
Extended Data Fig. 7|Tissue-specific results from the top 9 overall UTMOST-significant genes.
UTMOST algorithm applied to CHIP genome wide association study results from n=65,405 individuals. P-value from generalized Berk-Jones test. eQTL z-scores for associations with P<0.05 are displayed in each bar. GTEX eQTL tissue listed on Y-axis.
Extended Data Fig. 8 |CRISPR/Cas9 editing efficiency of TET2 Enhancer deletion in primary CD34+ HSPCs.
a, Schematic showing the position of the two sgRNAs used to delete the TET2 enhancer (512bp) containing rs79901204. B, Gel electrophoresis image of PCR products from genomic DNA of edited HSPCs indicating unedited (WT) and deletion bands at sgRNA target site. Percentages of deletion alleles determined by band intensity and is shown below each lane. The experiment contains 3 biological replicates and was performed once.
Extended Data Fig. 9 |rs79901204 associated with genome wide differential methylation signal,
Methylation Quantitative Trait association results of rs79901204 variant with cpg methylation probes identify an altered peripheral leukocyte methylation profile genome wide in N = 1747 individuals. The strongest signal is at the chr4 TET2 locus. P-values on Y-axis derived from two-sided linear mixed effects model (see methods). To account for multiple hypothesis testing, a Bonferroni threshold of p < 5.8 × 10−8 was used to establish statistical significance.
Extended Data Fig. 10 |Sensitivity of CHIP detection at various VAFs across sequencing depths.
A set of 30 samples from a previously published CHIP cohort (Gibbons et al, 2017) were computationally down sampled to 30x, 40x, 50x, 100x and 400x sequencing depth. TOPMed WGS data was typically in the 40x depth range across CHIP genes. WGS data has excellent sensitivity to detect CHIP clones with VAF >10%, and ~50% sensitivity to detect CHIP VAF 5–10%, with minimal ability to detect CHIP clones <5%.