| Literature DB >> 26690673 |
S Choufani1, C Cytrynbaum1,2,3, B H Y Chung4, A L Turinsky1,5, D Grafodatskaya1, Y A Chen1,6, A S A Cohen7, L Dupuis2,3, D T Butcher1, M T Siu1, H M Luk8, I F M Lo8, S T S Lam8, O Caluseriu9, D J Stavropoulos10,11, W Reardon12, R Mendoza-Londono1,2,13, M Brudno1,5,14, W T Gibson7, D Chitayat2,3,13,15, R Weksberg1,2,3,6,13.
Abstract
Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1(+/-)-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1(+/-) signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26690673 PMCID: PMC4703864 DOI: 10.1038/ncomms10207
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1DNA methylation signature associated with NSD1 mutations.
Unsupervised hierarchical clustering of 72 samples using the differentially methylated CpG sites comprising the NSD1+/−-specific signature is shown using Pearson correlation. Note that two patients (A1208 and DL179067) harbour the same mutation in NSD1 at the end of the gene (exon 22) and display slightly different DNA methylation changes compared with the other SS patients. Orange indicates high DNA methylation and blue indicates low DNA methylation. Pink bar represents SS patients with NSD1 whole-gene deletion and truncating mutations. Blue bar represents controls.
Figure 2Testing the sensitivity and the specificity of the NSD1+/− DNAm signature.
Using 19 SS and 53 healthy subjects from the discovery cohort (labelled as ‘primary'), we generated the median-methylation profiles of SS and control, respectively, on the CpG sites comprising the NSD1+/−-specific signature. We then estimated the specificity of these profiles on a validation set of GEO blood samples (n=1,056, blue dots), all of which were more similar to the control profile (specificity 100%). We also estimated the sensitivity on a separate SS validation set (n=19, magenta dots), all of which were more similar to the SS profile (100% sensitivity). Similarity was computed as the Pearson correlation to either the SS or the control DNAm profile. Out of 16 missense samples (green squares), 9 classify with SS and 7 with controls. Also shown are Weaver patients with EZH2 mutations (orange triangles) as well as the classification of the primary 53 controls (dark blue squares) and 19 SS (light magenta squares) used in the derivation of the NSD1-specific signature.
Figure 3NSD1-specific signature can be identified in SS fibroblasts.
Unsupervised hierarchical clustering of three SS-derived fibroblasts (Supplementary Data 1) and four control-derived fibroblasts clearly distinguish SS from controls. Orange indicates high DNA methylation and blue indicates low DNA methylation. Pink bar represents Sotos samples with NSD1 whole-gene deletion and truncating mutations. Blue bar represents controls.
Figure 4Photographs of two patients carrying NSD1 missense variants.
(a) Patient (DL136303) with a missense mutation of uncertain significance, clinical presentation characteristic of Sotos syndrome and a positive SS score based on our analysis. (b) Patient (DL181344) with a missense mutation of uncertain significance, clinical presentation that is not characteristic of Sotos syndrome and a negative SS score based on our analysis. Specific consent to publish facial photographs was obtained for the two patients.
Comparison between different prediction algorithms and NSD1 -specific signature for missense mutations.
| HK-5474 | p.Cys1606Tyr | Yes | 0.096 | Photos not available | Probably damaging (1) | Deleterious (0) | Medium (3.37) | Pathological (9) | Disease causing (1) | |
| HK-11693 | p.Pro1726Arg | Yes | 0.122 | Photos not available | Probably damaging (1) | Deleterious (0) | High (3.61) | Pathological (7) | Disease causing (1) | |
| HK-5581 | p.Val1968Ala | Yes | 0.016 | Typical Sotos | Probably damaging (1) | Deleterious (0) | High (4.11) | Neutral (1) | Disease causing (1) | |
| HK-3326 | p.Tyr1997Cys | Yes | 0.089 | Possible Sotos | Probably damaging (1) | Deleterious (0) | High (4.75) | Pathological (8) | Disease causing (1) | |
| HK-435 | p.Arg2017Trp | Yes | 0.131 | Typical Sotos | Probably damaging (1) | Deleterious (0) | High (4.89) | Pathological (9) | Disease causing (1) | |
| p.Ala1927Pro | Unknown | Yes | 0.110 | Typical Sotos | Probably damaging (1) | Deleterious (0) | Medium (3.12) | Pathological (3) | Disease causing (1) | |
| p.Cys2146Ser | Yes | 0.077 | Typical Sotos | Probably damaging (1) | Deleterious (0) | High (3.89) | Pathological (1) | Disease causing (1) | ||
| p.Cys2138Arg | Yes | 0.068 | Typical Sotos | Probably damaging (1) | Deleterious (0) | High (3.89) | Pathological (8) | Disease causing (1) | ||
| p.Arg2005Gly | Yes | 0.148 | Typical Sotos | Probably damaging (1) | Deleterious (0) | Medium (2.35) | Pathological (6) | Disease causing (1) | ||
| p.Asn1650Ser | Unknown | No | −0.141 | Unlikely Sotos | Benign (0) | Tolerated (0.09) | Low (1.08) | Pathological (0) | Disease causing (0.9) | |
| HK-9776 | p.Asn357Ser | Mat. inheritance | No | −0.122 | Unlikely Sotos | Benign (0.1) | Tolerated (0.73) | Neutral (−0.81) | Neutral (5) | Polymorphism (0.9) |
| HK-12366 | p.Asn1149Ser | Mat. inheritance | No | −0.131 | Photos not available | Benign (0) | Tolerated (0.28) | Neutral (0.20) | Neutral (2) | Polymorphism (1) |
| HK-6943 | p.Pro2225Gln | Mat. inheritance | No | −0.139 | Photos not available | Benign (0) | Tolerated (0.14) | Neutral (0.55) | Neutral (0) | Polymorphism (0.9) |
| HK-14867 | p.Gln2474Arg | No | −0.121 | Unlikely Sotos | Benign (0) | Deleterious (0.01) | Neutral (0.34) | Neutral (1) | Polymorphism (1) | |
| HK-11767 | p.Lys1786Arg | Mat. inheritance | No | −0.149 | Photos not available | Benign (0.2) | Deleterious (0) | Neutral (0.36) | Neutral (8) | Disease causing (0.9) |
| p.Ser1241Thr | Mat. inheritance | No | −0.137 | Unlikely Sotos | Benign (0) | Tolerated (0.14) | Neutral (0.55) | Neutral (8) | Polymorphism (1) |
Sample IDs representing the discovery cohort are in bold and the remaining samples are from the validation cohort.
Single-nucleotide variants were classified on the basis of DNA methylation data, clinical assessment and mutation databases. Each variant was classified by each of the mutation effect prediction algorithms independently.
*Photos available, but did not meet inclusion criteria for clinical review.
†Both parents tested negative for the variant in NSD1. Also, non-paternity was ruled out in this de novo variant case.
Figure 5Genomic distribution of the NSD1+/−-specific CpG sites.
Bar chart representing the percentage distribution of the CpG sites according to genomic annotations extracted from the Illumina 450K array annotation file. We compared the distribution of the CpGs between the data set (424,586 CpGs) and NSD1+/− CpGs (7,085 CpGs) for the regulatory feature group, relation to CpG island and other functional categories such as overlapping enhancer region, DNase hypersensitive sites (DHS) and type of differentially methylated regions (Reprogrammed DMR (RDMR), cancer DMR (cDMR), other DMR) in addition to relation to RefSeq group annotation. The numbers at the top of each bar represent the percentage distribution of CpGs within each category. N refers to north and S refers to south.
Figure 6Enrichment analysis of the genes overlapping NSD1+/− DNAm signature.
We identified 2,167 unique genes that overlap the CpGs comprising the NSD1+/−-specific signature and used DAVID (http://david.abcc.ncifcrf.gov) to identify the biological processes most enriched within our data set. Over-represented functional categories were visualized in Cytoscape (http://www.cytoscape.org) using the Enrichment Map plugin (www.baderlab.org/Software/EnrichmentMap). Network nodes represent statistically significant Gene Ontology—Biological Process terms (Benjamini-corrected P value <0.05), with node size proportional to the number of SS-associated genes annotated for each term. Edges represent overlaps between these gene sets, with edge thickness proportional to their Jaccard index. The results demonstrate enrichment in functional terms that relate to cellular morphogenesis, neuronal development and cellular differentiation, involving highly overlapping subsets of genes. In addition, we identified an enrichment of genes with roles in organ development, ion transport as well as embryonic developmental pathways.