| Literature DB >> 31311581 |
M T Siu1, D T Butcher1, A L Turinsky1,2, C Cytrynbaum1,3,4, D J Stavropoulos5,6, S Walker7, O Caluseriu8, M Carter9, Y Lou1, R Nicolson10, S Georgiades11, P Szatmari12,13, E Anagnostou14,15, S W Scherer1,4,7, S Choufani1, M Brudno1,2,16, R Weksberg17,18,19,20,21.
Abstract
BACKGROUND: Autism spectrum disorder (ASD) is a common and etiologically heterogeneous neurodevelopmental disorder. Although many genetic causes have been identified (> 200 ASD-risk genes), no single gene variant accounts for > 1% of all ASD cases. A role for epigenetic mechanisms in ASD etiology is supported by the fact that many ASD-risk genes function as epigenetic regulators and evidence that epigenetic dysregulation can interrupt normal brain development. Gene-specific DNAm profiles have been shown to assist in the interpretation of variants of unknown significance. Therefore, we investigated the epigenome in patients with ASD or two of the most common genomic variants conferring increased risk for ASD. Genome-wide DNA methylation (DNAm) was assessed using the Illumina Infinium HumanMethylation450 and MethylationEPIC arrays in blood from individuals with ASD of heterogeneous, undefined etiology (n = 52), and individuals with 16p11.2 deletions (16p11.2del, n = 9) or pathogenic variants in the chromatin modifier CHD8 (CHD8+/-, n = 7).Entities:
Keywords: Autism spectrum disorder; DNA methylation; Epigenetics; Genetic stratification; Genomic variants; Heterogeneity
Mesh:
Substances:
Year: 2019 PMID: 31311581 PMCID: PMC6636171 DOI: 10.1186/s13148-019-0684-3
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Demographic and deletion information for 16p11.2del training signature cases and age-, sex-matched controls
| Sample ID | Sex | Age (years) | ASD status | 16p11.2del (600 kb region) coordinates | Deletion size (kb) | Inheritance |
|---|---|---|---|---|---|---|
| 435 | M | 12 | Unknown | 29,656,657–30,190,593 | 534 | Unknown |
| 516 | M | 7 | Unknown | 29,656,657–30,190,593 | 534 | Unknown |
| 639 | F | 13 | Unknown | 29,592,751–30,190,593 | 598 | Unknown |
| 797 | M | 7 | Unknown | 29,590,493–30,190,593 | 600 | Unknown |
| 871 | M | 10 | Unknown | 29,590,493–30,190,593 | 600 | Unknown |
| 293 | M | 5 | Unknown | 29,567,295–30,178,406 | 611 | Unknown |
| 3-0269-000 | M | 4 | ASD | 29,567,309–30,177,807 | 610 | Maternal |
| 1-0019-004 | M | 7.5 | ASD | 29,652,799–30,199,507 | 547 | De novo |
| 7-0229-003 | M | 11.5 | ASD | 29,567,295–30,177,916 | 611 | Unknown |
| EP15W | M | 13 | Control | |||
| EP16W | F | 2 | Control | |||
| EP17W | F | 4 | Control | |||
| EP1W | F | 8 | Control | |||
| EP22W | M | 4 | Control | |||
| EP2W | F | 10 | Control | |||
| EP3W | F | 13 | Control | |||
| EP4W | M | 13 | Control | |||
| EP5W | F | 8 | Control | |||
| EP6W | M | 13 | Control | |||
| EP7W | M | 9 | Control | |||
| EP8W | M | 8 | Control | |||
| EP9W | M | 11 | Control | |||
| EP51 | M | 12 | Control | |||
| EP60W | M | 12 | Control | |||
| 460/2048 | M | 11.5 | Control | |||
| EP32W | M | 6 | Control | |||
| EP45W | M | 10 | Control | |||
| EP59W | M | 6 | Control | |||
| EP57W | M | 7 | Control | |||
| CT0011-003 | M | 4 | Control | |||
| MK-311/1601 | M | 10 | Control | |||
| KA-352/1734 | M | 7 | Control |
Mean age 16p11.2del 8.556 ± 3.196
Mean age Control 8.761 ± 3.35
16p11.2.del M, F 8, 1
Control M, F 17, 6
Variant details and respective classification scores for 16p11.2del CNV and CHD8 test cases
| Sample ID | Sex | Age (years) | 16p11.2del coordinates (size in kb) | Diagnostic classification for 16p11.2 typical deletion (pathogenic, benign, variant of unknown significance [VUS]) | Inheritance | ASD status | 16p11.2del score |
|---|---|---|---|---|---|---|---|
| 348 | M | 0.25 | 29,590,493–30,190,593 (600) | Pathogenic | Unknown | Unknown | 0.11505 |
| 228 | M | 0.4 | 29,590,493–30,190,593 (600) | Pathogenic | Unknown | Unknown | 0.12668 |
| 875 | F | 4 | 29,656,657–30,190,593 (534) | Pathogenic | Unknown | Unknown | 0.03073 |
| 172 | F | 6 | 29,590,493–30,190,593 (600) | Pathogenic | Unknown | Unknown | 0.04668 |
| 292 | M | 0.25 | 29,500,252–30,098,094 (598) | Pathogenic | Unknown | Unknown | 0.13978 |
| 2-0088-003 | F | 12 | 29,652,799–30,199,507 (547) | Pathogenic | De novo | ASD | -0.00441 |
| 3-0406-000 | M | 1.2 | 29,581,764–30,199,579 (618) | Pathogenic | Unknown | ASD | 0.07735 |
| 6-0196-03 | M | Unknown | 29,500,084–30,098,210 (598) | Pathogenic | Unknown | ASD | 0.04815 |
| 1-0616-003 | M | 7.5 | 28,751,255–28,952,351 (201) | Not determined* | Maternal | ASD | − 0.11936 |
| Sample ID | Sex | Age (years) | Diagnostic classification (pathogenic, benign, variant of unknown significance [VUS]) | Inheritance | ASD status | ||
| 14406.p1 | M | 13 | c.7493_7495del; p.His2498del (37) | VUS | De novo | ASD | − 0.03482 |
| 1-0494-003 | M | 4.75 | c.2219A>G; p.Asn740Ser (9) | Benign | Paternal | ASD | − 0.04451 |
| 2-1228-003 | F | 4.5 | c.6841C>G; p.Pro2281Ala (34) | Benign | Paternal | ASD | − 0.04178 |
| 1-0277-003 | M | Unknown | c.6565G>A; p.Gly2189Arg (33) | Benign | Maternal | ASD | − 0.03065 |
| 1-0507-003 | F | 14 | c.3940G>A; p.Ala1314Thr (19) | Benign | Maternal | ASD | − 0.05507 |
| 7-0167-003 | M | 4 | c.7499A>C; p.His2500Pro (37) | Benign | Unknown | ASD | − 0.05984 |
| 1-0559-003 | M | 7.7 | c.6148dupA; p.Thr2050fs (31) | Pathogenic | De novo | ASD | 0.0437 |
| M | 10 | c.4215G>T; Synonymous (21) | VUS | Paternal | ASD | − 0.04478 | |
| M | 40 | c.4215G>T; Synonymous (21) | VUS | Not applicable | Unaffected | − 0.04332 | |
| CHD8-V3** | M | 6.5 | c.6649C>T, p.Arg2217* (33) | Pathogenic | Paternal | ASD | 0.04616 |
| CHD8-V4 (sibling of CHD8-V3)** | M | 14 | c.6649C>T, p.Arg2217* (33) | Pathogenic | Paternal | ASD | 0.03631 |
| DW0013** | M | 13.5 | c.6947delC, p.Pro2316Leufs*39 (35) | Likely pathogenic | Unknown | ASD | 0.05183 |
| EX0070-W** | M | 6.75 | c.4327C>T, p.Arg1443Cys (21) | VUS | Paternal | ASD | − 0.04131 |
*Deletion encompassing SH2B1 gene, not overlapping 16p11.2del typical deletion region
**Test case data are from EPIC array, classified using 93 probes overlapping 450K and EPIC arrays from CHD8 signature
Demographic and variant information for CHD8 training signature cases and age-, sex-matched controls
| Sample ID | Sex | Age (years) | ASD status | Nucleotide change | Amino acid change | Exon | Inheritance |
|---|---|---|---|---|---|---|---|
| 11654.p1 | F | 12.5 | ASD | c.3517 A>G | p.Arg1173Gly | 16 | De novo |
| 12714.p1 | M | 5 | ASD | c.185C>G | p.Ser62X | 1 | De novo |
| 12752.p1 | F | 8 | ASD | c.6359_6360del | p.Leu2120ProfsX | 32 | De novo |
| 12991.p1 | M | 16 | ASD | c.6307_6308del | p.Glu2103ArgfsX | 31 | De novo |
| 13844.p1 | M | 5 | ASD | c.3712C>T | p.Gln1238X | 17 | De novo |
| 14016.p1 | M | 14 | ASD | c.4009C>T | p.Arg1337X | 19 | De novo |
| 14233.p1 | M | 10 | ASD | c.7113_7114del | p.Asn2371LysfsX2 | 36 | De novo |
| EP48W | F | 11 | Control | ||||
| EP58W | M | 15 | Control | ||||
| EP41W | M | 16 | Control | ||||
| EH-268 W | F | 6 | Control | ||||
| JS-265 W | M | 16 | Control | ||||
| EP34W | F | 13 | Control | ||||
| TG-278 W | M | 13 | Control | ||||
| TB-342/1706 | M | 16 | Control | ||||
| EP43W | F | 7 | Control | ||||
| EP32W | M | 6 | Control | ||||
| JL-137/1362 | M | 14 | Control | ||||
| EP45W | M | 10 | Control | ||||
| EP59W | M | 6 | Control | ||||
| EP57W | M | 7 | Control | ||||
| CT0009-003 | M | 8 | Control | ||||
| CT0011-003 | M | 4 | Control | ||||
| MK-311/1601 | M | 10 | Control | ||||
| EP39W | F | 13 | Control | ||||
| EP42W | F | 8 | Control | ||||
| KA-352/1734 | M | 7 | Control | ||||
| MM-119/1567 | M | 10 | Control |
Mean age CHD8 10.07 ± 4.325
Mean age Control 10.29 ± 3.836
CHD8 M, F 5, 2
Control M, F 15, 6
Fig. 1Whole blood DNAm comparison between heterogeneous ASD group (n = 52) and age-, sex-matched controls (n = 30). Following filtering by variance (40,550 sites), limma regression and Mann-Whitney U comparison, no CpG sites meet significance criteria of q ≤ 0.05. The 22 sites shown at uncorrected p < 0.001, |Δβ| ≥ 5% do not distinguish clearly between ASD cases and controls. Hierarchical clustering (Euclidian) and principal component analysis (PCA, first 3 principal components labeled) plot show that 16p11.2del (purple) and CHD8 (orange) cases are mixed with both heterogeneous ASD cases (gray) and controls (green). Data are normalized for visualization (mean = 0, variance = 1). In heat map, yellow represents high methylation and blue represents low methylation
Fig. 2a, b DNAm signatures identified in whole blood of individuals with 16p11.2del (600 kb risk locus) or CHD8. a Hierarchical clustering and PCA plot (first 3 principal components labeled) show that 16p11.2del training cases (purple; n = 9) are distinct from age-, sex-matched controls (green; n = 23) at the DNAm signature sites (115 CpG sites; q < 0.05, absolute methylation difference (|Δβ| ≥ 5%). b Hierarchical clustering and PCA plot (first 3 principal components labeled) show that CHD8 training cases (orange; n = 7) are distinct from age-, sex-matched controls (green; n = 21) at DNAm signature sites (103 CpG sites; q < 0.01, |Δβ| ≥ 5%) used for classification. Only a single CpG site overlaps between the 16p11.2del and CHD8 DNAm signatures: cg25970491 (CLTCL1), which is hypomethylated in both CHD8 and 16p11.2del cases relative to controls. All data are normalized for visualization (mean = 0, variance = 1). In heat map, yellow represents high methylation and blue represents low methylation
Fig. 3a–d Evaluating the sensitivity and specificity of the 16p11.2del and CHD8 DNAm signatures. The classification models (details in Ref. [15]) are represented by the following classification plots: a The classification model based on the 16p11.2del DNAm signature sites (115 sites; q < 0.05, |Δβ| ≥ 5%) accurately classified CHD8 signature cases, and independent test cases consisting of CHD8 sequence variants (orange box with cross) and heterogeneous ASD cases (open gray triangle) as more similar to controls (green C). The model also accurately classified 7 of 9 independent 16p11.2del test cases (solid purple circle) as more similar to 16p11.2del signature cases (open purple circle), demonstrating 100% sensitivity. Two variants (arrows) correctly received negative classification scores: hatched arrow indicates a mosaic 16p11.2del case (2-0088-003), solid arrow indicates a 16p11.2del distal to and not overlapping the 600 kb typical deletion region (1-0616-003). b The 16p11.2del DNAm signature was tested on an independent set of compiled blood control DNAm data (n = 162) extracted from the Gene Expression Omnibus (GEO) Database. All GEO controls (turquoise C) were properly classified with experimental controls (green C), as distinct from 16p11.2del signature training cases (open purple circle), demonstrating 100% specificity. c The classification model based on the CHD8 DNAm signature sites for classification (103 sites; q < 0.01, |Δβ| ≥ 5%) accurately classified 16p11.2del signature cases and independent test cases consisting of 16p11.2del variants (purple circle with cross) and heterogeneous ASD cases (gray triangle) as more similar to controls (green C). The model also accurately classified 1 of 9 independent CHD8 test cases consisting of sequence variants (solid orange square) as more similar to CHD8 signature training cases (open orange square) (unable to report sensitivity with one positive case, indicated by arrow). Eight variants correctly received negative classification scores (7 missense, 1 with an in-frame deletion in the last exon of CHD8, not predicted pathogenic). Classifications are in agreement with in silico predictions. d The CHD8 DNAm signature was tested on the same GEO controls (turquoise C) as in b; all but one GEO control were properly classified with experimental controls (green C), as distinct from CHD8 signature cases (open orange square), demonstrating >99.3% specificity
Fig. 4Evaluating the sensitivity of the CHD8 DNAm signature using additional CHD8 test cases (EPIC array). Additional independent test cases consisting of CHD8 sequence variants (n = 4; orange squares with cross) were run on the EPIC array. Of the 103 sites in the 450K-derived CHD8 DNAm classification signature, 92 sites overlapped those of the EPIC array and were thus used to classify the additional EPIC test cases. One VUS case received a negative classification score. Three of the four additional CHD8 test cases were accurately classified as more similar to CHD8 signature training cases (open orange square); these cases were known to have pathogenic CHD8 variants, demonstrating 100% sensitivity. The one sample classified as benign was an inherited missense variant, similar to those 450K test cases receiving negative scores. All other cases and controls (450K) were accurately classified as in Fig. 3
Fig. 5Biological and cross-tissue functional significance of differentially methylated genes. a Differentially methylated genes associated with the 16p11.2del and CHD8 DNAm signatures (q < 0.05, |Δβ| ≥ 5%) and the DNAm signature overlap with known ASD-risk genes (SFARI Gene). A single SFARI gene, clathrin heavy chain like 1 (CLTCL1), is significantly hypomethylated in both groups. An independent set of 9 SFARI genes are differentially methylated in the CHD8 DNAm signature. b Our findings are further corroborated by an independent study (48) showing that CHD8 human iPSC-derived neuronal precursor cells (NPCs) and differentiated neurons result in differentially expressed genes that overlap with some of the differentially methylated genes in CHD8 DNAm signature, including known ASD-risk genes (SFARI Gene), and c another study (49) showing that cerebral organoids derived from the iPSCs in b also have differentially expressed genes that overlap our CHD8 DNAm signature.