| Literature DB >> 29463886 |
Else Eising1, Amaia Carrion-Castillo1, Arianna Vino1, Edythe A Strand2, Kathy J Jakielski3, Thomas S Scerri4,5, Michael S Hildebrand6, Richard Webster7, Alan Ma8, Bernard Mazoyer9, Clyde Francks1,10, Melanie Bahlo4,5, Ingrid E Scheffer6,11, Angela T Morgan6,12, Lawrence D Shriberg13, Simon E Fisher14,15.
Abstract
Genetic investigations of people with impaired development of spoken language provide windows into key aspects of human biology. Over 15 years after FOXP2 was identified, most speech and language impairments remain unexplained at the molecular level. We sequenced whole genomes of nineteen unrelated individuals diagnosed with childhood apraxia of speech, a rare disorder enriched for causative mutations of large effect. Where DNA was available from unaffected parents, we discovered de novo mutations, implicating genes, including CHD3, SETD1A and WDR5. In other probands, we identified novel loss-of-function variants affecting KAT6A, SETBP1, ZFHX4, TNRC6B and MKL2, regulatory genes with links to neurodevelopment. Several of the new candidates interact with each other or with known speech-related genes. Moreover, they show significant clustering within a single co-expression module of genes highly expressed during early human brain development. This study highlights gene regulatory pathways in the developing brain that may contribute to acquisition of proficient speech.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29463886 PMCID: PMC6756287 DOI: 10.1038/s41380-018-0020-x
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
De novo exonic protein-altering variants in trios with CAS
| Proband | Chr | Base | Gene | Transcript | cDNA change | Protein change | Impact | RVISa | pLia | MIS_Za | GERPb | CADDc | Polyphend | Sifte | Classification |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 01 | 17 | 7806599 |
| ENST00000380358 | c.3682C>T | p.R1228W | Missense | + | NA | + | – | + | PrD | D | Pathogenic |
| 02 | 18 | 9221974 |
| ENST00000262126 | c.920C>G | p.S307C | Missense | + | NA | – | + | + | PrD | D | VUS |
| 11 | 62594638 |
| ENST00000294179 | c.412A>G | p.I138V | Missense | − | NA | – | + | + | PrD | T | VUS | |
| 19 | 607997 |
| ENST00000251287 | c.1252C>G | p.L418V | Missense | + | NA | + | + | + | PosD | T | VUS | |
| 04 | 16 | 30976714 |
| ENST00000262519 | c.1652_1656dup | p.P553Wfs*110 | Frameshift | + | + | NA | + | NA | NA | NA | Pathogenic |
| 07 | 9 | 137017143 |
| ENST00000358625 | c.623C>T | p.T208M | Missense | + | NA | + | + | + | PosD | D | Pathogenic |
| 08 | 1 | 212792854 |
| ENST00000366981 | c.503A>C | p.N168T | Missense | – | NA | – | + | + | B | D | VUS |
| 8 | 144944141 |
| ENST00000525985 | c.3281G>A | p.S1094N | Missense | NA | NA | – | + | – | B | D | VUS | |
| 15 | 63943544 |
| ENST00000443617 | c.10454G>A | p.S3485N | Missense | + | NA | + | + | + | B | T | VUS |
Variants were classified according to recent guidelines [28]
Chr chromosome, RVIS residual variation intolerance score, pLi probability of being loss-of-function intolerant, MIS_Z Z-score for missense constraint, GERP genomic evolutionary rate profiling, CADD combined annotation dependent depletion, Polyphen polymorphism phenotyping, sift: sorting intolerant from tolerant, NA not available or not applicable, VUS variant of uncertain significance
aPlus-sign: score indicating intolerant gene, minus-sign: score indicating tolerant gene
bPlus-sign: score indicating conserved base, minus-sign: score indicating not conserved base
cPlus-sign: score indicating intolerated variant, minus-sign: score indicating tolerated variant
dPrD: probably damaging, PosD: possibly damaging, B: benign
eD: deleterious, T: tolerated
a–eScores on which classification is based can be found in Supplementary Table 3
Fig. 1De novo pathogenic variants in CHD3, SETD1A and WDR5. a Sanger validation of de novo pathogenic variants in family 01, 04 and 07. b Conservation of the mutated amino acids across species. Blue boxes mark the mutated amino acid. The blue line indicates the site of the duplication. c Schematic representation of CHD3 (ENST00000380358) organization, with CHD C- and N-terminal domains in black, plant homeodomain (PHD) zinc finger domains in blue, chromatin organization modifier (CHROMO) domains in green, DEAD-like helicases superfamily (DEXDc) domain in yellow, a helicase superfamily c-terminal (HELICc) domain in red and two domains of unknown function (DUF) in gray. Blue line indicates site of the p.R1228W variant. d Schematic representation of SETD1A (ENST00000262519) protein, with an RNA recognition motif (RRM) in red, a complex proteins associated with Set1p (COMPASS) component N (N-SET) domain in yellow, a Su(var)3–9, Enhancer-of-zeste, Trithorax (SET) domain in green and a post-SET domain in blue. Blue line indicates site of the frameshift, and the blue shaded area indicates the deleted part of the protein. e Schematic representation of WDR5 (ENST00000358625), with WD40 repeats in green. Blue line indicates site of the p.T208M variant
Rare loss-of-function variants in intolerant genes identified in ten singleton cases with CAS
| Proband | Chr | Base | Gene | Transcript | cDNA change | Protein change | Impact | RVIS | pLI | Phenotype previously associated with LoF | Classification |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 10 | 8 | 41806856 |
| ENST00000265713 | c.1599-56_1621del | p.? | Splice acceptor variant | 3 | 1.00 | Developmental delay, language delay, hypotonia, specific facial features [ | Pathogenic |
| 13 | 18 | 42531084 |
| ENST00000282030 | c.1781del | p.P594Lfs*36 | Frameshift | 10 | 1.00 | Developmental delay and language delay [ | Pathogenic |
| 3 | 193380725 |
| ENST00000361908 | c.2581C>T | p.R861* | Stop gained | 10 | 0.99 | Dominant optic atrophy | VUS | |
| 1 | 21923758 |
| ENST00000542643 | c.*35-1G>A | p.? | Splice acceptor variant | 18 | 0.97 | NA | VUS | |
| 14 | 8 | 77761747 |
| ENST00000521891 | c.3646-1G>A | p.? | Splice acceptor variant | 1 | 1.00 | Developmental delay and specific facial features (gene deletions only) [ | Pathogenic |
| 15 | 22 | 40662274 |
| ENST00000454349 | c.2040G>A | p.W680* | Stop gained | 3 | 1.00 | Autism spectrum disorder [ | Pathogenic |
| 18 | 16 | 14234400 |
| ENST00000571589 | c.-63-1G>C | p.? | Splice acceptor variant | 3 | 1.00 | NA | Likely pathogenic |
Variants were classified according to recent guidelines [28]
Chr chromosome, RVIS residual variation intolerance score, pLi probability of being loss-of-function intolerant, NA not available, VUS variant of uncertain significance
Fig. 2Co-expression network analysis. a A co-expression network was calculated using gene expression data of brain samples collected between 8 weeks post conception up to 1 year of age from the cortex, hippocampus, amygdala, striatum and thalamus. A total of 16 modules were detected. Module 3 (indicated by black arrow) was highly enriched for the genes we implicated in CAS through whole-genome sequencing (CAS-WGS). b Enrichment of developmental disorder gene sets in the 16 modules. Gene sets included are the 10 genes implicated in CAS through WGS (CAS-WGS), and genes with de novo mutations in patients with the following (1) intellectual disability (ID, n = 230), (2) autism spectrum disorder (ASD, n = 2760), and (3) schizophrenia (SCZ, n = 711). Significant enrichments with False discovery rate (FDR)-corrected p value < 0.05 and odds ratio (OR) >1 are shown. Colors indicate FDR-corrected p values for enrichment. Numbers show OR
Fig. 3Human brain expression pattern and functional enrichment of module M3. a Spatial expression pattern of the CAS-related module at 13–24 weeks post conception, as visualized by the M3 module’s Eigengene. Red shows high expression, blue shows low expression. Multiple samples per region were averaged. No expression data were available for gray regions. b Developmental brain expression pattern of the enriched module during development, as visualized by the module Eigengene. Each dot represents a brain sample, the black line is the loess curve fitted through the data points. The blue vertical line represents time of birth. Pcw: post conception week. c Gene functions enriched in the module identified through gene ontology (GO) term enrichment followed by clustering of GO terms using the functional annotation clustering tool in DAVID. The p values represent the geometric mean of Bonferroni-corrected p values of all GO terms underlying each function. The blue vertical line represents the threshold for significant enrichment (p = 0.05). Cx cortex, PF prefrontal