| Literature DB >> 34737294 |
Livia O Loureiro1, Jennifer L Howe1, Miriam S Reuter2, Alana Iaboni3, Kristina Calli4, Delnaz Roshandel1, Iva Pritišanac5,6, Alan Moses6, Julie D Forman-Kay5,7, Brett Trost1, Mehdi Zarrei1, Olivia Rennie1, Lynette Y S Lau8, Christian R Marshall8,9, Siddharth Srivastava10, Brianna Godlewski10, Elizabeth D Buttermore10, Mustafa Sahin10, Dean Hartley11, Thomas Frazier12, Jacob Vorstman13,14, Stelios Georgiades15, Suzanne M E Lewis4, Peter Szatmari13,14,16, Clarrisa A Lisa Bradley1, Anne-Claude Tabet17,18, Marjolaine Willems19, Serge Lumbroso20, Amélie Piton21,22,23, James Lespinasse19, Richard Delorme17,24, Thomas Bourgeron17, Evdokia Anagnostou3,25, Stephen W Scherer26,27.
Abstract
Autism Spectrum Disorder (ASD) is genetically complex with ~100 copy number variants and genes involved. To try to establish more definitive genotype and phenotype correlations in ASD, we searched genome sequence data, and the literature, for recurrent predicted damaging sequence-level variants affecting single genes. We identified 18 individuals from 16 unrelated families carrying a heterozygous guanine duplication (c.3679dup; p.Ala1227Glyfs*69) occurring within a string of 8 guanines (genomic location [hg38]g.50,721,512dup) affecting SHANK3, a prototypical ASD gene (0.08% of ASD-affected individuals carried the predicted p.Ala1227Glyfs*69 frameshift variant). Most probands carried de novo mutations, but five individuals in three families inherited it through somatic mosaicism. We scrutinized the phenotype of p.Ala1227Glyfs*69 carriers, and while everyone (17/17) formally tested for ASD carried a diagnosis, there was the variable expression of core ASD features both within and between families. Defining such recurrent mutational mechanisms underlying an ASD outcome is important for genetic counseling and early intervention.Entities:
Year: 2021 PMID: 34737294 PMCID: PMC8568906 DOI: 10.1038/s41525-021-00254-0
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Genome annotation of the SHANK3 guanine duplication (rs797044936).
| Reference genome | Transcript accession | Exon | Genomic position | Coding change | Protein change | Annotation tool |
|---|---|---|---|---|---|---|
| Hg38 | NM_033517.1 | 21 | Chr22:50721512–50721513 | c.3679dup | p.(Ala1227Glyfs*69) | 1Alamut Visual v2.15.0 |
| 2NM_001080420.1 | 22 | Chr22:g.50721512dup | c.3727dup | p.(Ala1243Glyfs*69) | Alamut Visual v2.15.0 | |
| ENST00000262795.5 | 24 | Chr22:g.50721512dup | c.3676dup | p.(Ala1226Glyfs*69) | Alamut Visual v2.15.0 | |
| NM_001372044 (replaced NM_033517) | 22 | 22-50721503-50721504-T-TG | c.3855dupG | p.L1285fs | MSSNG | |
| Hg19 | NM_033517.1 | 21 | Chr22:51159940–51159941 | c.3679dup | p.(Ala1227Glyfs*69) | Alamut Visual v2.15.0 |
| NM_001080420.1 | 22 | Chr22:g.51159940dup | c.3727dup | p.(Ala1243Glyfs*69) | Alamut Visual v2.15.0 | |
| ENST00000262795.5 | 24 | Chr22:g.51159940dup | c.3676dup | p.(Ala1226Glyfs*69) | Alamut Visual v2.15.0 | |
| NM_033517 | 21 | chr22:51159932-51159932-T-TG | c.3630dup | p.L1210fs | VariCarta; GATK VariCarta | |
| chr22:51159933-51159933-G-GG | ||||||
| ENST00000262795.3 | 22 | 22-5119932-T-TG | c.3719_3720 insG | p.Ala1243GlyfsTer6 | O’Roak et al.[ | |
| ENST00000262795.3 | 22 | 22-5119932-T-TG | c.3720dupG | p.L1240fs | Feliciano et al.[ |
The annotation considers different reference genomes, the position of the duplication in the guanine string, and the annotation tool. The guanine duplication in each carrier in the main text of the paper is referred to as p.Ala1227Glyfs*69.
1Alamut Visual version 2.15 (SOPHiA GENETICS, Lausanne, Switzerland). This tool annotates the final G as duplicated and provides the coding and protein change as well as ClinVar entries and general population frequencies (Supplementary Fig. S3).
2NM_001080420.1 record has been removed from NCBI. This RefSeq was permanently suppressed because currently there is insufficient support for the transcript and the protein. Exon 11 was based on ab initio prediction and is not supported by transcript data.
At the time of submission, the most recent RefSeq NM_01372044, which replaces and updates NM_033517, was not available in the Alamut software.
Fig. 1Pedigrees of MSSNG families reported for the first time in this study and their Sanger sequencing confirmation.
A Pedigree MSSNG00342; B Pedigree 1–1047 (unaffected sibling was targeted Sanger sequenced but was not the whole-genome sequenced); C Pedigree 2-1774 (unaffected sibling sample was not available); D. Pedigree 7–0574 (will be available in MSSNG DB7). Gray shapes indicate individuals with an ASD diagnosis and carry the SHANK3 variant.
ASD probands identified in MSSNG, SSC, and other publications containing the p.Ala1227Glyfs*69 SHANK3 variant.
| ID | SEX | GENOMIC (H.G.) | REFERENCE SEQUENCE | CODING | PROTEIN | INHERITANCE | PUBLICATION/ COHORT | ANCESTRY | TECHNOLOGY | 1PRS |
|---|---|---|---|---|---|---|---|---|---|---|
| MSSNG00342-003 | M | g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.A1227Gfs*69 | Paternal, mosaic | MSSNG—This paper | European | WGS | 3.639 |
| MSSNG00342-004 | F | g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.A1227Gfs*69 | Paternal, mosaic | MSSNG—This paper | European | WGS | 6.336 |
| 1-1047-003 | M | g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.A1227Gfs*69 | Maternal, mosaic | MSSNG—This paper | European | WGS | −1.167 |
| 2-1774-003 | M | g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.A1227Gfs*69 | MSSNG—This paper | European | WGS | 7.035 | |
| 7-0572-003 | M | g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.A1227Gfs*69 | MSSNG-DB7—This paper | European | WGS | 15.606 | |
| 1505221080 | M | g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.A1227Gfs*69 | This paper | N/A | Direct Sanger sequencing | Not estimated | |
| HNDS_0130-01 | F | g.50721512dup (hg38) | NM_033517.1 | c.3679dupG | p.A1227Gfs*69 | This paper | N/A | WES | Not estimated | |
| ASD-2 pt1 | M | NM_033517.1 | c.3679dup | Maternal, mosaic | Durand et al.[ | European | FISH and direct sequencing | Not estimated | ||
| ASD-2 pt2 | M | NM_033517.1 | c.3679dup | Maternal, mosaic | Durand et al.[ | N/A | FISH and direct sequencing | Not estimated | ||
| S7 | F | g.51159940dupG (hg19) | NM_033517.1 | c.3679dupG | p.A1227Gfs*69 | Non-paternal | De Rubeis et al.[ | N/A | WES | Not estimated |
| S8 | M | g.51159940dupG (hg19) | NM_033517.1 | c.3679dupG | p.A1227Gfs*69 | De Rubeis et al.[ | N/A | WES | Not estimated | |
| B1 | F | g.51159940dupG (hg19) | NM_033517.1 | c.3679dupG | p.A1227Gfs*69 | De Rubeis et al.[ | N/A | WES | Not estimated | |
| AU013503 | F | (hg19) | NM_033517.1 | c.3679dupG | p.Ala1227fs | Zhou et al.[ | Chinese | Target sequencing | Not estimated | |
| AU035703 | F | (hg19) | NM_033517.1 | c.3679dupG | p.Ala1227fs | Zhou et al.[ | Chinese | Target sequencing | Not estimated | |
| 14470.p1 | M | 22-51159932 -T-TG(hg19) | ENST00000262795.3 | c.3719_3720insG | p.Ala1243GlyfsTer6 | O’Roak et al.[ | European | WES - O’Roak et al. 2014 WGS – This paper | 6.718 | |
| ASD-685 | M | NM_033517.1 | c.3630dupG | p.L1210fs | Du et al.[ | Chinese | WES | Not estimated | ||
| G01-GEA-71-HI | F | 22:51159932:T:TG (hg19) | Satterstrom et al.[ | N/A | WES | Not estimated | ||||
| SP0051409 | F | 22- 51159932 -T-TG(hg19) | ENST00000262795.3 | c.3720dupG | p.L1240fs | Feliciano et al.[ | N/A | WES | Not estimated | |
| Farwell - N/A | N/A | N/A | NM_033517.1 | c.3679dupG | p.A1227Gfs*69 | Farwell et al.[ | American | WES | Not estimated | |
2ClinVar SCV000850848 History of Neurodevelopmental Disorder | N/A | Chr22: g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.Ala1227fs | N/A | Ambry Genetics | N/A | Clinical testing | Not estimated |
ClinVar SCV000244220 Inborn genetic diseases | N/A | Chr22: g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.Ala1227fs | N/A | Ambry Genetics | N/A | Clinical testing | Not estimated |
ClinVar SCV001149930 22q13.3 deletion syndrome | N/A | Chr22: g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.Ala1227fs | Institute of Human Genetics, Klinikum rechts der Isar | N/A | Clinical testing | Not estimated | |
ClinVar SCV001149930 22q13.3 deletion syndrome | N/A | Chr22: g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.Ala1227fs | Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn | N/A | Clinical testing | Not estimated | |
ClinVar SCV000329516 No condition provided | N/A | Chr22: g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.Ala1227fs | N/A | GeneDx | N/A | Clinical testing | Not estimated |
ClinVar SCV001468904 No condition provided | N/A | Chr22: g.50721512dup (hg38) | NM_033517.1 | c.3679dup | p.Ala1227fs | N/A | Laboratoire de Génétique Moléculaire, CHU Bordeaux | N/A | Clinical testing | Not estimated |
Thicker lines box individuals from the same family. WGS whole-genome sequencing, WES whole-exome sequencing, PRS polygenetic risk score, N/A not available. Note that individual S7 described in De Rubeis[38] has not been formally diagnosed with ASD but has reported autism-associated phenotypes. We also detected one female individual with ASD carrying a de novo G deletion (7-G’s) at this same site (c.3679del p.Ala1227Profs*57).
1For the PRS in addition to the main text only SNPs with minor allele frequency of >0.05 in controls and high imputation quality (INFO >0.9) were included. SNPs within the broad MHC region (Chr6:25–35MB) were excluded as well as all ambiguous SNPs to avoid potential strand conflicts. Only variants with good sequencing quality (filter = PASS) were included. Clumping was done with r2 threshold and radius set at 0.1 and 500 kb, respectively. Subsequently, PRS was generated with p value threshold of 0.1 weighing by the additive scale effect (log (OR)) of each variant and summing over the variants using PLINK 1.9. The scores were centered by the mean in whole population. 2Variants submitted and cataloged by ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/variation/208759/), Accession VCV000208759.8, provided from clinical testing and interpreted as pathogenic.
Phenotype of ASD probands identified in MSSNG, SSC, and other publications containing the SHANK3 p.Ala1227Glyfs*69 variant Thicker lines box individuals from the same family.
| ID | SEX | ASD | Dysmorphia | Intellectual disability/developmental delay | Other Medical comorbidities | Psychiatric comorbidity | Other Neurological comorbidities | Other Organ anomalies | Language/speech disorder |
|---|---|---|---|---|---|---|---|---|---|
| 1MSSNG00342-003 | M | Yes | Macrocephaly Mandibular prognathism. Malar flattening | Severe ID | Bipolar disorder | Conductive hearing loss | yes | ||
| 1MSSNG00342-004 | F | Yes | Macrocephaly Asymmetric facial features (r; Mandibular prognathism; Prominent supraorbital ridge. Abnormal iris pigmentation. Hirsutism. Thick skin texture | Mild ID | Depression with psychotic tendencies, Self-injurious behavior; late regression | yes | |||
| 11-1047-003 | M | Yes | Severe ID | ADHD | Epilepsy | severe | |||
| 12-1774-003 | M | Yes | Severe ID | Food sensitivities, G.I. distress, Eczema, Sleep disturbance | Anxiety | Epilepsy Hypotonia, Hypermobility | Severe, with early language loss | ||
| 17-0572-003 | M | Yes | ID | Lactose intolerance | PICA (compulsive ingestion of inedible matter) | severe | |||
| 11505221080 | M | Yes | Moderate ID | ADHD | Developmental Coordination Disorder | ||||
| 2ASD-2 pt1 | M | Yes | large ears and elbow extension limitation | Moderate ID | Neonatal hypotonia | Severe | |||
| 2ASD-2 pt2 | M | Yes | Macrocephaly by 9 months of age, followed by slow growth | Severe ID | Epilepsy | severe | |||
| 2S7 | F | Not available | Mild ID | Hypotonia, Visuo-motor deficits | Coronary artery fistula | severe | |||
| 2S8 | M | Yes | Severe ID | Hypotonia/Dysphagia, Abnormal gait | yes | ||||
| 2B1 | F | Yes | Preauricular skin tags Finger and toe-tapping MRI: Bilateral T2 hyper-intensities of posterior centrum semiovale | Mild ID | Scoliosis Sleep disturbance | Hyperactivity | Constipation | Mild | |
| 2AU013503 | F | Yes | Developmental delay | yes | |||||
| 2AU035703 | F | Yes | Developmental delay | Sleep disturbance | Hyperactivity | Abnormal gait Gastrointestinal complaints | Yes, with regression | ||
| 114470.p1 | M | Yes | Severe ID | Epilepsy | Severe | ||||
| 2ASD-685 | M | Yes | ID | ||||||
| G01-GEA-71-HI | F | Yes | Data not available | ||||||
| SP0051409 | F | Yes | Data not available | ||||||
| Farwell - N/A | N/A | Yes | Data not available |
ID intellectual disability, DD developmental delay, ADHD Attention deficit hyperactivity disorder, N/A not available.
Note that individual S7 described in De Rubeis et al.[38]. has not been diagnosed with ASD but has some autism-associated phenotypes. Graphical representation of these phenotypes is presented in Fig. 2.
1available clinical diagnosis, or scores in clinically significant range
2descriptions from original manuscript
Fig. 2Phenotypic heterogeneity in individuals (X-axis) carrying the SHANK3 p.Ala1227Glyfs*69 variant reported in the MSSNG[6], SSC[32,33], and in published papers[4,36,38–42,71].
Those individuals in the same family are grouped within the black boxes. Gray spaces indicate the absence of the phenotype. White spaces indicate that the phenotype might have not been accessed in the proband. Phenotypic categories are described in Table 3. Individual S7 was not formally reported as being formally tested for ASD. *Caution is needed in the interpretation of these frequencies since some phenotypes were not assessed for some individuals.
Fig. 3Impact of the SHANK3 p.Ala1227Glyfs*69 variant on the protein.
A (top left) Guanine string containing 8 Gs found in non-affected individuals; (top right) Guanine string containing nine Gs found in ASD-affected individuals and parents with somatic mutations; (bottom) Location of the frequent guanine duplication in the SHANK3 gene. ANK ankyrin repeats, SH3 SRC homology 3 domain, PDZ postsynaptic density 95/Discs large/zona occludens, HBS homer binding site, CBS cortactin binding site, SAM sterile alpha motif domain. B Alignment of wild type protein sequences, for each of three highly expressed splice isoforms, to the protein sequence of the variant around the position of the mutation; (note, in this figure the first transcript presented is ENST00000262795.5 and the protein change for this is p.Ala1226Glyfs*69 as shown in Table 1). C Normalized impact of the variant for the three isoforms using FAIDR, a tool that identifies physical features and the presence of consensus protein recognition motifs in intrinsically disordered protein regions[56]. (*Note that SCD, sequence charge decoration, a measure of charge patterning associated with phase separation, has values significantly above 2: 5.4, 7.0, and 10.2 for the three isoform.).