| Literature DB >> 34838103 |
Rebeca Borges-Monroy1,2,3, Chong Chu3, Caroline Dias1,4, Jaejoon Choi1,2,5, Soohyun Lee3, Yue Gao1,2,6, Taehwan Shin1,2,6, Peter J Park3, Christopher A Walsh7,8,9,10,11, Eunjung Alice Lee12,13,14.
Abstract
BACKGROUND: Retrotransposons have been implicated as causes of Mendelian disease, but their role in autism spectrum disorder (ASD) has not been systematically defined, because they are only called with adequate sensitivity from whole genome sequencing (WGS) data and a large enough cohort for this analysis has only recently become available.Entities:
Keywords: Alu; Autism spectrum disorder; LINE-1; Neurobiology; Polymorphic insertions; Retrotransposons; SVA; Transposable elements; de novo insertions; de novo rates
Year: 2021 PMID: 34838103 PMCID: PMC8627061 DOI: 10.1186/s13100-021-00256-w
Source DB: PubMed Journal: Mob DNA
Fig. 1Detection of transposable element insertions (TEIs) in the SSC cohort. A Pipeline and analysis overview. Quad and trio bam files were analyzed for TEIs using a dockerized version of xTea on the cloud in Amazon Web Services (AWS). Candidate TE insertions were filtered using xTea filters, and filters for regions of the genome with reference and known non-reference TEIs for a high confidence set. A custom pipeline for detection of de novo insertions was used, and candidates were manually inspected on the Integrative Genomics Viewer. Enrichment or depletion of TEIs in ASD genes, high pLI genes, genomic regions, and regulatory regions in fetal brain development was tested by simulation analyses. A subset of candidates was validated by full-length PCR. B Mean number of TEIs detected in the SSC cohort with standard deviation. C Percentage of insertions in the SSC cohort that were not found in previous studies (novel) or overlap with TEIs from previous analyses (known) for all TEIs including those in parents and children (left) and Venn diagram showing overlap with other large cohort studies for TEIs detected in unrelated parental samples in our cohort (right). D Cumulative fraction of TEIs in unrelated parental samples which are found at a certain population allele frequency (PAF) within the SSC cohort. 94% L1, 92% Alu, and 95% SVA insertions show < 1% PAF
Fig. 2Rates of de novo TEIs. A Combined rates of de novo TEIs per birth for ASD and controls compared to previous studies. The adjusted rate in our study accounts for lower sensitivity for detecting TEIs in short-read Illumina data compared to long-read sequencing data. B Rates of de novo TEIs per birth in probands with ASD and unaffected siblings (controls)
Fig. 3Enrichment of de novo TEIs in SFARI ASD genes. Observed numbers of de novo TEIs in a list of complied ASD genes are marked by red dots. Black dots and lines represent mean numbers and 95% confidence intervals of expected TEIs based on 10,000 random simulations, respectively. More de novo L1 insertions in ASD genes than expected are observed in cases only
Select de novo insertions in ASD and high pLI genes in affected individuals
| Proband ID | TE Type | Gene | SFARI Classification | pLI | Genic Region | Validation Status | Observed Phenotype | Previous Neurodevelopmental Phenotype associated with gene | Reference |
|---|---|---|---|---|---|---|---|---|---|
| 11,859.p1 | Alu | CSDE1 | No | 1 | Exon | Validated | ASD, language delay, ID, macrocephaly, history of vision correction, normal EEG at 4 years | LGD variants associated with ASD, developmental delay, ID, seizures, macrocephaly, ADHD, anxiety, ocular abnormalities | Guo et al. 2019 [ |
| 14,565.p1 | Alu | KBTBD6 | No | 0.935 | Exon | Validated | ASD, macrocephaly, uncoordinated, normal IQ, BMI Z-score − 3.91 | ||
| 12,548.p1 | Alu | APPBP2 | No | 0.999 | Intron | Validated | ASD, normal IQ, macrocephaly | ||
| 12,748.p1 | Alu | SYT1 | Syndromic | 0.837 | Intron | Validated | ASD, normal IQ, uncoordinated | Developmental delays, autistic features, hypotonia, ocular abnormalities, hyperkinetic movements associated with de novo missense variation | Baker et al., 2018 [ |
| 13,931.p1 | Alu | OTUD7A | Suggestive evidence | 0.975 | Intron | Validated | ASD, borderline IQ, normal EEG, and brain imaging | Neurodevelopmental phenotype of ASD, developmental delay, ID, seizures associated with 15q13.3 microdeletion syndrome | Yin et al. 2018 [ |
| 13,107.p1 | Alu | TOX3 | No | 0.994 | Intron | Validated | ASD, normal IQ | ||
| 14,315.p1 | Alu | JAZF1 | No | 0.958 | Intron | Validated | ASD, borderline verbal IQ, normal nonverbal IQ, normal EEG | ||
| 11,196.p1 | L1 | SRGAP3 | Minimal Evidence | 1 | Intron | Validated | ASD, above average IQ, no history of seizures, heart problems reported | Case report of translocation breakpoint at loci posited to be LoF associated with hypotonia and severe ID | Endris et al. 2002 [ |
| 13,684.p1 | L1 | HCN1 | Syndromic | 0.953 | Intron | NA | ASD, Tourette syndrome, above average IQ, GI problems, uncoordinated | Missense variation associated with a syndrome of seizures, intellectual disability, and autistic features, gene also implicated in Tourette syndrome, role in striatal neuronal function and enteric nervous system | Nava et al. 2014 [ |
| 14,080.p1 | L1 | DAB1 | Hypothesized | 0.981 | Intron | Validated | ASD, uncoordinated, GI problems | ASD, GI problems, schizophrenia, spinocerebellar ataxia-37 associated with non-coding nucleotide repeats | Corral-Juan et al. 2018 [ |
| 14,282.p1 | L1 | DPYD | Suggestive evidence | 0 | Intron | Not in LCL DNA, predicted blood mosaic | ASD, normal IQ | ASD, ID | Carter et al. 2011 [ |
| 11,234.p1 | L1 | NOTCH2 | No | 1 | Intron | NA | ASD, above average IQ | ||
| 13,451.p1 | L1 | DPP10 | Suggestive evidence | 1 | Intron | NA | ASD, borderline IQ | ASD | Marshall et al. 2008 [ |
| 14,404.p1 | SVA | GRAMD1B | No | 0.985 | Intron | NA, predicted blood mosaic | ASD, non-verbal, IQ in profound intellectual disability range, macrocephaly | Autosomal recessive intellectual disability | Santos-Cortez et al. 2018 [ |
| 14,523.p1 | SVA | ACACA | No | 1 | Intron | NA | ASD, above average IQ, macrocephaly | Acetyl-CoA carboxylase deficiency | Blom et al. 1981 [ |
A subset of de novo TEIs observed in individuals with ASD in genes relevant to ASD or with a high probability of being loss-of-function intolerant (pLI > 0.9). aSFARI annotations were obtained in 2019. LCL Lymphoblastoid cell line, LGD Likely gene disrupting, ID Intellectual disability, LoF Loss of function, ADHD Attention-deficit hyperactivity disorder, GI Gastrointestinal.
Fig. 4Genomic distribution of polymorphic and de novo TEIs. A 10,000 random simulations were performed for both polymorphic and de novo TEIs based on the observed rates. Log2 fold change of observed compared to expected counts in different genomic regions are shown for coding and gene regulatory regions. 95% confidence intervals were estimated based on the empirical distribution of the random simulations. Polymorphic TEIs from parental individuals are depleted in exons and regulatory regions in the developing fetal brain. De novo Alu (A) and L1 insertions (B) do not show this depletion compared to 10,000 random simulations. Two-sided empirical p-values and Benjamini–Yekutieli q-values based on multiple correction of all enrichment and depletions performed are represented
Fig. 5Full-length PCR validations and visual inspection. A Full-length PCR validation of the Alu insertion in CSDE1 and of the de novo L1 insertion in DAB1 in ASD cases. In lymphoblastoid cell line DNA, we validated the insertions in the ASD proband only. NTC: non-template control. B Integrative Genomics Viewer image at the insertion site in gene CSDE1 in an ASD case. For each individual, the sequencing coverage (top) and sequencing reads (bottom) are shown. The insertion shows the canonical signatures of target-primed reverse transcription (TPRT)-mediated retrotransposition: 15 bp target site duplication (TSD) between the two insertion breakpoints, a poly-A tail, supporting clipped reads, and discordant reads with mates mapping to the consensus Alu sequence. The mother has one small clipped read sequence at the breakpoint which has the same sequence as in the proband, suggesting that the insertion could be mosaic at a low allele frequency in the mother’s blood