Literature DB >> 31965078

Mobile element insertion detection in 89,874 clinical exomes.

Rebecca I Torene1, Kevin Galens2, Shuxi Liu2, Kevin Arvai2, Carlos Borroto2, Julie Scuffins2, Zhancheng Zhang2, Bethany Friedman2, Hana Sroka2, Jennifer Heeley3, Erin Beaver3, Lorne Clarke4, Sarah Neil4, Jagdeep Walia5, Danna Hull5, Jane Juusola2, Kyle Retterer2.   

Abstract

PURPOSE: Exome sequencing (ES) is increasingly used for the diagnosis of rare genetic disease. However, some pathogenic sequence variants within the exome go undetected due to the technical difficulty of identifying them. Mobile element insertions (MEIs) are a known cause of genetic disease in humans but have been historically difficult to detect via ES and similar targeted sequencing methods.
METHODS: We developed and applied a novel MEI detection method prospectively to samples received for clinical ES beginning in November 2017. Positive MEI findings were confirmed by an orthogonal method and reported back to the ordering provider. In this study, we examined 89,874 samples from 38,871 cases.
RESULTS: Diagnostic MEIs were present in 0.03% (95% binomial test confidence interval: 0.02-0.06%) of all cases and account for 0.15% (95% binomial test confidence interval: 0.08-0.25%) of cases with a molecular diagnosis. One diagnostic MEI was a novel founder event. Most patients with pathogenic MEIs had prior genetic testing, three of whom had previous negative DNA sequencing analysis of the diagnostic gene.
CONCLUSION: MEI detection from ES is a valuable diagnostic tool, reveals molecular findings that may be undetected by other sequencing assays, and increases diagnostic yield by 0.15%.

Entities:  

Keywords:  Mendelian disease; diagnostics; exome sequencing; mobile elements; rare disease

Mesh:

Year:  2020        PMID: 31965078      PMCID: PMC7200591          DOI: 10.1038/s41436-020-0749-x

Source DB:  PubMed          Journal:  Genet Med        ISSN: 1098-3600            Impact factor:   8.822


INTRODUCTION

Mobile elements are discrete segments of genomic DNA that can insert new copies elsewhere in the genome through an RNA intermediate. In humans, the vast majority of mobile elements no longer retain the ability to create new insertions.[1] A minority of mobile elements, primarily from the L1, Alu, and SVA families, remain active and are capable of producing new insertions.[2-4] It is estimated that 1 in 12–14 live human births has a de novo MEI.[5] As such, MEIs are an endogenous and ongoing source of variation in human genomes. MEIs cause disease by directly disrupting coding sequence or otherwise altering messenger RNA (mRNA). For example, the first disease-causing MEI variant identified in humans was a hemizygous variant in F8, causing hemophilia through loss of function.[6] A recent review created a comprehensive catalog of 124 disease-causing MEIs, and a subsequent study identified an additional 34 disease-causing MEIs in a number of genes and conditions.[7,8] MEIs are not routinely detected in clinical diagnostic testing except for some known founder events with moderate population frequencies.[9-11] Specialized variant-calling algorithms are needed to detect MEIs from next-generation sequencing (NGS) data, and the paucity of known disease-causing MEIs is thus likely due to ascertainment bias. Recently, diagnostic laboratories have been evaluating hereditary cancer genes for MEIs.[8,12] To date, there has been no systematic analysis of the role MEIs play more broadly in rare disease; however, it is estimated that MEIs are responsible for disease in 0.04% to 0.1% of individuals with suspicion of genetic disease.[5,13] We applied MEI discovery to a prospective cohort of clinical exome samples. In doing so, we determined the overall burden of diagnostic MEIs in a referral population for exome sequencing.

MATERIALS AND METHODS

Patients

We analyzed 89,874 clinical exome sequencing (ES) samples from 6 November 2017 to 31 August 2019 (Table S1, Fig. S1). Samples were sequenced by Illumina HiSeq or NovaSeq 2 × 100 or 2 × 150 reads after hybridization capture using either Agilent Clinical Research Exome or IDT xGen Exome v1.0 baits as previously described.[14] The study was conducted in accordance with all guidelines set forth by the Western Institutional Review Board, Puyallup, Washington (WIRB 20162523). Informed consent for genetic testing was obtained from all individuals undergoing testing, and WIRB waived authorization for use of de-identified aggregate data for these purposes. All positive findings were confirmed by Sanger sequencing and reported back to the ordering provider (Table S2). Patients for whom clinical data, including photos, are reported provided consent for their information to be presented. All reported variants were submitted to ClinVar and SCV IDs are pending. The general assertion criteria for variant classification are publicly available on the GeneDx ClinVar submission page (http://www.ncbi.nlm.nih.gov/clinvar/submitters/26957/).

MEI detection

MEI detection tools that rely on discordant read pairs have reduced sensitivity on NGS capture data relative to selection-free genome sequencing or to larger insert libraries where there would be higher rates of discordant read pairs. MEIs are underrepresented in targeted sequence fragments, and smaller insert-size libraries used to increase on-target percentage lead to fewer discordant read pairs spanning MEIs. Our clinical exomes have low rates of discordant read pairs (mean 1.4%, Fig. S2). Even so, MEIs occurring within a targeted capture region produce reads that partially map to the reference genome and partially map to MEI sequence (i.e., clipped reads). We therefore developed a custom MEI detection tool called SCRAMble (Soft Clipped Read Alignment Mapper) for application to targeted capture sequencing. In brief, SCRAMble identifies clusters of soft clipped reads in a BAM file, builds consensus sequences, aligns to representative L1Ta, AluYa5, and SVA-E sequences, and outputs MEI calls (Fig. S3, Table S3). We estimate a technical sensitivity of 85.0–91.5%. and a precision of 93.8–99.9% (Supplementary Methods, Table S4, Fig. S4). All reported MEI calls were confirmed by Sanger sequencing.

Comparison with other MEI callers

We compared runtimes and MEI calls for MELT,[15] Mobster,[16] and SCRAMble for 1075 sequential ES samples from January 2019 (Fig. S5). We also resequenced 12 of the 14 MEI positive samples from our cohort, ran all three MEI callers, and examined recall of the pathogenic MEI (Supplementary Methods). SCRAMble had the highest overall recall rate of known MEI polymorphisms in targeted regions (Table S5). SCRAMble also had the highest recall rate of pathogenic MEIs in resequenced positive samples (Table S5). For targeted sequencing, it may be valuable to apply MEI detection that does not rely upon discordant read pairs.

Statistical analysis

All statistical analyses, unless otherwise noted, were performed in the R computing environment.

Code availability

SCRAMble source code is available on GitHub for noncommercial use (https://github.com/GeneDx/scramble).

RESULTS

In November 2017, we began prospectively detecting MEIs in samples referred to our laboratory for clinical ES. In this study, we examined 89,874 samples from 38,871 cases. Among these individuals were 21,806 complete child–parent trios. The probands were referred for genetic testing for a variety of clinical indications with neurodevelopmental delay being the most common (Fig. S1). SCRAMble does not distinguish between heterozygous and homozygous genotypes, however, we use postprocessing steps in our pipeline to estimate genotype based on the SCRAMble call and local sequence coverage. Herein, we refer to MEIs detected in a given sample as “calls” and refer to unique MEI sites as “variants.” There were 1,101,790 calls of 23,014 MEI variants (12,380 Alus, 8531 L1s, and 2103 SVAs) for an average of 12.2 MEI calls per person (Fig. S6). The MEIs in this study show hallmarks of L1-mediated target primed reverse transcription including a median target site duplication of 14 bp, variable 5′ truncation, and similarity to the 5′TT/AAAA3′ endonuclease recognition sequence at the insertion site (unpublished data). In this study, 8753/38,871 cases (22.5%) had a positive molecular diagnosis. Fourteen MEIs were classified as pathogenic or likely pathogenic by applying American College of Medical Genetics and Genomics (ACMG) criteria as previously described[14] and confirmed by Sanger sequencing (Table 1). Of these, 13 were sufficient to explain the patients’ phenotypes, and one was identified as a secondary finding in BRCA2. Thus, diagnostic MEIs accounted for 13/8753 (0.15%, binomial test 95% confidence interval [CI] 0.08–0.25%) of the positive cases and 13/38,871 (0.03%, binomial test 95% CI 0.02–0.06%) of all cases. An Alu insertion identified in MAK in one patient (Table 1) is a known founder event in Ashkenazi Jews.[11] The secondary finding MEI in another patient (Table 1) was of the same MEI family, in the same orientation, and in the same position as a pathogenic variant that was previously reported;[8] however, Qian et al. reported a full length Alu while the Alu observed here was truncated at the 5’ end by 220 bp (Table 1). The remaining pathogenic MEIs have not been previously described to the best of our knowledge.
Table 1

Reported pathogenic mobile element insertions (MEIs).

GeneClassificationTranscriptAnnotationInheritanceOrientationTSD5’ truncation
ATMLPATHNM_139312c.169–4insL1AR21 bp8 bp
BRCA2aPATHNM_000059.3c.3407_3408insAluAD+17 bp220 bp
CHD8LPATHNM_001170629.1c.5419_5420insAluAD+16 bpNone
DEPDC5PATHNM_001242896c.2958_2959insAluAD15 bp211 bp
EFTUD2PATHNM_001258353c.1277_1278insAluAD16 bp2 bp
ETFBPATHNM_001985c.426_427insAluAR+16 bpNone
MAKPATHNM_001242385c.1297_1298insAluAR11 bpNone
NSD1PATHNM_022455.4c.1926_1927insAluAD17 bpNone
OCRLPATHNM_001587c.2557_2558insL1XL17 bp5657 bp
OFD1PATHNM_001330209c.416_417insAluXL15 bpNone
RPL11PATHNM_000975c.375_376insSVAAD16 bp716 bp
SLC26A3LPATHNM_000111c.131+3insAluAR13 bp1 bp
USH2APATHNM_206933.2c.8932_8933insAluAR14 bp1 bp
ZEB2PATHNM_014795c.1600_1601insAluAD9 bpNone

Orientation is relative to hg19.

AD autosomal dominant, AR autosomal recessive, Ins insertion, LPATH likely pathogenic, PATH pathogenic, TSD target site duplication, XL X-linked.

aSecondary finding. All other MEIs are considered diagnostic.

Reported pathogenic mobile element insertions (MEIs). Orientation is relative to hg19. AD autosomal dominant, AR autosomal recessive, Ins insertion, LPATH likely pathogenic, PATH pathogenic, TSD target site duplication, XL X-linked. aSecondary finding. All other MEIs are considered diagnostic. Twelve of the 13 probands with a diagnostic MEI had prior genetic tests that were negative, 8 of whom had sequencing-based tests. For three of the patients, the gene that contained the diagnostic MEI had been previously sequenced, but no positive results had been reported. Among the 13 diagnosed patients were those with multiple congenital anomalies, neurodevelopmental delay, and abnormalities of the eye, musculature, and metabolism. These are too few diagnostic MEIs from which to draw broader conclusions; however, we expect that the yield of diagnostic MEIs for various disease areas mirrors the referral population for ES and that pathogenic MEIs are not restricted by any particular disease. In one case, prenatal ES identified a novel homozygous Alu insertion in exon 4 of ETFB, in a fetus with bilaterally enlarged microcystic kidneys, echogenic bowel, unilateral postaxial polydactyly, and an abnormal placenta. Biallelic loss of function variants in ETFB cause glutaric acidemia IIB.[17] The parents were each confirmed to be heterozygous for the MEI. Consanguinity was denied and confirmed by kinship analysis from ES data. In addition, a 574-kb run of homozygosity was found in the proband covering the MEI site suggesting a possible Eastern European founder event approximately 35 generations ago (Fig. S7). In another case, a female with global developmental delay, macrocephaly, agenesis of the corpus callosum, gray matter dysplasia, interhemispheric cysts, natal teeth, cleft tongue, lingual cyst, dysmorphic facial features, and digital anomalies was referred for ES with a differential diagnosis of oral–facial–digital (OFD) syndrome (Fig. 1). The family had previously pursued genetic testing by Sanger sequencing of OFD1 with negative results, but ES with MEI detection revealed a de novo Alu insertion in exon 6 (Fig. 1).
Fig. 1

Case report: de novo Alu insertion in OFD1 causes oral–facial–digital syndrome.

a Patient with a clinical diagnosis of oral–facial–digital syndrome for whom a diagnostic Alu was identified in exon 6 of the OFD1 gene. b Clipped read evidence of a minus strand Alu can be seen in proband, but not in either parent indicating a de novo event.

Case report: de novo Alu insertion in OFD1 causes oral–facial–digital syndrome.

a Patient with a clinical diagnosis of oral–facial–digital syndrome for whom a diagnostic Alu was identified in exon 6 of the OFD1 gene. b Clipped read evidence of a minus strand Alu can be seen in proband, but not in either parent indicating a de novo event. A 40-year-old male patient with intellectual disability, dilated cardiomyopathy, macrocephaly, kidney disease, scoliosis, and 2–3 toe syndactyly was referred for ES. This patient previously had microarray, fragile X, and Marfan-like connective tissue NGS panel testing, all with negative results. By applying MEI detection, an Alu insertion was detected in exon 10 of NSD1 that was absent from the mother and presumed de novo. No sample from the father was submitted for testing. Heterozygous, loss of function variants in NSD1 have been identified in patients with Sotos syndrome, which has substantial phenotypic overlap with the patient.

DISCUSSION

Mobile elements are a source of pathogenic variants in a referral patient population with suspected genetic etiology. By applying mobile element detection to our clinical diagnostic pipeline, we are able to improve diagnostic yield and provide clinicians and patients with diagnoses that are otherwise not apparent. MEIs account for positive molecular diagnosis in more than 1 in 700 diagnosed cases, a similar rate to one previously described.[18] Similarly, Gardner et al. describe an analogous study using nearly 10,000 exome sequenced trios for probands with neurodevelopmental delay (NDD).[5] We observe similar diagnostic rates (4/9738, 0.04% in Gardner et al., and 13/38,871, 0.03% in this study, two-sided Fisher p value 0.76). Thus, despite using different MEI detection methods (MELT[15] vs. SCRAMble), different sequencing methods,[14,19] and having independent cohorts, our study and Gardner et al., largely recapitulate each other. We identified 13 diagnostic, pathogenic MEIs that were sufficient to explain patient phenotypes in individuals with rare disease and one pathogenic MEI as a secondary finding. Of note, all but one of these MEIs were novel and would not have been detected by genotyping of known founder events. The pathogenic MEIs described here, although few, were of substantial consequence to the patients and families involved. For the case of the 11-year-old girl with a clinical diagnosis of oral–facial–digital syndrome (Fig. 1), OFD1 had already been sequenced before the diagnostic MEI was found. Her first sequencing test was by Sanger sequencing. We suspect that the Alu insertion caused preferential polymerase chain reaction (PCR) amplification of the shorter, wild-type allele causing allele dropout and blinding the lab to the MEI. Before having a confirmed molecular diagnosis, there was the potential of unanticipated comorbidities of a condition of unknown molecular etiology. With a definitive molecular diagnosis, the patient’s condition can be confidently managed. Likewise, a de novo diagnostic MEI was found in NSD1 that led to a diagnosis of Sotos syndrome in an adult male. A confirmed diagnosis allows the proband to be appropriately screened and managed for potential comorbidities of Sotos syndrome such as cardiac and renal anomalies. A limitation of our current diagnostic pipeline for MEIs is our inability to interpret noncoding MEIs on a case-by-case basis. There are now multiple examples of noncoding MEIs that cause Mendelian disease.[7] It is possible we are detecting more disease-causing MEIs, but are unable to classify them as pathogenic since they are in noncoding regions. Altogether, diagnostic MEIs accounted for a small proportion of positive cases (0.15%). We consider this the lower limit of the true diagnostic rate since MEI-containing sequence fragments are not captured as efficiently as wild-type alleles in library preparation and are thus underrepresented in capture-based NGS data. We expect the rate of diagnostic MEIs to climb as (1) improvements to bioinformatics tools and wet lab methods, including the transition to PCR-free clinical genome sequencing, increase the sensitivity for detecting MEIs, and (2) RNA and functional studies are performed to better understand the impact of regulatory and intronic MEIs. In the meantime, MEI discovery and genotyping are providing much needed answers for patients and families, including those who have already had genetic testing performed. Supplementary information
  12 in total

1.  A BBS1 SVA F retrotransposon insertion is a frequent cause of Bardet-Biedl syndrome.

Authors:  Clarisse Delvallée; Samuel Nicaise; Manuela Antin; Anne-Sophie Leuvrey; Elsa Nourisson; Carmen C Leitch; Georgios Kellaris; Corinne Stoetzel; Véronique Geoffroy; Sophie Scheidecker; Boris Keren; Christel Depienne; Joakim Klar; Niklas Dahl; Jean-François Deleuze; Emmanuelle Génin; Richard Redon; Florence Demurger; Koenraad Devriendt; Michèle Mathieu-Dramard; Christine Poitou-Bernert; Sylvie Odent; Nicholas Katsanis; Jean-Louis Mandel; Erica E Davis; Hélène Dollfus; Jean Muller
Journal:  Clin Genet       Date:  2020-11-14       Impact factor: 4.438

2.  Characterizing mobile element insertions in 5675 genomes.

Authors:  Yiwei Niu; Xueyi Teng; Honghong Zhou; Yirong Shi; Yanyan Li; Yiheng Tang; Peng Zhang; Huaxia Luo; Quan Kang; Tao Xu; Shunmin He
Journal:  Nucleic Acids Res       Date:  2022-03-21       Impact factor: 16.971

Review 3.  Evaluation of copy number variants for genetic hearing loss: a review of current approaches and recent findings.

Authors:  Wafaa Abbasi; Courtney E French; Shira Rockowitz; Margaret A Kenna; A Eliot Shearer
Journal:  Hum Genet       Date:  2021-11-22       Impact factor: 4.132

4.  Retrotransposon insertion as a novel mutational cause of spinal muscular atrophy.

Authors:  Myriam Vezain; Christel Thauvin-Robinet; Yoann Vial; Sophie Coutant; Séverine Drunat; Jon Andoni Urtizberea; Anne Rolland; Agnès Jacquin-Piques; Séverine Fehrenbach; Gaël Nicolas; François Lecoquierre; Pascale Saugier-Veber
Journal:  Hum Genet       Date:  2022-09-23       Impact factor: 5.881

5.  Noncanonical Splice Site and Deep Intronic FRMD7 Variants Activate Cryptic Exons in X-linked Infantile Nystagmus.

Authors:  Junwon Lee; Han Jeong; Dongju Won; Saeam Shin; Seung-Tae Lee; Jong Rak Choi; Suk Ho Byeon; Helen J Kuht; Mervyn G Thomas; Jinu Han
Journal:  Transl Vis Sci Technol       Date:  2022-06-01       Impact factor: 3.048

6.  Detection of mobile elements insertions for routine clinical diagnostics in targeted sequencing data.

Authors:  German Demidov; Joohyun Park; Sorin Armeanu-Ebinger; Cristiana Roggia; Ulrike Faust; Isabell Cordts; Maria Blandfort; Tobias B Haack; Christopher Schroeder; Stephan Ossowski
Journal:  Mol Genet Genomic Med       Date:  2021-09-07       Impact factor: 2.183

Review 7.  Variant calling: Considerations, practices, and developments.

Authors:  Stepanka Zverinova; Victor Guryev
Journal:  Hum Mutat       Date:  2021-12-16       Impact factor: 4.700

Review 8.  How to proceed after "negative" exome: A review on genetic diagnostics, limitations, challenges, and emerging new multiomics techniques.

Authors:  Saskia B Wortmann; Machteld M Oud; Mariëlle Alders; Karlien L M Coene; Saskia N van der Crabben; René G Feichtinger; Alejandro Garanto; Alex Hoischen; Mirjam Langeveld; Dirk Lefeber; Johannes A Mayr; Charlotte W Ockeloen; Holger Prokisch; Richard Rodenburg; Hans R Waterham; Ron A Wevers; Bart P C van de Warrenburg; Michel A A P Willemsen; Nicole I Wolf; Lisenka E L M Vissers; Clara D M van Karnebeek
Journal:  J Inherit Metab Dis       Date:  2022-05-22       Impact factor: 4.750

9.  Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders.

Authors:  Susan M Hiatt; James M J Lawlor; Lori H Handley; Ryne C Ramaker; Brianne B Rogers; E Christopher Partridge; Lori Beth Boston; Melissa Williams; Christopher B Plott; Jerry Jenkins; David E Gray; James M Holt; Kevin M Bowling; E Martina Bebin; Jane Grimwood; Jeremy Schmutz; Gregory M Cooper
Journal:  HGG Adv       Date:  2021-01-16

10.  One in seven pathogenic variants can be challenging to detect by NGS: an analysis of 450,000 patients with implications for clinical sensitivity and genetic test implementation.

Authors:  Stephen E Lincoln; Tina Hambuch; Justin M Zook; Sara L Bristow; Kathryn Hatchell; Rebecca Truty; Michael Kennemer; Brian H Shirts; Andrew Fellowes; Shimul Chowdhury; Eric W Klee; Shazia Mahamdallie; Megan H Cleveland; Peter M Vallone; Yan Ding; Sheila Seal; Wasanthi DeSilva; Farol L Tomson; Catherine Huang; Russell K Garlick; Nazneen Rahman; Marc Salit; Stephen F Kingsmore; Matthew J Ferber; Swaroop Aradhya; Robert L Nussbaum
Journal:  Genet Med       Date:  2021-05-18       Impact factor: 8.822

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.