Literature DB >> 28124432

Detection of genomic rearrangements from targeted resequencing data in Parkinson's disease patients.

Nino Spataro1, Ana Roca-Umbert1, Laura Cervera-Carles2,3, Mònica Vallès1, Roger Anglada4, Javier Pagonabarraga2,3, Berta Pascual-Sedano2,3,5, Antònia Campolongo2,3,5, Jaime Kulisevsky2,3,5, Ferran Casals4, Jordi Clarimón2,3, Elena Bosch1.   

Abstract

BACKGROUND: The analysis of coverage depth in next-generation sequencing data allows the detection of gene dose alterations. We explore the frequency of such structural events in a Spanish cohort of sporadic PD cases.
METHODS: Gene dose alterations were detected with the eXome-Hidden Markov Model (XHMM) software from depth of coverage in resequencing data available for 38 Mendelian and other risk PD loci in 394 individuals (249 cases and 145 controls) and subsequently validated by quantitative PCR.
RESULTS: We identified 10 PD patients with exon dosage alterations in PARK2, GBA-GBAP1, and DJ1. Additional functional variants, including 2 novel nonsense mutations (p.Arg1552Ter in LRRK2 and p.Trp90Ter in PINK1), were confirmed by Sanger sequencing. This combined approach disclosed the genetic cause of 12 PD cases.
CONCLUSIONS: Gene dose alterations related to PD can be correctly identified from targeting resequencing data. This approach substantially improves the detection rate of cases with causal genetic alterations.
© 2016 The Authors. Movement Disorders published by Wiley Periodicals, Inc. on behalf of International Parkinson and Movement Disorder Society. © 2016 International Parkinson and Movement Disorder Society.

Entities:  

Keywords:  Parkinson's disease; XHMM software; next generation sequencing; structural variants

Mesh:

Year:  2016        PMID: 28124432      PMCID: PMC5297984          DOI: 10.1002/mds.26845

Source DB:  PubMed          Journal:  Mov Disord        ISSN: 0885-3185            Impact factor:   10.338


Next‐generation sequencing (NGS) technologies such as whole‐genome, whole‐exome, and custom targeting sequencing strategies are increasingly being used to understand the genetic factors underlying both common and rare neurological disorders.1, 2 Notably, the recent advent of the NGS technologies has been accompanied by the implementation of new bioinformatic tools to detect copy number variants (CNVs) from the depth of read mapping, even after applying specific targeted enrichment.3, 4, 5, 6, 7 Most of these new tools are based on the assumption that differences in the depth of coverage among specific genomic regions and across multiple samples can be used as an indicator of the relative number of copies, resulting in a semiquantitative estimation of CNVs. However, as far as we know, this approach has never been attempted in PD. The main aim of this study was to explore whether CNVs help to explain some sporadic PD patients in which no causative point mutations are found. In particular, we used our own resequencing data of 38 PD‐associated genes in 249 PD cases and 145 unrelated controls of European ancestry8 to study the presence of structural variants predicted by eXome‐Hidden Markov Model (XHMM) software (https://atgu.mgh.harvard.edu/xhmm/), which was specifically developed to recover information on copy number variation from normalized read depth data obtained from targeted sequencing experiments.3 Subsequently, all the predicted CNVs were validated through quantitative polymerase chain reaction (PCR). Moreover, detailed frequencies for all potentially functional exon dose alterations detected here and for previously described pathogenic single nucleotide polymorphisms (SNPs) and indels in the same resequencing dataset are provided to understand their relative relevance in PD.

Methods

Participants and Targeted Resequencing Data

Targeted resequencing data from 394 individuals including 249 idiopathic PD cases and 145 unrelated controls of European origin were compiled from Spataro et al. (2015).8 Details on eligibility criteria, clinical and demographic features of PD patients and targeting resequencing design are available in Appendix 1 (Supplementary Data).

Coverage and Detection of CNVs

The mean coverage per target and sample was 49.39X, and 91% of the target bases were covered at ≥15X depth.8 The detection of CNVs was performed with the XHMM software, which uses principal component analysis normalization and a hidden Markov model to detect and genotype CNVs from normalized read‐depth data from targeted sequencing experiments.3 Phred‐scaled quality scores for the CNV events in the inferred intervals ranged from 30 to 99, with a mean of 80 (see Table S1). Gene dose alterations in Mendelian PD genes were then subsequently validated by quantitative PCR (see details in Appendix 2 in the Supplementary Data).

PCR and Sanger Sequencing Validation Analysis

Frameshift deletions and stop gain mutations within the PARK2, LRRK2 and PINK1 genes plus genomic rearrangements around the GBAGBAP1 region were confirmed by PCR and Sanger sequencing analyses (see details in Appendix 3 in the Supplementary Data).

Collapsing Tests

The potential enrichment of exon dosage alterations was tested in the whole set of cases and controls using the full list of CNVs reported in Table S1. CNV enrichment was performed with the VariantTools package (http://varianttools.sourceforge.net/), which includes up to 12 different collapsing tests.9

Results

Our study of structural genetic alterations across the 38 PD‐associated genes previously sequenced through NGS8 disclosed a total of 11 structural variants in the PARK2, GBA, and DJ1 genes, affecting 10 of 249 PD cases. All of these genomic alterations were predicted by the XHMM sofware3 (see details in Table S1) and were subsequently validated by quantitative PCR (Figure 1 and Figure S1). CNVs were also detected in 2 of 145 controls in a heterozygous state for the recessive gene PARK2 and the GBAGBAP1 region. The clinical features of the PD samples in which structural variants were found are provided in Appendix 1 (Supplementary Data). Additional functional sequence alterations detected in the same dataset are listed in Table 1 (see Table S2 for complete genotypes).
Figure 1

Detection of copy number variation: (A) PARK2 gene, (B) GBA‐GBAP1 region, and (C) DJ1 gene. In the upper panel, sequencing depth of coverage for those samples inferred to carry copy number variants by the eXome‐Hidden Markov Model (XHMM) software (https://atgu.mgh.harvard.edu/xhmm/) is indicated in colors along each exon; the color matches that of the individual label on the Y axis of the bottom panel. In all other samples, the gray background represents sequencing depth of coverage. In the bottom panel, a schematic representation of the corresponding validated copy number variants is presented for each gene. Red bars, deletion; light blue bars, duplication; black diamonds, frameshift indels. The bottom track represents a schematic representation of the gene structure.

Table 1

Summary of pathogenic alterations found in Mendelian genes related to PD.

GeneMutation type DNAchange Proteinchange Cases (2N = 498) Controls (2N = 290)Sample ID (coverage)
PARK2 Missensec.802G>A1 p.Arg234Gln30Cas74 (58), Cas172(126), Cas214 (75)
(M Recessive)Frameshiftc.154delA2 p.Asn52Metfs31Cas211 (31), Cas246ab (32), Cas20ab(30), Con142(45)
Exon deletionEx3‐4del30Cas57c (0)d, Cas246 (20)d
Exon deletionEx2del20Cas232c (0)d
Missensec.1244C>A3 p.Thr415Asn10Cas211 (76)
Missensec.574A>C4 p.Met192Leu10Cas76 (72)
Exon deletionEx2‐4del10Cas20 (34)d
Exon deletionEx3‐6del10Cas241 (24)d
Exon duplicationEx3dup20Cas148c (149)d
Frameshiftc.220‐221insGT5 p.Trp74Cysfs10Cas11 (29)
Frameshiftc.101‐102delAG6 p.Gln34Argfs10Cas241ab (25)
LRRK2 (M Dominant)Missensec.6055G>A7 p.Gly2019Ser30Cas213 (126), Cas226 (39), Cas113 (108)
Stop gainc.4654C>T8 p.Arg1552Ter10Cas55a (81)
DJ1 Exon deletionEx4del10Cas136 (36)d
(M Recessive)
PINK1 Stop gainc.270G>A8 p.Trp90Ter20Cas154ac (2)
(M Recessive)Stop gainc.1366C>T9 p.Gln456Ter10Cas194 (30)
GBA RecNcil10Cas103 (61)d

1rs144032774 (C/T), 2rs754809877 (T/‐), 3rs778125254 (G/T), 4rs9456735 (T/G), 5rs746646126 (‐/CA), 6rs55777503 (CT/‐), 7rs34637584 (G/A), 8Not reported before, 9rs45539432 (C/T), aConfirmed with Sanger sequencing, bHemizygous, cHomozygous, dAverage coverage across duplication/deletion. Abbreviations: 2N, number of chromosomes.

Summary of pathogenic alterations found in Mendelian genes related to PD. 1rs144032774 (C/T), 2rs754809877 (T/‐), 3rs778125254 (G/T), 4rs9456735 (T/G), 5rs746646126 (‐/CA), 6rs55777503 (CT/‐), 7rs34637584 (G/A), 8Not reported before, 9rs45539432 (C/T), aConfirmed with Sanger sequencing, bHemizygous, cHomozygous, dAverage coverage across duplication/deletion. Abbreviations: 2N, number of chromosomes. Detection of copy number variation: (A) PARK2 gene, (B) GBAGBAP1 region, and (C) DJ1 gene. In the upper panel, sequencing depth of coverage for those samples inferred to carry copy number variants by the eXome‐Hidden Markov Model (XHMM) software (https://atgu.mgh.harvard.edu/xhmm/) is indicated in colors along each exon; the color matches that of the individual label on the Y axis of the bottom panel. In all other samples, the gray background represents sequencing depth of coverage. In the bottom panel, a schematic representation of the corresponding validated copy number variants is presented for each gene. Red bars, deletion; light blue bars, duplication; black diamonds, frameshift indels. The bottom track represents a schematic representation of the gene structure.

Structural Variants in the PARK2 Gene

We identified 4 different exon deletions and 1 exon duplication in the PARK2 gene, affecting a total of 6 PD cases (Figure 1A). All of them have already been described in PD cases.10, 11 The 2 largest PARK2 deletions spanned from exon 3 to 6 and from exon 2 to 4 and were found in heterozygosis in Cas241 and Cas20, respectively. These 2 PD cases are examples of compound heterozygotes because both individuals also present different frameshift mutations in exon 2, causing a premature stop codon: Cas241 is heterozygote for a dinucleotide deletion (p.Gln34Argfs), whereas Cas20 is hemizygote for a single‐nucleotide deletion (p.Asn52Metfs; see Table 1, Table S2, and Figure S2). The PARK2 deletion comprising exons 3 to 4 was found in 2 different PD cases: Cas57, which is homozygote for the deletion, and Cas246, which is an additional PARK2 compound heterozygote also carrying the p.Asn52Metfs mutation. Finally, Cas232 is found homozygote for a PARK2 deletion affecting exon 2, and Cas148 is homozygote for a duplication affecting exon 3. As expected, early age at onset (before 45 years of age) was found in 4 of the 6 patients with structural variants in PARK2.

Structural Variants in the GBA‐PGBA1 Region

The high homology between GBA and its neighboring pseudogene (GBAP1), which share 96% of sequence identity, not only explains several gene‐pseudogene rearrangements and gene‐conversion events12 but also complicates the analysis of the whole region.13 Our analysis disclosed 4 individuals presenting CNVs along the GBAGBAP1 region (see Figure 1B). Cas103, which has already been described in Setó‐Salvia et al. (2012),14 is heterozygote for a recombinant deletion known as the Rec‐Ncil allele (where GBAP1 exons 1 to 10 and GBA exons 11 and 12 are deleted). In contrast, Cas211, Cas62, and Cas136 are heterozygotes for 2 different duplications along the same region. Because exons 11 and 12 in the gene and the pseudogene are nearly identical, we could not verify by quantitative PCR the exact limits of the GBAGBAP1 rearrangements detected (Figure 1B). Subsequent PCR and sequencing analysis allowed us to confirm that Cas211 is heterozygote for the reciprocal product of the Rec‐Ncil deleted allele (exon 11 and 12 of the GBA gene and GBAP1 exons 1 to 10 are duplicated), whereas Cas62 and Cas136 are heterozygotes for a duplication affecting the 3′unstranslated region of the GBA gene and most of the GBAP1 pseudogene (see details in Figure S3 and Table S3). In addition to the GBAGBAP1 duplication rearrangement, Cas62 also carries the p.Asp370Ser mutation described to increase risk for late‐onset PD15 and known to be the most common causal mutation for Gaucher's disease in Ashkenazi Jews16 in a heterozygous state (Table S2).

Structural Variants in the DJ1 Gene

Among all analyzed participants, only 1 PD patient (Cas136) carried a heterozygous deletion comprising the whole exon 4 of the DJ1 gene (Figure 1C). As far as we are aware, this alteration has never been reported before.10, 11 Interestingly, the same individual is also heterozygote for 1 duplication in the GBAGBAP1 region. Therefore, Cas136 could be a particular case of a double heterozygote for genomic rearrangements occurring in 2 different PD loci.

Other Functional Mutations and Enrichment of Rare Variation

Besides the detected structural variants, other functional alterations (including several known PD Mendelian mutations) had been previously detected in the same dataset.8 Frequencies and details are presented in Table 1 and Appendix 4 (Supplemental Data). Here, we further validated particular genotypes by Sanger sequencing in some PD cases to confirm genotypes in regions with low coverage depth as well as novel potential causative variants. Notably, we confirmed the identification of 2 stop‐gain mutations, to our knowledge previously unreported, having checked the PDmutDB,10, 11 1000 Genomes Project,17 ExAC database (http://exac.broadinstitute.org/, accessed April 26, 2016) and the dbSNP database18: p.Arg1552Ter in LRRK2, which was found in 1 heterozygote carrier (Cas55), and p.Trp90Ter in PINK1, which was found in homozygosis (Cas154) and could also represent a new causal variant for PD (see Figure S2). In the same dataset, we have previously shown that PD cases displayed significantly higher proportions of rare (minor allele frequency (MAF) <1%) code‐altering (nonsynonymous SNPs, nonsense mutations, and coding indels) variants than controls on Mendelian genes.8 Notably, when performing the same type of collapsing analyses, considering only the exon dosage alterations identified here, PD cases also show significant enrichment of CNV in PD Mendelian genes when compared with controls (P value <.05 in 10 of 12 tests; Figure S4).

Discussion

Our analysis demonstrates the usefulness of NGS for discovering different types of variants with a potential role in human disease. Among the detected inactivating variants, we not only report point mutations and small indels but also different exon dosage variants already known to be involved in PD aetiology. Notably, all predicted CNVs were subsequently confirmed by quantitative PCR, suggesting that the analysis of coverage from resequencing data across multiple individuals provides valuable information for the identification of exon dosage variants. It should be noted, however, that the sensitivity of the XHMM software could not be evaluated with this research design and that our study was focused on a limited set of candidate genes. Thus, an unknown number of CNVs could remain undiscovered in this set of patients. Whereas the analysis of functional SNP variation and indels in Mendelian genes related to PD allowed us to identify putative causative variants for 6 PD cases (Cas211, Cas213, Cas226, Cas113, Cas55, and Cas154), the joint analysis of these inactivating variants, together with the exon dose alterations detected here, probably helps to explain the disease phenotype of 6 additional PD cases (Cas246, Cas20, Cas57, Cas232, Cas148, Cas241) in our Spanish cohort of 249 PD cases (2.4%). Thus, as demonstrated in this dataset, CNVs in the form of exon dose alterations are at least as important as indels and other functional SNP variations in PD Mendelian genes when explaining the phenotype of apparently sporadic PD cases. Given the recognized role of structural variants in several neurodegenerative disorders and because many NGS‐based projects with large numbers of individuals are currently underway to study rare and common variant association, efforts should be made to integrate the analysis of potential CNVs in these new datasets.

Author Roles

1. Research Project: A. Conception, B. Organization, C. Execution; 2. Statistical Analysis: A. Design, B. Execution, C. Review and Critique; 3. Manuscript Preparation: A. Writing the First Draft, B. Review and Critique. N.S.: 1A, 1C, 2A, 2B, 3A A.R‐U.: 1C, 2B, 3A L.C.: 1B, 1C M.V.: 1C R.A.: 1C J.P.: 1B, 3B B.P‐S.: 1B, 3B A.C.: 1B, 3B J.K.: 1B, 3B F.C.: 1C, 3B J.C.: 1A, 1B, 2A, 2C, 3B E.B.: 1A, 1B, 2A, 2C, 3B

Full financial disclosure for the previous 12 months

All authors report nothing to disclose. Additional Supporting Information may be found in the online version of this article at the publisher's web‐site. Supporting Information Click here for additional data file. Supporting Information Click here for additional data file.
  18 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth.

Authors:  Menachem Fromer; Jennifer L Moran; Kimberly Chambert; Eric Banks; Sarah E Bergen; Douglas M Ruderfer; Robert E Handsaker; Steven A McCarroll; Michael C O'Donovan; Michael J Owen; George Kirov; Patrick F Sullivan; Christina M Hultman; Pamela Sklar; Shaun M Purcell
Journal:  Am J Hum Genet       Date:  2012-10-05       Impact factor: 11.025

3.  Modeling read counts for CNV detection in exome sequencing data.

Authors:  Michael I Love; Alena Myšičková; Ruping Sun; Vera Kalscheuer; Martin Vingron; Stefan A Haas
Journal:  Stat Appl Genet Mol Biol       Date:  2011-11-08

4.  The human glucocerebrosidase gene and pseudogene: structure and evolution.

Authors:  M Horowitz; S Wilder; Z Horowitz; O Reiner; T Gelbart; E Beutler
Journal:  Genomics       Date:  1989-01       Impact factor: 5.736

5.  Reciprocal and nonreciprocal recombination at the glucocerebrosidase gene region: implications for complexity in Gaucher disease.

Authors:  Nahid Tayebi; Barbara K Stubblefield; Joseph K Park; Eduard Orvisky; Jamie M Walker; Mary E LaMarca; Ellen Sidransky
Journal:  Am J Hum Genet       Date:  2003-02-13       Impact factor: 11.025

6.  Glucocerebrosidase mutations confer a greater risk of dementia during Parkinson's disease course.

Authors:  Núria Setó-Salvia; Javier Pagonabarraga; Henry Houlden; Berta Pascual-Sedano; Oriol Dols-Icardo; Arianna Tucci; Coro Paisán-Ruiz; Antonia Campolongo; Sofía Antón-Aguirre; Inés Martín; Laia Muñoz; Enric Bufill; Lluïsa Vilageliu; Daniel Grinberg; Mónica Cozar; Rafael Blesa; Alberto Lleó; John Hardy; Jaime Kulisevsky; Jordi Clarimón
Journal:  Mov Disord       Date:  2011-12-15       Impact factor: 10.338

7.  Copy number variation detection and genotyping from exome sequence data.

Authors:  Niklas Krumm; Peter H Sudmant; Arthur Ko; Brian J O'Roak; Maika Malig; Bradley P Coe; Aaron R Quinlan; Deborah A Nickerson; Evan E Eichler
Journal:  Genome Res       Date:  2012-05-14       Impact factor: 9.043

Review 8.  Genetic etiology of Parkinson disease associated with mutations in the SNCA, PARK2, PINK1, PARK7, and LRRK2 genes: a mutation update.

Authors:  Karen Nuytemans; Jessie Theuns; Marc Cruts; Christine Van Broeckhoven
Journal:  Hum Mutat       Date:  2010-07       Impact factor: 4.878

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  cnvScan: a CNV screening and annotation tool to improve the clinical utility of computational CNV prediction from exome sequencing data.

Authors:  Pubudu Saneth Samarakoon; Hanne Sørmo Sorte; Asbjørg Stray-Pedersen; Olaug Kristin Rødningen; Torbjørn Rognes; Robert Lyle
Journal:  BMC Genomics       Date:  2016-01-14       Impact factor: 3.969

View more
  7 in total

Review 1.  The contribution of CNVs to the most common aging-related neurodegenerative diseases.

Authors:  Giulia Gentile; Valentina La Cognata; Sebastiano Cavallaro
Journal:  Aging Clin Exp Res       Date:  2020-02-06       Impact factor: 3.636

2.  Evaluation of the detection of GBA missense mutations and other variants using the Oxford Nanopore MinION.

Authors:  Melissa Leija-Salazar; Fritz J Sedlazeck; Marco Toffoli; Stephen Mullin; Katya Mokretar; Maria Athanasopoulou; Aimee Donald; Reena Sharma; Derralynn Hughes; Anthony H V Schapira; Christos Proukakis
Journal:  Mol Genet Genomic Med       Date:  2019-01-13       Impact factor: 2.183

3.  Assessing the relationship between monoallelic PRKN mutations and Parkinson's risk.

Authors:  Steven J Lubbe; Bernabe I Bustos; Jing Hu; Dimitri Krainc; Theresita Joseph; Jason Hehir; Manuela Tan; Weijia Zhang; Valentina Escott-Price; Nigel M Williams; Cornelis Blauwendraat; Andrew B Singleton; Huw R Morris
Journal:  Hum Mol Genet       Date:  2021-03-25       Impact factor: 6.150

4.  PRKRAP1 and Other Pseudogenes in Movement Disorders: The Troublemakers in Genetic Analyses Are More Than Genomic Fossils.

Authors:  Francesca Magrinelli; Katja Lohmann
Journal:  Mov Disord Clin Pract       Date:  2022-06-23

5.  Comprehensive short and long read sequencing analysis for the Gaucher and Parkinson's disease-associated GBA gene.

Authors:  Marco Toffoli; Xiao Chen; Michael A Eberle; Christos Proukakis; Fritz J Sedlazeck; Chiao-Yin Lee; Stephen Mullin; Abigail Higgins; Sofia Koletsi; Monica Emili Garcia-Segura; Esther Sammler; Sonja W Scholz; Anthony H V Schapira
Journal:  Commun Biol       Date:  2022-07-06

6.  Results of next-generation sequencing gene panel diagnostics including copy-number variation analysis in 810 patients suspected of heritable thoracic aortic disorders.

Authors:  Eline Overwater; Luisa Marsili; Marieke J H Baars; Annette F Baas; Irma van de Beek; Eelco Dulfer; Johanna M van Hagen; Yvonne Hilhorst-Hofstee; Marlies Kempers; Ingrid P Krapels; Leonie A Menke; Judith M A Verhagen; Kak K Yeung; Petra J G Zwijnenburg; Maarten Groenink; Peter van Rijn; Marjan M Weiss; Els Voorhoeve; J Peter van Tintelen; Arjan C Houweling; Alessandra Maugeri
Journal:  Hum Mutat       Date:  2018-07-12       Impact factor: 4.878

7.  A genetic analysis of a Spanish population with early onset Parkinson's disease.

Authors:  Tejera-Parrado Cristina; Mir Pablo; Periñán María Teresa; Vela-Desojo Lydia; Abreu-Rodríguez Irene; Alonso-Cánovas Araceli; Bernal-Bernal Inmaculada; Bonilla-Toribio Marta; Buiza-Rueda Dolores; Catalán-Alonso María José; García-Ramos Rocío; García-Ruiz Pedro José; Huertas-Fernández Ismael; Jesús Silvia; Miguel A-Espinosa Labrador; López-Manzanares Lydia; Martínez-Castrillo Juan Carlos; Ignacio J Posada; Rojo-Sebastián Ana; Ruiz-Huete Cristina; Del Val Javier; Pilar Gómez-Garre
Journal:  PLoS One       Date:  2020-09-01       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.