Literature DB >> 25333066

Diagnosis of autosomal dominant polycystic kidney disease using efficient PKD1 and PKD2 targeted next-generation sequencing.

Daniel Trujillano1, Gemma Bullich2, Stephan Ossowski3, José Ballarín4, Roser Torra4, Xavier Estivill5, Elisabet Ars2.   

Abstract

Molecular diagnostics of autosomal dominant polycystic kidney disease (ADPKD) relies on mutation screening of PKD1 and PKD2, which is complicated by extensive allelic heterogeneity and the presence of six highly homologous sequences of PKD1. To date, specific sequencing of PKD1 requires laborious long-range amplifications. The high cost and long turnaround time of PKD1 and PKD2 mutation analysis using conventional techniques limits its widespread application in clinical settings. We performed targeted next-generation sequencing (NGS) of PKD1 and PKD2. Pooled barcoded DNA patient libraries were enriched by in-solution hybridization with PKD1 and PKD2 capture probes. Bioinformatics analysis was performed using an in-house developed pipeline. We validated the assay in a cohort of 36 patients with previously known PKD1 and PKD2 mutations and five control individuals. Then, we used the same assay and bioinformatics analysis in a discovery cohort of 12 uncharacterized patients. We detected 35 out of 36 known definitely, highly likely, and likely pathogenic mutations in the validation cohort, including two large deletions. In the discovery cohort, we detected 11 different pathogenic mutations in 10 out of 12 patients. This study demonstrates that laborious long-range PCRs of the repeated PKD1 region can be avoided by in-solution enrichment of PKD1 and PKD2 and NGS. This strategy significantly reduces the cost and time for simultaneous PKD1 and PKD2 sequence analysis, facilitating routine genetic diagnostics of ADPKD.

Entities:  

Keywords:  Autosomal dominant polycystic kidney disease; genetic counseling; molecular diagnostics; targeted NGS

Year:  2014        PMID: 25333066      PMCID: PMC4190876          DOI: 10.1002/mgg3.82

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


Introduction

Autosomal dominant polycystic kidney disease (ADPKD; OMIM IDs: 173900; 613095) is the most common inherited kidney disease, with an incidence of 1 in 400–1000 (Iglesias et al. 1983; Dalgaard and Norby 1989). ADPKD is caused by mutations in PKD1 (16p13.3; OMIM ID: 601313) in approximately 85% of the cases (The European Polycystic Kidney Disease Consortium 1994), and in PKD2 (4q21; OMIM ID: 173910) in the remaining 15% (Mochizuki et al. 1996). ADPKD is characterized by the development and progressive enlargement of cysts in the kidneys and other organs, eventually leading to end-stage renal disease (ESRD). The ADPKD phenotype displays a significant variability that is greatly influenced by the affected gene. Thus, PKD1 patients have a median age at ESRD of 58 years compared to 79 years for PKD2 mutated patients (Cornec-Le Gall et al. 2013). Diagnosis of ADPKD is mainly performed by renal imaging such as ultrasonography, computed tomography, or magnetic nuclear resonance (Pei et al. 2009). However, molecular diagnostics is necessary in several situations: (1) when a definite diagnosis is required in young individuals, such as a potential living related donor in an affected family with equivocal imaging data; (2) in patients with a negative family history of ADPKD, because of potential phenotypic overlap with several other kidney cystic diseases; (3) in families affected by early-onset polycystic kidney disease, since in this cases hypomorphic alleles and/or oligogenic inheritance can be involved (Rossetti et al. 2009; Bergmann et al. 2011; Harris and Hopp 2013); and (4) in patients requesting genetic counseling, especially in couples wishing a preimplantation genetic diagnosis (Harris and Rossetti 2010). Approximately 70% of the 5′ genomic region of the PKD1 gene (exons 1–33) is duplicated six times on chromosome 16p within six pseudogenes (PKD1P1 to PKD1P6), which share a 97.7% sequence identity with the genuine gene (Bogdanova et al. 2001; Rossetti et al. 2012). This, together with a high GC content, the presence of many missense variants, the absence of mutation hot spots, and the high allelic heterogeneity of ADPKD, makes the molecular diagnostics of ADPKD challenging. In addition, most mutations are private variants, with a total of 1272 pathogenic PKD1 and 202 pathogenic PKD2 mutations reported to date (March 2014, ADPKD Database [PKDB], http://pkdb.mayo.edu). Thus, genetic diagnosis by conventional techniques of a new ADPKD family requires long-range polymerase chain reaction (LR-PCR) of the repeated region of PKD1 followed by nested PCRs (Rossetti et al. 2002), combined with Sanger sequencing of all 46 PKD1 and 15 PKD2 exons. When pathogenic mutations are not identified by Sanger sequencing, multiplex ligation-dependent probe amplification (MLPA) analysis is also performed to identify potential insertions and deletions. Therefore, there is a demand for more simple and cost-effective molecular approaches that could be used for routine diagnosis, especially now with the coming specific therapies that will require differential genetic diagnosis (Torres and Harris 2006). To address these challenges, we have developed and validated an assay that couples genome partitioning and next-generation sequencing (NGS), to comprehensively perform in one-step mutation screening in PKD1 and PKD2, as an alternative to cumbersome conventional genetic testing methods.

Material and Methods

Subjects

High-quality genomic DNA from 53 unrelated patients was obtained from blood lymphocytes, using standard protocols. The validation cohort included 36 ADPKD patients and five control individuals that had previously undergone conventional genetic diagnosis by Sanger sequencing of all PKD1 and PKD2 exons and, if negative, MLPA was also applied. The discovery cohort consisted of 12 ADPKD consecutive patients received for genetic diagnosis for which no mutations were known. ADPKD diagnosis was based on standard clinical and imaging criteria. Blood samples were obtained from other family members if they were available. All samples were codified and bioinformatics mutation analysis was blindly performed. Signed informed consent was obtained for all participants. This study was approved by the institutional review board.

Capture and multiplexed sequencing of the PKD1 and PKD2 genes

To carry out DNA capture, we designed a custom NimbleGen SeqCap EZ Choice Library (Roche, Inc., Madison, WI) to target the complete genomic sequence of the PKD1 and PKD2 genes, and 1 kb of genomic sequence flanking at the 5′ and 3′ ends of each gene, accounting for 121,322 bp. Our design also included probes to target additional genes related to other human inherited diseases, for a total of 2.1 Mb of captured DNA after removal of repetitive sequences. DNA probes were selected using the most stringent settings for probe design (uniqueness tested by Sequence Search and Alignment by Hashing Algorithm [SSAHA]) (Ning et al. 2001). However, in order to be able to generate capture probes for the duplicated PKD1 regions, we altered the settings for probe design of this specific region to allow probes to have up to 10 close matches in the genome. No probe redundancy was allowed in the final capture design for the rest of target regions. The Browser Extensible Data file of captured regions is available on request to the authors. Libraries were prepared with the TruSeq DNA Sample Preparation Kits (Illumina, Inc., San Diego, CA). Genomic capture from pooled libraries was carried out using NimbleGen SeqCap EZ Library (Roche, Inc.) following User's Guide v3.0 instructions, as previously described (Trujillano et al. 2013). The libraries of the patients of the validation cohort and the five controls were prepared and sequenced together with seven samples of other diseases using the same capture design and enrichment protocol in two pools of 24 samples, for a total of 48 samples multiplexed in two HiSeq 2000 (Illumina, Inc.) lanes to generate 2 × 100 bp paired-end reads. The 12 patients of the discovery cohort were enriched in a single capture reaction and were sequenced in a Miseq (Illumina, Inc.) run to generate 2 × 250 bp paired-end reads.

Bioinformatics analysis and mutation identification and classification

The resulting fastq files were analyzed with an in-house developed pipeline previously described (Trujillano et al. 2013). All the bioinformatics tools used in this study were run using default settings unless stated otherwise. For the patients included in this study, only the sequencing data produced for PKD1 and PKD2 were analyzed, as stated in the signed informed consent. The reference sequences used were NM_001009944.2 for PKD1 and NM_000297.2 for PKD2. In order to identify pathogenic mutations that could cause ADPKD, we applied the following cascade of filtering steps (Walsh et al. 2010): We required all candidate variants on both sequenced DNA strands and to account for ≥20% of total reads at that site in order to filter out spurious variant calls caused by misaligned reads in the duplicated region of PKD1. Common polymorphisms (≥5% in the general population) were discarded by comparison with dbSNP 137, the 1000G, the Exome Variant Server (http://evs.gs.washington.edu), and an in-house exome variant database to filter out both common benign variants and recurrent artifact variant calls, especially in the duplicated PKD1 regions. However, since these databases also contain known disease-associated mutations, all detected variants were compared to gene mutation databases (The Human Gene Mutation Database [HGMD], http://www.hgmd.cf.ac.uk and ADPKD Database [PKDB], http://pkdb.mayo.edu). Mutations that could give rise to premature truncated proteins, that is, stop mutations, exonic deletions/insertions, and large genomic rearrangements were classified as definitely pathogenic. Missense and noncanonical splicing variants were considered a priori Unclassified Sequence Variants (UCV) and their potential pathogenicity was evaluated using an in silico scoring system developed for PKD1 and PKD2 genes as previously described (Rossetti et al. 2007). This scoring system takes into consideration a number of in silico predictors (Grantham 1974; Tavtigian et al. 2006; Rossetti et al. 2007) and population data. We scored each of these factors, the sum of which resulted in an overall Variant Score (VS). The UCV were classified into four groups: highly likely pathogenic (VS ≥ 11); likely pathogenic (5 ≤ VS ≤ 10), indeterminate (0 ≤ VS ≤ 4), and highly likely neutral (VS ≤ −1) (Rossetti et al. 2007). We considered to be pathogenic mutations those sequence variants predicted to result in a truncated protein (classified as definitely pathogenic) and those not found in healthy controls, that segregated with the disease in families and expected to severely alter the protein sequence using in silico predictors (classified as highly likely pathogenic and likely pathogenic variants). If no pathogenic mutations were identified, the bioinformatics pipeline automatically reported the target sequences that presented low or inexistent sequence coverage. These regions were screened by Sanger sequencing since they were more likely to contain the pathogenic variants missed by our NGS approach. Validation of newly identified single-nucleotide variants (SNVs) was performed by Sanger sequencing.

Results

PKD1 and PKD2 enrichment

Eighty-one percent of the PKD1- and PKD2-targeted bases could be covered with capture baits for a final targeted region of 98,524 bp divided into 99 individual regions, with lengths ranging from 65 to 6,493 bp (average of 995 bp) (Table S1). Noteworthy, 100% of all coding sequences, that is, the complete 46 and 15 exons of PKD1 and PKD2, respectively, were covered by capture baits. The target regions that precluded bait tilling correspond only to intronic and intergenic sequences.

Sequencing statistics

In the validation cohort, an evenly distributed mean depth of coverage of 331X and 481X for PKD1 and PKD2 was achieved, respectively, on average across samples (Table 1). We achieved a sequencing depth of 289X for the 46 exons of PKD1 and 453X for the 15 exons of PKD2, on average across samples (Table S2). Ninety-five percent of the coding base pairs of PKD1 and 94% of PKD2 were covered by more than 20 reads, which is enough for an accurate detection of known and novel mutations. Only exons 1 and 42 of PKD1 and exon 1 of PKD2 were not captured and sequenced at an adequate read depth (Fig. 1).
Table 1

Average sequencing quality control and coverage statistics of PKD1 and PKD2 in the validation and discovery cohorts

ValidationDiscovery


CohortAverageSDAverageSD
QC-passed reads14452006.672252761.131303016.25293339.48
Mapped14328976.122236282.411002567.63269009.02
Properly paired14140971.702203337.48780154.25265250.15
PKD1 mean coverage (X)331.1489.2080.6013.60
% PKD1 target bases covered = 0X1.980.353.700.39
% PKD1 target bases covered ≥ 1X98.020.3598.150.20
% PKD1 target bases covered ≥ 20X95.541.9886.692.59
% PKD1 target bases covered ≥ 50X92.404.1365.034.76
% PKD1 target bases covered ≥ 100X84.786.8452.011.65
PKD2 mean coverage (X)480.7387.98174.2228.72
% PKD2 target bases covered = 0X0.360.110.900.20
% PKD2 target bases covered ≥ 1X99.640.1199.550.10
% PKD2 target bases covered ≥ 20X99.190.2098.750.31
% PKD2 target bases covered ≥ 50X98.680.3792.743.46
% PKD2 target bases covered ≥ 100X97.102.2267.518.70
Figure 1

Representation of the average depth of coverage of PKD1 (A) and PKD2 (B) in the validation cohort. Red lines and the numbers underneath represent the exons of the genes. Green lines represent the regions tilled by capture baits.

Average sequencing quality control and coverage statistics of PKD1 and PKD2 in the validation and discovery cohorts Representation of the average depth of coverage of PKD1 (A) and PKD2 (B) in the validation cohort. Red lines and the numbers underneath represent the exons of the genes. Green lines represent the regions tilled by capture baits. Due to the lower throughput of the MiSeq sequencer, the average coverage achieved in the discovery cohort was of 81X and 174X for PKD1 and PKD2, respectively, across the 12 samples (Table 1). For a comprehensive summary of the obtained sequencing results, see also Tables S3, S4.

Detection of PKD1 and PKD2 mutations in the validation cohort

For the validation cohort we selected samples with as many different types of PKD1 and PKD2 mutations as possible, including SNVs, short insertions and deletions (InDels), and large structural variants (SVs). We identified 35 out of 36 previously known different pathogenic mutations (30 in PKD1 and five in PKD2) in their correct heterozygous state (Table 2). These results would have led to a diagnostic rate of 97.2%. Noteworthy, 25 (70%) of these mutations were spread along different exons within the segmentally duplicated regions of the PKD1 gene, highlighting the robustness of our approach even for genes with highly homologous pseudogenes. In addition, two previously known large deletions were correctly detected. Concretely, patient 03-106-P6 presented PKD1 g.2154344-2186386del (Fig. 2A), and patient 11-571-P2 presented PKD2 g.88952828-89050618del (Fig. 2B). For the unique patient with a previously known mutation not identified by our NGS assay, 03-393-P3, manual inspection of the sequence alignment files revealed that the missing p.(Met1fs) was localized in a region of exon 1 of PKD1 for which no NGS reads were available due to problems with the genomic capture.
Table 2

ADPKD mutations in PKD1 and PKD2 identified in the 36 samples of the validation cohort

SampleGeneDuplicated regioncDNA changeProtein changePKDB# PatientsClassificationRef countsVariants counts
03-106-P6PKD1Yesc.1-?_8161+?delp.(Met1fs)Absent0Definitely pathogenic
12-331-P1PKD1Yesc.566C>Gp.(Ser189*)Present1Definitely pathogenic18862
12-382-P1PKD1Yesc.736_737delp.(Ser246fs)Absent0Definitely pathogenic1914
04-016-P6PKD1Yesc.1831C>Tp.(Arg611Trp)Present1Likely pathogenic2513
12-235-P1PKD1Yesc.2329C>Tp.(Gln777*)Absent0Definitely pathogenic5937
12-366-P1PKD1Yesc.2478delCp.(Ile827 fs)Absent0Definitely pathogenic11483
12-010-P1PKD1Yesc.4888C>Tp.(Gln1630*)Present1Definitely pathogenic136114
02-010-P6PKD1Yesc.6583_6589del7p.(Cys2195fs)Present1Definitely pathogenic7479
10-326-P3PKD1Yesc.6778_6780delATTp.(Ile2260del)Present1Highly likely Pathogenic190148
11-220-P2PKD1Yesc.6221delAp.(Asn2074fs)Absent0Definitely Pathogenic123113
11-247-P7PKD1Yesc.6384C>Ap.(Asn2128Lys)Absent0Highly likely pathogenic181131
12-161-P1PKD1Yesc.6586C>Tp.(Gln2196*)Present1Definitely pathogenic8650
11-517-P1PKD1Yesc.6736C>Tp.(Gln2246*)Present1Definitely pathogenic185124
11-525-P2PKD1Yesc.6827T>Cp.(Leu2276Pro)Absent0Highly likely pathogenic280245
10-388-P3PKD1Yesc.8161+1G>Cp.(?)Absent0Definitely pathogenic2125
11-468-P1PKD1Yesc.8251C>Tp.(Gln2751*)Absent0Definitely Pathogenic9292
12-363-P1PKD1Yesc.8285delTp.(Ile2762fs)Absent0Definitely Pathogenic12089
10-463-P3PKD1Yesc.8311G>Ap.(Glu2771Lys)Present18Highly likely pathogenic5863
11-457-P2PKD1Yesc.8858A>Gp.(Asn2953Ser)Absent0Highly likely pathogenic236191
11-287-P2PKD1Yesc.9240_9241delATp.(Ala3082fs)Present3Definitely pathogenic16480
10-193-P3PKD1Yesc.9412G>Ap.(Val3138Met)Present2Likely Pathogenic208188
11-595-P2PKD1Yesc.9455_9456insCp.(Arg3152fs)Absent0Definitely pathogenic283184
07-172-P5PKD1Yesc.9889G>Ap.(Val3297Met)Absent0Likely pathogenic130114
09-403-P3PKD1Yesc.10170+25_+45del19p.(Gln3390fs)Present2Highly likely pathogenic9626
10-182-P3PKD1c.11017-10C>Ap.(Arg3672fs)Present7Highly likely pathogenic10983
12-144-P1PKD1c.10847C>Ap.(Ser3616*)Absent0Definitely Pathogenic224177
10-353-P3PKD1c.11359_11360delp.(Pro3788fs)Absent0Definitely pathogenic292203
11-256-P2PKD1c.11471G>Tp.(Gly3824Val)Absent0Likely pathogenic8563
11-168-P8PKD1c.12004-2A>Gp.(?)Absent0Definitely pathogenic143113
09-446-P3PKD1c.12031C>Tp.(Gln4011*)Present4Definitely pathogenic10685
11-133-P8PKD2c.224delCp.(Pro75fs)Absent0Definitely pathogenic4125
11-008-P3PKD2c.637C>Tp.(Arg213*)Absent0Definitely pathogenic198172
11-170-P2PKD2c.965G>Ap.(Arg322Gln)Present4Highly likely pathogenic396330
12-149-P1PKD2c.2050_2053del4p.(Tyr684fs)Present1Definitely pathogenic212181
11-571-P2PKD2c.709-?_2907+?delp.(Leu237_Val968del)Absent0Definitely pathogenic
09-393-P3

#Patients in previous studies, NM_001009944.2 for PKD1 and NM_000297.2 for PKD2

Figure 2

Detection of large deletions in the PKD1 and PKD2 genes by normalized depth of coverage analysis. Representation of the SVD-ZRPKM values calculated by Conifer for the 36 samples and 5 controls of the validation cohort. Yellow peaks indicate the two large deletions identified in this study. (A) Sample 03-106-P6 PKD1′s deletion (g.2154344-2186386del). (B) Sample 11-571-P2 PKD2/ABCG2′s deletion (g.88952828-89050618del).

ADPKD mutations in PKD1 and PKD2 identified in the 36 samples of the validation cohort #Patients in previous studies, NM_001009944.2 for PKD1 and NM_000297.2 for PKD2 Detection of large deletions in the PKD1 and PKD2 genes by normalized depth of coverage analysis. Representation of the SVD-ZRPKM values calculated by Conifer for the 36 samples and 5 controls of the validation cohort. Yellow peaks indicate the two large deletions identified in this study. (A) Sample 03-106-P6 PKD1′s deletion (g.2154344-2186386del). (B) Sample 11-571-P2 PKD2/ABCG2′s deletion (g.88952828-89050618del). We included in this study five control individuals without personal or family history of ADPKD to determine the clinical specificity of our assay. These controls had been previously genotyped with a HumanOmni 2.5-8 BeadChip (Illumina, Inc.) and were also used to determine the analytic sensitivity of our assay to detect heterozygous and homozygous SNVs. Genotype data were available for a total of 80 and 269 sites within the targeted regions of PKD1 and PKD2, respectively. Sensitivity was of 100% both for PKD1 (20/20) and PKD2 (103/103). Analytic specificity was 100% both for PKD1 (60/60) and PKD2 (166/166) (Table S5). Of note, no spurious pathogenic calls were detected in either the control individuals or the validation cohort.

Identification of PKD1 and PKD2 mutations in the discovery cohort

We detected pathogenic mutations in 10 out of 12 patients carrying a total of 11 different pathogenic mutations (10 in PKD1 and one in PKD2), which lead to a diagnostic rate of 83.3%. All variants were confirmed by Sanger sequencing (Table 3). Interestingly, one patient (12–444) harbored one definitively pathogenic mutation in PKD2 and one highly likely pathogenic mutation in PKD1, presenting a more severe phenotype compared to the rest of the family. For the two samples in which no pathogenic variants were identified with our NGS assay, the bioinformatics pipeline proposed a list of candidate regions with low sequence coverage that were screened by Sanger sequencing and the two causal mutations were identified. Then, by manually inspecting the alignment files of the NGS reads, we realized that we had lost p.(Val2768Met) in patient 13-102 and p.(Arg4021fs) in patient 07-335 because their locations were in poorly covered areas of PKD1 and the variant calling and filtering algorithms had discarded them as potential false positives calls. Noteworthy, no spurious pathogenic calls were reported in any of the samples of the discovery cohort.
Table 3

ADPKD mutations in PKD1 and PKD2 identified in the 12 samples of the discovery cohort

SampleGeneDuplicated regioncDNA changeProtein changePKDB# PatientsClassificationRef countsVariants counts
06-056PKD1Yesc.348_352delTTTAAp.(Asn116fs)Present1Definitely pathogenic2820
06-122PKD1Yesc.7204C>Tp.(Arg2402*)Present2Definitely pathogenic2412
07-032PKD1Yesc.8421_8422insCp.(Ile2808fs)Absent0Definitely pathogenic2218
11-444PKD1Yesc.8041C>Tp.(Arg2681Cys)Absent0Highly likely pathogenic3820
12-444PKD2c.1532_1533insATp.(Asp511fs)Absent0Definitely pathogenic15670
PKD1c.10921C>Tp.(Arg3642Cys)Absent0Highly likely pathogenic4052
12-505PKD1Yesc.50174_5015delAGp.(Arg1672fs)Present28Definitely pathogenic11888
13-199PKD1Yesc.7039delCp.(Arg2347fs)Absent0Definitely pathogenic3432
12-628PKD1Yesc.2180T>Cp.(Leu727Pro)Absent0Highly likely pathogenic208
08-258PKD1Yesc.7925C>Tp.(Arg2639*)Present5Definitely pathogenic2814
10-484PKD1c.12010C>Tp.(Gln4004*)Present4Definitely pathogenic2638
13-102
07-335

# patients in previous studies, NM_001009944.2 for PKD1 and NM_000297.2 for PKD2.

ADPKD mutations in PKD1 and PKD2 identified in the 12 samples of the discovery cohort # patients in previous studies, NM_001009944.2 for PKD1 and NM_000297.2 for PKD2.

Discussion

It has been suggested that the impact that NGS technologies will have on clinical genetics during the upcoming years will be comparable to the introduction of X-rays to medicine many decades ago (Hennekam and Biesecker 2012). After the tremendous impact of NGS technologies in the discovery of disease-causing genes during the last 4 years, we are witnessing the introduction of these technologies for diagnostic applications, with the aim of rapidly revolutionize the field of genetic diagnostics, making it much more cost- and time-effective, advance accuracy, and point to unsuspected yet treatable conditions. The purpose of this study was to develop a cost-effective method for the molecular diagnostics of ADPKD applying targeted NGS. First, we validated the assay in a cohort of 36 previously characterized ADPKD patients in which we detected 35 out of 36 known mutations. Second, we analyzed a discovery cohort of uncharacterized ADPKD patients and we reached a diagnostic rate of 83% (10 out of 12 patients), allowing test reporting 5 days after receiving the DNA samples. Although the size of our cohort is modest, these results are very encouraging since these numbers represent a diagnostic rate comparable to data obtained by Sanger sequencing (Audrezet et al. 2012; Cornec-Le Gall et al. 2013) and NGS (Rossetti et al. 2012). Recently, targeted sequencing by NGS has been used in the identification of mutations in ADPKD. Rossetti et al. (2012) did not apply a capture protocol for PKD1 and PKD2 enrichment since they speculated that the duplicated genomic regions of PKD1 would lead to concurrent capture of the six PKD1 pseudogenes making very difficult the identification of the ADPKD causal variants. Instead, these authors used a strategy of pooling equimolar LR-PCR amplicons and multiplexing barcoded libraries. Their approach showed a high sensitivity, specificity and accuracy, but it is a very laborious task more amenable to characterize large ADPKD populations than for routine genetic diagnosis. Moreover, their approach did not allow detecting large genomic rearrangements. Here, we do not only demonstrate that genome enrichment by in-solution hybridization using an elaborated probe design is an accurate strategy for mutation identification in the duplicated regions and the rest of PKD1 and PKD2, but also that this strategy is ready to substitute LR-PCR-based methods in the routine genetic diagnostics of ADPKD to detect all sorts of sequence variants, including SVs. When we conceived this study, we assumed that it would be extremely difficult to specifically capture the genuine PKD1, that is, there would always be residual enrichment of the six pseudogenes. Therefore, instead of excluding this region from our assay we decided to include unspecific probes to the duplicated region of PKD1 in our capture library. In this regard, the mutation calls are on average lower than the reference calls, most likely due to the pseudogenes background (Tables 2, 3). From our point of view, the critical point of the assay was not the presence of sequencing reads coming from both the genuine PKD1 and its pseudogenes. Instead, the major challenge was to map the reads coming from duplicated regions unambiguously to the genuine PKD1 or to the six pseudogenes. In order to minimize the impact of sequence reads coming from the pseudogenes we allowed mapping to the whole genome, instead of restricting the mapping to the targeted region. Moreover, the length of the millions of overlapping sequencing reads produced in this study (2 × 100 bp and 2 × 250 bp in the validation and discovery cohorts, respectively) in combination with the 300 bp insert sizes in the DNA libraries provided enough sequence specificity for accurate mapping and pseudogene discrimination, allowing us to unambiguously map a large proportion of the sequencing reads to PKD1 (Table 1). Furthermore, we also assume that the alignment algorithm is not 100% reliable and some reads coming from the pseudogenes could have been erroneously aligned to PKD1. In the worst scenario, the accumulation of these misaligned reads could lead to spurious variant calls but, as we have observed, none of these potential false positive variant calls passed the stringent filters of our variant prioritization pipeline in any of the patients of the validation and discovery cohorts neither in the five control samples. The low sequencing coverage obtained for exons 1 and 42 of PKD1 and exon 1 of PKD2, likely due to a high GC content, is the main limitation of this study as all variants that were missed were located in these poorly covered regions highlighting the importance of achieving sufficient depth of coverage for the optimal performance of the assay. However, we think that this can be fixed by rebalancing and adding new and replicate probes hybridizing with these poorly covered regions in the capture design. In the discovery cohort, the low average depth of coverage yielded by the MiSeq (Illumina, Inc.) in some samples was the cause of the lower mutation detection rate of the assay. However, we plan in the future to produce an optimized capture design including only PKD1/PKD2 and a few cystic genes that would help in the differential diagnosis, such as HNF1B (17p12; OMIM ID: 189907) and PKHD1 (6p12.3-p12.2; OMIM ID: 606702). This would significantly reduce the total captured DNA per sample, allowing multiplexing more samples per MiSeq (Illumina, Inc.) run, and to achieve higher depths of coverage (comparable to those obtained for the validation cohort) that will allow more confident variant calling. We estimate that with our NGS-based assay a 60% of cost savings per sample could be achieved, and the whole diagnostics process could be a minimum of five times faster than with the conventional techniques currently used for the genetic diagnostics of ADPKD. In addition, our strategy offers a complete definition of the captured genes, without the need for stepwise testing anymore and having to choose which gene to sequence first, and is capable to detect large genomic rearrangements and deep intronic variants. In the discovery cohort, the complete process of library preparation, genomic enrichment, NGS using a MiSeq (Illumina, Inc.), and bioinformatics analysis was completed in 5 days after reception of the DNA samples. In conclusion, we illustrate here the first successful study using in-solution hybridization enrichment coupled to NGS to detect ADPKD pathogenic mutations, both in the duplicated regions of PKD1 and the rest of PKD1 and PKD2 genes. Our approach is cost- and time-effective, and meets the sensitivity and specificity criteria required for genetic diagnostics, providing NGS experimental and bioinformatics approaches ready to substitute classic molecular tools in routine genetic diagnostics of ADPKD.
  22 in total

1.  SSAHA: a fast search method for large DNA databases.

Authors:  Z Ning; A J Cox; J C Mullikin
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

Review 2.  Molecular diagnostics for autosomal dominant polycystic kidney disease.

Authors:  Peter C Harris; Sandro Rossetti
Journal:  Nat Rev Nephrol       Date:  2010-02-23       Impact factor: 28.314

3.  Type of PKD1 mutation influences renal outcome in ADPKD.

Authors:  Emilie Cornec-Le Gall; Marie-Pierre Audrézet; Jian-Min Chen; Maryvonne Hourmant; Marie-Pascale Morin; Régine Perrichot; Christophe Charasse; Bassem Whebe; Eric Renaudineau; Philippe Jousset; Marie-Paule Guillodo; Anne Grall-Jezequel; Philippe Saliou; Claude Férec; Yannick Le Meur
Journal:  J Am Soc Nephrol       Date:  2013-02-21       Impact factor: 10.121

4.  The mutation, a key determinant of phenotype in ADPKD.

Authors:  Peter C Harris; Katharina Hopp
Journal:  J Am Soc Nephrol       Date:  2013-05-16       Impact factor: 10.121

5.  Identification of gene mutations in autosomal dominant polycystic kidney disease through targeted resequencing.

Authors:  Sandro Rossetti; Katharina Hopp; Robert A Sikkink; Jamie L Sundsbak; Yean Kit Lee; Vickie Kubly; Bruce W Eckloff; Christopher J Ward; Christopher G Winearls; Vicente E Torres; Peter C Harris
Journal:  J Am Soc Nephrol       Date:  2012-03-01       Impact factor: 10.121

6.  Comprehensive statistical study of 452 BRCA1 missense substitutions with classification of eight recurrent substitutions as neutral.

Authors:  S V Tavtigian; A M Deffenbaugh; L Yin; T Judkins; T Scholl; P B Samollow; D de Silva; A Zharkikh; A Thomas
Journal:  J Med Genet       Date:  2005-07-13       Impact factor: 6.318

7.  Mutations in multiple PKD genes may explain early and severe polycystic kidney disease.

Authors:  Carsten Bergmann; Jennifer von Bothmer; Nadina Ortiz Brüchle; Andreas Venghaus; Valeska Frank; Henry Fehrenbach; Tobias Hampel; Lars Pape; Annegret Buske; Jon Jonsson; Nanette Sarioglu; Antónia Santos; Jose Carlos Ferreira; Jan U Becker; Reinhold Cremer; Julia Hoefele; Marcus R Benz; Lutz T Weber; Reinhard Buettner; Klaus Zerres
Journal:  J Am Soc Nephrol       Date:  2011-10-27       Impact factor: 10.121

8.  Unified criteria for ultrasonographic diagnosis of ADPKD.

Authors:  York Pei; James Obaji; Annie Dupuis; Andrew D Paterson; Riccardo Magistroni; Elizabeth Dicks; Patrick Parfrey; Benvon Cramer; Eliecer Coto; Roser Torra; Jose L San Millan; Robert Gibson; Martijn Breuning; Dorien Peters; David Ravine
Journal:  J Am Soc Nephrol       Date:  2008-10-22       Impact factor: 10.121

9.  Comprehensive molecular diagnostics in autosomal dominant polycystic kidney disease.

Authors:  Sandro Rossetti; Mark B Consugar; Arlene B Chapman; Vicente E Torres; Lisa M Guay-Woodford; Jared J Grantham; William M Bennett; Catherine M Meyers; Denise L Walker; Kyongtae Bae; Qin Jean Zhang; Paul A Thompson; J Philip Miller; Peter C Harris
Journal:  J Am Soc Nephrol       Date:  2007-06-20       Impact factor: 10.121

Review 10.  Autosomal dominant polycystic kidney disease in the 1980's.

Authors:  O Z Dalgaard; S Nørby
Journal:  Clin Genet       Date:  1989-11       Impact factor: 4.438

View more
  28 in total

1.  Detection and characterization of mosaicism in autosomal dominant polycystic kidney disease.

Authors:  Katharina Hopp; Emilie Cornec-Le Gall; Sarah R Senum; Iris B A W Te Paske; Sonam Raj; Sravanthi Lavu; Saurabh Baheti; Marie E Edwards; Charles D Madsen; Christina M Heyer; Albert C M Ong; Kyongtae T Bae; Richard Fatica; Theodore I Steinman; Arlene B Chapman; Berenice Gitomer; Ronald D Perrone; Frederic F Rahbari-Oskoui; Vicente E Torres; Peter C Harris
Journal:  Kidney Int       Date:  2019-10-09       Impact factor: 10.612

2.  Whole-genome sequencing overcomes pseudogene homology to diagnose autosomal dominant polycystic kidney disease.

Authors:  Amali C Mallawaarachchi; Yvonne Hort; Mark J Cowley; Mark J McCabe; André Minoche; Marcel E Dinger; John Shine; Timothy J Furlong
Journal:  Eur J Hum Genet       Date:  2016-05-11       Impact factor: 4.246

3.  System analysis of gene mutations and clinical phenotype in Chinese patients with autosomal-dominant polycystic kidney disease.

Authors:  Meiling Jin; Yuansheng Xie; Zhiqiang Chen; Yujie Liao; Zuoxiang Li; Panpan Hu; Yan Qi; Zhiwei Yin; Qinggang Li; Ping Fu; Xiangmei Chen
Journal:  Sci Rep       Date:  2016-10-26       Impact factor: 4.379

Review 4.  Ciliopathies: Genetics in Pediatric Medicine.

Authors:  Machteld M Oud; Ideke J C Lamers; Heleen H Arts
Journal:  J Pediatr Genet       Date:  2016-11-10

5.  Comprehensive genetic testing approach for major inherited kidney diseases, using next-generation sequencing with a custom panel.

Authors:  Takayasu Mori; Kazuyoshi Hosomichi; Motoko Chiga; Shintaro Mandai; Hirofumi Nakaoka; Eisei Sohara; Tomokazu Okado; Tatemitsu Rai; Sei Sasaki; Ituro Inoue; Shinichi Uchida
Journal:  Clin Exp Nephrol       Date:  2016-02-26       Impact factor: 2.801

6.  Prevalence Estimates of Polycystic Kidney and Liver Disease by Population Sequencing.

Authors:  Matthew B Lanktree; Amirreza Haghighi; Elsa Guiard; Ioan-Andrei Iliuta; Xuewen Song; Peter C Harris; Andrew D Paterson; York Pei
Journal:  J Am Soc Nephrol       Date:  2018-08-22       Impact factor: 10.121

7.  Analysis of mutations in six Chinese families with autosomal dominant polycystic kidney disease.

Authors:  Hanlu Wang; Sen Dai; Jianhui Zhang; Yi Li; Yumian Gan; Tao Lu; Yaobin Zhu; Jiabin Wu; Ning Lin; Faqiang Tang; Jiewei Luo
Journal:  Am J Transl Res       Date:  2020-12-15       Impact factor: 4.060

Review 8.  Genetic Complexity of Autosomal Dominant Polycystic Kidney and Liver Diseases.

Authors:  Emilie Cornec-Le Gall; Vicente E Torres; Peter C Harris
Journal:  J Am Soc Nephrol       Date:  2017-10-16       Impact factor: 10.121

Review 9.  Polycystic kidney disease.

Authors:  Carsten Bergmann; Lisa M Guay-Woodford; Peter C Harris; Shigeo Horie; Dorien J M Peters; Vicente E Torres
Journal:  Nat Rev Dis Primers       Date:  2018-12-06       Impact factor: 52.329

Review 10.  Progress in the understanding of polycystic kidney disease.

Authors:  Vicente E Torres; Peter C Harris
Journal:  Nat Rev Nephrol       Date:  2019-02       Impact factor: 28.314

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.