Literature DB >> 34627256

Noninvasive prenatal diagnosis of monogenic disorders based on direct haplotype phasing through targeted linked-read sequencing.

Chao Chen1,2, Min Chen3, Yaping Zhu1,2, Lu Jiang1,2, Jia Li1, Yaoshen Wang1,2, Zhe Lu1,2, Fengyu Guo1,2, Hairong Wang1,4, Zhiyu Peng1, Yun Yang5,6,7, Jun Sun8,9.   

Abstract

BACKGROUND: Though massively parallel sequencing has been widely applied to noninvasive prenatal screen for common trisomy, the clinical use of massively parallel sequencing to noninvasive prenatal diagnose monogenic disorders is limited. This study was to develop a method for directly determining paternal haplotypes for noninvasive prenatal diagnosis of monogenic disorders without requiring proband's samples.
METHODS: The study recruited 40 families at high risk for autosomal recessive diseases. The targeted linked-read sequencing was performed on high molecular weight (HMW) DNA of parents using customized probes designed to capture targeted genes and single-nucleotide polymorphisms (SNPs) distributed within 1Mb flanking region of targeted genes. Plasma DNA from pregnant mothers also underwent targeted sequencing using the same probes to determine fetal haplotypes according to parental haplotypes. The results were further confirmed by invasive prenatal diagnosis.
RESULTS: Seventy-eight parental haplotypes of targeted gene were successfully determined by targeted linked-read sequencing. The predicted fetal inheritance of variant was correctly deduced in 38 families in which the variants had been confirmed by invasive prenatal diagnosis. Two families were determined to be no-call.
CONCLUSIONS: Targeted linked-read sequencing method demonstrated to be an effective means to phase personal haplotype for noninvasive prenatal diagnosis of monogenic disorders.
© 2021. The Author(s).

Entities:  

Keywords:  Direct haplotype phasing; Monogenic disease; Noninvasive prenatal diagnosis; Targeted linked-read sequencing

Mesh:

Year:  2021        PMID: 34627256      PMCID: PMC8502361          DOI: 10.1186/s12920-021-01091-x

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


Background

The discovery of cell-free fetal circulating DNA (cff-DNA) in maternal blood and the rapid advances of massively parallel sequencing (MPS) have provided an unprecedented opportunity to perform the prenatal genetic testing of common fetal aneuploidies and single-gene diseases. Though MPS has been widely applied to screen for fetal trisomy 21, 18 and 13 [1], the clinical use of MPS to diagnose monogenic disorders is limited [2]. Several studies have been conducted to develop noninvasive prenatal diagnosis (NIPD) for monogenic disease using various technologies such as real-time polymerase chain reaction (PCR), amplification at lower denaturation temperature-PCR, digital PCR, circulating single-molecule amplification and resequencing technology [3, 4] and MPS. These studies were confined to exclude paternally inherited [5] and detect de novo variants[6] based on variant-specific assays due to the strong interference of maternal background signal. The relative haplotype dosage approach has been demonstrated to detect parental inherited variants at the same time. Our group has employed a proband-based method for resolving parental haplotypes and successfully applied this method to NIPD of Duchenne muscular dystrophy (DMD) [7], congenital adrenal hyperplasia (CAH) [8], maple syrup urine disease (MSUD) [9], hyperphenylalaninemia [10] and spinal muscular atrophy (SMA) [11]. This phasing information makes it possible to measure the haplotype dosage imbalance in maternal plasma DNA. The advantage of relative haplotype dosage approach is that analysis is independent of variant types. While, the method needs proband’s samples to phase parental haplotypes, which hampers the application of NIPD to monogenic diseases in clinical practice. The haplotype phasing is a critical step for haplotype-based NIPD of monogenic disorders. Serval studies have reported specific haplotype building methods such as clone pool dilution sequencing [12], contiguity-preserving transposition sequencing [13], targeted locus amplification (TLA) [14], HaploSeq [15] and long fragment read (LFR) technology [16]. These approaches need complex experimental operations and are time consuming and associated with a low success rate. These limitations can be problematic for identifying single gene disorders. Population data-based personal haplotype phasing overcomes the above drawbacks. The population-based method is based on reference population with genotyping data of unrelated individuals and the accuracy of NIPD is only 80%, which is lower than the experimental methods [17]. In order to further improve the success rate and accuracy of haplotype phasing, microfluidics-based linked-read sequencing technology and TLA-based phasing were utilized to phase parental DNA directly [18, 19]. The former approach combined the whole-genome sequencing (WGS) and linked-read sequencing method and succeeded in predicting fetal inherited variants in 12 of 13 pregnancies. The informative sequencing depth (40x) of WGS and the expensive experimental reagents restricted its clinical practice for NIPD [18]. Targeted TLA-based phasing approach is also subject to the complex acquisition of TLA template and customized target kit for NIPD which is inconvenient. A customized probe which covers dozens of common single gene disorders in China is used for haplotype-based NIPD. Therefore, we speculated that the linked-read sequencing combined with targeted sequencing using the above probes would expand the list of single gene disorders and reduce the cost compared with the whole-genome sequencing. In our previous study, we demonstrated direct haplotyping of NIPD based on linked‐read sequencing is accurate for the prediction of fetal pathogenic variants of DMD [20]. The objectives of this study are to investigate the feasibility and accuracy of targeted linked-read sequencing in six different types of autosomal recessive diseases. We analyzed 40 families at high risk for six kinds of autosomal recessive diseases and showed that direct haplotype phasing of parental high molecular weight (HMW) DNA is feasible using targeted linked-read sequencing of target genes. Targeted sequencing of maternal plasma DNA combined with the parental haplotype information were interpreted to determine the inherited variants in fetus. Our approach might be a cost-effective and applicable method for NIPD of autosomal recessive monogenic disorders in clinical settings.

Methods

Sample collection

We recruited 40 families at high risk for autosomal recessive diseases, including 13 methylmalonic acidemia (MMA) families, 12 β-thalassemia families, 8 phenylketonuria (PKU) families, 5 α-thalassemia families, 1 autosomal recessive polycystic kidney disease (ARPKD) family and 1 autosomal recessive deafness-1A (DFNB1A) family caused by pathogenic variants of GJB2 gene. The variants have been identified in all families (Table 1). All participants provided written informed consent to join in the study. The ethics committee of the participating hospitals and the Institutional Review Board of BGI approved the conduct of this study (BGI-IRB No 17080-T1).
Table 1

Clinical information of the participating families

FamilyDiseaseGeneGenotypes of the TriosGAFF (%)
MatPatFetus (Mat/Pat)
F01β-thalassemiaHBBc.316-197C>T /Nc.-78A>G/NN/N12+49.3
F02β-thalassemiaHBBc.126_129delCTTT/Nc.126_129delCTTT/NN/N20+515.9
F03β-thalassemiaHBBc.126_129delCTTT/Nc.-78A>G/Nc.126_129delCTTT/c.-78A>G12+315.4
F04β-thalassemiaHBBc.316-197C>T /Nc.126_129delCTTT/NN/c.126_129delCTTT1812.1
F05β-thalassemiaHBBc.126_129delCTTT/Nc.316-197C>T /Nc.126_129delCTTT/N13+620.6
F06β-thalassemiaHBBc.216_217insA/T/Nc.126_129delCTTT/Nc.216_217insA/T/ c.126_129delCTTT13+226.8
F07β-thalassemiaHBBc.79G>A/Nc.126_129delCTTT/NN/c.126_129delCTTT11+312.3
F08β-thalassemiaHBBc.126_129delCTTT/Nc.316-197C>T/Nc.126_129delCTTT/N12+316.5
F09β-thalassemiaHBBc.52A>T/Nc.84_85insC/Nc.52A>T/N12+127.7
F10β-thalassemiaHBBc.126_129delCTTT/Nc.79G>A/Nc.126_129delCTTT/c.79G>A11+117.7
F11β-thalassemiaHBBc.126_129delCTTT/Nc.126_129delCTTT/Nc.126_129delCTTT/c.126_129delCTTT178.1
F12β-thalassemiaHBBc.126_129delCTTT/Nc.126_129delCTTT/NN/c.126_129delCTTT179.7
F13α-thalassemiaHBA- -SEA/N- -SEA/N- -SEA/- -SEA13+315.7
F14α-thalassemiaHBA- -SEA/N- -SEA/NN/N11+613.7
F15α-thalassemiaHBA- -SEA/N- -SEA/NN/- -SEA12+417.5
F16α-thalassemiaHBA- -SEA/N- -SEA/N- -SEA/- -SEA11+323.5
F17α-thalassemiaHBA- -SEA/Nc.369C>G/N- -SEA/c.369C>G186.7
F18MMAMMACHCc.609G>A/Nc.609G>A/Nc.609G>A/N1916.5
F19MMAMMACHCc.656-658delAGA/Nc.609G>A/NN/c.609G>A1814.2
F20MMAMMACHCc.609G>A/Nc.656-658delAGA/NN/N1612.8
F21MMAMMACHCc.656-658delAGA/Nc.609G>A/NN/N1710.4
F22MMAMMACHCc.80A>G/Nc.609G>A/Nc.80A>G/N1710.2
F23MMAMMACHCc.609G>A/Nc.441TG[2]/Nc.609G>A/c.441TG[2]1710.1
F24MMAMMACHCc.609G>A/Nc.609G>A/NN/N1817.8
F25MMAMMACHCc.80A>G/Nc.609G>A/NN/N1713.7
F26MMAMMACHCc.609G>A/Nc.658-660delAAG/Nc.609G>A/c.658-660delAAG179.8
F27MMAMMACHCc.609G>A/Nc.445-446delTG/NN/N1710.4
F28MMAMMACHCc.482G>A/Nc.445-446delTG/NN/N178.2
F29MMAMMACHCc.315C>G/Nc.609G>A/Nc.315C>G/N166.5
F30MMAMMACHCc.609G>A/Nc.609G>A/NN/N17+58.0
F31PKUPAHc.1197A>T/Nc.764T>C/Nc.1197A>T/c.764T>C187.3
F32PKUPAHc.992T>C/Nc.770G>T/NN/c.770G>T177.5
F33PKUPAHc.1045T>G/Nc.728G>A/NN/N1811.3
F34PKUPAHc.728G>A/Nc.611A>G/NN/N205.9
F35PKUPAHc.977G>A/Nc.1238G>C/Nc.977G>A/N1721.2
F36PKUPAHc.473G>A/Nc.208_210delTCTc.473G>A/c.208_210delTCT1812.8
F37PKUPAHc.1223G>A/Nc.727C>T/NN/N128.5
F38PKUPAHc.728G>A/Nc.721C>T/Nc.728G>A/c.721C>T127.2
F39ARPKDPKHD1c.11042T>G/Nc.5137G>T /NN/c.5137G>T12+615.0
F40DFNB1AGJB2c.235delC/Nc.299-300delAT/Nc.235delC/N13+115.3

FF fetal fraction, GA gestational age, N Normal, PKU phenylketonuria, MMA methylmalonic academia, ARPKD autosomal recessive polycystic kidney disease, DFNB1A autosomal recessive deafness-1A

Clinical information of the participating families FF fetal fraction, GA gestational age, N Normal, PKU phenylketonuria, MMA methylmalonic academia, ARPKD autosomal recessive polycystic kidney disease, DFNB1A autosomal recessive deafness-1A

Target capture probe design

The targeted enrichment of DNA libraries was performed according to the custom-designed SeqCap EZ Choice Library (NimbleGen, Roche) protocol. The capture probes (NimbleGen, Roche) targeting the whole genes of HBB, HBA1, HBA2, and highly heterozygous SNPs within 1Mb flanking region of target genes were designed for NIPD of β-thalassemia and α-thalassemia. Another set of target capture probe was designed to cover the coding region and SNPs within 1Mb upstream and downstream regions of the interested genes, including MMACHC (MMA), PAH (PKU), PKHD1 (ARPKD) and GJB2 (DFNB1A).

Targeted linked-read sequencing

HMW genomic DNA (gDNA) was extracted from stored blood using the Mag Attract HMW Kit (Qiagen, Germany). The size of HMW gDNA should be more than 50kb according to the pulse electrophoresis results. Then gDNA was processed with Chromium™ Genome v2 libraries (10x Genomics, USA). Long gDNA strands were partitioned in barcoded gel beads through a microfluidic device. Barcoded oligonucleotides in a gel bead bind randomly onto the long molecules and generate short fragments with the same barcode. The chance that two molecules were covering the same genomic locus on each gel bead is low, and the short fragments with the same barcode were considered to come from the same long molecule. Libraries of the barcoded fragments were prepared and captured using the customized probe. The prepared DNA library was then sequenced using an Illumina HiSeq2500 sequencer with a paired-end format of 101 bp or 150 bp.

Variant calling and direct haplotype phasing

The barcoded libraries read were then processed with the Long Ranger pipeline (v.2.2.2) provided by 10x Genomics [21]. Reads associated with valid barcodes were aligned against the human genome 19 (Hg19) by using the Burrows-Wheeler Aligner (BWA) software [22]. Output files annotated with barcode and phasing information were generated and served as the reference haplotypes of the family for downstream analysis. The maternal plasma DNA sequencing reads were aligned against the reference hg19 using BWA. After duplicated reads were marked by the Picard Mark Duplicates tool, the GATK tools were applied to perform local realignment and base quality score recalibration [23]. The free Long Ranger (v.2.2.2) software was utilized to determine the parental haplotype in the interested region. Barcode information provides the clue to associate short reads to the original long input molecules. Variant-linked haplotype referred to those reads whose barcodes were consistent with the ones with variant alleles. In contrast, wild-linked haplotype denoted the reads carrying same barcode with the ones with wild-type alleles. The different haplotype blocks were linked with identified SNPs using the overlapping region. SNPs associated with the same haplotypes carrying the wild-type and variant alleles were used for the maternal plasma DNA analysis.

The estimation of fetal fraction and NIPD of monogenic disorders

The evaluation of fetal fraction could be conducted according to the procedure reported before [8]. The haplotype related to variant and wild alleles was constructed based on targeted linked-read sequencing. The informative SNPs that were heterozygous in the mother but homozygous in the father were analyzed for maternal inheritance. On the contrary, the paternal inheritance analysis followed the opposite strategy with maternal inheritance analysis. We used hidden Markov model (HMM) to predict the most likely inherited haplotype using our previously reported algorithm [24]. The probabilities that the fetus inherited the pathogenic and non-pathogenic alleles were evaluated using the number of reads in maternal plasma and then considered as the HMM emission probabilities. The genetic map from the National Center for Biotechnology Information provided the genetic position of the SNPs in centimorgan (cM) and recombination rates between SNPs, these probabilities were regarded as HMM transition probabilities. Lastly, the Viterbi algorithm was utilized to predict the inherited haplotype in the fetus.

Validation of NIPD

The samples obtained through invasive procedures including chorionic villus sampling (CVS) and amniocentesis were used for prenatal genetic diagnosis. After DNA extraction, Sanger sequencing, gap-PCR and reverse dot blot PCR for target variations were performed in a blind manner to further validate the accuracy of NIPD.

Results

Clinical information of the monogenic families

40 families at high risk for autosomal recessive diseases, including 13 MMA families, 12 β-thalassemia families, 8 PKU families, 5 α-thalassemia families, 1 ARPKD family and 1 DFNB1A family caused by pathogenic variants of GJB2 gene were recruited. The gestational age (GA) of 40 pregnant women varied from 11+1 to 20+5 weeks, with a median GA of 15.5 weeks. The clinical information, variant loci and variant status of the 40 families are presented in Table 1.

Targeted linked-reads sequencing

Targeted sequencing on the interested gene region was performed in plasma DNA samples from 40 pregnant women at different gestational weeks. The fetal fraction varied from 5.9 to 27.7%, with a mean fetal fraction of 13.2%, showing significant differences between individuals (Table 1). The targeted sequencing of gDNA samples showed the coverage was relatively consistent in the targeted genes, with a mean read depth of 402× (Additional file 1: Table S1). After data pre-processing and alignment, over 98% of the linked-reads were aligned to the hg19, an average of 50% of the bases were on-target (Additional file 1: Table S1). The summary statistics of alignment are presented in detail in Additional file 1: Table S1.

Direct haplotype phasing

The 10x genomics barcoding technology allowed us to obtain long-range information by linking the short sequencing reads produced. There were two haplotypes, the pathogenic haplotype (P) and normal haplotype (N). The former referred to the reads whose alleles or barcodes were in consistence with variant-supporting reads at heterozygous SNP positions. While the latter represented those reads whose alleles were opposite to the variant-supporting reads at heterozygous SNP positions. The two haplotypes of were directly determined by linking the haplotype blocks assembled by the barcoded reads for all parental gDNA. N50 phase-block length represents the contiguity achieved in the experimental haplotyping, the average length of N50 phase-block was 1Mb (range 413.04 kb~3.54 Mb). N50 phase block, phase block across the target region and longest phase block for the 40 families is depicted in Table 2 and Additional file 1: Table S1. The number of SNPs in the phase blocks used for phasing ranged from 3 to 2418 SNPs, with a mean of 1006 (Table 2). All variants carried by family members were initially detected by the targeted linked-read sequencing and verified to be concordant with those from the MPS data. The paternal haplotypes phasing of F27 and F36 failed, because the haplotype block cannot cover the pathogenic variants. Therefore, the NIPD analysis is not required for failed phasing individuals (pF27 and pF36).
Table 2

Parental haplotypes phasing data

FamilySampleGenePhase block across target regionPhasing block size(kb)No. of SNPs across the block
F01matHBBchr11:4249489-62389601989.52367
patHBBchr11:4269280-57617971492.51469
F02matHBBchr11:4366798-62463831879.61803
patHBBchr11:4366798-62375651870.81655
F03matHBBchr11:4249238-58845951635.41716
patHBBchr11:4346064-61212711775.21972
F04matHBBchr11:4587676-62439821656.32308
patHBBchr11:4905140-62163041311.21644
F05matHBBchr11:5192535-5900085707.6955
patHBBchr11:4249095-54504931201.41359
F06matHBBchr11:4852009-5555972704.0741
patHBBchr11:5196669-6082903886.21541
F07matHBBchr11:4697080-62393441542.31810
patHBBchr11:4306665-62460511939.42043
F08matHBBchr11:4936613-61161421179.51544
patHBBchr11:4249126-57719151522.81369
F09matHBBchr11:4436676-62393441802.71681
patHBBchr11:4249163-60903721841.22247
F10matHBBchr11:4249271-62375651988.31666
patHBBchr11:4249031-60378031788.81733
F11matHBBchr11:4345701-56471661301.51202
patHBBchr11:4389404-57192511329.81450
F12matHBBchr11:4249095-62393441990.22301
patHBBchr11:4387760-61214281733.72418
F13matHBAchr16:60185-679412619.2284
patHBAchr16:60185-12256281165.4937
F14matHBAchr16:186950-12169971030.0606
patHBAchr16:132246-612607480.4251
F15matHBAchr16:94080-12251841131.1899
patHBAchr16:74039-11976121123.6687
F16matHBAchr16:79811-12237221143.9883
patHBAchr16:60185-460830400.6339
F17matHBAchr16:60185-11926201132.41045
patHBAchr16:60291-12251841164.91010
F18matMMACHCchr1: 44966837-469521641985.31599
patMMACHCchr1: 44972309-469729582000.6926
F19matMMACHCchr1: 45513754-469734541459.7440
patMMACHCchr1: 44979498-469758771996.3831
F20matMMACHCchr1: 45767431-46206444439.0119
patMMACHCchr1: 45386861-465032171116.3247
F21matMMACHCchr1: 45765523-469752941209.8457
patMMACHCchr1:44967323-469758772008.6812
F22matMMACHCchr1:45762749-46722939960.2445
patMMACHCchr1:45701916-46097939396.0161
F23matMMACHCchr1:45738336-469754501237.1729
patMMACHCchr1:44967431-469758772008.41228
F24matMMACHCchr1:45947353-46095125147.827
patMMACHCchr1:45775550-46605728830.2609
F25matMMACHCchr1:45765523-46053981288.5156
patMMACHCchr1:45765523-45982693217.241
F26matMMACHCchr1:45767431-469758771208.4691
patMMACHCchr1:45762749-469758771213.1684
F27matMMACHCchr1:45683746-46645681961.9572
patMMACHCchr1:45962137-4597440712.33
F28matMMACHCchr1:44967323-459745201007.2595
patMMACHCchr1:44967323-466912451723.91149
F29matMMACHCchr1:45640368-469758771335.5599
patMMACHCchr1:44973546-469758772002.31185
F30matMMACHCchr1:44967825-469758772008.11082
patMMACHCchr1:45683419-469245631241.1685
F31matPAHchr12:103214192-104013534799.3301
patPAHchr12:102252463-1042253031972.81299
F32matPAHchr12:102241500-1043095592068.11300
patPAHchr12:102240964-1042613742020.41094
F33matPAHchr12:102240964-1032764411035.5555
patPAHchr12:102241500-1041738801932.41048
F34matPAHchr12:102618568-1043097121691.11095
patPAHchr12:102728895-1042721131543.2696
F35matPAHchr12:102894838-103267467372.6136
patPAHchr12:103075411-1043093831234.01069
F36matPAHchr12:102248565-1042757212027.21189
patPAHchr12:103105959-103274915169.0106
F37matPAHchr12:102321986-1037912201469.2984
patPAHchr12:102710699-1043004411589.71062
F38matPAHchr12:102240964-1036238551382.9619
patPAHchr12:102240964-1043047052063.71246
F39matPKHD1chr6:50968947-529500471981.11347
patPKHD1chr6:50982112-529055921923.5985
F40matGJB2chr13:20687773-20802900115.193
patGJB2chr13:20676993-21122165445.2279
Parental haplotypes phasing data

Noninvasive prenatal diagnosis

As shown in the NIPD flowchart (Figure 1), maternal and paternal haplotypes were first established using target-region sequencing data and the HMM and Viterbi algorithm was then applied to predict fetal haplotypes. Our goal was to precisely infer the fetal genotypes at pathogenic sites, not to correctly infer the haplotypes of all SNP markers flanking the target gene. Therefore, the specific rules [25] were set to determine the fetal genotype at the pathogenic site after obtaining the optimal path of the fetal haplotype block via the Viterbi algorithm. If the path contains only one halotype block (pathogenic or normal) and the block spans the target gene, the fetal genotype at the pathogenic site is the state of the haplotype block that spans the target gene. If the path contains two haplotype blocks (pathogenic and normal) and only one haplotype block spans the target gene, the fetal genotype at the pathogenic site is the state of the haplotype block that spans the target gene (for example, mF04 and mF06). If two haplotype block (pathogenic and normal) exists inside the target gene, the fetal genotype at the pathogenic site is determined as no-call (for example, mF36). A confidence score (CS) [25] was introduced into our algorithm to quantify the probability of obtaining the correct results for NIPD. The CS was calculated using the fetal fraction, sequencing depth of maternal plasma and number of parental informative SNPs as inputs for computational simulation. The detailed method can be referred to the published literature [25]. The condition that the CS was less than 0.99 was defined as no-call.
Fig. 1.

The flow charts of targeted linked-read sequencing and proband-based indirect phasing. a Parental genotypes and haplotype determination, prediction of fetal haplotype and noninvasive prenatal diagnosis of monogenic diseases using the targeted linked-read sequencing method. b Parental and proband's genotype and haplotype determination, prediction of fetal haplotype and noninvasive prenatal diagnosis of monogenic diseases using the proband-based indirect phasing method

The flow charts of targeted linked-read sequencing and proband-based indirect phasing. a Parental genotypes and haplotype determination, prediction of fetal haplotype and noninvasive prenatal diagnosis of monogenic diseases using the targeted linked-read sequencing method. b Parental and proband's genotype and haplotype determination, prediction of fetal haplotype and noninvasive prenatal diagnosis of monogenic diseases using the proband-based indirect phasing method The NIPD results exhibited that 38 fetuses had both alleles detected; of these 38 fetuses, 11 were affected, 15 were carriers and 12 were normal. (Table 3, Additional file 2: Figure S1, Additional file 3: Figure S2 and Additional file 4: Figure S3). For F27, only one normal haplotype inherited from mother can be inferred by NIPD. For F36, we cannot predict fetal haplotypes inherited from parents.
Table 3

The NIPD results

FamilyGeneNo. of Maternal Informative SNPsNo. of Paternal Informative SNPsCSmat (%)CSpat (%)NIPD (mat/pat)Invasive prenatal diagnosis (mat/pat)
F01HBB1260305100100N/NN/N
F02HBB1073607100100N/NN/N
F03HBB521566100100c.126_129delCTTT/c.-78A>Gc.126_129delCTTT/c.-78A>G
F04HBB394317100100N/c.126_129delCTTTN/c.126_129delCTTT
F05HBB255555100100c.126_129delCTTT/Nc.126_129delCTTT/N
F06HBB268453100100c.216_217insA/T/ c.126_129delCTTTc.216_217insA/T/ c.126_129delCTTT
F07HBB697695100100N/c.126_129delCTTTN/c.126_129delCTTT
F08HBB636442100100c.126_129delCTTT/Nc.126_129delCTTT/N
F09HBB669553100100c.52A>T/Nc.52A>T/N
F10HBB908594100100c.126_129delCTTT/c.79G>Ac.126_129delCTTT/c.79G>A
F11HBB603380100100c.126_129delCTTT/c.126_129delCTTTc.126_129delCTTT/c.126_129delCTTT
F12HBB1029550100100N/c.126_129delCTTTN/c.126_129delCTTT
F13HBA5318100100- -SEA/- -SEA- -SEA/- -SEA
F14HBA23552100100N/NN/N
F15HBA11884100100N/- -SEAN/- -SEA
F16HBA19378100100- -SEA/- -SEA- -SEA/- -SEA
F17HBA361140100100- -SEA/c.369C>G- -SEA/c.369C>G
F18MMACHC775228100100c.609G>A/Nc.609G>A/N
F19MMACHC298424100100N/c.609G>AN/c.609G>A
F20MMACHC97175100100N/NN/N
F21MMACHC348361100100N/NN/N
F22MMACHC28549100100c.80A>G/Nc.80A>G/N
F23MMACHC531300100100c.609G>A/c.441TG[2]c.609G>A/c.441TG[2]
F24MMACHC15420100100N/NN/N
F25MMACHC797100100N/NN/N
F26MMACHC492107100100c.609G>A/c.658-660delAAGc.609G>A/c.658-660delAAG
F27MMACHC353NA100NAN/NAN/N
F28MMACHC474457100100N/NN/N
F29MMACHC319469100100c.315C>G/Nc.315C>G/N
F30MMACHC77642100100N/NN/N
F31PAH69321100100c.1197A>T/c.764T>Cc.1197A>T/c.764T>C
F32PAH362185100100N/c.770G>TN/c.770G>T
F33PAH161147100100N/NN/N
F34PAH26295100100N/NN/N
F35PAH13174100100c.977G>A/Nc.977G>A/N
F36PAH188NA100NANC [*]/NAc.473G>A/c.208_210delTCT
F37PAH261164100100N/NN/N
F38PAH406561100100c.728G>A/c.721C>Tc.728G>A/c.721C>T
F39PKHD1971267100100N/c.5137G>TN/c.5137G>T
F40GJB22953100100c.235delC/Nc.235delC/N

*NC no-call, NA not applicable, No. number, CS confidence score for fetal inheritance from maternal haplotype, CS confidence score for fetal inheritance from paternal haplotype

The NIPD results *NC no-call, NA not applicable, No. number, CS confidence score for fetal inheritance from maternal haplotype, CS confidence score for fetal inheritance from paternal haplotype The fetal genotypes inferred by NIPD were compared with direct sequencing results of fetal gDNA extracted from CVS or amniotic fluid cells to further validate the accuracy of NIPD. The results of NIPD were in concordant with invasive diagnosis and the standard genotype of captured sequencing (Table 3).

Discussion

In our study, we applied the targeted linked-read sequencing method to resolve the parental haplotypes across a range of disease loci and successfully determined the fetal genotypes in 38 families, at risk for various single gene diseases. The previous method of NIPD needs the input of the genomics data of an affected family member and involves complex computational resources for indirectly phasing proband-based haplotype. As compared to the previous NIPD method, our targeted linked-read sequencing method may show certain advantages. Either genomics data from a proband or other family members may not be obligatory for deducing fetal variant status, or an additional capture probe. The new method may in particularly benefit the first pregnancy for those women carrying disease variants, due to lack of genomics information from other affected family members. In recent years, several studies have utilized the direct haplotyping method to perform NIPD of single gene disorders [18, 23]. Hui et al conducted whole genome haplotyping method and resolved the parental haplotypes with the use of linked-read sequencing technology. They correctly deduced the fetal variant profiles in 12 out of 13 families at risk for a number of autosomal and X-linked diseases. However, the cost of whole genome haplotyping method is relatively high, which might limit its wide use in clinical settings. Vermeulen et al established the targeted locus amplification approach and phased heterozygous variants in selected genes, the method reduced the cost of whole genome haplotyping method and predicted fetal variant status with a high accuracy. Michael Parks utilized targeted capture enrichment of SNPs across a 6 Mb genomic window on chromosome 5 containing the SMN1 gene and successfully deduced fetal variants by relative haplotype dosage with 100% accuracy [11]. However, customizing the targeted region might be a complex task, due to population frequency difference of SNPs across different ethnicities [26]. Our method is advantageous to the above-mentioned 2 direct phasing methods with respect to the cost-effectiveness and recombination prediction. The current NIPD practically requires maternal, paternal DNA and proband’s DNA samples, therefore, the cost of the current proband-dependent method is approximately $830. The major advantage of our method is that it bypassed the availability of the proband’s DNA which considerably reduced the cost to $700. Moreover, multiplexing of a barcoded library further reduces the cost of linked-read sequencing. The turnaround time of linked-read sequencing is 3 weeks, that is more time-consuming than that of the proband-based method but is still affordable for noninvasive prenatal diagnosis. One potential application of our method is NIPD of cystic fibrosis variants which are more relevant to other ethnicity. As demonstrated in this study, the capture probes should cover the whole CF transmembrane regulator (CFTR) gene and highly heterozygous SNPs within 1Mb flanking region of CFTR. With reduced cost, the targeted linked-read sequencing method is capable of NIPD of a wide range of monogenic disorders independently of proband sample. Despite the advantages as mentioned above, our method still has certain limitations. First, the average percentage of bases on target is approximately 50%, the low on-target rate is a potential limitation of this linked-read target sequencing and may increase the sequencing cost. However, as compared to two other studies, in which the authors reported mean on-target rates of 30.7% and 32% [7, 19], our linked-read target sequencing outperformed the previously published methods. Second, the design of target region and capture probe is critical to successfully conduct targeted linked-read sequencing. There is no existent recommended guideline on the design of capture probes. Additionally, it’s essential to evaluate recombination hot spots surrounding the target region and include the results in the recombination adjustment [27]. Given the clinical applicability of linked-read sequencing hasn’t fully characterized, more researches are required to validate the readiness and effectiveness of this technique in the future.

Conclusions

In summary, we have provided solid evidence that targeted linked-read sequencing method could be applied to the noninvasive assessment of a variety of fetal single gene diseases. The method is a cost-effective and could be widely adopted in clinical practice. Additional file 1: Table S1. Summary statistics of alignment Additional file 2: Figure S1. The NIPD results of α-thalassemia and β-thalassemia Additional file 3: Figure S2. The NIPD results of MMA Additional file 4: Figure S3. The NIPD results of PKU, ARPKD and DFNB1A
  27 in total

1.  Noninvasive prenatal diagnosis of monogenic diseases by targeted massively parallel sequencing of maternal plasma: application to β-thalassemia.

Authors:  Kwan-Wood G Lam; Peiyong Jiang; Gary J W Liao; K C Allen Chan; Tak Y Leung; Rossa W K Chiu; Y M Dennis Lo
Journal:  Clin Chem       Date:  2012-08-15       Impact factor: 8.327

2.  Noninvasive prenatal diagnosis of duchenne muscular dystrophy: comprehensive genetic diagnosis in carrier, proband, and fetus.

Authors:  Seong-Keun Yoo; Byung Chan Lim; Jiyoung Byeun; Hee Hwang; Ki Joong Kim; Yong Seung Hwang; JoonHo Lee; Joong Shin Park; Yong-Sun Lee; Junghyun Namkung; Jungsun Park; Seungbok Lee; Jong-Yeon Shin; Jeong-Sun Seo; Jong-Il Kim; Jong Hee Chae
Journal:  Clin Chem       Date:  2015-04-06       Impact factor: 8.327

Review 3.  Haplotype phasing: existing methods and new developments.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Nat Rev Genet       Date:  2011-09-16       Impact factor: 53.242

4.  Whole-genome haplotype reconstruction using proximity-ligation and shotgun sequencing.

Authors:  Siddarth Selvaraj; Jesse R Dixon; Vikas Bansal; Bing Ren
Journal:  Nat Biotechnol       Date:  2013-11-03       Impact factor: 54.908

5.  Haplotype-based approach for noninvasive prenatal tests of Duchenne muscular dystrophy using cell-free fetal DNA in maternal plasma.

Authors:  Yan Xu; Xuchao Li; Hui-Juan Ge; Bing Xiao; Yan-Yan Zhang; Xiao-Min Ying; Xiao-Yu Pan; Lei Wang; Wei-Wei Xie; Lin Ni; Sheng-Pei Chen; Wen-Ting Jiang; Ping Liu; Hui Ye; Ying Cao; Jing-Min Zhang; Yu Liu; Zu-Jing Yang; Ying-Wei Chen; Fang Chen; Hui Jiang; Xing Ji
Journal:  Genet Med       Date:  2015-02-05       Impact factor: 8.822

6.  Non-invasive prenatal diagnosis for cystic fibrosis: detection of paternal mutations, exploration of patient preferences and cost analysis.

Authors:  Melissa Hill; Philip Twiss; Talitha I Verhoef; Suzanne Drury; Fiona McKay; Sarah Mason; Lucy Jenkins; Stephen Morris; Lyn S Chitty
Journal:  Prenat Diagn       Date:  2015-04-05       Impact factor: 3.050

7.  Haplotype-based Noninvasive Prenatal Diagnosis of Hyperphenylalaninemia through Targeted Sequencing of Maternal Plasma.

Authors:  Jun Ye; Chao Chen; Yuan Yuan; Lianshu Han; Yaoshen Wang; Wenjuan Qiu; Huiwen Zhang; Xuefan Gu
Journal:  Sci Rep       Date:  2018-01-09       Impact factor: 4.379

8.  Targeted linked-read sequencing for direct haplotype phasing of maternal DMD alleles: a practical and reliable method for noninvasive prenatal diagnosis.

Authors:  Se Song Jang; Byung Chan Lim; Seong-Keun Yoo; Jong-Yeon Shin; Ki-Joong Kim; Jeong-Sun Seo; Jong-Il Kim; Jong Hee Chae
Journal:  Sci Rep       Date:  2018-06-06       Impact factor: 4.379

9.  Resolving the full spectrum of human genome variation using Linked-Reads.

Authors:  Patrick Marks; Sarah Garcia; Alvaro Martinez Barrio; Kamila Belhocine; Jorge Bernate; Rajiv Bharadwaj; Keith Bjornson; Claudia Catalanotti; Josh Delaney; Adrian Fehr; Ian T Fiddes; Brendan Galvin; Haynes Heaton; Jill Herschleb; Christopher Hindson; Esty Holt; Cassandra B Jabara; Susanna Jett; Nikka Keivanfar; Sofia Kyriazopoulou-Panagiotopoulou; Monkol Lek; Bill Lin; Adam Lowe; Shazia Mahamdallie; Shamoni Maheshwari; Tony Makarewicz; Jamie Marshall; Francesca Meschi; Christopher J O'Keefe; Heather Ordonez; Pranav Patel; Andrew Price; Ariel Royall; Elise Ruark; Sheila Seal; Michael Schnall-Levin; Preyas Shah; David Stafford; Stephen Williams; Indira Wu; Andrew Wei Xu; Nazneen Rahman; Daniel MacArthur; Deanna M Church
Journal:  Genome Res       Date:  2019-03-20       Impact factor: 9.043

10.  Non-invasive prenatal diagnosis of achondroplasia and thanatophoric dysplasia: next-generation sequencing allows for a safer, more accurate, and comprehensive approach.

Authors:  Lyn S Chitty; Sarah Mason; Angela N Barrett; Fiona McKay; Nicholas Lench; Rebecca Daley; Lucy A Jenkins
Journal:  Prenat Diagn       Date:  2015-05-26       Impact factor: 3.050

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.