Literature DB >> 31115175

Effect quantification and value prediction of factors in noninvasive detection for specific fetal copy number variants by semiconductor sequencing.

Chunhua Zhang1, Bo Liang2, Longwei Qiao1, Liming Xuan3, Hong Li1, Quanze He1, Xiaojuan Wu1, Jiafeng Lu1, Bin Yu4, Ting Wang1.   

Abstract

BACKGROUND: The detection limit of noninvasive prenatal testing (NIPT) by next generation sequencing for any given fetal copy number variants (CNV) can be influenced by several factors. In this study, we quantified the effects and predicted the value of parameters for CNVs detection by NIPT.
METHODS: Genomic DNA from patient's leucocytes with 3.16 Mb microdeletion in 22q11.21 was mixed with DNA from aborted fetal tissues without CNV at various concentrations by an enzyme digestion method. Abnormal DNA at 0% served as negative control. Sequencing of mixture samples (at 0%, 4%, 12%, and 20%) by Ion Proton Sequencer was performed at flow 500, with WISECONDOR as the pipeline in CNV-calling and bin of 500, 750 and 1,000 kb for counting unique reads. The parameters were evaluated with Box-Behnken design. The region with Z score ≦-3 was marked as a potential microdeletion.
RESULTS: The equation of Z score depending on fetal fraction, unique read number and bin size was obtained by Box-Behnken design. The negative effect was quantified as the coefficient in the equation. The smallest values of these parameters were defined as 4 M unique read number, and 10.08% fetal DNA concentration at bin of 750 kb for detecting subchromosomal microdeletion of 3.16 Mb.
CONCLUSION: The quantification of effect and value of parameters as well as the method used in this study can benefit the establishment of quality standards for CNVs detection and interpretation of CNVs detection results.
© 2019 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.

Entities:  

Keywords:  fetal copy number variants; microdeletion; next generation sequencing; noninvasive prenatal testing

Mesh:

Substances:

Year:  2019        PMID: 31115175      PMCID: PMC6625335          DOI: 10.1002/mgg3.718

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


What's already known about this topic?

The application of Noninvasive prenatal testing (NIPT) in detection of fetal copy number variation (CNV) has been reported. In general, NIPT in aneuploidy screening could detect the smallest microdeletion of 3 Mb in size, and has good performance for detecting CNV larger than 5 Mb. Fetal fraction, unique read number, size of fetal CNV, and the biological and technical variability of event region are reported as the factors affecting the sensitivity of CNV detection. The limit of CNV detection can be improved by increasing the fraction of abnormal DNA as well as the sequencing depth.

What does this study add?

In this study, the Box–Behnken design was utilized to obtain the equation of Z score depending on fetal fraction, unique read number and bin. The negative effect of these parameters was quantified as the coefficient in the equation, through which the smallest values of these parameters could be defined as 4 M unique read number, and 10.08% fetal DNA concentration at bin of 750 kb for detecting subchromosomal microdeletion of 3.16 Mb. Thus, our study quantifies the effects and predicts the value of important parameters in NIPT for fetal CNV detection, which is extremely valuable for clinical application.

INTRODUCTION

Fluorescence in situ hybridization (Buysse et al., 2009) and chromosomal microarray analysis (CMA) (Carter, 2007) are widely used classical prenatal diagnosis methods for fetal copy number variants (CNVs) detection. However, invasive procedures (through chorionic villous sampling or amniocentesis) used for these classical prenatal diagnosis methods are associated with pregnancy loss risk (Buysse et al., 2009). Noninvasive prenatal testing (NIPT) by next generation sequencing provides a viable alternative to the existing methods for CNVs detection with higher accuracy and resolution (Zhao, Wang, Wang, Jia, & Zhao, 2013). NIPT used for aneuploidy screening to detect CNV ≥5 Mb shows good performance with sensitivity of 90.0%, while sensitivity for detecting CNV <5 Mb is only 14.3% (Li et al., 2016). The detection limit of NIPT for any given CNV is influenced mostly by fetal fraction, CNV size, coverage, as well as biological and technical variability of the CNV region (Zhao et al., 2015). In this study, using simulated DNA samples, we tried to quantify the effects and predict the value of parameters when NIPT is applied for CNV detection.

MATERIALS AND METHODS

Ethical compliance

This study was approved by the Institutional Review Boards of The Affiliated Suzhou Hospital of Nanjing Medical University (No. 011) in May 2016. Written informed consent for this study was obtained from all participants.

DNA Preparation

DNA extraction from leukocytes and aborted tissues was performed with QIAGEN DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany), according to the manufacturer's protocol. To simulate the DNA mixture derived from maternal peripheral blood, genomic DNA obtained from the leukocyte of patients (XY) with 3.16 Mb microdeletion in 22q11.21 [the DiGeorge syndrome chromosome region, Online Mendelian Inheritance in Man 188400] was mixed with DNA from aborted fetal tissues (XX) without CNV at concentrations of 0%, 4%, 12%, and 20% (w/w %) (Table S1). Abnormal DNA at concentration of 0% was used as negative control in each run of sequencing.

Setting main parameters of sequencing

The artificial mixture of genomic DNA at different ratios was subjected to gDNA fragment library preparation with Ion XpressTM Plus Fragment Library Kit (Life Technologies, Foster, CA) according to the manufacturer's protocol (Knierim, Lucke, Schwarz, Schuelke, & Seelow, 2011). The library was subjected to sequencing using an Ion Proton sequencer at 500 flows according to the manufacturer's instructions (Life Technologies, Carlsbad, CA). Raw reads obtained from the Ion Torrent Suite Software were of different lengths and were trimmed according to the sequencing quality value of >15. After raw reads with read length <50 bp was filtered out, the remaining raw reads were aligned to the human reference genome (hg19) using BWA software. Reads that were unmapped, had multiple primary alignment records, or duplicated, were removed. The number of unique read obtained was counted from each bin on each chromosome. The LOESS regression was used to compute the corrected number of unique reads in each bin, to eliminate the effect of GC bias in different simulated samples (Yin et al., 2015). The Z score method was used to compare the normalized read frequencies of the simulated samples to the normalized read frequencies in the same bin in a reference set of the control normal samples (Straver et al., 2014). Sample preparation, sequencing, and data processing were performed according to previous publication (Yin et al., 2015) with a few modifications: Firstly, sequencing of four mixture samples (at 0%, 4%, 12%, 20%) by Ion Proton Sequencer (Life Technologies, Carlsbad, CA) was performed at flow 500 respectively; Secondly WISECONDOR (Straver et al., 2014) was used as the pipeline implemented in CNV‐calling, and thirdly, bin of 500, 750, and 1000 kb was used for counting unique read. The region with Z score ≦−3 was marked as a potential microdeletion.

Evaluation of parameters

The following parameters were evaluated with Box–Behnken design: unique read number (A, related to coverage), fetal DNA concentration (B), and window width (C, also named as bin). Design Expert Software (version 8.0.6 trial, Stat‐Ease, Inc) was employed to perform the Box–Behnken design. The significance of each coefficient was statistically analyzed by p < 0.05.

RESULTS

The read length distribution of the artificial DNA mixture is provided in Figure S1. In the artificial DNA mixture, DNA derived from leukocytes of patient with 3.16 Mb deletion in chr22q11.21 was mixed in some ratios to simulate the fetal fraction in DNA derived from maternal plasma at known concentration, resulting in the observed patterns of read length distribution of the artificial mixture. Similar to the read length distribution of DNA derived from maternal plasma for aneuploidy screening, the median of the read length distribution for the artificial mixture was also located within 150–200 bp, in addition to similar read length distribution pattern. The Box–Behnken results are shown in Table S2. The simulated fetal fraction ranged from 4% to 20% with unique read number ranging from 4 to 10 M and bin size ranging from 500 to 1,000 kb. The mean Z score and cv % was obtained at different conditions. A multiple quadratic regression model was established based on the results: R1=−4.72–0.83×A–0.88×B–0.81×C–0.59×A×B–0.80×A×C+0.84×A2+0.42×C2 (Final Equation in Terms of Coded Factors). The variance analysis of the model is as shown in Table 1. A, B, and C had negative effects on the Z score. The interactions between A and B as well as A and C also presented negative effects. The existence of A2 and C2 had positive effects on Z score with bigger p value. The statistical results implied that this model could be used to navigate the design space. (Table 1).
Table 1

ANOVA for response surface reduced quadratic model analysis of variance table [Partial sum of squares ‐ Type III]

SourceSum of Squares df Mean Square F Value p‐value Prob > F Model status R 2 Adjusted R 2 Predicted R 2
Model24.777.003.5437.81<0.0001Significant0.970.940.76
A–A5.491.005.4958.71<0.0001    
B–B6.141.006.1465.63<0.0001    
C–C5.281.005.2856.43<0.0001    
AB1.421.001.4215.130.0037    
AC2.541.002.5427.180.0006    
A2 2.971.002.9731.690.0003    
C2 0.751.000.757.980.0199    
Residual0.849.000.09      
Lack of fit0.845.000.17      
Pure error0.004.000.00      
Cor total25.6116.00       

The coefficients were considered statistically significant at p < 0.05.

ANOVA for response surface reduced quadratic model analysis of variance table [Partial sum of squares ‐ Type III] The coefficients were considered statistically significant at p < 0.05. The simulated models by Box–Behnken design are shown in Figure 1. When B stayed in the region ranging from 4% to 20% with A changing in the scope of 4–10 M, the response surface diagram showed the highest Z score (Figure 1a). There were many solutions for the target Z score (−3) with a desirability of 1 (Figure 1b).
Figure 1

(a) Response surface diagram of Z score corresponding to fetal DNA concentration and unique read number for sample CMA0217‐P1. (b) Desirability and 95% prediction intervals of Z score corresponding to fetal DNA concentration and unique read number

(a) Response surface diagram of Z score corresponding to fetal DNA concentration and unique read number for sample CMA0217‐P1. (b) Desirability and 95% prediction intervals of Z score corresponding to fetal DNA concentration and unique read number Based on the response surface model, we obtained 24 optimization solutions of the Z score −3 for microdeletion of 3.16 Mb (Table S3). The solution of fetal DNA concentration (code −0.24, predicted value 10.08%) and unique read number (code −1, predicted value 4 M) was recommended by the software, which can be set as the lowest quality specifications that must be met, when detecting fetal CNV ≧3.16 Mb.

DISCUSSION

In this study, the equation of Z score depending on the fetal fraction, unique read number and bin size was obtained by Box–Behnken design. The negative effects of A, B and C were quantified as coefficients in the equation, and the smallest values of these parameters were defined as unique read number of 4 Mb and fetal DNA concentration of 10.08% at fixed bin of 750 kb for detecting microdeletion of 3.16 Mb. The smaller the Z score, the larger the absolute value, and the more favorable it is to report the CNV. When the Z score is greater than −3, the pipeline WISECONDOR reports absence of CNV. The negative effect of fetal DNA concentration and unique read number has been reported in a previous study (Zhao et al., 2015). We further found that bin size had significant negative effect on Z score for microdeletions, interactions of A and B as well as A and C were also significant negative factors. In contrast to other publications that characterized the effect of factors as positive or negative (Lo et al., 2016), this study quantified the effect of fetal DNA concentration, unique read number and bin as coefficients of −0.83, −0.88, and −0.81. Moreover, the value of B (fetal DNA concentration) can be as large as possible because there is no B2 existing in the equation. Clinically, we used this conclusion to verify specious CNV. As shown in Figure S2, sample 1 and 2 showed plausible CNV; re‐sequencing was performed after enrichment of fetal fraction by cfDNA size selection, Z scores increased with the increasing fetal fraction correspondingly. Thus, the plausible CNVs should be existing reliably and finally were confirmed by CMA. It is indicated that resequencing after DNA size selection by gel electrophoresis could be an effective validation method for CNV detection. The simulated sample in this study was not prepared by mixing the DNA from the fetal plasma and maternal plasma due to sampling difficulty, but rather was prepared similar to that used for negative control preparation in aneuploidy screening. In addition, we only assessed a single genotype (3.16 Mb deletion at 22q11.2). Nonetheless, the quality control specification obtained from such mixtures is expected to extend the application scope to CNVs ≥3.16 Mb since CNV size has a negative effect on CNV detection (Zhao et al., 2015). Taking CNVs ≥3.16 Mb in Yin's research as examples (Yin et al., 2015), among the samples with sequencing results satisfying the quality control specification, the sensitivity for CNVs ≥3.16 Mb reaches 97.3% (36/37, with one deletion of 4.95 M in Chr21 not reported) higher than 75% (when values of parameters meeting the original quality control standard of unique read number ≥2.5 M, fetal DNA concentration ≥3.5% in aneuploidy screening). Since samples representing the biological and technical variability of the event region are difficult to mimic and quantify (Li et al., 2016), they were not involved in this study. Taken together, the predicted quality control specification for detecting CNVs ≥3.16 Mb by the simulated model needs to be verified and adjusted by more clinical samples. This method is expected to predict detection conditions for smaller size of CNVs with specific locations, sizes and genotype. Further, nested deletions, such as those in 22q11.2 present as a significant fraction of Velocardiofacial syndrome (Kong, Cheng, To, & Leung, 2019), will be further explored in a subsequent study. Most studies retrospectively reported the clinical performance of general NIPT in CNV screening (Liu et al., 2016), while in this study, we positively designed experiments to quantify the effect of key factors involved in microdeletion detection and determine the value of parameters. Such a method can benefit the establishment of quality standards for CNV detection and help doctors and technicians in interpreting the CNV detection results.

COMPETING INTEREST

None declared. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  12 in total

Review 1.  Methods and strategies for analyzing copy number variation using DNA microarrays.

Authors:  Nigel P Carter
Journal:  Nat Genet       Date:  2007-07       Impact factor: 38.330

2.  Challenges for CNV interpretation in clinical molecular karyotyping: lessons learned from a 1001 sample experience.

Authors:  Karen Buysse; Barbara Delle Chiaie; Rudy Van Coster; Bart Loeys; Anne De Paepe; Geert Mortier; Frank Speleman; Björn Menten
Journal:  Eur J Med Genet       Date:  2009-09-16       Impact factor: 2.708

3.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives.

Authors:  Min Zhao; Qingguo Wang; Quan Wang; Peilin Jia; Zhongming Zhao
Journal:  BMC Bioinformatics       Date:  2013-09-13       Impact factor: 3.169

4.  Detection of fetal copy number variants by non-invasive prenatal testing for common aneuploidies.

Authors:  R Li; J Wan; Y Zhang; F Fu; Y Ou; X Jing; J Li; D Li; C Liao
Journal:  Ultrasound Obstet Gynecol       Date:  2016-01       Impact factor: 7.299

5.  Detection of fetal subchromosomal abnormalities by sequencing circulating cell-free DNA from maternal plasma.

Authors:  Chen Zhao; John Tynan; Mathias Ehrich; Gregory Hannum; Ron McCullough; Juan-Sebastian Saldivar; Paul Oeth; Dirk van den Boom; Cosmin Deciu
Journal:  Clin Chem       Date:  2015-02-20       Impact factor: 8.327

6.  Prevalence of chromosomal abnormalities and 22q11.2 deletion in conotruncal and non-conotruncal antenatally diagnosed congenital heart diseases in a Chinese population.

Authors:  C W Kong; Y K Y Cheng; W W K To; T Y Leung
Journal:  Hong Kong Med J       Date:  2019-01-18       Impact factor: 2.227

7.  Noninvasive detection of fetal subchromosomal abnormalities by semiconductor sequencing of maternal plasma DNA.

Authors:  Ai-hua Yin; Chun-fang Peng; Xin Zhao; Bennett A Caughey; Jie-xia Yang; Jian Liu; Wei-wei Huang; Chang Liu; Dong-hong Luo; Hai-liang Liu; Yang-yi Chen; Jing Wu; Rui Hou; Mindy Zhang; Michael Ai; Lianghong Zheng; Rachel Q Xue; Ming-qin Mai; Fang-fang Guo; Yi-ming Qi; Dong-mei Wang; Michal Krawczyk; Daniel Zhang; Yu-nan Wang; Quan-fei Huang; Michael Karin; Kang Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-09       Impact factor: 11.205

8.  Systematic comparison of three methods for fragmentation of long-range PCR products for next generation sequencing.

Authors:  Ellen Knierim; Barbara Lucke; Jana Marie Schwarz; Markus Schuelke; Dominik Seelow
Journal:  PLoS One       Date:  2011-11-30       Impact factor: 3.240

9.  Performance Evaluation of NIPT in Detection of Chromosomal Copy Number Variants Using Low-Coverage Whole-Genome Sequencing of Plasma DNA.

Authors:  Hongtai Liu; Ya Gao; Zhiyang Hu; Linhua Lin; Xuyang Yin; Jun Wang; Dayang Chen; Fang Chen; Hui Jiang; Jinghui Ren; Wei Wang
Journal:  PLoS One       Date:  2016-07-14       Impact factor: 3.240

10.  Effect quantification and value prediction of factors in noninvasive detection for specific fetal copy number variants by semiconductor sequencing.

Authors:  Chunhua Zhang; Bo Liang; Longwei Qiao; Liming Xuan; Hong Li; Quanze He; Xiaojuan Wu; Jiafeng Lu; Bin Yu; Ting Wang
Journal:  Mol Genet Genomic Med       Date:  2019-05-21       Impact factor: 2.183

View more
  3 in total

1.  Effect quantification and value prediction of factors in noninvasive detection for specific fetal copy number variants by semiconductor sequencing.

Authors:  Chunhua Zhang; Bo Liang; Longwei Qiao; Liming Xuan; Hong Li; Quanze He; Xiaojuan Wu; Jiafeng Lu; Bin Yu; Ting Wang
Journal:  Mol Genet Genomic Med       Date:  2019-05-21       Impact factor: 2.183

2.  A fetal fraction enrichment method reduces false negatives and increases test success rate of fetal chromosome aneuploidy detection in early pregnancy loss.

Authors:  Longwei Qiao; Bin Zhang; Xiaojuan Wu; Chunhua Zhang; Ying Xue; Hui Tang; Haoyu Tang; Jingye Shi; Yuting Liang; Bin Yu; Ting Wang
Journal:  J Transl Med       Date:  2022-08-02       Impact factor: 8.440

3.  Clinical value for the detection of fetal chromosomal deletions/duplications by noninvasive prenatal testing in clinical practice.

Authors:  Lingshan Gou; Feng Suo; Yi Wang; Na Wang; Qin Wu; Shunan Hu; Peng Wang; Lize Gu; Man Zhang; Chuanxia Wang; Yan Zhang; Xin Yin; Peng Zhang; Jian Xu; Xingqi Wang; Maosheng Gu
Journal:  Mol Genet Genomic Med       Date:  2021-05-05       Impact factor: 2.183

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.