| Literature DB >> 31165590 |
Jianchao Zheng1,2,3, Hongyun Zhang2,3, Santasree Banerjee4, Yun Li2, Junyu Zhou2, Qian Yang2, Xuemei Tan2, Peng Han2, Qinmei Fu2, Xiaoli Cui2, Yuying Yuan2, Meiyan Zhang2, Ruiqin Shen2, Haifeng Song2, Xiuqing Zhang1,4, Lijian Zhao2,3, Zhiyu Peng3, Wei Wang2,3, Ye Yin3.
Abstract
BACKGROUND: Recently, increasing innovations improved the accuracy of next generation sequencing (NGS) data. However, the validation of all NGS variants increased the cost and turn-around time of clinical diagnosis, and therefore limited the further development of clinical applications. We aimed to comprehensively assess the necessity of validating NGS variants.Entities:
Keywords: NGS; Sanger sequencing; quality threshold; target enrichment; validation
Mesh:
Year: 2019 PMID: 31165590 PMCID: PMC6625156 DOI: 10.1002/mgg3.748
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Information of the five capture panels used in this study
| Panel name | Targeted region (kb) | Number of genes | Number of exons | Panel provider | Average coverage | Average depth | ||
|---|---|---|---|---|---|---|---|---|
| HiSeq2500 | BGISEQ‐500 | HiSeq2500 | BGISEQ‐500 | |||||
| HC | 839 | 115 | 1,734 | Roche | 99.20 | 99.18 | 426 | 261 |
| HD | 1,565 | 313 | 4,899 | Roche | 92.75 | 91.69 | 393 | 159 |
| 4.8M | 4,760 | 1,130 | 17,108 | Roche | 99.84 | 99.78 | 294 | 210 |
| C2181 | 5,199 | 2,182 | 31,632 | Roche | 99.84 | 99.84 | 234 | 176 |
| Exo | 58,882 | 18,979 | 197,748 | BGI | 99.45 | 99.74 | 123 | 122 |
HC, panel for hereditary cancer; HD, panel for hereditary deafness; 4.8M and C2181, two panels for Mendelian disease; Exo, human exome.
Variants distribution used in this study
| Panel | Sequence chemistry | Sample | Gene | Variant | SNV | indel |
|---|---|---|---|---|---|---|
| HC | HiSeq2500 | 110 | 8 | 114 | 67 | 47 |
| HC | BGISEQ‐500 | 216 | 10 | 223 | 119 | 104 |
| HD | HiSeq2500 | 662 | 73 | 1,125 | 842 | 283 |
| HD | BGISEQ‐500 | 291 | 8 | 507 | 416 | 91 |
| 4.8M | HiSeq2500 | 1,328 | 231 | 1,882 | 1,471 | 411 |
| 4.8M | BGISEQ‐500 | 134 | 5 | 184 | 152 | 32 |
| C2181 | HiSeq2500 | 1,584 | 295 | 2,042 | 1,553 | 489 |
| C2181 | BGISEQ‐500 | 523 | 32 | 669 | 508 | 161 |
| Exo | HiSeq2500 | 154 | 203 | 379 | 368 | 11 |
| Exo | BGISEQ‐500 | 188 | 180 | 476 | 460 | 16 |
| Total | 5,190 | 1,045 | 7,601 | 5,956 | 1,645 |
HC, panel for hereditary cancer; HD, panel for hereditary deafness; 4.8M and C2181, two panels for Mendelian disease; Exo, human exome.
Figure 1Characterization of variants and genes in this study. (a) The GC content distribution of the 5,345 unique variants. GC content was calculated using the 100 bp flanking sequence of each variant. (b) The chromosome distribution of the 5,345 unique variants. (c) The chromosome distribution of the 1,045 unique genes
Figure 2Validation results of NGS variants. The scattergrams show the NGS depth and heterozygous ratio of true‐positive variants with high quality (a), true‐positive variants with low quality (b), and false‐positive variants (c). The depth of variant was calculated as 630 (A) or 400 (B) (C), when it was up to 630 or 400, respectively
The NGS quality distribution of 662 variants with low‐quality score
| Total | With homology | |||||||
|---|---|---|---|---|---|---|---|---|
| True | True % | False | False % | True | True % | False | False % | |
| ≥35 × depth & <35% Het ratio | 171 | 40.1% | 142 | 60.2% | 64 | 40.8% | 73 | 77.7% |
| <35 × depth & <35% Het ratio | 28 | 6.6% | 71 | 30.1% | 14 | 8.9% | 14 | 14.9% |
| <35 × depth &≥35% Het ratio | 227 | 53.3% | 23 | 9.7% | 79 | 50.3% | 7 | 7.4% |
| Total | 426 | 100.0% | 236 | 100.0% | 157 | 100.0% | 94 | 100.0% |