| Literature DB >> 31105557 |
Yu Liang1, Li He2, Yiru Zhao3, Yinyi Hao1, Yifan Zhou1, Menglong Li1, Chuan Li3, Xuemei Pu1, Zhining Wen1.
Abstract
Despite of the low occurrence rate in the entire genomes, de novo mutation is proved to be deleterious and will lead to severe genetic diseases via impacting on the gene function. Considering the fact that the traditional family based linkage approaches and the genome-wide association studies are unsuitable for identifying the de novo mutations, in recent years, several pipelines have been proposed to detect them based on the whole-genome or whole-exome sequencing data and were used for calling them in the rare diseases. However, how the performance of these variant calling pipelines on detecting the de novo mutations is still unexplored. For the purpose of facilitating the appropriate choice of the pipelines and reducing the false positive rate, in this study, we thoroughly evaluated the performance of the commonly used trio calling methods on the detection of the de novo single-nucleotide variants (DNSNVs) by conducting a comparative analysis for the calling results. Our results exhibited that different pipelines have a specific tendency to detect the DNSNVs in the genomic regions with different GC contents. Additionally, to refine the calling results for a single pipeline, our proposed filter achieved satisfied results, indicating that the read coverage at the mutation positions can be used as an effective index to identify the high-confidence DNSNVs. Our findings should be good support for the committees to choose an appropriate way to explore the de novo mutations for the rare diseases.Entities:
Keywords: de novo mutation; gene function; rare diseases; variant calling pipelines evaluation; whole-exon sequencing
Year: 2019 PMID: 31105557 PMCID: PMC6499170 DOI: 10.3389/fphar.2019.00358
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
FIGURE 1The numbers of transitions and transversions in the calling results generated by GATK, RTG and VarScan.
FIGURE 2The distribution of GC contents around the de novo SNVs identified by GATK, RTG and VarScan.
FIGURE 3The distribution of SNV densities around the de novo SNVs identified by GATK, RTG, and VarScan.
FIGURE 4The number of de novo SNVs kept in the calling results when filtered by using different cut-offs in our proposed filter.
FIGURE 5Results on the filtered de novo SNVs identified by GATK, RTG and VarScan. (A) The number of transitions and transversions in the filtered calling results. (B) The chromosomal distribution of the filtered de novo SNVs. (C) The overlaps of the filtered de novo SNVs among three pipelines.
FIGURE 6The distribution of GQ around the de novo SNVs identified by GATK, RTG, and VarScan.
The 22 overlapped DNSNVs identified by all the trio calling methods and the corresponding genes associated with the diseases.
| Chromosome | Position | Substitution | Gene symbol | phenotype |
|---|---|---|---|---|
| 2 | 25457155 | C→T | DTNB∗ | Muscular dystrophy |
| 2 | 197791183 | C→T | – | – |
| 2 | 241835203 | G→T | – | – |
| 3 | 4669342 | A→T | ITPR1∗ | Gillespie syndrome Spinocerebellar ataxia 15 Spinocerebellar ataxia 29, congenital non-progressive |
| 3 | 48603870 | G→A | UQCRC1 | Predisposition of Alzheimer’s disease |
| 3 | 52547912 | C→G | PBRM1∗ | Clear cell renal cell carcinoma |
| 6 | 109740508 | T→ C | FIG4∗ | Polymicrogyria, bilateral temporooccipital Amyotrophic lateral sclerosis 11 Charcot-Marie-Tooth disease, type 4J Yunis-Varon syndrome |
| 5 | 140730969 | A→G | – | – |
| 8 | 30703949 | C→G | GSR∗ | Hemolytic anemia due to glutathione reductase deficiency |
| 9 | 131041047 | G→T | LAMC3∗ | Cortical malformations, occipital |
| 9 | 131851319 | C→G | – | – |
| 11 | 82698724 | T→C | – | – |
| 14 | 77245340 | C→T | TMEM63C | |
| 14 | 93170535 | G→ A | – | – |
| 14 | 96180196 | C→G | – | – |
| 14 | 106330036 | T→A | – | – |
| 14 | 106993919 | A→G | – | – |
| 15 | 79852462 | G→T | MTHFS | Heart defects; lung cancer |
| 17 | 53638886 | G→A | – | – |
| 19 | 45720935 | C→T | FBXO46 | – |
| 22 | 23029544 | A→C | – | – |
| 22 | 23029596 | A→T | – | – |