| Literature DB >> 33983414 |
José Marcos Moreno-Cabrera1,2,3, Jesús Del Valle2,3, Elisabeth Castellanos1,4, Lidia Feliubadaló2,3, Marta Pineda2,3, Eduard Serra1,3, Gabriel Capellá2,3, Conxi Lázaro2,3, Bernat Gel1.
Abstract
Germline copy-number variants (CNVs) are relevant mutations for multiple genetics fields, such as the study of hereditary diseases. However, available benchmarks show that all next-generation sequencing (NGS) CNV calling tools produce false positives. We developed CNVfilteR, an R package that uses the single nucleotide variant calls usually obtained in germline NGS pipelines to identify those false positives. The package can detect both false deletions and false duplications. We evaluated CNVfilteR performance on callsets generated by 13 CNV calling tools on 3 whole-genome sequencing and 541 panel samples, showing a decrease of up to 44.8% in false positives and consistent F1-score increase. Using CNVfilteR to detect false-positive calls can improve the overall performance of existing CNV calling pipelines. AVAILABILITY: CNVfilteR is released under Artistic-2.0 License. Source code and documentation are freely available at Bioconductor (http://www.bioconductor.org/packages/CNVfilteR). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.Entities:
Year: 2021 PMID: 33983414 PMCID: PMC9502136 DOI: 10.1093/bioinformatics/btab356
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Fig. 1.(A) CNV deletion example, adapted from CNVfilteR output. (B) CNV duplication example, adapted from CNVfilteR output. (C) Scoring model for CNV duplications, plotted by CNVfilteR. (D–F) F1-score differences before (light blue) and after (dark blue) removing the false-positive CNVs identified by CNVfilteR in the HuRef, AK1 and NA12878 WGS samples