Literature DB >> 30222580

Preprocessing Sequence Coverage Data for More Precise Detection of Copy Number Variations.

Fatima Zare, Sardar Ansari, Kayvan Najarian, Sheida Nabavi.   

Abstract

Copy number variation (CNV) is a type of genomic/genetic variation that plays an important role in phenotypic diversity, evolution, and disease susceptibility. Next generation sequencing (NGS) technologies have created an opportunity for more accurate detection of CNVs with higher resolution. However, efficient and precise detection of CNVs remains challenging due to high levels of noise and biases, data heterogeneity, and the "big data" nature of NGS data. Sequence coverage (readcount) data are mostly used for detecting CNVs, specially for whole exome sequencing data. Readcount data are contaminated with several types of biases and noise that hinder accurate detection of CNVs. In this work, we introduce a novel preprocessing pipeline for reducing noise and biases to improve the detection accuracy of CNVs in heterogeneous NGS data, such as cancer whole exome sequencing data. We have employed several normalization methods to reduce readcount's biases that are due to GC content of reads, read alignment problems, and sample impurity. We have also developed a novel efficient and effective smoothing approach based on Taut String to reduce noise and increase CNV detection power. Using simulated and real data we showed that employing the proposed preprocessing pipeline significantly improves the accuracy of CNV detection.

Entities:  

Mesh:

Year:  2018        PMID: 30222580      PMCID: PMC7278033          DOI: 10.1109/TCBB.2018.2869738

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  37 in total

1.  Circular binary segmentation for the analysis of array-based DNA copy number data.

Authors:  Adam B Olshen; E S Venkatraman; Robert Lucito; Michael Wigler
Journal:  Biostatistics       Date:  2004-10       Impact factor: 5.899

2.  A fast and flexible method for the segmentation of aCGH data.

Authors:  Erez Ben-Yaacov; Yonina C Eldar
Journal:  Bioinformatics       Date:  2008-08-15       Impact factor: 6.937

3.  Sensitive and accurate detection of copy number variants using read depth of coverage.

Authors:  Seungtai Yoon; Zhenyu Xuan; Vladimir Makarov; Kenny Ye; Jonathan Sebat
Journal:  Genome Res       Date:  2009-08-05       Impact factor: 9.043

Review 4.  A copy number variation map of the human genome.

Authors:  Mehdi Zarrei; Jeffrey R MacDonald; Daniele Merico; Stephen W Scherer
Journal:  Nat Rev Genet       Date:  2015-02-03       Impact factor: 53.242

5.  Noninvasive prenatal diagnosis of common aneuploidies by semiconductor sequencing.

Authors:  Can Liao; Ai-hua Yin; Chun-fang Peng; Fang Fu; Jie-xia Yang; Ru Li; Yang-yi Chen; Dong-hong Luo; Yong-ling Zhang; Yan-mei Ou; Jian Li; Jing Wu; Ming-qin Mai; Rui Hou; Frances Wu; Hongrong Luo; Dong-zhi Li; Hai-liang Liu; Xiao-zhuang Zhang; Kang Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2014-05-05       Impact factor: 11.205

6.  Software for computing and annotating genomic ranges.

Authors:  Michael Lawrence; Wolfgang Huber; Hervé Pagès; Patrick Aboyoun; Marc Carlson; Robert Gentleman; Martin T Morgan; Vincent J Carey
Journal:  PLoS Comput Biol       Date:  2013-08-08       Impact factor: 4.475

7.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.

Authors:  Günter Klambauer; Karin Schwarzbauer; Andreas Mayr; Djork-Arné Clevert; Andreas Mitterecker; Ulrich Bodenhofer; Sepp Hochreiter
Journal:  Nucleic Acids Res       Date:  2012-02-01       Impact factor: 16.971

8.  A Signal Processing Approach for Detection of Hemodynamic Instability before Decompensation.

Authors:  Ashwin Belle; Sardar Ansari; Maxwell Spadafore; Victor A Convertino; Kevin R Ward; Harm Derksen; Kayvan Najarian
Journal:  PLoS One       Date:  2016-02-12       Impact factor: 3.240

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data.

Authors:  Layla Oesper; Ahmad Mahmoody; Benjamin J Raphael
Journal:  Genome Biol       Date:  2013-07-29       Impact factor: 13.583

View more
  1 in total

1.  Copy Number Variation Detection Using Total Variation.

Authors:  Fatima Zare; Sheida Nabavi
Journal:  ACM BCB       Date:  2019-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.