Literature DB >> 30779024

SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples.

Yaoyao Li1, Xiguo Yuan2, Junying Zhang3, Liying Yang1, Jun Bai4, Shan Jiang5.   

Abstract

BACKGROUND: Copy number variation (CNV) is an important form of genomic structural variation and is linked to dozens of human diseases. Using next-generation sequencing (NGS) data and developing computational methods to characterize such structural variants is significant for understanding the mechanisms of diseases.
OBJECTIVE: The objective of this study is to develop a new statistical method of detection recurrent CNVs across multiple samples from genomic sequences.
METHODS: A statistical method is carried out to detect recurrent CNVs, referred to as SM-RCNV. This method uses a statistic associated with each location by combining the frequency of variation at one location across whole samples and the correlation among consecutive locations. The weights of the frequency and correlation are trained using real datasets with known CNVs. P-value is assessed for each location on the genome by permutation testing.
RESULTS: Compared with six peer methods, SM-RCNV outperforms the peer methods under receiver operating characteristic curves. SM-RCNV successfully identifies many consistent recurrent CNVs, most of which are known to be of biological significance and associated with diseased genes. The validation rate of SM-RCNV in the CEU call set and YRI call set with Database of Genomic Variants are 258/328 (79%) and (157/309) 51%, respectively.
CONCLUSION: SM-RCNV is a well-grounded statistical framework for detecting recurrent CNVs from multiple genomic sequences, providing valuable information to study genomes in human diseases. The source code is freely available at https://sourceforge.net/projects/sm-rcnv/ .

Entities:  

Keywords:  Correlation; Permutation test; Read depth; Recurrent copy number variations

Mesh:

Year:  2019        PMID: 30779024     DOI: 10.1007/s13258-019-00788-9

Source DB:  PubMed          Journal:  Genes Genomics        ISSN: 1976-9571            Impact factor:   1.839


  30 in total

1.  Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.

Authors:  Ruibin Xi; Angela G Hadjipanayis; Lovelace J Luquette; Tae-Min Kim; Eunjung Lee; Jianhua Zhang; Mark D Johnson; Donna M Muzny; David A Wheeler; Richard A Gibbs; Raju Kucherlapati; Peter J Park
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-07       Impact factor: 11.205

2.  Sensitive and accurate detection of copy number variants using read depth of coverage.

Authors:  Seungtai Yoon; Zhenyu Xuan; Vladimir Makarov; Kenny Ye; Jonathan Sebat
Journal:  Genome Res       Date:  2009-08-05       Impact factor: 9.043

Review 3.  Copy number variation: new insights in genome diversity.

Authors:  Jennifer L Freeman; George H Perry; Lars Feuk; Richard Redon; Steven A McCarroll; David M Altshuler; Hiroyuki Aburatani; Keith W Jones; Chris Tyler-Smith; Matthew E Hurles; Nigel P Carter; Stephen W Scherer; Charles Lee
Journal:  Genome Res       Date:  2006-06-29       Impact factor: 9.043

4.  Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients.

Authors:  Xiaohui Ni; Minglei Zhuo; Zhe Su; Jianchun Duan; Yan Gao; Zhijie Wang; Chenghang Zong; Hua Bai; Alec R Chapman; Jun Zhao; Liya Xu; Tongtong An; Qi Ma; Yuyan Wang; Meina Wu; Yu Sun; Shuhang Wang; Zhenxiang Li; Xiaodan Yang; Jun Yong; Xiao-Dong Su; Youyong Lu; Fan Bai; X Sunney Xie; Jie Wang
Journal:  Proc Natl Acad Sci U S A       Date:  2013-12-09       Impact factor: 11.205

5.  Common copy number variation detection from multiple sequenced samples.

Authors:  Junbo Duan; Hong-Wen Deng; Yu-Ping Wang
Journal:  IEEE Trans Biomed Eng       Date:  2014-03       Impact factor: 4.538

Review 6.  The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.

Authors:  Peter J A Cock; Christopher J Fields; Naohisa Goto; Michael L Heuer; Peter M Rice
Journal:  Nucleic Acids Res       Date:  2009-12-16       Impact factor: 16.971

7.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.

Authors:  Günter Klambauer; Karin Schwarzbauer; Andreas Mayr; Djork-Arné Clevert; Andreas Mitterecker; Ulrich Bodenhofer; Sepp Hochreiter
Journal:  Nucleic Acids Res       Date:  2012-02-01       Impact factor: 16.971

8.  CNV-TV: a robust method to discover copy number variation from short sequencing reads.

Authors:  Junbo Duan; Ji-Gang Zhang; Hong-Wen Deng; Yu-Ping Wang
Journal:  BMC Bioinformatics       Date:  2013-05-02       Impact factor: 3.169

9.  CNV-seq, a new method to detect copy number variation using high-throughput sequencing.

Authors:  Chao Xie; Martti T Tammi
Journal:  BMC Bioinformatics       Date:  2009-03-06       Impact factor: 3.169

10.  A survey of tools for variant analysis of next-generation genome sequencing data.

Authors:  Stephan Pabinger; Andreas Dander; Maria Fischer; Rene Snajder; Michael Sperk; Mirjana Efremova; Birgit Krabichler; Michael R Speicher; Johannes Zschocke; Zlatko Trajanoski
Journal:  Brief Bioinform       Date:  2013-01-21       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.