Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples.

Literature DB >> 30779024

SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples.

Yaoyao Li¹, Xiguo Yuan², Junying Zhang³, Liying Yang¹, Jun Bai⁴, Shan Jiang⁵.

Abstract

BACKGROUND: Copy number variation (CNV) is an important form of genomic structural variation and is linked to dozens of human diseases. Using next-generation sequencing (NGS) data and developing computational methods to characterize such structural variants is significant for understanding the mechanisms of diseases.
OBJECTIVE: The objective of this study is to develop a new statistical method of detection recurrent CNVs across multiple samples from genomic sequences.
METHODS: A statistical method is carried out to detect recurrent CNVs, referred to as SM-RCNV. This method uses a statistic associated with each location by combining the frequency of variation at one location across whole samples and the correlation among consecutive locations. The weights of the frequency and correlation are trained using real datasets with known CNVs. P-value is assessed for each location on the genome by permutation testing.
RESULTS: Compared with six peer methods, SM-RCNV outperforms the peer methods under receiver operating characteristic curves. SM-RCNV successfully identifies many consistent recurrent CNVs, most of which are known to be of biological significance and associated with diseased genes. The validation rate of SM-RCNV in the CEU call set and YRI call set with Database of Genomic Variants are 258/328 (79%) and (157/309) 51%, respectively.
CONCLUSION: SM-RCNV is a well-grounded statistical framework for detecting recurrent CNVs from multiple genomic sequences, providing valuable information to study genomes in human diseases. The source code is freely available at https://sourceforge.net/projects/sm-rcnv/ .

Entities: Species

Keywords: Correlation; Permutation test; Read depth; Recurrent copy number variations

Mesh：

Year: 2019 PMID： 30779024 DOI： 10.1007/s13258-019-00788-9

Source DB: PubMed Journal: Genes Genomics ISSN： 1976-9571 Impact factor: 1.839

Keyword Cloud
References

30 in total

1. Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.

Authors: Ruibin Xi; Angela G Hadjipanayis; Lovelace J Luquette; Tae-Min Kim; Eunjung Lee; Jianhua Zhang; Mark D Johnson; Donna M Muzny; David A Wheeler; Richard A Gibbs; Raju Kucherlapati; Peter J Park
Journal: Proc Natl Acad Sci U S A Date: 2011-11-07 Impact factor: 11.205

2. Sensitive and accurate detection of copy number variants using read depth of coverage.

Authors: Seungtai Yoon; Zhenyu Xuan; Vladimir Makarov; Kenny Ye; Jonathan Sebat
Journal: Genome Res Date: 2009-08-05 Impact factor: 9.043

Review 3. Copy number variation: new insights in genome diversity.

Authors: Jennifer L Freeman; George H Perry; Lars Feuk; Richard Redon; Steven A McCarroll; David M Altshuler; Hiroyuki Aburatani; Keith W Jones; Chris Tyler-Smith; Matthew E Hurles; Nigel P Carter; Stephen W Scherer; Charles Lee
Journal: Genome Res Date: 2006-06-29 Impact factor: 9.043

4. Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients.

Authors: Xiaohui Ni; Minglei Zhuo; Zhe Su; Jianchun Duan; Yan Gao; Zhijie Wang; Chenghang Zong; Hua Bai; Alec R Chapman; Jun Zhao; Liya Xu; Tongtong An; Qi Ma; Yuyan Wang; Meina Wu; Yu Sun; Shuhang Wang; Zhenxiang Li; Xiaodan Yang; Jun Yong; Xiao-Dong Su; Youyong Lu; Fan Bai; X Sunney Xie; Jie Wang
Journal: Proc Natl Acad Sci U S A Date: 2013-12-09 Impact factor: 11.205

5. Common copy number variation detection from multiple sequenced samples.

Authors: Junbo Duan; Hong-Wen Deng; Yu-Ping Wang
Journal: IEEE Trans Biomed Eng Date: 2014-03 Impact factor: 4.538

Review 6. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.

Authors: Peter J A Cock; Christopher J Fields; Naohisa Goto; Michael L Heuer; Peter M Rice
Journal: Nucleic Acids Res Date: 2009-12-16 Impact factor: 16.971

7. cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.

Authors: Günter Klambauer; Karin Schwarzbauer; Andreas Mayr; Djork-Arné Clevert; Andreas Mitterecker; Ulrich Bodenhofer; Sepp Hochreiter
Journal: Nucleic Acids Res Date: 2012-02-01 Impact factor: 16.971