Literature DB >> 23393425

Robust Detection and Identification of Sparse Segments in Ultra-High Dimensional Data Analysis.

T Tony Cai1, X Jessie Jeng, Hongzhe Li.   

Abstract

Copy number variants (CNVs) are alternations of DNA of a genome that results in the cell having a less or more than two copies of segments of the DNA. CNVs correspond to relatively large regions of the genome, ranging from about one kilobase to several megabases, that are deleted or duplicated. Motivated by CNV analysis based on next generation sequencing data, we consider the problem of detecting and identifying sparse short segments hidden in a long linear sequence of data with an unspecified noise distribution. We propose a computationally efficient method that provides a robust and near-optimal solution for segment identification over a wide range of noise distributions. We theoretically quantify the conditions for detecting the segment signals and show that the method near-optimally estimates the signal segments whenever it is possible to detect their existence. Simulation studies are carried out to demonstrate the efficiency of the method under different noise distributions. We present results from a CNV analysis of a HapMap Yoruban sample to further illustrate the theory and the methods.

Entities:  

Year:  2012        PMID: 23393425      PMCID: PMC3563068          DOI: 10.1111/j.1467-9868.2012.01028.x

Source DB:  PubMed          Journal:  J R Stat Soc Series B Stat Methodol        ISSN: 1369-7412            Impact factor:   4.488


  29 in total

Review 1.  Structural variation in the human genome.

Authors:  Lars Feuk; Andrew R Carson; Stephen W Scherer
Journal:  Nat Rev Genet       Date:  2006-02       Impact factor: 53.242

2.  Copy number variation at 1q21.1 associated with neuroblastoma.

Authors:  Sharon J Diskin; Cuiping Hou; Joseph T Glessner; Edward F Attiyeh; Marci Laudenslager; Kristopher Bosse; Kristina Cole; Yaël P Mossé; Andrew Wood; Jill E Lynch; Katlyn Pecor; Maura Diamond; Cynthia Winter; Kai Wang; Cecilia Kim; Elizabeth A Geiger; Patrick W McGrady; Alexandra I F Blakemore; Wendy B London; Tamim H Shaikh; Jonathan Bradfield; Struan F A Grant; Hongzhe Li; Marcella Devoto; Eric R Rappaport; Hakon Hakonarson; John M Maris
Journal:  Nature       Date:  2009-06-18       Impact factor: 49.962

Review 3.  Computational methods for discovering structural variation with next-generation sequencing.

Authors:  Paul Medvedev; Monica Stanciu; Michael Brudno
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

4.  Accurate and exact CNV identification from targeted high-throughput sequence data.

Authors:  Alex S Nord; Ming Lee; Mary-Claire King; Tom Walsh
Journal:  BMC Genomics       Date:  2011-04-12       Impact factor: 3.969

5.  Optimal Sparse Segment Identification with Application in Copy Number Variation Analysis.

Authors:  X Jessie Jeng; T Tony Cai; Hongzhe Li
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

6.  Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly.

Authors:  Yingrui Li; Hancheng Zheng; Ruibang Luo; Honglong Wu; Hongmei Zhu; Ruiqiang Li; Hongzhi Cao; Boxin Wu; Shujia Huang; Haojing Shao; Hanzhou Ma; Fan Zhang; Shuijian Feng; Wei Zhang; Hongli Du; Geng Tian; Jingxiang Li; Xiuqing Zhang; Songgang Li; Lars Bolund; Karsten Kristiansen; Adam J de Smith; Alexandra I F Blakemore; Lachlan J M Coin; Huanming Yang; Jian Wang; Jun Wang
Journal:  Nat Biotechnol       Date:  2011-07-24       Impact factor: 54.908

7.  Rare chromosomal deletions and duplications increase risk of schizophrenia.

Authors: 
Journal:  Nature       Date:  2008-07-30       Impact factor: 49.962

8.  Large-scale copy number polymorphism in the human genome.

Authors:  Jonathan Sebat; B Lakshmi; Jennifer Troge; Joan Alexander; Janet Young; Pär Lundin; Susanne Månér; Hillary Massa; Megan Walker; Maoyen Chi; Nicholas Navin; Robert Lucito; John Healy; James Hicks; Kenny Ye; Andrew Reiner; T Conrad Gilliam; Barbara Trask; Nick Patterson; Anders Zetterberg; Michael Wigler
Journal:  Science       Date:  2004-07-23       Impact factor: 47.728

9.  Rare structural variants disrupt multiple genes in neurodevelopmental pathways in schizophrenia.

Authors:  Tom Walsh; Jon M McClellan; Shane E McCarthy; Anjené M Addington; Sarah B Pierce; Greg M Cooper; Alex S Nord; Mary Kusenda; Dheeraj Malhotra; Abhishek Bhandari; Sunday M Stray; Caitlin F Rippey; Patricia Roccanova; Vlad Makarov; B Lakshmi; Robert L Findling; Linmarie Sikich; Thomas Stromberg; Barry Merriman; Nitin Gogtay; Philip Butler; Kristen Eckstrand; Laila Noory; Peter Gochman; Robert Long; Zugen Chen; Sean Davis; Carl Baker; Evan E Eichler; Paul S Meltzer; Stanley F Nelson; Andrew B Singleton; Ming K Lee; Judith L Rapoport; Mary-Claire King; Jonathan Sebat
Journal:  Science       Date:  2008-03-27       Impact factor: 47.728

10.  ReadDepth: a parallel R package for detecting copy number alterations from short sequencing reads.

Authors:  Christopher A Miller; Oliver Hampton; Cristian Coarfa; Aleksandar Milosavljevic
Journal:  PLoS One       Date:  2011-01-31       Impact factor: 3.240

View more
  5 in total

1.  A Statistical Method for Identifying Trait-Associated Copy Number Variants.

Authors:  Jessie Jeng; Qian Wu; Hongzhe Li
Journal:  Hum Hered       Date:  2015-07-28       Impact factor: 0.444

2.  Parametric modeling of whole-genome sequencing data for CNV identification.

Authors:  Saran Vardhanabhuti; X Jessie Jeng; Yinghua Wu; Hongzhe Li
Journal:  Biostatistics       Date:  2014-01-28       Impact factor: 5.899

3.  A super scalable algorithm for short segment detection.

Authors:  Ning Hao; Yue Selena Niu; Feifei Xiao; Heping Zhang
Journal:  Stat Biosci       Date:  2020-04-18

4.  SomatiCA: identifying, characterizing and quantifying somatic copy number aberrations from cancer genome sequencing data.

Authors:  Mengjie Chen; Murat Gunel; Hongyu Zhao
Journal:  PLoS One       Date:  2013-11-12       Impact factor: 3.240

5.  iSeg: an efficient algorithm for segmentation of genomic and epigenomic data.

Authors:  Senthil B Girimurugan; Yuhang Liu; Pei-Yau Lung; Daniel L Vera; Jonathan H Dennis; Hank W Bass; Jinfeng Zhang
Journal:  BMC Bioinformatics       Date:  2018-04-11       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.