Literature DB >> 23421794

CNVeM: copy number variation detection using uncertainty of read mapping.

Zhanyong Wang1, Farhad Hormozdiari, Wen-Yun Yang, Eran Halperin, Eleazar Eskin.   

Abstract

Copy number variations (CNVs) are widely known to be an important mediator for diseases and traits. The development of high-throughput sequencing (HTS) technologies has provided great opportunities to identify CNV regions in mammalian genomes. In a typical experiment, millions of short reads obtained from a genome of interest are mapped to a reference genome. The mapping information can be used to identify CNV regions. One important challenge in analyzing the mapping information is the large fraction of reads that can be mapped to multiple positions. Most existing methods either only consider reads that can be uniquely mapped to the reference genome or randomly place a read to one of its mapping positions. Therefore, these methods have low power to detect CNVs located within repeated sequences. In this study, we propose a probabilistic model, CNVeM, that utilizes the inherent uncertainty of read mapping. We use maximum likelihood to estimate locations and copy numbers of copied regions and implement an expectation-maximization (EM) algorithm. One important contribution of our model is that we can distinguish between regions in the reference genome that differ from each other by as little as 0.1%. As our model aims to predict the copy number of each nucleotide, we can predict the CNV boundaries with high resolution. We apply our method to simulated datasets and achieve higher accuracy compared to CNVnator. Moreover, we apply our method to real data from which we detected known CNVs. To our knowledge, this is the first attempt to predict CNVs at nucleotide resolution and to utilize uncertainty of read mapping.

Entities:  

Mesh:

Year:  2013        PMID: 23421794      PMCID: PMC3590897          DOI: 10.1089/cmb.2012.0258

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  21 in total

1.  Detection of large-scale variation in the human genome.

Authors:  A John Iafrate; Lars Feuk; Miguel N Rivera; Marc L Listewnik; Patricia K Donahoe; Ying Qi; Stephen W Scherer; Charles Lee
Journal:  Nat Genet       Date:  2004-08-01       Impact factor: 38.330

2.  Fine-scale structural variation of the human genome.

Authors:  Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal:  Nat Genet       Date:  2005-05-15       Impact factor: 38.330

3.  HAPLOFREQ--estimating haplotype frequencies efficiently.

Authors:  Eran Halperin; Elad Hazan
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

Review 4.  Methods and strategies for analyzing copy number variation using DNA microarrays.

Authors:  Nigel P Carter
Journal:  Nat Genet       Date:  2007-07       Impact factor: 38.330

5.  CNVDetector: locating copy number variations using array CGH data.

Authors:  Peng-An Chen; Hsiao-Fei Liu; Kun-Mao Chao
Journal:  Bioinformatics       Date:  2008-10-07       Impact factor: 6.937

6.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing.

Authors:  Alexej Abyzov; Alexander E Urban; Michael Snyder; Mark Gerstein
Journal:  Genome Res       Date:  2011-02-07       Impact factor: 9.043

7.  Global variation in copy number in the human genome.

Authors:  Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2006-11-23       Impact factor: 49.962

8.  High-resolution mapping of copy-number alterations with massively parallel sequencing.

Authors:  Derek Y Chiang; Gad Getz; David B Jaffe; Michael J T O'Kelly; Xiaojun Zhao; Scott L Carter; Carsten Russ; Chad Nusbaum; Matthew Meyerson; Eric S Lander
Journal:  Nat Methods       Date:  2008-11-30       Impact factor: 28.547

9.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

10.  Strong association of de novo copy number mutations with autism.

Authors:  Jonathan Sebat; B Lakshmi; Dheeraj Malhotra; Jennifer Troge; Christa Lese-Martin; Tom Walsh; Boris Yamrom; Seungtai Yoon; Alex Krasnitz; Jude Kendall; Anthony Leotta; Deepa Pai; Ray Zhang; Yoon-Ha Lee; James Hicks; Sarah J Spence; Annette T Lee; Kaija Puura; Terho Lehtimäki; David Ledbetter; Peter K Gregersen; Joel Bregman; James S Sutcliffe; Vaidehi Jobanputra; Wendy Chung; Dorothy Warburton; Mary-Claire King; David Skuse; Daniel H Geschwind; T Conrad Gilliam; Kenny Ye; Michael Wigler
Journal:  Science       Date:  2007-03-15       Impact factor: 47.728

View more
  11 in total

1.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives.

Authors:  Min Zhao; Qingguo Wang; Quan Wang; Peilin Jia; Zhongming Zhao
Journal:  BMC Bioinformatics       Date:  2013-09-13       Impact factor: 3.169

2.  A genome-wide approach for detecting novel insertion-deletion variants of mid-range size.

Authors:  Li C Xia; Sukolsak Sakshuwong; Erik S Hopmans; John M Bell; Susan M Grimes; David O Siegmund; Hanlee P Ji; Nancy R Zhang
Journal:  Nucleic Acids Res       Date:  2016-06-20       Impact factor: 16.971

Review 3.  Statistical Considerations on NGS Data for Inferring Copy Number Variations.

Authors:  Jie Chen
Journal:  Methods Mol Biol       Date:  2021

4.  Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.

Authors:  WeiBo Wang; Wei Wang; Wei Sun; James J Crowley; Jin P Szatkiewicz
Journal:  Nucleic Acids Res       Date:  2015-04-16       Impact factor: 16.971

5.  GIP: an open-source computational pipeline for mapping genomic instability from protists to cancer cells.

Authors:  Gerald F Späth; Giovanni Bussotti
Journal:  Nucleic Acids Res       Date:  2022-04-08       Impact factor: 16.971

6.  CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.

Authors:  Rituparna Sinha; Sandip Samaddar; Rajat K De
Journal:  PLoS One       Date:  2015-08-20       Impact factor: 3.240

7.  Identification of copy number variants in whole-genome data using Reference Coverage Profiles.

Authors:  Gustavo Glusman; Alissa Severson; Varsha Dhankani; Max Robinson; Terry Farrah; Denise E Mauldin; Anna B Stittrich; Seth A Ament; Jared C Roach; Mary E Brunkow; Dale L Bodian; Joseph G Vockley; Ilya Shmulevich; John E Niederhuber; Leroy Hood
Journal:  Front Genet       Date:  2015-02-17       Impact factor: 4.599

Review 8.  An NGS Workflow Blueprint for DNA Sequencing Data and Its Application in Individualized Molecular Oncology.

Authors:  Jian Li; Aarif Mohamed Nazeer Batcha; Björn Grüning; Ulrich R Mansmann
Journal:  Cancer Inform       Date:  2016-04-10

9.  mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications.

Authors:  Faraz Hach; Iman Sarrafi; Farhad Hormozdiari; Can Alkan; Evan E Eichler; S Cenk Sahinalp
Journal:  Nucleic Acids Res       Date:  2014-05-08       Impact factor: 16.971

10.  SRBreak: A Read-Depth and Split-Read Framework to Identify Breakpoints of Different Events Inside Simple Copy-Number Variable Regions.

Authors:  Hoang T Nguyen; James Boocock; Tony R Merriman; Michael A Black
Journal:  Front Genet       Date:  2016-09-15       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.