Literature DB >> 20805290

Detecting copy number variation with mated short reads.

Paul Medvedev1, Marc Fiume, Misko Dzamba, Tim Smith, Michael Brudno.   

Abstract

The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can also be used for the reliable identification of large copy-variable regions. Such methods, however, are hindered by sequencing biases that lead certain regions of the genome to be over- or undersampled, lowering their resolution and ability to accurately identify the exact breakpoints of the variants. In this work, we develop a method for CNV detection that supplements the depth-of-coverage with paired-end mapping information, where mate pairs mapping discordantly to the reference serve to indicate the presence of variation. Our algorithm, called CNVer, combines this information within a unified computational framework called the donor graph, allowing us to better mitigate the sequencing biases that cause uneven local coverage and accurately predict CNVs. We use CNVer to detect 4879 CNVs in the recently described genome of a Yoruban individual. Most of the calls (77%) coincide with previously known variants within the Database of Genomic Variants, while 81% of deletion copy number variants previously known for this individual coincide with one of our loss calls. Furthermore, we demonstrate that CNVer can reconstruct the absolute copy counts of segments of the donor genome and evaluate the feasibility of using CNVer with low coverage datasets.

Entities:  

Mesh:

Year:  2010        PMID: 20805290      PMCID: PMC2963824          DOI: 10.1101/gr.106344.110

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  34 in total

1.  End-sequence profiling: sequence-based analysis of aberrant genomes.

Authors:  Stanislav Volik; Shaying Zhao; Koei Chin; John H Brebner; David R Herndon; Quanzhou Tao; David Kowbel; Guiqing Huang; Anna Lapuk; Wen-Lin Kuo; Gregg Magrane; Pieter De Jong; Joe W Gray; Colin Collins
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-04       Impact factor: 11.205

2.  Detection of large-scale variation in the human genome.

Authors:  A John Iafrate; Lars Feuk; Miguel N Rivera; Marc L Listewnik; Patricia K Donahoe; Ying Qi; Stephen W Scherer; Charles Lee
Journal:  Nat Genet       Date:  2004-08-01       Impact factor: 38.330

3.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

4.  Savant: genome browser for high-throughput sequencing data.

Authors:  Marc Fiume; Vanessa Williams; Andrew Brook; Michael Brudno
Journal:  Bioinformatics       Date:  2010-06-20       Impact factor: 6.937

5.  A novel method for multiple alignment of sequences with repeated and shuffled elements.

Authors:  Benjamin Raphael; Degui Zhi; Haixu Tang; Pavel Pevzner
Journal:  Genome Res       Date:  2004-11       Impact factor: 9.043

6.  Fine-scale structural variation of the human genome.

Authors:  Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal:  Nat Genet       Date:  2005-05-15       Impact factor: 38.330

Review 7.  Methods and strategies for analyzing copy number variation using DNA microarrays.

Authors:  Nigel P Carter
Journal:  Nat Genet       Date:  2007-07       Impact factor: 38.330

8.  Global variation in copy number in the human genome.

Authors:  Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2006-11-23       Impact factor: 49.962

9.  Large-scale copy number polymorphism in the human genome.

Authors:  Jonathan Sebat; B Lakshmi; Jennifer Troge; Joan Alexander; Janet Young; Pär Lundin; Susanne Månér; Hillary Massa; Megan Walker; Maoyen Chi; Nicholas Navin; Robert Lucito; John Healy; James Hicks; Kenny Ye; Andrew Reiner; T Conrad Gilliam; Barbara Trask; Nick Patterson; Anders Zetterberg; Michael Wigler
Journal:  Science       Date:  2004-07-23       Impact factor: 47.728

10.  Reconstructing tumor genome architectures.

Authors:  Benjamin J Raphael; Stanislav Volik; Colin Collins; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

View more
  88 in total

1.  Reconstructing cancer genomes from paired-end sequencing data.

Authors:  Layla Oesper; Anna Ritz; Sarah J Aerni; Ryan Drebin; Benjamin J Raphael
Journal:  BMC Bioinformatics       Date:  2012-04-19       Impact factor: 3.169

2.  MATE-CLEVER: Mendelian-inheritance-aware discovery and genotyping of midsize and long indels.

Authors:  Tobias Marschall; Iman Hajirasouliha; Alexander Schönhuth
Journal:  Bioinformatics       Date:  2013-09-25       Impact factor: 6.937

3.  Toward Recovering Allele-specific Cancer Genome Graphs.

Authors:  Ashok Rajaraman; Jian Ma
Journal:  J Comput Biol       Date:  2018-04-16       Impact factor: 1.479

Review 4.  Direct mutation analysis by high-throughput sequencing: from germline to low-abundant, somatic variants.

Authors:  Michael Gundry; Jan Vijg
Journal:  Mutat Res       Date:  2011-10-12       Impact factor: 2.433

5.  Efficient algorithms for tandem copy number variation reconstruction in repeat-rich regions.

Authors:  Dan He; Farhad Hormozdiari; Nicholas Furlotte; Eleazar Eskin
Journal:  Bioinformatics       Date:  2011-04-19       Impact factor: 6.937

6.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing.

Authors:  Alexej Abyzov; Alexander E Urban; Michael Snyder; Mark Gerstein
Journal:  Genome Res       Date:  2011-02-07       Impact factor: 9.043

7.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives.

Authors:  Min Zhao; Qingguo Wang; Quan Wang; Peilin Jia; Zhongming Zhao
Journal:  BMC Bioinformatics       Date:  2013-09-13       Impact factor: 3.169

8.  Copy number variant analysis using genome-wide mate-pair sequencing.

Authors:  James B Smadbeck; Sarah H Johnson; Stephanie A Smoley; Athanasios Gaitatzes; Travis M Drucker; Roman M Zenka; Farhad Kosari; Stephen J Murphy; Nicole Hoppman; Umut Aypar; William R Sukov; Robert B Jenkins; Hutton M Kearney; Andrew L Feldman; George Vasmatzis
Journal:  Genes Chromosomes Cancer       Date:  2018-07-30       Impact factor: 5.006

9.  Common copy number variation detection from multiple sequenced samples.

Authors:  Junbo Duan; Hong-Wen Deng; Yu-Ping Wang
Journal:  IEEE Trans Biomed Eng       Date:  2014-03       Impact factor: 4.538

10.  Detecting highly differentiated copy-number variants from pooled population sequencing.

Authors:  Daniel R Schrider; David J Begun; Matthew W Hahn
Journal:  Pac Symp Biocomput       Date:  2013
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.