Literature DB >> 19447966

Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes.

Fereydoun Hormozdiari1, Can Alkan, Evan E Eichler, S Cenk Sahinalp.   

Abstract

Recent studies show that along with single nucleotide polymorphisms and small indels, larger structural variants among human individuals are common. The Human Genome Structural Variation Project aims to identify and classify deletions, insertions, and inversions (>5 Kbp) in a small number of normal individuals with a fosmid-based paired-end sequencing approach using traditional sequencing technologies. The realization of new ultra-high-throughput sequencing platforms now makes it feasible to detect the full spectrum of genomic variation among many individual genomes, including cancer patients and others suffering from diseases of genomic origin. Unfortunately, existing algorithms for identifying structural variation (SV) among individuals have not been designed to handle the short read lengths and the errors implied by the "next-gen" sequencing (NGS) technologies. In this paper, we give combinatorial formulations for the SV detection between a reference genome sequence and a next-gen-based, paired-end, whole genome shotgun-sequenced individual. We describe efficient algorithms for each of the formulations we give, which all turn out to be fast and quite reliable; they are also applicable to all next-gen sequencing methods (Illumina, 454 Life Sciences [Roche], ABI SOLiD, etc.) and traditional capillary sequencing technology. We apply our algorithms to identify SV among individual genomes very recently sequenced by Illumina technology.

Entities:  

Mesh:

Year:  2009        PMID: 19447966      PMCID: PMC2704429          DOI: 10.1101/gr.088633.108

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  31 in total

Review 1.  Structural variation in the human genome.

Authors:  Lars Feuk; Andrew R Carson; Stephen W Scherer
Journal:  Nat Rev Genet       Date:  2006-02       Impact factor: 53.242

2.  Completing the map of human genetic variation.

Authors:  Evan E Eichler; Deborah A Nickerson; David Altshuler; Anne M Bowcock; Lisa D Brooks; Nigel P Carter; Deanna M Church; Adam Felsenfeld; Mark Guyer; Charles Lee; James R Lupski; James C Mullikin; Jonathan K Pritchard; Jonathan Sebat; Stephen T Sherry; Douglas Smith; David Valle; Robert H Waterston
Journal:  Nature       Date:  2007-05-10       Impact factor: 49.962

Review 3.  The impact of next-generation sequencing technology on genetics.

Authors:  Elaine R Mardis
Journal:  Trends Genet       Date:  2008-02-11       Impact factor: 11.639

Review 4.  Bioinformatics challenges of new sequencing technology.

Authors:  Mihai Pop; Steven L Salzberg
Journal:  Trends Genet       Date:  2008-02-11       Impact factor: 11.639

5.  Whole-genome sequencing and variant discovery in C. elegans.

Authors:  LaDeana W Hillier; Gabor T Marth; Aaron R Quinlan; David Dooling; Ginger Fewell; Derek Barnett; Paul Fox; Jarret I Glasscock; Matthew Hickenbotham; Weichun Huang; Vincent J Magrini; Ryan J Richt; Sacha N Sander; Donald A Stewart; Michael Stromberg; Eric F Tsung; Todd Wylie; Tim Schedl; Richard K Wilson; Elaine R Mardis
Journal:  Nat Methods       Date:  2008-01-20       Impact factor: 28.547

6.  Global variation in copy number in the human genome.

Authors:  Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2006-11-23       Impact factor: 49.962

Review 7.  Mutational and selective effects on copy-number variants in the human genome.

Authors:  Gregory M Cooper; Deborah A Nickerson; Evan E Eichler
Journal:  Nat Genet       Date:  2007-07       Impact factor: 38.330

8.  Paired-end mapping reveals extensive structural variation in the human genome.

Authors:  Jan O Korbel; Alexander Eckehart Urban; Jason P Affourtit; Brian Godwin; Fabian Grubert; Jan Fredrik Simons; Philip M Kim; Dean Palejev; Nicholas J Carriero; Lei Du; Bruce E Taillon; Zhoutao Chen; Andrea Tanzer; A C Eugenia Saunders; Jianxiang Chi; Fengtang Yang; Nigel P Carter; Matthew E Hurles; Sherman M Weissman; Timothy T Harkins; Mark B Gerstein; Michael Egholm; Michael Snyder
Journal:  Science       Date:  2007-09-27       Impact factor: 47.728

9.  Evaluation of paired-end sequencing strategies for detection of genome rearrangements in cancer.

Authors:  Ali Bashir; Stanislav Volik; Colin Collins; Vineet Bafna; Benjamin J Raphael
Journal:  PLoS Comput Biol       Date:  2008-04-25       Impact factor: 4.475

10.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

View more
  146 in total

1.  Simultaneous structural variation discovery among multiple paired-end sequenced genomes.

Authors:  Fereydoun Hormozdiari; Iman Hajirasouliha; Andrew McPherson; Evan E Eichler; S Cenk Sahinalp
Journal:  Genome Res       Date:  2011-11-02       Impact factor: 9.043

2.  Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.

Authors:  Ruibin Xi; Angela G Hadjipanayis; Lovelace J Luquette; Tae-Min Kim; Eunjung Lee; Jianhua Zhang; Mark D Johnson; Donna M Muzny; David A Wheeler; Richard A Gibbs; Raju Kucherlapati; Peter J Park
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-07       Impact factor: 11.205

Review 3.  Uncovering the roles of rare variants in common disease through whole-genome sequencing.

Authors:  Elizabeth T Cirulli; David B Goldstein
Journal:  Nat Rev Genet       Date:  2010-06       Impact factor: 53.242

4.  Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome.

Authors:  Aaron R Quinlan; Royden A Clark; Svetlana Sokolova; Mitchell L Leibowitz; Yujun Zhang; Matthew E Hurles; Joshua C Mell; Ira M Hall
Journal:  Genome Res       Date:  2010-03-22       Impact factor: 9.043

5.  Savant: genome browser for high-throughput sequencing data.

Authors:  Marc Fiume; Vanessa Williams; Andrew Brook; Michael Brudno
Journal:  Bioinformatics       Date:  2010-06-20       Impact factor: 6.937

Review 6.  Annotating non-coding regions of the genome.

Authors:  Roger P Alexander; Gang Fang; Joel Rozowsky; Michael Snyder; Mark B Gerstein
Journal:  Nat Rev Genet       Date:  2010-07-13       Impact factor: 53.242

Review 7.  Detecting structural variations in the human genome using next generation sequencing.

Authors:  Ruibin Xi; Tae-Min Kim; Peter J Park
Journal:  Brief Funct Genomics       Date:  2011-01-06       Impact factor: 4.241

8.  Reconstructing cancer genomes from paired-end sequencing data.

Authors:  Layla Oesper; Anna Ritz; Sarah J Aerni; Ryan Drebin; Benjamin J Raphael
Journal:  BMC Bioinformatics       Date:  2012-04-19       Impact factor: 3.169

9.  Using Genome Query Language to uncover genetic variation.

Authors:  Christos Kozanitis; Andrew Heiberg; George Varghese; Vineet Bafna
Journal:  Bioinformatics       Date:  2013-06-10       Impact factor: 6.937

10.  MATE-CLEVER: Mendelian-inheritance-aware discovery and genotyping of midsize and long indels.

Authors:  Tobias Marschall; Iman Hajirasouliha; Alexander Schönhuth
Journal:  Bioinformatics       Date:  2013-09-25       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.