Literature DB >> 20385726

Detection and characterization of novel sequence insertions using paired-end next-generation sequencing.

Iman Hajirasouliha1, Fereydoun Hormozdiari, Can Alkan, Jeffrey M Kidd, Inanc Birol, Evan E Eichler, S Cenk Sahinalp.   

Abstract

MOTIVATION: In the past few years, human genome structural variation discovery has enjoyed increased attention from the genomics research community. Many studies were published to characterize short insertions, deletions, duplications and inversions, and associate copy number variants (CNVs) with disease. Detection of new sequence insertions requires sequence data, however, the 'detectable' sequence length with read-pair analysis is limited by the insert size. Thus, longer sequence insertions that contribute to our genetic makeup are not extensively researched.
RESULTS: We present NovelSeq: a computational framework to discover the content and location of long novel sequence insertions using paired-end sequencing data generated by the next-generation sequencing platforms. Our framework can be built as part of a general sequence analysis pipeline to discover multiple types of genetic variation (SNPs, structural variation, etc.), thus it requires significantly less-computational resources than de novo sequence assembly. We apply our methods to detect novel sequence insertions in the genome of an anonymous donor and validate our results by comparing with the insertions discovered in the same genome using various sources of sequence data. AVAILABILITY: The implementation of the NovelSeq pipeline is available at http://compbio.cs.sfu.ca/strvar.htm CONTACT: eee@gs.washington.edu; cenk@cs.sfu.ca

Entities:  

Mesh:

Year:  2010        PMID: 20385726      PMCID: PMC2865866          DOI: 10.1093/bioinformatics/btq152

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Fine-scale structural variation of the human genome.

Authors:  Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal:  Nat Genet       Date:  2005-05-15       Impact factor: 38.330

2.  Short read fragment assembly of bacterial genomes.

Authors:  Mark J Chaisson; Pavel A Pevzner
Journal:  Genome Res       Date:  2007-12-14       Impact factor: 9.043

3.  MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions.

Authors:  Seunghak Lee; Fereydoun Hormozdiari; Can Alkan; Michael Brudno
Journal:  Nat Methods       Date:  2009-05-31       Impact factor: 28.547

4.  ABySS: a parallel assembler for short read sequence data.

Authors:  Jared T Simpson; Kim Wong; Shaun D Jackman; Jacqueline E Schein; Steven J M Jones; Inanç Birol
Journal:  Genome Res       Date:  2009-02-27       Impact factor: 9.043

Review 5.  Computational methods for discovering structural variation with next-generation sequencing.

Authors:  Paul Medvedev; Monica Stanciu; Michael Brudno
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

6.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

7.  Characterization of missing human genome sequences and copy-number polymorphic insertions.

Authors:  Jeffrey M Kidd; Nick Sampas; Francesca Antonacci; Tina Graves; Robert Fulton; Hillary S Hayden; Can Alkan; Maika Malig; Mario Ventura; Giuliana Giannuzzi; Joelle Kallicki; Paige Anderson; Anya Tsalenko; N Alice Yamada; Peter Tsang; Rajinder Kaul; Richard K Wilson; Laurakay Bruhn; Evan E Eichler
Journal:  Nat Methods       Date:  2010-05       Impact factor: 28.547

8.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

9.  Personalized copy number and segmental duplication maps using next-generation sequencing.

Authors:  Can Alkan; Jeffrey M Kidd; Tomas Marques-Bonet; Gozde Aksay; Francesca Antonacci; Fereydoun Hormozdiari; Jacob O Kitzman; Carl Baker; Maika Malig; Onur Mutlu; S Cenk Sahinalp; Richard A Gibbs; Evan E Eichler
Journal:  Nat Genet       Date:  2009-08-30       Impact factor: 38.330

10.  Mapping and sequencing of structural variation from eight human genomes.

Authors:  Jeffrey M Kidd; Gregory M Cooper; William F Donahue; Hillary S Hayden; Nick Sampas; Tina Graves; Nancy Hansen; Brian Teague; Can Alkan; Francesca Antonacci; Eric Haugen; Troy Zerr; N Alice Yamada; Peter Tsang; Tera L Newman; Eray Tüzün; Ze Cheng; Heather M Ebling; Nadeem Tusneem; Robert David; Will Gillett; Karen A Phelps; Molly Weaver; David Saranga; Adrianne Brand; Wei Tao; Erik Gustafson; Kevin McKernan; Lin Chen; Maika Malig; Joshua D Smith; Joshua M Korn; Steven A McCarroll; David A Altshuler; Daniel A Peiffer; Michael Dorschner; John Stamatoyannopoulos; David Schwartz; Deborah A Nickerson; James C Mullikin; Richard K Wilson; Laurakay Bruhn; Maynard V Olson; Rajinder Kaul; Douglas R Smith; Evan E Eichler
Journal:  Nature       Date:  2008-05-01       Impact factor: 49.962

View more
  55 in total

1.  Simultaneous structural variation discovery among multiple paired-end sequenced genomes.

Authors:  Fereydoun Hormozdiari; Iman Hajirasouliha; Andrew McPherson; Evan E Eichler; S Cenk Sahinalp
Journal:  Genome Res       Date:  2011-11-02       Impact factor: 9.043

Review 2.  Detecting structural variations in the human genome using next generation sequencing.

Authors:  Ruibin Xi; Tae-Min Kim; Peter J Park
Journal:  Brief Funct Genomics       Date:  2011-01-06       Impact factor: 4.241

3.  MATE-CLEVER: Mendelian-inheritance-aware discovery and genotyping of midsize and long indels.

Authors:  Tobias Marschall; Iman Hajirasouliha; Alexander Schönhuth
Journal:  Bioinformatics       Date:  2013-09-25       Impact factor: 6.937

Review 4.  Direct mutation analysis by high-throughput sequencing: from germline to low-abundant, somatic variants.

Authors:  Michael Gundry; Jan Vijg
Journal:  Mutat Res       Date:  2011-10-12       Impact factor: 2.433

5.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives.

Authors:  Min Zhao; Qingguo Wang; Quan Wang; Peilin Jia; Zhongming Zhao
Journal:  BMC Bioinformatics       Date:  2013-09-13       Impact factor: 3.169

Review 6.  Sequence assembly demystified.

Authors:  Niranjan Nagarajan; Mihai Pop
Journal:  Nat Rev Genet       Date:  2013-01-29       Impact factor: 53.242

Review 7.  Copy number variation in the cattle genome.

Authors:  George E Liu; Derek M Bickhart
Journal:  Funct Integr Genomics       Date:  2012-07-13       Impact factor: 3.410

8.  Detection of genomic variations and DNA polymorphisms and impact on analysis of meiotic recombination and genetic mapping.

Authors:  Ji Qi; Yamao Chen; Gregory P Copenhaver; Hong Ma
Journal:  Proc Natl Acad Sci U S A       Date:  2014-06-23       Impact factor: 11.205

Review 9.  The overdue promise of short tandem repeat variation for heritability.

Authors:  Maximilian O Press; Keisha D Carlson; Christine Queitsch
Journal:  Trends Genet       Date:  2014-08-30       Impact factor: 11.639

Review 10.  Sequencing XMET genes to promote genotype-guided risk assessment and precision medicine.

Authors:  Yaqiong Jin; Geng Chen; Wenming Xiao; Huixiao Hong; Joshua Xu; Yongli Guo; Wenzhong Xiao; Tieliu Shi; Leming Shi; Weida Tong; Baitang Ning
Journal:  Sci China Life Sci       Date:  2019-05-20       Impact factor: 6.038

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.