Literature DB >> 21929371

Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.

Song Gao1, Wing-Kin Sung, Niranjan Nagarajan.   

Abstract

Scaffolding, the problem of ordering and orienting contigs, typically using paired-end reads, is a crucial step in the assembly of high-quality draft genomes. Even as sequencing technologies and mate-pair protocols have improved significantly, scaffolding programs still rely on heuristics, with no guarantees on the quality of the solution. In this work, we explored the feasibility of an exact solution for scaffolding and present a first tractable solution for this problem (Opera). We also describe a graph contraction procedure that allows the solution to scale to large scaffolding problems and demonstrate this by scaffolding several large real and synthetic datasets. In comparisons with existing scaffolders, Opera simultaneously produced longer and more accurate scaffolds demonstrating the utility of an exact approach. Opera also incorporates an exact quadratic programming formulation to precisely compute gap sizes (Availability: http://sourceforge.net/projects/operasf/ ).

Mesh:

Year:  2011        PMID: 21929371      PMCID: PMC3216105          DOI: 10.1089/cmb.2011.0170

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  20 in total

1.  Comparative genome assembly.

Authors:  Mihai Pop; Adam Phillippy; Arthur L Delcher; Steven L Salzberg
Journal:  Brief Bioinform       Date:  2004-09       Impact factor: 11.622

2.  De novo fragment assembly with short mate-paired reads: Does the read length matter?

Authors:  Mark J Chaisson; Dumitru Brinza; Pavel A Pevzner
Journal:  Genome Res       Date:  2008-12-03       Impact factor: 9.043

3.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

4.  A genomic survey of positive selection in Burkholderia pseudomallei provides insights into the evolution of accidental virulence.

Authors:  Tannistha Nandi; Catherine Ong; Arvind Pratap Singh; Justin Boddey; Timothy Atkins; Mitali Sarkar-Tyson; Angela E Essex-Lopresti; Hui Hoon Chua; Talima Pearson; Jason F Kreisberg; Christina Nilsson; Pramila Ariyaratne; Catherine Ronning; Liliana Losada; Yijun Ruan; Wing-Kin Sung; Donald Woods; Richard W Titball; Ifor Beacham; Ian Peak; Paul Keim; William C Nierman; Patrick Tan
Journal:  PLoS Pathog       Date:  2010-04-01       Impact factor: 6.823

5.  SOPRA: Scaffolding algorithm for paired reads via statistical optimization.

Authors:  Adel Dayarian; Todd P Michael; Anirvan M Sengupta
Journal:  BMC Bioinformatics       Date:  2010-06-24       Impact factor: 3.169

6.  Versatile and open software for comparing large genomes.

Authors:  Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg
Journal:  Genome Biol       Date:  2004-01-30       Impact factor: 13.583

7.  The phusion assembler.

Authors:  James C Mullikin; Zemin Ning
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

8.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

9.  Multiplex sequencing of paired-end ditags (MS-PET): a strategy for the ultra-high-throughput analysis of transcriptomes and genomes.

Authors:  Patrick Ng; Jack J S Tan; Hong Sain Ooi; Yen Ling Lee; Kuo Ping Chiu; Melissa J Fullwood; Kandhadayar G Srinivasan; Clotilde Perbost; Lei Du; Wing-Kin Sung; Chia-Lin Wei; Yijun Ruan
Journal:  Nucleic Acids Res       Date:  2006-07-13       Impact factor: 16.971

10.  Scaffolding and validation of bacterial genome assemblies using optical restriction maps.

Authors:  Niranjan Nagarajan; Timothy D Read; Mihai Pop
Journal:  Bioinformatics       Date:  2008-03-20       Impact factor: 6.937

View more
  95 in total

1.  Faustovirus, an asfarvirus-related new lineage of giant viruses infecting amoebae.

Authors:  Dorine Gaëlle Reteno; Samia Benamar; Jacques Bou Khalil; Julien Andreani; Nicholas Armstrong; Thomas Klose; Michael Rossmann; Philippe Colson; Didier Raoult; Bernard La Scola
Journal:  J Virol       Date:  2015-07       Impact factor: 5.103

2.  Bambus 2: scaffolding metagenomes.

Authors:  Sergey Koren; Todd J Treangen; Mihai Pop
Journal:  Bioinformatics       Date:  2011-09-16       Impact factor: 6.937

3.  Genome sequence of Afipia birgiae, a rare bacterium associated with Amoebae.

Authors:  Isabelle Pagnier; Olivier Croce; Catherine Robert; Didier Raoult; Bernard La Scola
Journal:  J Bacteriol       Date:  2012-12       Impact factor: 3.490

Review 4.  Sequence assembly demystified.

Authors:  Niranjan Nagarajan; Mihai Pop
Journal:  Nat Rev Genet       Date:  2013-01-29       Impact factor: 53.242

5.  The combination of direct and paired link graphs can boost repetitive genome assembly.

Authors:  Wenyu Shi; Peifeng Ji; Fangqing Zhao
Journal:  Nucleic Acids Res       Date:  2017-04-07       Impact factor: 16.971

6.  Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

Authors:  Nathan D Olson; Todd J Treangen; Christopher M Hill; Victoria Cepeda-Espinoza; Jay Ghurye; Sergey Koren; Mihai Pop
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

7.  Repeat-aware evaluation of scaffolding tools.

Authors:  Igor Mandric; Sergey Knyazev; Alex Zelikovsky
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

8.  Genome sequence of Legionella tunisiensis strain LegM(T), a new Legionella species isolated from hypersaline lake water.

Authors:  Isabelle Pagnier; Mondher Boughalmi; Olivier Croce; Catherine Robert; Didier Raoult; Bernard La Scola
Journal:  J Bacteriol       Date:  2012-11       Impact factor: 3.490

9.  Genome sequence of Bartonella rattimassiliensis, a bacterium isolated from European Rattus norvegicus.

Authors:  Vicky Merhej; Olivier Croce; Catherine Robert; Jean-Marc Rolain; Didier Raoult
Journal:  J Bacteriol       Date:  2012-12       Impact factor: 3.490

10.  Exact approaches for scaffolding.

Authors:  Mathias Weller; Annie Chateau; Rodolphe Giroudeau
Journal:  BMC Bioinformatics       Date:  2015-10-02       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.