Literature DB >> 25294974

Enabling large-scale next-generation sequence assembly with Blacklight.

M Brian Couger1, Lenore Pipes2, Fabio Squina3, Rolf Prade1, Adam Siepel2, Robert Palermo4, Michael G Katze4, Christopher E Mason5, Philip D Blood6.   

Abstract

A variety of extremely challenging biological sequence analyses were conducted on the XSEDE large shared memory resource Blacklight, using current bioinformatics tools and encompassing a wide range of scientific applications. These include genomic sequence assembly, very large metagenomic sequence assembly, transcriptome assembly, and sequencing error correction. The data sets used in these analyses included uncategorized fungal species, reference microbial data, very large soil and human gut microbiome sequence data, and primate transcriptomes, composed of both short-read and long-read sequence data. A new parallel command execution program was developed on the Blacklight resource to handle some of these analyses. These results, initially reported previously at XSEDE13 and expanded here, represent significant advances for their respective scientific communities. The breadth and depth of the results achieved demonstrate the ease of use, versatility, and unique capabilities of the Blacklight XSEDE resource for scientific analysis of genomic and transcriptomic sequence data, and the power of these resources, together with XSEDE support, in meeting the most challenging scientific problems.

Entities:  

Keywords:  NGS; RNA-seq; bioinformatics; data-intensive computing; de novo assembly; genome; genomics; high-performance computing; large shared memory computing; metagenome; primates; transcriptome

Year:  2014        PMID: 25294974      PMCID: PMC4185199          DOI: 10.1002/cpe.3231

Source DB:  PubMed          Journal:  Concurr Comput        ISSN: 1532-0626            Impact factor:   1.536


  18 in total

1.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

Review 2.  Next-generation sequencing data interpretation: enhancing reproducibility and accessibility.

Authors:  Anton Nekrutenko; James Taylor
Journal:  Nat Rev Genet       Date:  2012-09       Impact factor: 53.242

3.  Lignocellulose: A chewy problem.

Authors:  Katharine Sanderson
Journal:  Nature       Date:  2011-06-22       Impact factor: 49.962

4.  Genome sequencing in clinical microbiology.

Authors:  Jacqueline Z-M Chan; Mark J Pallen; Beryl Oppenheim; Chrystala Constantinidou
Journal:  Nat Biotechnol       Date:  2012-11       Impact factor: 54.908

5.  The case for cloud computing in genome informatics.

Authors:  Lincoln D Stein
Journal:  Genome Biol       Date:  2010-05-05       Impact factor: 13.583

6.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

7.  Remarkably AT-rich genomic DNA from the anaerobic fungus Neocallimastix.

Authors:  A G Brownlee
Journal:  Nucleic Acids Res       Date:  1989-02-25       Impact factor: 16.971

8.  dbCAN: a web resource for automated carbohydrate-active enzyme annotation.

Authors:  Yanbin Yin; Xizeng Mao; Jincai Yang; Xin Chen; Fenglou Mao; Ying Xu
Journal:  Nucleic Acids Res       Date:  2012-05-29       Impact factor: 16.971

9.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

10.  The non-human primate reference transcriptome resource (NHPRTR) for comparative functional genomics.

Authors:  Lenore Pipes; Sheng Li; Marjan Bozinoski; Robert Palermo; Xinxia Peng; Phillip Blood; Sara Kelly; Jeffrey M Weiss; Jean Thierry-Mieg; Danielle Thierry-Mieg; Paul Zumbo; Ronghua Chen; Gary P Schroth; Christopher E Mason; Michael G Katze
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

View more
  1 in total

1.  Computational Strategies for Scalable Genomics Analysis.

Authors:  Lizhen Shi; Zhong Wang
Journal:  Genes (Basel)       Date:  2019-12-06       Impact factor: 4.096

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.