Literature DB >> 35037200

CyVerse for Reproducible Research: RNA-Seq Analysis.

Jason Williams1.   

Abstract

Posing complex research questions poses complex reproducibility challenges. Datasets may need to be managed over long periods of time. Reliable and secure repositories are needed for data storage. Sharing big data requires advance planning and becomes complex when collaborators are spread across institutions and countries. Many complex analyses require the larger compute resources only provided by cloud and high-performance computing infrastructure. Finally at publication, funder and publisher requirements must be met for data availability and accessibility and computational reproducibility. For all of these reasons, cloud-based cyberinfrastructures are an important component for satisfying the needs of data-intensive research. Learning how to incorporate these technologies into your research skill set will allow you to work with data analysis challenges that are often beyond the resources of individual research institutions. One of the advantages of CyVerse is that there are many solutions for high-powered analyses that do not require knowledge of command line (i.e., Linux) computing. In this chapter we will highlight CyVerse capabilities by analyzing RNA-Seq data. The lessons learned will translate to doing RNA-Seq in other computing environments and will focus on how CyVerse infrastructure supports reproducibility goals (e.g., metadata management, containers), team science (e.g., data sharing features), and flexible computing environments (e.g., interactive computing, scaling).
© 2022. The Author(s).

Entities:  

Keywords:  Cloud computing; Containers; Cyberinfrastructure; Data life cycle; Kallisto; Metadata; RNA-Seq; Reproducible research; Workflow management

Mesh:

Year:  2022        PMID: 35037200     DOI: 10.1007/978-1-0716-2067-0_3

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  10 in total

Review 1.  RNA sequencing: the teenage years.

Authors:  Rory Stark; Marta Grzelak; James Hadfield
Journal:  Nat Rev Genet       Date:  2019-07-24       Impact factor: 53.242

2.  Differential analysis of RNA-seq incorporating quantification uncertainty.

Authors:  Harold Pimentel; Nicolas L Bray; Suzette Puente; Páll Melsted; Lior Pachter
Journal:  Nat Methods       Date:  2017-06-05       Impact factor: 28.547

3.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

4.  [Indications for piperacillin in pediatrics].

Authors:  T May; P Canton
Journal:  Presse Med       Date:  1986-12-20       Impact factor: 1.228

5.  The sequence read archive.

Authors:  Rasko Leinonen; Hideaki Sugawara; Martin Shumway
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

6.  The European Nucleotide Archive.

Authors:  Rasko Leinonen; Ruth Akhtar; Ewan Birney; Lawrence Bower; Ana Cerdeno-Tárraga; Ying Cheng; Iain Cleland; Nadeem Faruque; Neil Goodgame; Richard Gibson; Gemma Hoad; Mikyung Jang; Nima Pakseresht; Sheila Plaister; Rajesh Radhakrishnan; Kethi Reddy; Siamak Sobhany; Petra Ten Hoopen; Robert Vaughan; Vadim Zalunin; Guy Cochrane
Journal:  Nucleic Acids Res       Date:  2010-10-23       Impact factor: 16.971

7.  The iPlant Collaborative: Cyberinfrastructure for Plant Biology.

Authors:  Stephen A Goff; Matthew Vaughn; Sheldon McKay; Eric Lyons; Ann E Stapleton; Damian Gessler; Naim Matasci; Liya Wang; Matthew Hanlon; Andrew Lenards; Andy Muir; Nirav Merchant; Sonya Lowry; Stephen Mock; Matthew Helmke; Adam Kubach; Martha Narro; Nicole Hopkins; David Micklos; Uwe Hilgert; Michael Gonzales; Chris Jordan; Edwin Skidmore; Rion Dooley; John Cazes; Robert McLay; Zhenyuan Lu; Shiran Pasternak; Lars Koesterke; William H Piel; Ruth Grene; Christos Noutsos; Karla Gendler; Xin Feng; Chunlao Tang; Monica Lent; Seung-Jin Kim; Kristian Kvilekval; B S Manjunath; Val Tannen; Alexandros Stamatakis; Michael Sanderson; Stephen M Welch; Karen A Cranston; Pamela Soltis; Doug Soltis; Brian O'Meara; Cecile Ane; Tom Brutnell; Daniel J Kleibenstein; Jeffery W White; James Leebens-Mack; Michael J Donoghue; Edgar P Spalding; Todd J Vision; Christopher R Myers; David Lowenthal; Brian J Enquist; Brad Boyle; Ali Akoglu; Greg Andrews; Sudha Ram; Doreen Ware; Lincoln Stein; Dan Stanzione
Journal:  Front Plant Sci       Date:  2011-07-25       Impact factor: 5.753

8.  The iPlant Collaborative: Cyberinfrastructure for Enabling Data to Discovery for the Life Sciences.

Authors:  Nirav Merchant; Eric Lyons; Stephen Goff; Matthew Vaughn; Doreen Ware; David Micklos; Parker Antin
Journal:  PLoS Biol       Date:  2016-01-11       Impact factor: 8.029

9.  Direct comparison of Arabidopsis gene expression reveals different responses to melatonin versus auxin.

Authors:  Sajal F Zia; Oliver Berkowitz; Frank Bedon; James Whelan; Ashley E Franks; Kim M Plummer
Journal:  BMC Plant Biol       Date:  2019-12-19       Impact factor: 4.215

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

  10 in total
  1 in total

1.  Rapid and simple analysis of short and long sequencing reads using DuesselporeTM.

Authors:  Christian Vogeley; Thach Nguyen; Selina Woeste; Jean Krutmann; Thomas Haarmann-Stemmann; Andrea Rossi
Journal:  Front Genet       Date:  2022-08-11       Impact factor: 4.772

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.