Literature DB >> 18428782

Using galaxy to perform large-scale interactive data analyses.

James Taylor1, Ian Schenck, Dan Blankenberg, Anton Nekrutenko.   

Abstract

While most experimental biologists know where to download genomic data, few have a concrete plan on how to analyze it. This situation can be corrected by: (1) providing unified portals serving genomic data and (2) building Web applications to allow flexible retrieval and on-the-fly analyses of the data. Powerful resources, such as the UCSC Genome Browser already address the first issue. The second issue, however, remains open. For example, how to find human protein-coding exons with the highest density of single nucleotide polymorphisms (SNPs) and extract orthologous sequences from all sequenced mammals? Indeed, one can access all relevant data from the UCSC Genome Browser. But once the data is downloaded how would one deal with millions of SNPs and gigabytes of alignments? Galaxy (http://g2.bx.psu.edu) is designed specifically for that purpose. It amplifies the strengths of existing resources (such as UCSC Genome Browser) by allowing the user to access and, most importantly, analyze data within a single interface in an unprecedented number of ways. Copyright 2007 by John Wiley & Sons, Inc.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18428782      PMCID: PMC3418382          DOI: 10.1002/0471250953.bi1005s19

Source DB:  PubMed          Journal:  Curr Protoc Bioinformatics        ISSN: 1934-3396


  26 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  The UCSC Table Browser data retrieval tool.

Authors:  Donna Karolchik; Angela S Hinrichs; Terrence S Furey; Krishna M Roskin; Charles W Sugnet; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Ensembl 2004.

Authors:  E Birney; D Andrews; P Bevan; M Caccamo; G Cameron; Y Chen; L Clarke; G Coates; T Cox; J Cuff; V Curwen; T Cutts; T Down; R Durbin; E Eyras; X M Fernandez-Suarez; P Gane; B Gibbins; J Gilbert; M Hammond; H Hotz; V Iyer; A Kahari; K Jekosch; A Kasprzyk; D Keefe; S Keenan; H Lehvaslaiho; G McVicker; C Melsopp; P Meidl; E Mongin; R Pettett; S Potter; G Proctor; M Rae; S Searle; G Slater; D Smedley; J Smith; W Spooner; A Stabenau; J Stalker; R Storey; A Ureta-Vidal; C Woodwark; M Clamp; T Hubbard
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  The UCSC Genome Browser Database.

Authors:  D Karolchik; R Baertsch; M Diekhans; T S Furey; A Hinrichs; Y T Lu; K M Roskin; M Schwartz; C W Sugnet; D J Thomas; R J Weber; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

5.  Galaxy: a platform for interactive large-scale genome analysis.

Authors:  Belinda Giardine; Cathy Riemer; Ross C Hardison; Richard Burhans; Laura Elnitski; Prachi Shah; Yi Zhang; Daniel Blankenberg; Istvan Albert; James Taylor; Webb Miller; W James Kent; Anton Nekrutenko
Journal:  Genome Res       Date:  2005-09-16       Impact factor: 9.043

6.  A framework for collaborative analysis of ENCODE data: making large-scale analyses biologist-friendly.

Authors:  Daniel Blankenberg; James Taylor; Ian Schenck; Jianbin He; Yi Zhang; Matthew Ghent; Narayanan Veeraraghavan; Istvan Albert; Webb Miller; Kateryna D Makova; Ross C Hardison; Anton Nekrutenko
Journal:  Genome Res       Date:  2007-06       Impact factor: 9.043

7.  Structured Query Language (SQL) fundamentals.

Authors:  D Curtis Jamison
Journal:  Curr Protoc Bioinformatics       Date:  2003-02

8.  The UCSC Archaeal Genome Browser.

Authors:  Kevin L Schneider; Katherine S Pollard; Robert Baertsch; Andy Pohl; Todd M Lowe
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.

Authors:  Kim D Pruitt; Tatiana Tatusova; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  54 in total

Review 1.  Next-generation genomics: an integrative approach.

Authors:  R David Hawkins; Gary C Hon; Bing Ren
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

2.  microRNA-34c is a novel target to treat dementias.

Authors:  Athanasios Zovoilis; Hope Y Agbemenyah; Roberto C Agis-Balboa; Roman M Stilling; Dieter Edbauer; Pooja Rao; Laurent Farinelli; Ivana Delalle; Andrea Schmitt; Peter Falkai; Sanaz Bahari-Javan; Susanne Burkhardt; Farahnaz Sananbenesi; Andre Fischer
Journal:  EMBO J       Date:  2011-09-23       Impact factor: 11.598

3.  Chromosomal position effects on AAV-mediated gene targeting.

Authors:  Anda M Cornea; David W Russell
Journal:  Nucleic Acids Res       Date:  2010-02-25       Impact factor: 16.971

Review 4.  Application of a systems approach to study developmental gene regulation.

Authors:  Joshua W K Ho
Journal:  Biophys Rev       Date:  2012-09-01

5.  The case for cloud computing in genome informatics.

Authors:  Lincoln D Stein
Journal:  Genome Biol       Date:  2010-05-05       Impact factor: 13.583

6.  Manipulation of FASTQ data with Galaxy.

Authors:  Daniel Blankenberg; Assaf Gordon; Gregory Von Kuster; Nathan Coraor; James Taylor; Anton Nekrutenko
Journal:  Bioinformatics       Date:  2010-06-18       Impact factor: 6.937

7.  Context dependent substitution biases vary within the human genome.

Authors:  P Andrew Nevarez; Christopher M DeBoever; Benjamin J Freeland; Marissa A Quitt; Eliot C Bush
Journal:  BMC Bioinformatics       Date:  2010-09-15       Impact factor: 3.169

8.  Orphan CpG islands identify numerous conserved promoters in the mammalian genome.

Authors:  Robert S Illingworth; Ulrike Gruenewald-Schneider; Shaun Webb; Alastair R W Kerr; Keith D James; Daniel J Turner; Colin Smith; David J Harrison; Robert Andrews; Adrian P Bird
Journal:  PLoS Genet       Date:  2010-09-23       Impact factor: 5.917

9.  SEQADAPT: an adaptable system for the tracking, storage and analysis of high throughput sequencing experiments.

Authors:  David B Burdick; Chris C Cavnor; Jeremy Handcock; Sarah Killcoyne; Jake Lin; Bruz Marzolf; Stephen A Ramsey; Hector Rovira; Ryan Bressler; Ilya Shmulevich; John Boyle
Journal:  BMC Bioinformatics       Date:  2010-07-14       Impact factor: 3.169

10.  Next-generation sequencing and de novo assembly, genome organization, and comparative genomic analyses of the genomes of two Helicobacter pylori isolates from duodenal ulcer patients in India.

Authors:  Narender Kumar; Asish K Mukhopadhyay; Rajashree Patra; Ronita De; Ramani Baddam; Sabiha Shaik; Jawed Alam; Suma Tiruvayipati; Niyaz Ahmed
Journal:  J Bacteriol       Date:  2012-11       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.