Literature DB >> 17568012

A framework for collaborative analysis of ENCODE data: making large-scale analyses biologist-friendly.

Daniel Blankenberg1, James Taylor, Ian Schenck, Jianbin He, Yi Zhang, Matthew Ghent, Narayanan Veeraraghavan, Istvan Albert, Webb Miller, Kateryna D Makova, Ross C Hardison, Anton Nekrutenko.   

Abstract

The standardization and sharing of data and tools are the biggest challenges of large collaborative projects such as the Encyclopedia of DNA Elements (ENCODE). Here we describe a compact Web application, Galaxy2(ENCODE), that effectively addresses these issues. It provides an intuitive interface for the deposition and access of data, and features a vast number of analysis tools including operations on genomic intervals, utilities for manipulation of multiple sequence alignments, and molecular evolution algorithms. By providing a direct link between data and analysis tools, Galaxy2(ENCODE) allows addressing biological questions that are beyond the reach of existing software. We use Galaxy2(ENCODE) to show that the ENCODE regions contain >2000 unannotated transcripts under strong purifying selection that are likely functional. We also show that the ENCODE regions are representative of the entire genome by estimating the rate of nucleotide substitution and comparing it to published data. Although each of these analyses is complex, none takes more than 15 min from beginning to end. Finally, we demonstrate how new tools can be added to Galaxy2(ENCODE) with almost no effort. Every section of the manuscript is supplemented with QuickTime screencasts. Galaxy2(ENCODE) and the screencasts can be accessed at http://g2.bx.psu.edu.

Mesh:

Year:  2007        PMID: 17568012      PMCID: PMC1891355          DOI: 10.1101/gr.5578007

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  16 in total

1.  Combining phylogenetic and hidden Markov models in biosequence analysis.

Authors:  Adam Siepel; David Haussler
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

2.  The ENCODE (ENCyclopedia Of DNA Elements) Project.

Authors: 
Journal:  Science       Date:  2004-10-22       Impact factor: 47.728

3.  HyPhy: hypothesis testing using phylogenies.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost; Spencer V Muse
Journal:  Bioinformatics       Date:  2004-10-27       Impact factor: 6.937

4.  Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution.

Authors:  Jill Cheng; Philipp Kapranov; Jorg Drenkow; Sujit Dike; Shane Brubaker; Sandeep Patel; Jeffrey Long; David Stern; Hari Tammana; Gregg Helt; Victor Sementchenko; Antonio Piccolboni; Stefan Bekiranov; Dione K Bailey; Madhavan Ganesh; Srinka Ghosh; Ian Bell; Daniela S Gerhard; Thomas R Gingeras
Journal:  Science       Date:  2005-03-24       Impact factor: 47.728

5.  The general stochastic model of nucleotide substitution.

Authors:  F Rodríguez; J L Oliver; A Marín; J R Medina
Journal:  J Theor Biol       Date:  1990-02-22       Impact factor: 2.691

6.  Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation.

Authors:  Z Yang; N Goldman; A Friday
Journal:  Mol Biol Evol       Date:  1994-03       Impact factor: 16.240

7.  Conservation and functional significance of gene topology in the genome of Caenorhabditis elegans.

Authors:  Nansheng Chen; Lincoln D Stein
Journal:  Genome Res       Date:  2006-04-10       Impact factor: 9.043

8.  Male-biased mutation rate and divergence in autosomal, z-linked and w-linked introns of chicken and Turkey.

Authors:  Erik Axelsson; Nick G C Smith; Hannah Sundström; Sofia Berlin; Hans Ellegren
Journal:  Mol Biol Evol       Date:  2004-05-12       Impact factor: 16.240

9.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution.

Authors:  Richard A Gibbs; George M Weinstock; Michael L Metzker; Donna M Muzny; Erica J Sodergren; Steven Scherer; Graham Scott; David Steffen; Kim C Worley; Paula E Burch; Geoffrey Okwuonu; Sandra Hines; Lora Lewis; Christine DeRamo; Oliver Delgado; Shannon Dugan-Rocha; George Miner; Margaret Morgan; Alicia Hawes; Rachel Gill; Robert A Holt; Mark D Adams; Peter G Amanatides; Holly Baden-Tillson; Mary Barnstead; Soo Chin; Cheryl A Evans; Steve Ferriera; Carl Fosler; Anna Glodek; Zhiping Gu; Don Jennings; Cheryl L Kraft; Trixie Nguyen; Cynthia M Pfannkoch; Cynthia Sitter; Granger G Sutton; J Craig Venter; Trevor Woodage; Douglas Smith; Hong-Mei Lee; Erik Gustafson; Patrick Cahill; Arnold Kana; Lynn Doucette-Stamm; Keith Weinstock; Kim Fechtel; Robert B Weiss; Diane M Dunn; Eric D Green; Robert W Blakesley; Gerard G Bouffard; Pieter J De Jong; Kazutoyo Osoegawa; Baoli Zhu; Marco Marra; Jacqueline Schein; Ian Bosdet; Chris Fjell; Steven Jones; Martin Krzywinski; Carrie Mathewson; Asim Siddiqui; Natasja Wye; John McPherson; Shaying Zhao; Claire M Fraser; Jyoti Shetty; Sofiya Shatsman; Keita Geer; Yixin Chen; Sofyia Abramzon; William C Nierman; Paul H Havlak; Rui Chen; K James Durbin; Amy Egan; Yanru Ren; Xing-Zhi Song; Bingshan Li; Yue Liu; Xiang Qin; Simon Cawley; Kim C Worley; A J Cooney; Lisa M D'Souza; Kirt Martin; Jia Qian Wu; Manuel L Gonzalez-Garay; Andrew R Jackson; Kenneth J Kalafus; Michael P McLeod; Aleksandar Milosavljevic; Davinder Virk; Andrei Volkov; David A Wheeler; Zhengdong Zhang; Jeffrey A Bailey; Evan E Eichler; Eray Tuzun; Ewan Birney; Emmanuel Mongin; Abel Ureta-Vidal; Cara Woodwark; Evgeny Zdobnov; Peer Bork; Mikita Suyama; David Torrents; Marina Alexandersson; Barbara J Trask; Janet M Young; Hui Huang; Huajun Wang; Heming Xing; Sue Daniels; Darryl Gietzen; Jeanette Schmidt; Kristian Stevens; Ursula Vitt; Jim Wingrove; Francisco Camara; M Mar Albà; Josep F Abril; Roderic Guigo; Arian Smit; Inna Dubchak; Edward M Rubin; Olivier Couronne; Alexander Poliakov; Norbert Hübner; Detlev Ganten; Claudia Goesele; Oliver Hummel; Thomas Kreitler; Young-Ae Lee; Jan Monti; Herbert Schulz; Heike Zimdahl; Heinz Himmelbauer; Hans Lehrach; Howard J Jacob; Susan Bromberg; Jo Gullings-Handley; Michael I Jensen-Seaman; Anne E Kwitek; Jozef Lazar; Dean Pasko; Peter J Tonellato; Simon Twigger; Chris P Ponting; Jose M Duarte; Stephen Rice; Leo Goodstadt; Scott A Beatson; Richard D Emes; Eitan E Winter; Caleb Webber; Petra Brandt; Gerald Nyakatura; Margaret Adetobi; Francesca Chiaromonte; Laura Elnitski; Pallavi Eswara; Ross C Hardison; Minmei Hou; Diana Kolbe; Kateryna Makova; Webb Miller; Anton Nekrutenko; Cathy Riemer; Scott Schwartz; James Taylor; Shan Yang; Yi Zhang; Klaus Lindpaintner; T Dan Andrews; Mario Caccamo; Michele Clamp; Laura Clarke; Valerie Curwen; Richard Durbin; Eduardo Eyras; Stephen M Searle; Gregory M Cooper; Serafim Batzoglou; Michael Brudno; Arend Sidow; Eric A Stone; J Craig Venter; Bret A Payseur; Guillaume Bourque; Carlos López-Otín; Xose S Puente; Kushal Chakrabarti; Sourav Chatterji; Colin Dewey; Lior Pachter; Nicolas Bray; Von Bing Yap; Anat Caspi; Glenn Tesler; Pavel A Pevzner; David Haussler; Krishna M Roskin; Robert Baertsch; Hiram Clawson; Terrence S Furey; Angie S Hinrichs; Donna Karolchik; William J Kent; Kate R Rosenbloom; Heather Trumbower; Matt Weirauch; David N Cooper; Peter D Stenson; Bin Ma; Michael Brent; Manimozhiyan Arumugam; David Shteynberg; Richard R Copley; Martin S Taylor; Harold Riethman; Uma Mudunuri; Jane Peterson; Mark Guyer; Adam Felsenfeld; Susan Old; Stephen Mockrin; Francis Collins
Journal:  Nature       Date:  2004-04-01       Impact factor: 49.962

10.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution.

Authors:  Ross C Hardison; Krishna M Roskin; Shan Yang; Mark Diekhans; W James Kent; Ryan Weber; Laura Elnitski; Jia Li; Michael O'Connor; Diana Kolbe; Scott Schwartz; Terrence S Furey; Simon Whelan; Nick Goldman; Arian Smit; Webb Miller; Francesca Chiaromonte; David Haussler
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

View more
  78 in total

1.  Temporal uncoupling of the DNA methylome and transcriptional repression during embryogenesis.

Authors:  Ozren Bogdanovic; Steven W Long; Simon J van Heeringen; Arie B Brinkman; Jose Luis Gómez-Skarmeta; Hendrik G Stunnenberg; Peter L Jones; Gert Jan C Veenstra
Journal:  Genome Res       Date:  2011-06-02       Impact factor: 9.043

Review 2.  Next-generation genomics: an integrative approach.

Authors:  R David Hawkins; Gary C Hon; Bing Ren
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

3.  CloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing.

Authors:  Guoguang Zhao; Dechao Bu; Changning Liu; Jing Li; Jian Yang; Zhiyong Liu; Yi Zhao; Runsheng Chen
Journal:  Protein Cell       Date:  2012-03-17       Impact factor: 14.870

4.  Using Galaxy to perform large-scale interactive data analyses.

Authors:  Jennifer Hillman-Jackson; Dave Clements; Daniel Blankenberg; James Taylor; Anton Nekrutenko
Journal:  Curr Protoc Bioinformatics       Date:  2012-06

5.  Mutation biases and mutation rate variation around very short human microsatellites revealed by human-chimpanzee-orangutan genomic sequence alignments.

Authors:  William Amos
Journal:  J Mol Evol       Date:  2010-08-11       Impact factor: 2.395

6.  Processing and analyzing ChIP-seq data: from short reads to regulatory interactions.

Authors:  Marion Leleu; Grégory Lefebvre; Jacques Rougemont
Journal:  Brief Funct Genomics       Date:  2010-09-22       Impact factor: 4.241

7.  Coactivation of GR and NFKB alters the repertoire of their binding sites and target genes.

Authors:  Nagesha A S Rao; Melysia T McCalman; Panagiotis Moulos; Kees-Jan Francoijs; Aristotelis Chatziioannou; Fragiskos N Kolisis; Michael N Alexis; Dimitra J Mitsiou; Hendrik G Stunnenberg
Journal:  Genome Res       Date:  2011-07-12       Impact factor: 9.043

8.  Ride the wavelet: A multiscale analysis of genomic contexts flanking small insertions and deletions.

Authors:  Erika M Kvikstad; Francesca Chiaromonte; Kateryna D Makova
Journal:  Genome Res       Date:  2009-06-05       Impact factor: 9.043

9.  Does base-pairing strength play a role in microRNA repression?

Authors:  Ido Carmel; Noam Shomron; Yael Heifetz
Journal:  RNA       Date:  2012-09-27       Impact factor: 4.942

10.  28-way vertebrate alignment and conservation track in the UCSC Genome Browser.

Authors:  Webb Miller; Kate Rosenbloom; Ross C Hardison; Minmei Hou; James Taylor; Brian Raney; Richard Burhans; David C King; Robert Baertsch; Daniel Blankenberg; Sergei L Kosakovsky Pond; Anton Nekrutenko; Belinda Giardine; Robert S Harris; Svitlana Tyekucheva; Mark Diekhans; Thomas H Pringle; William J Murphy; Arthur Lesk; George M Weinstock; Kerstin Lindblad-Toh; Richard A Gibbs; Eric S Lander; Adam Siepel; David Haussler; W James Kent
Journal:  Genome Res       Date:  2007-11-05       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.