Literature DB >> 15123589

The Ensembl analysis pipeline.

Simon C Potter1, Laura Clarke, Val Curwen, Stephen Keenan, Emmanuel Mongin, Stephen M J Searle, Arne Stabenau, Roy Storey, Michele Clamp.   

Abstract

The Ensembl pipeline is an extension to the Ensembl system which allows automated annotation of genomic sequence. The software comprises two parts. First, there is a set of Perl modules ("Runnables" and "RunnableDBs") which are 'wrappers' for a variety of commonly used analysis tools. These retrieve sequence data from a relational database, run the analysis, and write the results back to the database. They inherit from a common interface, which simplifies the writing of new wrapper modules. On top of this sits a job submission system (the "RuleManager") which allows efficient and reliable submission of large numbers of jobs to a compute farm. Here we describe the fundamental software components of the pipeline, and we also highlight some features of the Sanger installation which were necessary to enable the pipeline to scale to whole-genome analysis.

Mesh:

Substances:

Year:  2004        PMID: 15123589      PMCID: PMC479123          DOI: 10.1101/gr.1859804

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  16 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  Genescript: DNA sequence annotation pipeline.

Authors:  Alexander K Hudek; Joseph Cheung; Andrew P Boright; Stephen W Scherer
Journal:  Bioinformatics       Date:  2003-06-12       Impact factor: 6.937

3.  Biopipe: a flexible framework for protocol-based bioinformatics analysis.

Authors:  Shawn Hoon; Kiran Kumar Ratnapu; Jer-Ming Chia; Balamurugan Kumarasamy; Xiao Juguang; Michele Clamp; Arne Stabenau; Simon Potter; Laura Clarke; Elia Stupka
Journal:  Genome Res       Date:  2003-07-17       Impact factor: 9.043

Review 4.  The Ensembl core software libraries.

Authors:  Arne Stabenau; Graham McVicker; Craig Melsopp; Glenn Proctor; Michele Clamp; Ewan Birney
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

5.  The Ensembl automatic gene annotation system.

Authors:  Val Curwen; Eduardo Eyras; T Daniel Andrews; Laura Clarke; Emmanuel Mongin; Steven M J Searle; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

6.  The Ensembl computing architecture.

Authors:  James A Cuff; Guy M P Coates; Tim J R Cutts; Mark Rae
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

7.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

8.  Sequence mapping by electronic PCR

Authors:  Gregory D Schuler
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

Review 9.  An integrated computational pipeline and database to support whole-genome sequence annotation.

Authors:  C J Mungall; S Misra; B P Berman; J Carlson; E Frise; N Harris; B Marshall; S Shu; J S Kaminker; S E Prochnik; C D Smith; E Smith; J L Tupy; C Wiel; G M Rubin; S E Lewis
Journal:  Genome Biol       Date:  2002-12-23       Impact factor: 13.583

10.  ASAP, a systematic annotation package for community analysis of genomes.

Authors:  Jeremy D Glasner; Paul Liss; Guy Plunkett; Aaron Darling; Tejasvini Prasad; Michael Rusch; Alexis Byrnes; Michael Gilson; Bryan Biehl; Frederick R Blattner; Nicole T Perna
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

View more
  47 in total

1.  The Ensembl Web site: mechanics of a genome browser.

Authors:  James Stalker; Brian Gibbins; Patrick Meidl; James Smith; William Spooner; Hans-Rudolf Hotz; Antony V Cox
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

2.  The Ensembl automatic gene annotation system.

Authors:  Val Curwen; Eduardo Eyras; T Daniel Andrews; Laura Clarke; Emmanuel Mongin; Steven M J Searle; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

Review 3.  An overview of Ensembl.

Authors:  Ewan Birney; T Daniel Andrews; Paul Bevan; Mario Caccamo; Yuan Chen; Laura Clarke; Guy Coates; James Cuff; Val Curwen; Tim Cutts; Thomas Down; Eduardo Eyras; Xose M Fernandez-Suarez; Paul Gane; Brian Gibbins; James Gilbert; Martin Hammond; Hans-Rudolf Hotz; Vivek Iyer; Kerstin Jekosch; Andreas Kahari; Arek Kasprzyk; Damian Keefe; Stephen Keenan; Heikki Lehvaslaiho; Graham McVicker; Craig Melsopp; Patrick Meidl; Emmanuel Mongin; Roger Pettett; Simon Potter; Glenn Proctor; Mark Rae; Steve Searle; Guy Slater; Damian Smedley; James Smith; Will Spooner; Arne Stabenau; James Stalker; Roy Storey; Abel Ureta-Vidal; K Cara Woodwark; Graham Cameron; Richard Durbin; Anthony Cox; Tim Hubbard; Michele Clamp
Journal:  Genome Res       Date:  2004-04-12       Impact factor: 9.043

Review 4.  A basal deuterostome genome viewed as a natural experiment.

Authors:  R Andrew Cameron; Eric H Davidson
Journal:  Gene       Date:  2007-05-06       Impact factor: 3.688

5.  The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes.

Authors:  Kim D Pruitt; Jennifer Harrow; Rachel A Harte; Craig Wallin; Mark Diekhans; Donna R Maglott; Steve Searle; Catherine M Farrell; Jane E Loveland; Barbara J Ruef; Elizabeth Hart; Marie-Marthe Suner; Melissa J Landrum; Bronwen Aken; Sarah Ayling; Robert Baertsch; Julio Fernandez-Banet; Joshua L Cherry; Val Curwen; Michael Dicuccio; Manolis Kellis; Jennifer Lee; Michael F Lin; Michael Schuster; Andrew Shkeda; Clara Amid; Garth Brown; Oksana Dukhanina; Adam Frankish; Jennifer Hart; Bonnie L Maidak; Jonathan Mudge; Michael R Murphy; Terence Murphy; Jeena Rajan; Bhanu Rajput; Lillian D Riddick; Catherine Snow; Charles Steward; David Webb; Janet A Weber; Laurens Wilming; Wenyu Wu; Ewan Birney; David Haussler; Tim Hubbard; James Ostell; Richard Durbin; David Lipman
Journal:  Genome Res       Date:  2009-06-04       Impact factor: 9.043

6.  Bringing Web 2.0 to bioinformatics.

Authors:  Zhang Zhang; Kei-Hoi Cheung; Jeffrey P Townsend
Journal:  Brief Bioinform       Date:  2008-10-08       Impact factor: 11.622

7.  Hypothalamic differences in expression of genes involved in monoamine synthesis and signaling pathways after insulin injection in chickens from lines selected for high and low body weight.

Authors:  Wei Zhang; Sungwon Kim; Robert Settlage; Wyatt McMahon; Lindsay H Sumners; Paul B Siegel; Benjamin J Dorshorst; Mark A Cline; Elizabeth R Gilbert
Journal:  Neurogenetics       Date:  2015-01-13       Impact factor: 2.660

8.  xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud.

Authors:  Jon Duvick; Daniel S Standage; Nirav Merchant; Volker P Brendel
Journal:  Plant Cell       Date:  2016-03-28       Impact factor: 11.277

9.  Language-related Cntnap2 gene is differentially expressed in sexually dimorphic song nuclei essential for vocal learning in songbirds.

Authors:  S Carmen Panaitof; Brett S Abrahams; Hongmei Dong; Daniel H Geschwind; Stephanie A White
Journal:  J Comp Neurol       Date:  2010-06-01       Impact factor: 3.215

10.  Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome.

Authors:  Clara Amid; Linda M Rehaume; Kelly L Brown; James G R Gilbert; Gordon Dougan; Robert E W Hancock; Jennifer L Harrow
Journal:  BMC Genomics       Date:  2009-12-15       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.