Literature DB >> 15123590

The Ensembl automatic gene annotation system.

Val Curwen1, Eduardo Eyras, T Daniel Andrews, Laura Clarke, Emmanuel Mongin, Steven M J Searle, Michele Clamp.   

Abstract

As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, and EST sequences. The gene-building system rests on top of the core Ensembl (MySQL) database schema and Perl Application Programming Interface (API), and the data generated are accessible through the Ensembl genome browser (http://www.ensembl.org). To date, the Ensembl predicted gene sets are available for the A. gambiae, C. briggsae, zebrafish, mouse, rat, and human genomes and have been heavily relied upon in the publication of the human, mouse, rat, and A. gambiae genome sequence analysis. Here we describe in detail the gene-building system and the algorithms involved. All code and data are freely available from http://www.ensembl.org.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15123590      PMCID: PMC479124          DOI: 10.1101/gr.1858004

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  33 in total

1.  Introducing RefSeq and LocusLink: curated human genome resources at the NCBI.

Authors:  K D Pruitt; K S Katz; H Sicotte; D R Maglott
Journal:  Trends Genet       Date:  2000-01       Impact factor: 11.639

2.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

3.  Analysis of compositionally biased regions in sequence databases.

Authors:  J C Wootton; S Federhen
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

4.  The EMBL Nucleotide Sequence Database.

Authors:  G Stoesser; P Sterk; M A Tuli; P J Stoehr; G N Cameron
Journal:  Nucleic Acids Res       Date:  1997-01-01       Impact factor: 16.971

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

Review 6.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

7.  EST_GENOME: a program to align spliced DNA sequences to unspliced genomic DNA.

Authors:  R Mott
Journal:  Comput Appl Biosci       Date:  1997-08

8.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

9.  The DNA sequence and comparative analysis of human chromosome 20.

Authors:  P Deloukas; L H Matthews; J Ashurst; J Burton; J G Gilbert; M Jones; G Stavrides; J P Almeida; A K Babbage; C L Bagguley; J Bailey; K F Barlow; K N Bates; L M Beard; D M Beare; O P Beasley; C P Bird; S E Blakey; A M Bridgeman; A J Brown; D Buck; W Burrill; A P Butler; C Carder; N P Carter; J C Chapman; M Clamp; G Clark; L N Clark; S Y Clark; C M Clee; S Clegg; V E Cobley; R E Collier; R Connor; N R Corby; A Coulson; G J Coville; R Deadman; P Dhami; M Dunn; A G Ellington; J A Frankland; A Fraser; L French; P Garner; D V Grafham; C Griffiths; M N Griffiths; R Gwilliam; R E Hall; S Hammond; J L Harley; P D Heath; S Ho; J L Holden; P J Howden; E Huckle; A R Hunt; S E Hunt; K Jekosch; C M Johnson; D Johnson; M P Kay; A M Kimberley; A King; A Knights; G K Laird; S Lawlor; M H Lehvaslaiho; M Leversha; C Lloyd; D M Lloyd; J D Lovell; V L Marsh; S L Martin; L J McConnachie; K McLay; A A McMurray; S Milne; D Mistry; M J Moore; J C Mullikin; T Nickerson; K Oliver; A Parker; R Patel; T A Pearce; A I Peck; B J Phillimore; S R Prathalingam; R W Plumb; H Ramsay; C M Rice; M T Ross; C E Scott; H K Sehra; R Shownkeen; S Sims; C D Skuce; M L Smith; C Soderlund; C A Steward; J E Sulston; M Swann; N Sycamore; R Taylor; L Tee; D W Thomas; A Thorpe; A Tracey; A C Tromans; M Vaudin; M Wall; J M Wallis; S L Whitehead; P Whittaker; D L Willey; L Williams; S A Williams; L Wilming; P W Wray; T Hubbard; R M Durbin; D R Bentley; S Beck; J Rogers
Journal:  Nature       Date:  2001 Dec 20-27       Impact factor: 49.962

10.  Identification of human gene structure using linear discriminant functions and dynamic programming.

Authors:  V V Solovyev; A A Salamov; C B Lawrence
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1995
View more
  190 in total

1.  Terpene Synthases and Terpene Variation in Cannabis sativa.

Authors:  Judith K Booth; Macaire M S Yuen; Sharon Jancsik; Lufiani L Madilao; Jonathan E Page; Jörg Bohlmann
Journal:  Plant Physiol       Date:  2020-06-26       Impact factor: 8.340

2.  The Ensembl analysis pipeline.

Authors:  Simon C Potter; Laura Clarke; Val Curwen; Stephen Keenan; Emmanuel Mongin; Stephen M J Searle; Arne Stabenau; Roy Storey; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

3.  GeneWise and Genomewise.

Authors:  Ewan Birney; Michele Clamp; Richard Durbin
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

4.  ESTGenes: alternative splicing from ESTs in Ensembl.

Authors:  Eduardo Eyras; Mario Caccamo; Val Curwen; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

Review 5.  An overview of Ensembl.

Authors:  Ewan Birney; T Daniel Andrews; Paul Bevan; Mario Caccamo; Yuan Chen; Laura Clarke; Guy Coates; James Cuff; Val Curwen; Tim Cutts; Thomas Down; Eduardo Eyras; Xose M Fernandez-Suarez; Paul Gane; Brian Gibbins; James Gilbert; Martin Hammond; Hans-Rudolf Hotz; Vivek Iyer; Kerstin Jekosch; Andreas Kahari; Arek Kasprzyk; Damian Keefe; Stephen Keenan; Heikki Lehvaslaiho; Graham McVicker; Craig Melsopp; Patrick Meidl; Emmanuel Mongin; Roger Pettett; Simon Potter; Glenn Proctor; Mark Rae; Steve Searle; Guy Slater; Damian Smedley; James Smith; Will Spooner; Arne Stabenau; James Stalker; Roy Storey; Abel Ureta-Vidal; K Cara Woodwark; Graham Cameron; Richard Durbin; Anthony Cox; Tim Hubbard; Michele Clamp
Journal:  Genome Res       Date:  2004-04-12       Impact factor: 9.043

Review 6.  Generating and navigating proteome maps using mass spectrometry.

Authors:  Christian H Ahrens; Erich Brunner; Ermir Qeli; Konrad Basler; Ruedi Aebersold
Journal:  Nat Rev Mol Cell Biol       Date:  2010-10-14       Impact factor: 94.444

Review 7.  Reproductive and developmental toxicity of dioxin in fish.

Authors:  Tisha C King-Heiden; Vatsal Mehta; Kong M Xiong; Kevin A Lanham; Dagmara S Antkiewicz; Alissa Ganser; Warren Heideman; Richard E Peterson
Journal:  Mol Cell Endocrinol       Date:  2011-09-21       Impact factor: 4.102

8.  A predicted interactome for Arabidopsis.

Authors:  Jane Geisler-Lee; Nicholas O'Toole; Ron Ammar; Nicholas J Provart; A Harvey Millar; Matt Geisler
Journal:  Plant Physiol       Date:  2007-08-03       Impact factor: 8.340

9.  The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.

Authors:  Raffaele Mazza; Francesco Strozzi; Andrea Caprera; Paolo Ajmone-Marsan; John L Williams
Journal:  BMC Genomics       Date:  2009-12-14       Impact factor: 3.969

10.  Comparative genomic analysis of teleost fish bmal genes.

Authors:  Han Wang
Journal:  Genetica       Date:  2008-10-14       Impact factor: 1.082

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.