Literature DB >> 14576296

Exegesis: a procedure to improve gene predictions and its use to find immunoglobulin superfamily proteins in the human and mouse genomes.

Bernard de Bono1, Cyrus Chothia.   

Abstract

Exegesis is a procedure to refine the gene predictions that are produced for complex genomes, e.g. those of humans and mice. It uses the program Genewise, sequences determined by experiment, experimental maps of gene segment libraries and a new browser that allows the user to rapidly inspect and compare multiple gene maps to regions of genomic sequences. The procedure should be of general use. Here, we use the procedure to find members of the immunoglobulin superfamily in the human and mouse genomes. To do this, Exegesis was used to process the original gene predictions from the automated Ensembl annotation pipeline. Exegesis produced (i) many more complete genes and new transcripts and (ii) a mapping of the immunoglobulin and T cell receptor gene libraries to the genome, which are largely absent in the Ensembl set.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14576296      PMCID: PMC275470          DOI: 10.1093/nar/gkg828

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  32 in total

1.  IMGT, the international ImMunoGeneTics database.

Authors:  M P Lefranc
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Clustering of highly homologous sequences to reduce the size of large protein databases.

Authors:  W Li; L Jaroszewski; A Godzik
Journal:  Bioinformatics       Date:  2001-03       Impact factor: 6.937

3.  The 5' part of the mouse immunoglobulin kappa locus.

Authors:  F Röschenthaler; T Kirschbaum; V Heim; V Kirschbaum; K F Schäble; J Schwendinger; I Zocher; H G Zachau
Journal:  Eur J Immunol       Date:  1999-07       Impact factor: 5.532

4.  SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments.

Authors:  Julian Gough; Cyrus Chothia
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

5.  The EMBL Nucleotide Sequence Database.

Authors:  Guenter Stoesser; Wendy Baker; Alexandra van den Broek; Evelyn Camon; Maria Garcia-Pastor; Carola Kanz; Tamara Kulikova; Rasko Leinonen; Quan Lin; Vincent Lombard; Rodrigo Lopez; Nicole Redaschi; Peter Stoehr; Mary Ann Tuli; Katerina Tzouvara; Robert Vaughan
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  The Ensembl genome database project.

Authors:  T Hubbard; D Barker; E Birney; G Cameron; Y Chen; L Clark; T Cox; J Cuff; V Curwen; T Down; R Durbin; E Eyras; J Gilbert; M Hammond; L Huminiecki; A Kasprzyk; H Lehvaslaiho; P Lijnzaad; C Melsopp; E Mongin; R Pettett; M Pocock; S Potter; A Rust; E Schmidt; S Searle; G Slater; J Smith; W Spooner; A Stabenau; J Stalker; E Stupka; A Ureta-Vidal; I Vastrik; M Clamp
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  The Protein Information Resource: an integrated public resource of functional annotation of proteins.

Authors:  Cathy H Wu; Hongzhan Huang; Leslie Arminski; Jorge Castro-Alvear; Yongxing Chen; Zhang-Zhi Hu; Robert S Ledley; Kali C Lewis; Hans-Werner Mewes; Bruce C Orcutt; Baris E Suzek; Akira Tsugita; C R Vinayaka; Lai-Su L Yeh; Jian Zhang; Winona C Barker
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

8.  The Protein Data Bank.

Authors:  Helen M Berman; Tammy Battistuz; T N Bhat; Wolfgang F Bluhm; Philip E Bourne; Kyle Burkhardt; Zukang Feng; Gary L Gilliland; Lisa Iype; Shri Jain; Phoebe Fagan; Jessica Marvin; David Padilla; Veerasamy Ravichandran; Bohdan Schneider; Narmada Thanki; Helge Weissig; John D Westbrook; Christine Zardecki
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2002-05-29

9.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

10.  A maximum likelihood method for analyzing pseudogene evolution: implications for silent site evolution in humans and rodents.

Authors:  Carlos D Bustamante; Rasmus Nielsen; Daniel L Hartl
Journal:  Mol Biol Evol       Date:  2002-01       Impact factor: 16.240

View more
  2 in total

1.  From IMGT-ONTOLOGY to IMGT/LIGMotif: the IMGT standardized approach for immunoglobulin and T cell receptor gene identification and description in large genomic sequences.

Authors:  Jérôme Lane; Patrice Duroux; Marie-Paule Lefranc
Journal:  BMC Bioinformatics       Date:  2010-04-30       Impact factor: 3.169

2.  A HaemAtlas: characterizing gene expression in differentiated human blood cells.

Authors:  Nicholas A Watkins; Arief Gusnanto; Bernard de Bono; Subhajyoti De; Diego Miranda-Saavedra; Debbie L Hardie; Will G J Angenent; Antony P Attwood; Peter D Ellis; Wendy Erber; Nicola S Foad; Stephen F Garner; Clare M Isacke; Jennifer Jolley; Kerstin Koch; Iain C Macaulay; Sarah L Morley; Augusto Rendon; Kate M Rice; Niall Taylor; Daphne C Thijssen-Timmer; Marloes R Tijssen; C Ellen van der Schoot; Lorenz Wernisch; Thilo Winzer; Frank Dudbridge; Christopher D Buckley; Cordelia F Langford; Sarah Teichmann; Berthold Göttgens; Willem H Ouwehand
Journal:  Blood       Date:  2009-02-19       Impact factor: 22.113

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.