| Literature DB >> 15608201 |
Paul Kersey1, Lawrence Bower, Lorna Morris, Alan Horne, Robert Petryszak, Carola Kanz, Alexander Kanapin, Ujjwal Das, Karine Michoud, Isabelle Phan, Alexandre Gattiker, Tamara Kulikova, Nadeem Faruque, Karyn Duggan, Peter Mclaren, Britt Reimholz, Laurent Duret, Simon Penel, Ingmar Reuter, Rolf Apweiler.
Abstract
Integr8 is a new web portal for exploring the biology of organisms with completely deciphered genomes. For over 190 species, Integr8 provides access to general information, recent publications, and a detailed statistical overview of the genome and proteome of the organism. The preparation of this analysis is supported through Genome Reviews, a new database of bacterial and archaeal DNA sequences in which annotation has been upgraded (compared to the original submission) through the integration of data from many sources, including the EMBL Nucleotide Sequence Database, the UniProt Knowledgebase, InterPro, CluSTr, GOA and HOGENOM. Integr8 also allows the users to customize their own interactive analysis, and to download both customized and prepared datasets for their own use. Integr8 is available at http://www.ebi.ac.uk/integr8.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15608201 PMCID: PMC539993 DOI: 10.1093/nar/gki039
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Incorporation of new data and data types in Genome Reviews (compared with parent entries in the EMBL nucleotide sequence database)
| Original EMBL entries | Genome Reviews entries | |
|---|---|---|
| Number of feature types | 30 | 11 |
| Number of qualifier types | 42 | 28 |
| Number of feature qualifiers | 4 649 864 | 6 783 847 |
| Number of external databases cross-referenced | 6 | 18 |
| Number of ‘mat_peptide’ features | 0 | 3825 |
| Number of ‘/db_xref’ qualifiers | 631 881 | 2 527 269 |
| Number of ‘/locus_tag’ qualifiers | 367 771 | 384 899 |
| Number of evidence tags | 0 | 5 474 235 |
The table shows how the total quantity of annotation has been increased (with some examples), while the number of feature and feature qualifier types has been reduced, with the remaining types used more consistently across all entries. Statistics were compiled by comparing Genome Reviews release 7.0 with EMBL release 79, incrementally updated to August 10, 2004.
Figure 1Incorporation of new data into Genome Reviews. The figure shows a portion of Genome Reviews entry AL009126_GR from release 7.0. Data in boldface have been added to the corresponding portion of the original submission to the EMBL/GenBank/DDBJ nucleotide sequence databases.
Resources integrated in Integr8
| Database | Brief description of content/purpose | URL |
|---|---|---|
| CleanEx ( | Gene expression data | |
| CluSTr ( | Clusters of proteins with similar sequences | |
| EMBL nucleotide sequence database ( | Nucleotide sequences and annotation | |
| Ensembl ( | Predictions of gene structure and protein sequence | |
| Eukaryotic Promoter Database ( | Promoters | |
| Genome Reviews | Genome sequences with upgraded annotation | |
| Gene Ontology ( | Gene product classification hierarchy | |
| GOA ( | GO annotations for proteins | |
| HAMAP ( | Bacterial gene families and annotation specifications | |
| HOGENOM | Phylogenetically analysed protein clusters | |
| HSSP ( | Tertiary structure inferred from secondary structure | |
| InterPro ( | Protein domains, families and repeats | |
| IPI ( | Protein sequences | |
| PDB ( | Macromolecular structures | |
| ReAlSplice | Splice sites and events | |
| RefSeq ( | Nucleotide and protein sequences and annotation | |
| S/MARt Db | Scaffold/matrix attachment regions | |
| UniProt Archive ( | Protein sequences | |
| UniProt Knowledgebase ( | Protein sequences and annotation | |
| RZPD Clone Database | DNA clones | |
| TRANSFAC ( | Transcription factors | |
| UTRdb ( | UTRs |
Figure 2Genome Statistics for the fission yeast Schizosaccharomyces pombe, as represented in the Integr8 browser.