| Literature DB >> 17071959 |
Joshua L Heazlewood1, Robert E Verboom, Julian Tonti-Filippini, Ian Small, A Harvey Millar.
Abstract
Knowledge of protein localisation contributes towards our understanding of protein function and of biological inter-relationships. A variety of experimental methods are currently being used to produce localisation data that need to be made accessible in an integrated manner. Chimeric fluorescent fusion proteins have been used to define subcellular localisations with at least 1100 related experiments completed in Arabidopsis. More recently, many studies have employed mass spectrometry to undertake proteomic surveys of subcellular components in Arabidopsis yielding localisation information for approximately 2600 proteins. Further protein localisation information may be obtained from other literature references to analysis of locations (AmiGO: approximately 900 proteins), location information from Swiss-Prot annotations (approximately 2000 proteins); and location inferred from gene descriptions (approximately 2700 proteins). Additionally, an increasing volume of available software provides location prediction information for proteins based on amino acid sequence. We have undertaken to bring these various data sources together to build SUBA, a SUBcellular location database for Arabidopsis proteins. The localisation data in SUBA encompasses 10 distinct subcellular locations, >6743 non-redundant proteins and represents the proteins encoded in the transcripts responsible for 51% of Arabidopsis expressed sequence tags. The SUBA database provides a powerful means by which to assess protein subcellular localisation in Arabidopsis (http://www.suba.bcs.uwa.edu.au).Entities:
Mesh:
Substances:
Year: 2006 PMID: 17071959 PMCID: PMC1635339 DOI: 10.1093/nar/gkl863
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The SUBA graphical user interface showing a search (A) where complex queriescan be built from pull down menus and visualised in the sentence structure represented in the text box. In this case for proteins localised to mitochondria by three different experimental data sets. The results (B) of this search is shown, and the AGI page (C) for one of these identifications giving links and information on the requested protein.
Compiled data resources collected in the Arabidopsis Subcellular Database (SUBA)
| Location | FP | MS | AmiGO | Swiss-Prot | Description |
|---|---|---|---|---|---|
| Cell plate | 15 | — | 5 | 6 | 12 |
| Cytoskeleton | 34 | — | 27 | 2 | 50 |
| Cytosol | 192 | — | 124 | 248 | 117 |
| ER | 58 | 179 | 34 | 43 | 16 |
| Extracellular | 25 | 187 | 17 | 214 | 80 |
| Golgi | 41 | 88 | 28 | 41 | 32 |
| Mitochondria | 93 | 547 | 97 | 227 | 417 |
| Nucleus | 403 | 367 | 275 | 551 | 1392 |
| Peroxisome | 52 | 42 | 25 | 11 | 69 |
| PM | 119 | 597 | 76 | 117 | 117 |
| Plastid | 142 | 1110 | 237 | 323 | 560 |
| Vacuole | 49 | 383 | 35 | 42 | 32 |
| Unclear | 115 | — | — | 238 | — |
| Any Location | 1093 | 2604 | 877 | 1981 | 2701 |
Locations are 13 subcellular localisation groups; FP—data from visualisation in vivo of fluorescent protein constructs; MS—data from mass spectrometry analysis of proteins from isolated subcellular fractions, AmiGO—inferred from direct assay data in the GO database from Arabidopsis; Swiss-Prot—Swiss-Prot database localisation of Arabidopsis proteins; Description—text search of TAIR gene annotation for location. Numbers are non-redundant Arabidopsis proteins in each category.