| Literature DB >> 25725059 |
Simarjeet Negi1, Sanjit Pandey2, Satish M Srinivasan1, Akram Mohammed1, Chittibabu Guda3.
Abstract
LocSigDB (http://genome.unmc.edu/LocSigDB/) is a manually curated database of experimental protein localization signals for eight distinct subcellular locations; primarily in a eukaryotic cell with brief coverage of bacterial proteins. Proteins must be localized at their appropriate subcellular compartment to perform their desired function. Mislocalization of proteins to unintended locations is a causative factor for many human diseases; therefore, collection of known sorting signals will help support many important areas of biomedical research. By performing an extensive literature study, we compiled a collection of 533 experimentally determined localization signals, along with the proteins that harbor such signals. Each signal in the LocSigDB is annotated with its localization, source, PubMed references and is linked to the proteins in UniProt database along with the organism information that contain the same amino acid pattern as the given signal. From LocSigDB webserver, users can download the whole database or browse/search for data using an intuitive query interface. To date, LocSigDB is the most comprehensive compendium of protein localization signals for eight distinct subcellular locations. Database URL: http://genome.unmc.edu/LocSigDB/Entities:
Mesh:
Substances:
Year: 2015 PMID: 25725059 PMCID: PMC4343182 DOI: 10.1093/database/bav003
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Representation of the database statistics showing the number of localization signals for distinct subcellular location along with the count of the research articles elucidating these signals. As clearly seen, most studies have been done on inferring the protein localization signals for nucleus followed by mitochondria and all the other six organelles with a wide margin.
Figure 2.An overview of the frequency distribution of amino acids in the signal set for each of the eight subcellular organelles. As seen from the radar plots, there are clear differences in the frequency occurrence of amino acids for each distinct organelle. Nucleus is dominated by positively charged residues like lysines and arginines; whereas, mitochondrial signals have frequent occurrence of hydrophobic amino acids like: glycine, leucine and alanine as well as positively charged amino acids like arginine. Also, negatively charged amino acids like aspartic acid and glutamic acid are present only in the signals of organelles like Golgi and ER.
Figure 3.An overview of the query and the search results interface in LocSigDB. LocSigDB provides three search functions to retrieve the localization signal information. (A) Query using a signal displays links to all the available descriptors for the signal in question; (B) Query by a protein ID or protein sequence (FASTA format) will retrieve corresponding sequence from the public database. The sequence is displayed in the results interface and the user can mouse over the corresponding signals identified in the protein, while highlighting the signal coordinates in red on the protein sequence. Each signal is in turn linked to the signal attribute window that displays the annotations for that signal.