| Literature DB >> 29544487 |
Konstantinos C Tsolis1, Evridiki-Pandora Tsare2,3, Georgia Orfanoudaki4, Tobias Busche5, Katerina Kanaki4, Reshmi Ramakrishnan6, Frederic Rousseau6, Joost Schymkowitz6, Christian Rückert5, Jörn Kalinowski5, Jozef Anné1, Spyridoula Karamanou1, Maria I Klapa2, Anastassios Economou7.
Abstract
BACKGROUND: Members of the genus Streptomyces are Gram-positive bacteria that are used as important cell factories to produce secondary metabolites and secrete heterologous proteins. They possess some of the largest bacterial genomes and thus proteomes. Understanding their complex proteomes and metabolic regulation will improve any genetic engineering approach.Entities:
Keywords: Database; Membranome; Peptidoglycan; Protein subcellular localization; Protein subcellular topology; Proteome annotation; S. lividans TK24; Sec system; Secretome; Signal peptide; Sortase; TAT system
Mesh:
Substances:
Year: 2018 PMID: 29544487 PMCID: PMC5853079 DOI: 10.1186/s12934-018-0892-0
Source DB: PubMed Journal: Microb Cell Fact ISSN: 1475-2859 Impact factor: 5.328
Fig. 1SToPSdb subcellular topology nomenclature. Single letter formalism in SToPSdb for the designation of the different subcellular compartments, corresponding to the equivalent GO term (see Additional file 3: Table S3) for both Gram-negative (a) and Gram-positive (b) bacteria. For Gram-positive bacteria, proteins are classified into 10 categories: N: nucleoid-associated, r: ribosomal, A: cytoplasmic, F1: peripherally associated with the membrane facing the cytoplasm, B: integral membrane proteins, E: secreted lipoproteins, F2: secreted peripherally associated proteins, P: peptidoglycan binding proteins; X: extracellular secreted proteins; V: extracellular vesicle proteins
Fig. 2Overview of the topology annotation workflow. Protein sequences as derived after the re-annotation of S. lividans TK24 genome (Buche et al. in preparation) were used. The protein topology annotation in Uniprot was evaluated, revealing a large proportion of non-annotated proteins. De novo protein topology was assigned using prediction tools for topology or structural features followed by manual curation. Topology conflicts were manually curated. N: Nucleoid; r: Ribosome; A: Cytoplasmic; F1: Peripheral inner membrane protein facing the cytoplasm; B: Integral Membrane; F2 Peripheral inner membrane protein facing the periplasm; E: Inner Membrane Lipoprotein; P: Peptidoglycan binding; X: Extracellular; V: Extracellular vesicle; U: Uncharacterized
Summary of previously existing subcellular annotation and comparison with SToPSdb
| Proteome annotation | Unit of measure | Uniprot | SToPSdb | |
|---|---|---|---|---|
| Number of proteins | 7322 | 7505 | ||
| Subcellular topology annotations | Existing | Number or proteins | 2153 | 7494 |
| % of reference proteome | 29% | >99% | ||
| Contradicting topology annotations between Uniprot and SToPSdb | Number or proteins | 12 | 12 | |
| Missing/unresolved | Number or proteins | 5169 | 11 | |
| % of reference proteome | 71% | < 1% |
Approximately 30% of the S. lividans TK24 proteins contains proteome annotation for topology in Uniprot (2153 out of 7322). For 12 proteins Uniprot and SToPSdb were not in agreement. Protein topology for this strain was assigned de novo in SToPSdb leaving a small fraction of 11 proteins without topological annotation. Proteins annotated only in one of the two resources are labeled as “Unique (total)”
Fig. 3Distribution of proteins across the subcellular compartments. a Percentage and absolute numbers of proteins corresponding in each subcellular compartment in S. lividans TK24. b Summarized proportion of proteome corresponding in each topology group for the three model bacterial strains S. lividans TK24, S. coelicolor and E. coli K-12. Topology of the S. coelicolor proteome was extrapolated by the S. lividans homologues and additional bioinformatics analyses