| Literature DB >> 21092245 |
Miaomiao Zhou1, Daniel Theunissen, Michiel Wels, Roland J Siezen.
Abstract
BACKGROUND: In Lactic Acid Bacteria (LAB), the extracellular and surface-associated proteins can be involved in processes such as cell wall metabolism, degradation and uptake of nutrients, communication and binding to substrates or hosts. A genome-scale comparative study of these proteins (secretomes) can provide vast information towards the understanding of the molecular evolution, diversity, function and adaptation of LAB to their specific environmental niches.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21092245 PMCID: PMC3017865 DOI: 10.1186/1471-2164-11-651
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The predicted LAB secretomes (genomes included in the original LaCOG analysis 43 are marked by *).
| Secretome proteins (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| LAB species and strains | Total proteins | A | B | C | D | E | F | G | Total |
| 3186 | 2.32 | 1.26 | 3.36 | 0.97 | 0.16 | 1.6 | 0.13 | 9.8 | |
| 1834 | 2.24 | 0.65 | 4.09 | 0.93 | 0 | 2.45 | 0.05 | 10.41 | |
| 1733 | 1.85 | 0.69 | 3.92 | 0.52 | 0.12 | 0.69 | 0 | 7.79 | |
| 1789 | 2.07 | 0.89 | 4.3 | 0.56 | 0.39 | 0.06 | 0 | 8.27 | |
| 1536 | 1.56 | 0.13 | 3.45 | 1.04 | 0.07 | 2.02 | 0 | 8.27 | |
| 1681 | 1.43 | 0.06 | 3.15 | 0.95 | 0.18 | 2.08 | 0 | 7.85 | |
| 2693 | 1.63 | 0.78 | 3.79 | 0.78 | 0.15 | 1.41 | 0.07 | 8.61 | |
| 2973 | 1.68 | 0.77 | 3.4 | 0.84 | 0 | 1.35 | 0.13 | 8.17 | |
| 1973 | 0.91 | 0.25 | 3.4 | 0.61 | 0.15 | 1.27 | 0.1 | 6.69 | |
| 1845 | 1.52 | 0.33 | 3.36 | 0.76 | 0.05 | 2.06 | 0.27 | 8.35 | |
| 2981 | 1.61 | 1.11 | 3.99 | 0.91 | 0.3 | 0.1 | 0 | 8.02 | |
| 2178 | 1.29 | 0.55 | 3.35 | 1.52 | 0.14 | 2.53 | 0.09 | 9.47 | |
| 1826 | 0.66 | 0.22 | 2.96 | 0.55 | 0 | 1.15 | 0.05 | 5.59 | |
| 1597 | 1.38 | 0.13 | 4.51 | 0.44 | 0 | 2.13 | 0 | 8.59 | |
| 1881 | 0.74 | 0.21 | 3.67 | 0.85 | 0 | 1.01 | 0 | 6.48 | |
| 1803 | 0.78 | 0.28 | 3.55 | 1 | 0 | 1.22 | 0 | 6.83 | |
| 2393 | 1.46 | 0.46 | 3.01 | 0.79 | 0 | 1.96 | 0 | 7.68 | |
| 2459 | 1.38 | 0.41 | 3.17 | 1.02 | 0.12 | 1.67 | 0.08 | 7.85 | |
| 2284 | 1.4 | 0.61 | 4.29 | 0.74 | 0.04 | 1.62 | 0.18 | 8.88 | |
| 1784 | 0.06 | 0.28 | 4.43 | 1.23 | 1.23 | 0 | 0.06 | 7.29 | |
| 1872 | 1.28 | 0.05 | 3.47 | 0.53 | 0.27 | 0.43 | 0.05 | 6.08 | |
| 1669 | 1.5 | 0.24 | 3.89 | 0.54 | 0.18 | 0.84 | 0 | 7.19 | |
| 1854 | 1.29 | 0.11 | 3.78 | 0.54 | 0.49 | 0.65 | 0 | 6.86 | |
| 1966 | 0.1 | 0.31 | 4.93 | 1.12 | 0.31 | 1.22 | 0.15 | 8.14 | |
| 1664 | 0.12 | 0.06 | 4.33 | 0.9 | 1.56 | 0 | 0.06 | 7.03 | |
| 1727 | 1.1 | 0.17 | 3.88 | 0.35 | 0.17 | 0.98 | 0.12 | 6.77 | |
A: Lipid anchored; B: LPxTG Cell-wall anchored; C: N-terminally anchored (No cleavage site); D: N-terminally anchored (with cleavage site); E: Secreted via minor pathways (bacteriocin) (no cleavage site); F: Extracellular (with cleavage site); G: C-terminally anchored (with cleavage site)
The SCL prediction was made by LocateP.
Figure 1The flowchart for constructing the secretome LaCOGs. The completely sequenced LAB genomes are used as input data. No plasmid sequences were used for the Inparanoid search. The squares with dash-line frames are intermediate products that are not user-queryable from the LAB-Secretome interface; the squares with full-line frames are the final information stored in LAB-Secretome database. The upper left frame shows the processes that produce new LACOGs; the upper right frame shows the processes that extend existing LaCOGs. The new LaCOGs are coded starting with "9", the extended existing LaCOGs retain the original names from Makarova et.al [43]. BlastP1: the Blast results were processed by a revised criterion "uniform top 3" (see Material and Methods); BlastP2: the Blast results were processed by cut-off of 1e-3 and aligned sequence coverage of 60% for distant homolog identification. This work scheme can be used to update the LAB-Secretome database when new bacterial genomes are available.
Figure 2overview of distribution of secretome proteins in LaCOGs. The central pie depicts the distribution of secretome proteins in LaCOGs according to their functional classes. The percentage was calculated as the number of proteins in the category divided by the total of 3357 secretome proteins that were clustered into LaCOGs. The number of LaCOGs in each category is listed in the pie chart legend behind the name of the functional class. The separate yellow-red-green piecharts for each functional class represents the distribution of this LaCOG in the LAB genomes, i.e. ubiquitous, .species/strain-specific, or variable.
Overview of the LaCOGs (genomes included in the original LaCOG analysis 43 are marked by *).
| LAB species and strains | Secretome size | Proteins in LaCOG | Distant Homologs | ORFans | LaCOGs |
|---|---|---|---|---|---|
| 281 | 232 | 22 | 27 | 131 | |
| 171 | 161 | 2 | 8 | 108 | |
| 177 | 154 | 5 | 18 | 113 | |
| 192 | 187 | 3 | 2 | 148 | |
| 205 | 197 | 0 | 8 | 153 | |
| 112 | 112 | 0 | 0 | 93 | |
| 115 | 113 | 0 | 2 | 94 | |
| 87 | 79 | 3 | 5 | 68 | |
| 112 | 112 | 0 | 0 | 89 | |
| 115 | 113 | 0 | 2 | 88 | |
| 131 | 123 | 2 | 6 | 97 | |
| 236 | 209 | 6 | 21 | 131 | |
| 105 | 103 | 0 | 2 | 86 | |
| 105 | 105 | 0 | 0 | 87 | |
| 136 | 114 | 4 | 18 | 80 | |
| 112 | 94 | 5 | 13 | 77 | |
| 160 | 151 | 5 | 4 | 123 | |
| 159 | 156 | 1 | 2 | 124 | |
| 171 | 156 | 2 | 13 | 123 | |
| 114 | 103 | 4 | 7 | 80 | |
| 135 | 126 | 3 | 6 | 103 | |
| 95 | 90 | 0 | 5 | 70 | |
| 99 | 89 | 1 | 9 | 79 | |
| 90 | 90 | 0 | 0 | 77 | |
| 97 | 94 | 1 | 2 | 84 | |
| 94 | 94 | 0 | 0 | 81 |
Figure 3Variations in domain composition. Examples of LaCOGs families showing different domain types, domain compositions and repeats.
Figure 4Screen shot of the LAB-Secretome database. A: Overview page of the database showing statistical information of the predicted LAB secretomes with active links to their corresponding pages; B: The search engine in LAB-Secretome which can browse the database by various types of queries; C: The BlastP search page of LAB-Secretome; D: An example page depicting parts of the detailed information that LAB-Secretome presents for each LaCOG.
Figure 5Domain structure variation of enzymes within a family. Examples of an enzyme family (N-acetyl-glucosaminidase) with variations in the type and number of cell-envelope binding domains.
Examples of specific enzyme and binding-protein sub-families
| Product | LaCOG | Functional domain | Distribution | Special features |
|---|---|---|---|---|
| Subtilisin-like serine protease | LaCOG02153 | Subtilase family | PA domain (PF02225) inserted in the subtilase family domain | |
| LaCOG90024 | Subtilase family | no PA insert domain | ||
| Trans-glycosylase | LaCOG01094 | Transglycosylase-like domain, | mainly in | different domains for PG binding |
| LaCOG01589 | aggregation promoting factor related surface protein | not in | PG bound by LysM domain; highly conserved C-terminal domain ending in GWY | |
| LaCOG02932 | aggregation promoting factor related surface protein | only in | highly conserved C-terminal domain ending in WY | |
| LaCOG90005 | aggregation promoting factor related surface protein | only in | highly conserved C-terminal domain ending in GWY | |
| Dextran sucrase | LaCOG90016 | glycosyl hydrolase family 70 | only in | |
| alpha/beta hydrolase | LaCOG01137 | alpha/beta hydrolase of unknown function (DUF915) | ubiquitous | |
| LaCOG01138 | alpha/beta hydrolase (DUF915) | Ubiquitous | ||
| LacOG01920 | alpha/beta hydrolase (DUF915) | only in | ||
| LaCOG02785 | alpha/beta hydrolase (DUF915) | only in | ||
| lipase/Acyl-hydrolase | LaCOG00342 | GDSL-like Lipase/Acylhydrolase | not in | with GDSL-like motif |
| general cell surface hydrolase | LacOG02019 | cell surface hydrolase membrane-bound (putative) | only in | |
| LaCOG01618 | cell-surface hydrolase; | only | ||
| mannose-specific adhesion | LaCOG01741 | MUB domain, Gram positive anchor | only in | |
| collagen-binding protein | LaCOG00092 | Collagen binding domain, Gram positive anchor | not in | |
| mucus-binding protein | LaCOG00885 | MucBP domain (Classical), Gram positive anchor | not in | Leucine Rich Repeat, PT repeat |
| LaCOG01470 | MUB domain, Gram positive anchor | many pseudogenes, most | ||
| LacOG02280 | MUB domain, Gram positive anchor | only in | very large, YSIRK-type signal peptide | |
| LaCOG03211 | MUB domain, Gram positive anchor | 5 of 10 are pseudogenes; YSIRK SP in | ||
| LacOG99309 | MUB domain, Gram positive anchor | only in | all pseudogenes; YSIRK type signal peptide | |
| chitin-binding protein | LaCOG01300 | Chitin binding domain | maybe related to niche | |
| adherence protein | LaCOG01366 | von Willebrand factor type A domain, Cna protein B-type domain | only in | |