| Literature DB >> 20459862 |
Marnix H Medema1, Miaomiao Zhou, Sacha A F T van Hijum, Jolein Gloerich, Hans J C T Wessels, Roland J Siezen, Marc Strous.
Abstract
BACKGROUND: Anaerobic ammonium-oxidizing (anammox) bacteria perform a key step in global nitrogen cycling. These bacteria make use of an organelle to oxidize ammonia anaerobically to nitrogen (N2) and so contribute approximately 50% of the nitrogen in the atmosphere. It is currently unknown which proteins constitute the organellar proteome and how anammox bacteria are able to specifically target organellar and cell-envelope proteins to their correct final destinations. Experimental approaches are complicated by the absence of pure cultures and genetic accessibility. However, the genome of the anammox bacterium Candidatus "Kuenenia stuttgartiensis" has recently been sequenced. Here, we make use of these genome data to predict the organellar sub-proteome and address the molecular basis of protein sorting in anammox bacteria.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20459862 PMCID: PMC2881027 DOI: 10.1186/1471-2164-11-299
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Performance comparison of the RF model trained on different types of input data. 500 RF models with randomly generated P1 and P2 sets, to correct for class A and P inbalance, were trained on each of the following 6 types of data: the full-length amino acid sequences, the signal peptides (SP) and the mature protein amino acid sequences, each analyzed with either the residue frequency of single amino acids or the frequency of 2 adjacent amino acids. When the 6 top-performing models of each input type are compared, the model trained with full-length protein sequences with the 2 adjacent amino acids combination shows the highest overall accuracy (89%) and A protein recall (90%).
Figure 2Signal peptide predictions on the whole proteome of . Signal peptide predictions on the whole proteome (4663 proteins) of K. stuttgartiensis by fifteen signal peptide prediction algorithms. The y-axis shows the number of proteins predicted to carry a signal peptide. Abbreviations: G+: predict option of Gram-positive; G-: predict option of Gram-negative; Euk: predict option of Eukaryote. The number of predicted Tat and Type IV prepilin substrates using TatFind and PilFind were nine and ten, respectively.
Figure 3Experimental validation of signal peptides in the Candidatus . Identification of cleavage sites from seven Candidatus Kuenenia stuttgartiensis mature proteins. The peptides that were identified in the tryptic digest are coloured red; their N-terminal sides were non-tryptic. Underlined sequences represent the putative signal peptides, and the putative SPase 1 recognition sites adjacent to the non-tryptic side of the peptides are printed in bold. The left column indicates whether the protein is present in either the A or the P training set.
Composition of the predicted organellar proteome of K. stuttgartiensis.
| Protein family | Examples of the predicted organellar proteins | Locus tag | # proteins |
|---|---|---|---|
| Cytochrome C proteins | cytochrome c551 peroxidase | kuste2905 | 49 |
| cytochrome c6 | kustc0563 | ||
| cbb3-type cytochrome c oxidase subunit 1 (CcoN) | kustc0429 | ||
| hepta heme protein | kuste2855 | ||
| Respiratory complex proteins | cd1 nitrite reductase (NirS) | kuste4136 | 50 |
| NAD(P)H:quinone oxidoreductase chain 5 | kustc0838 | ||
| proton-translocating NADH dehydrogenase I chain A (NuoA) | kustc0822 | ||
| NADH:ubiquinone oxidoreductase subunit M | kustc0840 | ||
| Transporters | Ammonium transporter 1 | kustc0381 | 27 |
| Nitrite Transporter 1 (FocA) | kustd1720 | ||
| Nitrite/nitrate antiporter (NarK) | kuste2335 | ||
| copper-transporting ATPase | kuste2247 | ||
| Protein translocation system | SecDF-YajC accessory complex (YajC) | kustd1963 | 5 |
| SecYEG translocation complex (SecE) | kuste2951 | ||
| Transmembrane pore (TatC) | kustc0286 | ||
| Tat signal recognition (TatA/B) | kuste2348 | ||
| Cytochrome C maturation system | thiol-disulfide oxidoreductase (ResA) | kustc0860 | 6 |
| cytochrome c-type biogenesis protein (ResC) | kustd1760 | ||
| thiol:disulfide interchange protein (DsbD) | kustc0946 | ||
| TPR proteins | N-acetylglucosaminyltransferase (O-GlcNAc transferase) | kuste2787 | 32 |
| kinesin light chain KLC (putative) | kuste2807 | ||
| Hypothetical and other proteins | 392 | ||
| Total proteins | 562 | ||
A summary of the predicted anammoxosomal sub-proteome of K. stuttgartiensis. In this set, 371 proteins are hypothetical proteins of unknown function. Some examples of proteins are listed, especially those with functions known to be related to anammox. Details of the predicted sub-proteome are stored in Additional file 2.
Protein sorting components encoded in the K. stuttgartiensis genome
| SecY | SecYEG translocation complex | Membrane | kuste2983 | CAJ73737 |
| SecE | SecYEG translocation complex | Membrane | kuste2951 | CAJ73704 |
| SecG | SecYEG translocation complex | Membrane | kuste4254 | CAJ75016 |
| SecB | chaperone | Cytoplasm | - | - |
| SecA | ATPase motor protein | Cytoplasm | kustb0170 | CAJ70915 |
| SecDF | SecDF-YajC accessory complex | Membrane | kustd1962 | CAJ72707 |
| YajC | SecDF-YajC accessory complex | Membrane | kustd1963 | CAJ72708 |
| YidC | membrane protein assembly | Membrane | kustd1734 | CAJ72479 |
| TatA/B | Transmembrane pore | Membrane | kuste2348 | CAJ73093 |
| TatC | Tat signal recognition | Membrane | kustc0286 | CAJ71031 |
| Signal Peptidase I | Sec signal peptidase | Membrane | kuste3749 | CAJ74512 |
| Signal Peptidase II | lipoprotein signal peptidase | Membrane | kuste4338 | CAJ75100 |
| Signal Peptidase IV | prepilin signal peptidase | Membrane | kustc0984 | CAJ71729 |
| FtsY | SRP receptor | Membrane | kustc0279 | CAJ71024 |
| Ffh | Major SRP subunit | Cytoplasm | kuste3317 | CAJ74078 |
Putative Candidatus Kuenenia stuttgartiensis orthologues of proteins that are known to be involved in protein sorting. Orthology is based on reciprocal best Blast hits and on the unique presence of PFAM functional domains.
Figure 4Sequence composition of the signal peptides of the anammoxosome and cell envelope protein sets. Weblogos of the signal peptides of protein sets A (anammoxosomal) and P (cell envelope) are shown. Both the hydrophobic h-regions (residues -6 to -17) and the signal peptidase AxA consensus (residues -1 to -3) preceding the cleavage site are clearly visible. The weblogos were created from sequences aligned to the cleavage site, using Weblogo [47].
Figure 5Physicochemical differences between anammoxosomal and cell envelope proteins. Two physicochemical parameters are plotted against each other: GRAVY index (grand average of hydropathy) and aliphatic index (relative volume occupied by aliphatic side chains of I, L, V, and A), which can both be calculated from amino acid compositions. These two parameters separate sets A and P into two largely distinct clusters. Purple dots: set P. Blue dots: set A.