| Literature DB >> 34730408 |
Sook-Ha Fan1, Elisa Liberini1, Friedrich Götz1.
Abstract
In recent years, it became apparent that not only autotrophic but also most other bacteria require CO2 or bicarbonate for growth. Two systems are available for the acquisition of dissolved inorganic carbon supply (DICS): the cytoplasmic localized carbonic anhydrase (CA) and the more recently described bicarbonate transporter MpsAB (membrane potential generating system). In the pathogenic species Staphylococcus aureus, there are contradictions in the literature regarding the presence of a CA or MpsAB. Here, we address these contradictions in detail. We could demonstrate by careful BLASTp analyses with 259 finished and 4,590 unfinished S. aureus genomes that S. aureus does not contain CA and that the bicarbonate transporter MpsAB is the only DICS system in this species. This finding is further supported by two further pieces of evidence: (i) mpsAB deletion mutants in four different S. aureus strains failed to grow under atmospheric air, which should not be the case if they possess CAs, since we have previously shown that both CA and MpsAB can substitute for each other, and (ii) S. aureus is completely resistant to CA inhibitors, whereas Staphylococcus carnosus, which has been shown to have only CA, was inhibited by ethoxyzolamide (EZA). Taken together, we demonstrate beyond doubt that the species S. aureus possesses only the bicarbonate transporter MpsAB as its sole DICS system. IMPORTANCE The discrepancies in the current literature and even in NCBI database, which listed some protein sequences annotated as Staphylococcus aureus carbonic anhydrase (CA), are misleading. One of the existing problems in publicly available sequence databases is the presence of incorrectly annotated genes, especially if they originated from unfinished genomes. Here, we demonstrate that some of these unfinished genomes are of poor quality and should be interpreted with caution. In the present study, we aimed to address these discrepancies and correct the current literature about S. aureus CA, considering the medical relevance of S. aureus. If left unchecked, these misleading studies and wrongly annotated genes might lead to a continual propagation of wrong annotation and, consequently, wrong interpretations and wasted time. In addition, we also show that bicarbonate transporter MpsAB-harboring bacteria are resistant to CA inhibitor, suggesting that pathogens possessing both MpsAB and CA are not treatable with CA inhibitors.Entities:
Keywords: Firmicutes; MpsAB; Staphylococcus aureus; Staphylococcus carnosus; bicarbonate transporter; carbonic anhydrase
Mesh:
Substances:
Year: 2021 PMID: 34730408 PMCID: PMC8567241 DOI: 10.1128/Spectrum.00970-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
The presence of CAs inferred from Pfam motif correlates with the CA protein identity of S. carnosus in selected finished Staphylococcus genomes
| Genome | CA | Identity (%) (amino acids aligned based on BLASTp) | |
|---|---|---|---|
| Pro | Euk | ||
| + | − | 73 (135/186) | |
| − | − | ||
| − | − | ||
| − | − | ||
| − | − | ||
| + | − | 100 (192/192) | |
| − | − | ||
| + | − | 97 (187/192) | |
| − | − | ||
| − | − | ||
| + | − | 73 (135/186) | |
| − | − | ||
| − | − | ||
| + | − | 72 (134/185) | |
| − | − | ||
| + | − | 70 (133/191) | |
| + | − | 70 (130/188) | |
| − | − | ||
| − | − | ||
| + | − | 75 (140/192) | |
| + | − | 96 (185/192) | |
| + | − | 70 (131/188) | |
| − | − | ||
| + | − | 73 (132/185) | |
| + | − | 67 (126/188) | |
| − | − | ||
| + | − | 87 (167/192) | |
| + | − | 66 (125/190) | |
| − | − | ||
| − | − | ||
| − | − | ||
The presence of the proteins was inferred based on the following protein families (Pfam) domains search from finished bacterial genomes in the Integrated Microbial Genomes and Microbiomes (IGM/M) database: prokaryotic type-carbonic anhydrase (CA) (pro) (PFam00484), eukaryotic-type CA (euk) (PFam00194), and PFam10563 for putative CA-like domain. The symbols + and − indicate the presence or absence of the protein domains. Identity refers to identical residues shared with CA from S. carnosus (WP_015900702.1) using protein-protein Basic Local Alignment Search Tool (BLASTp).
Analysis of proteins wrongly annotated as S. aureus CA in NCBI
| NCBI accession no./length (amino acids) | Annotation in NCBI | Source (strain) | Comment |
|---|---|---|---|
| MBO8619751.1 (64) | Carbonic anhydrase family protein, partial ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity). The rest of the hits were from multiple organisms with the highest identity from one | |
| MBO8666615.1 (77) | Carbonic anhydrase family protein, partial ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity). The rest of the hits were from multiple organisms with the highest identity from | |
| MVW54107.1 (151) | Carbonic anhydrase, partial ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity). The rest of the 99 hits were from different bacteria like | |
| NGB42162.1 (184) | Gamma-carbonic anhydrase family protein ( | When this sequence was subjected to BLASTp search in NCBI, there was no hit even against its own sequence or any | |
| NGG14433.1 (97) | Gamma-carbonic anhydrase family protein, partial ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity). The rest of the 99 hits were all from | |
|
| Carbonic anhydrase, partial ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit with 100% identity but it was annotated as SulP family inorganic anion transporter, partial from | |
| Carbonic anhydrase ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity). The rest were multiple hits from other | ||
| Carbonic anhydrase ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity). The rest were multiple hits from other | ||
| Gamma-carbonic anhydrase family protein, partial ( | When this sequence was subjected to BLASTp search in NCBI, there was only one hit against its own sequence (100% identity) and almost all the hits were from |
SulP, sulfate permease.
Submitted by the same group.
Submitted by the same group.
Protein sequence similarity search for selected α-CAs in the genomes of S. aureus, S. carnosus, and S. pseudintermedius using BLASTp
| Species | UniProt ID/length (amino acids) | Ref | Identity (%) | Comment | ||
|---|---|---|---|---|---|---|
|
|
| No significant similarity found | No significant similarity found | No significant similarity found | ||
|
|
| No significant similarity found | No significant similarity found | 98 (53/54) ( | When | |
|
| No significant similarity found | No significant similarity found | No significant similarity found | |||
|
|
| No significant similarity found | No significant similarity found | 37 (18/49) ( | Same comment as in | |
| Human CA1 | 38 (27/72) (MBO8666615.1) in only one unfinished genome of | No significant similarity found | No significant similarity found | See | ||
| Human CA2 | 50 (27/54) (MBO8619751.1) in one unfinished genome of | No significant similarity found | No significant similarity found | See | ||
Identity refers to shared identical residues with each of the CA proteins (UniProt ID) from selected bacteria and the indicated Staphylococcus species using BLASTp.
Protein sequence similarity search for selected γ-CAs in the genomes of S. aureus, S. carnosus, and S. pseudintermedius using BLASTp
| Species | UniProt ID/length (amino acids) | Ref | Identity (%) | ||
|---|---|---|---|---|---|
|
|
| 3 hits in | No significant similarity found | No significant similarity found | |
|
| 3 hits in | No significant similarity found | No significant similarity found | ||
|
|
| 3 hits in | No significant similarity found | No significant similarity found | |
|
|
| 31 (51/163) (MVW54107.1) | No significant similarity found | No significant similarity found | |
Identity refers to shared identical residues with each of the carbonic anhydrase (CA) protein (UniProt ID) from selected bacteria and the indicated Staphylococcus species using BLASTp.
See Table 5 for comments regarding these protein sequences.
Homology of protein annotated as S. aureus CAs in all finished and permanent genomes sequences of S. aureus
| Accession no./protein length (amino acids) | Identity (%) | Comment |
|---|---|---|
| Finished genomes in IMG/M (259 strains) | ||
| MBO8619751.1 (64) | No significant similarity found | |
| MBO8666615.1 (77) | No significant similarity found | |
| MVW54107.1 (151) | No significant similarity found | |
| NGB42162.1 (184) | No significant similarity found | |
| NGG14433.1 (97) | No significant similarity found | |
| No identity in 253 strains, except in 5 strains which showed 27 (46/170) identity in proteins annotated as sulfate permease | ||
| No significant similarity found | ||
| No significant similarity found | ||
| All strains show 33 (40/122) identity in proteins annotated as acetyltransferase (isoleucine patch superfamily), acetyltransferase-like (isoleucine patch superfamily), galactoside O-acetyltransferase, or hypothetical protein | Annotated as such because these sequences have the related COG, KOG, or Pfam motifs | |
| Permanent draft genomes in IMG/M (4,590 strains) | ||
| MBO8619751.1 (64) | No significant similarity found | |
| MBO8666615.1 (77) | No significant similarity found | |
| MVW54107.1 (151) | No identity in all 4,586 strains except 4 strains: DEU37 (30 [49/163] as CA, partial gene [“no stop”]), DEU28 (30 [49/163] as CA), DEU35 (33 [39/117] as CA, partial gene [“no start”]), DEU41 (33 [39/117] as CA, partial gene [“no start”]) | All the 4 strains listed here were marked as “anomalous assembly: contaminated” by NCBI |
| NGB42162.1 (184) | No identity in 4,584 strains except DEU28, DEU35, DEU37, DEU41, and DEU42 (37 [57/156] as CA or acetyltransferase), DEU39 (36% [31/86] as transferase hexapeptide [six repeat-containing protein]), C0673 (39 [67/171] as CA or acetyltransferase encoded by gene V070_00826) | All the 6 strains listed here were marked as “anomalous assembly: contaminated” by NCBI. C0673 is wrongly annotated as |
| NGG14433.1 (97) | No identity in all 4,583 strains except DEU28, DEU35, DEU37, DEU41, and DEU42 (35 [27/77] as CA or acetyltransferase), DEU39 (partial gene [“no start”]), C0673 (41 [38/92] as CA or acetyltransferase encoded by gene V070_00826) | All the 6 strains listed here were marked as “anomalous assembly: contaminated” by NCBI. C0673 is wrongly annotated as |
| No identity in 4,514 strains except in 76 strains which showed 27 (46/170) identity in proteins annotated as sulfate permease | ||
| No identity in all 4,589 strains except C0673 (66 [35/53] as CA encoded by gene V070_02709) | C0673 is wrongly annotated as | |
| No identity in all 4,589 strains except C0673 (71 [95/133] as CA encoded by gene V070_02709 and another 25 [32/126] as CA encoded by V070_01492) | C0673 is wrongly annotated as | |
| All 4,590 strains showed 33 (40/122) as acetyltransferase (isoleucine patch superfamily), galactoside O-transferase, or hypothetical protein. A few strains have unspecific hits, for example: NRS384 (35 [33/95] as hexapeptide repeat of succinyltransferase), OCMM6067 (32 [23/66] as 2,3,4,5-tetrahydropyridine-2-6, dicarboxylate | These strains have the same similarity as all the finished genomes. As there only a few strains out of 4,590 permanent draft sequences with unspecific and low identity, the origins of each of these strains were not examined. C0673 is wrongly annotated as |
COG, Clusters of Orthologous Genes; KOG, Eukaryotic Orthologous Groups; IGM/M, Integrated Microbial Genomes and Microbiomes; Pfam, protein families.
Identity refers to shared identical residues with each of the CA proteins (NCBI accession number) against the indicated S. aureus genomes in IMG/M database using BLASTp.
MIC values of CA inhibitors against selected staphylococcal strains
| Dissolved inorganic carbon supply (DICS) system | Strain | MIC (μM) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| AZA | EZA | DOZ | MEZ | CT | FAM | S0859 | CEL | ||
| MpsAB | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | |
| >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| CA | >1,000 | 64 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | |
| >1,000 | 250 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| >1,000 | 250 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
| >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | >1,000 | ||
AZA, acetazolamide; EZA, ethoxyzolamide; DOR, dorzolamide; MEZ, methazolamide; CT, chlorthalidone; FAM, famotidine; S0859, N-cyanosulphonamide; CEL, celecoxib.
MIC values were obtained from three independent biological replicates. Vancomycin and oxacillin were used as positive controls with a MIC of <2 μM for both. (EZA 64 μM = 16.5 μg/ml, 250 μM = 64.6 μg/ml).
FIG 1Disk diffusion results showing inhibition zones of CA inhibitor ethoxyzolamide (EZA) against selected staphylococcal strains. The Mueller-Hinton agar plates were inoculated with S. aureus HG001, S. carnosus TM300, and S. carnosus TM300 (pRB473-can), in which the CA was overexpressed, and S. pseudintermedius ED99. Paper disks impregnated with 10 μl of EZA, oxacillin (OXA), and vancomycin (VAN) as positive controls at concentrations of 1 mM each and appropriate concentration of DMSO as negative control were incubated at 37°C overnight in atmospheric and CO2 conditions.