| Literature DB >> 17324287 |
Abstract
BACKGROUND: The genome of Mycobacterium tuberculosis harbors four copies of a cluster of genes termed mce operons. Despite extensive research that has demonstrated the importance of these operons on infection outcome, their physiological function remains obscure. Expanding databases of complete microbial genome sequences facilitate a comparative genomic approach that can provide valuable insight into the role of uncharacterized proteins.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17324287 PMCID: PMC1810536 DOI: 10.1186/1471-2164-8-60
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic representation of the . Proximal transcription regulators are colored in purple, yrbE genes in blue, mce genes in green, and genes encoding 'conserved mce-associated proteins' in yellow [44].
Distribution of Mce and YrbE proteins within the Order Actinomycetalesa
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 6 | 2 | UniProt | |||
| 18 | 7 | UniProt | |||
| 24 | 7 | TIGR | |||
| 24 | 8 | TIGR | |||
| 48 | 14 | UniProt | |||
| 34 | 11 | TIGR | |||
| 38 | 11 | JGI | |||
| 38 | 12 | JGI | |||
| 50 | 16 | JGI | |||
| 48 | 13 | UniProt | |||
| 66 | 24 | UniProt | |||
| 36 | 12 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | JGI | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 6 | 2 | NCBI | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 0 | 0 | UniProt | |||
| 12 | 3 | UniProt | |||
| 0 | 0 | UniProt | |||
| 6 | 2 | Pfam | |||
| 6 | 2 | UniProt | |||
| 6 | 2 | UniProt | |||
| 0 | 0 | UniProt |
a Taxonomy from Bergey's Manual of Systematic Bacteriology [107]
b Number of proteins classified as PF02470
c Number of proteins classified as PF02405
d Incomplete genome, EMBL Accession AF040570
Figure 2Schematic representation of the organization of . Genes encoding proteins belonging to Pfam family PF02470 (Mce) are depicted as green boxes, and to family PF02405 (DUF140) as blue boxes. Dashes indicate gaps in gene numbering.
Figure 3Conserved proteins encoded in the neighborhood of . Coloring reflects conserved domains identified in the key. Protein families shown are: NBD, an ABC transporter ATPase (IPR003439); DUF140 (IPR003453); Mce (IPR003399); Tol, a Ttg2 toluene tolerance protein (IPR008869); STAS, a domain found in sulfate transporters and anti-sigma factor antagonists (IPR002645); VacJ, a lipoprotein of unknown function (IPR007428); BolA, a possible regulator induced by stress (IPR002634); MurA, UDP-N-acetylglucosamine-1-carboxyvinyltransferase (IPR005750); DUF330 (IPR005586); PqiA, an integral membrane protein inducible by superoxide generators (IPR007498); SAM, an S-adenosyl methionine binding methyltransferase (IPR000051); and ABC2, an ABC-2 type permease (IPR013525).
Actinomycetales mce-linked ATPases and mycobacterial orthologs
| TrEMBL: Q7BUF5 | |
| JNB_08429 | |
| nfa51100 | |
| NocaDRAFT_4321 | |
| SAV5902 | |
| SCO2422 | |
| Mb0674 | |
| MflvDRAFT_3283 | |
| ML1892 | |
| MAP4129 | |
| MSMEG1359 | |
| MjlsDRAFT_1757 | |
| MkmsDRAFT_1059 | |
| MmcsDRAFT_0968 | |
| MT0684 | |
| Rv0655 | |
| MvanDRAFT_5200 |
Figure 4Phylogenetic tree showing relationship between . ATPases encoded within mce operons in Actinomycetales species are colored blue; those in Gram-negative bacterial mce operons are colored green. The sequences most similar to nfa51100, SAV5902 and SCO2422 (indicated in bold), in the Actinomycetales genomes listed in Table 1, were identified by BLASTP searches and included in the tree. All of the best hits from mycobacterial species cluster within the Mkl family and are colored red. For comparison, sequences of all M. tuberculosis H37Rv ATPases of ABC uptake transporters were included [20]. All of the top hits from Actinomycetales that do not possess mce operons are rooted among these non-mce-linked ATPases, as are all of the second hits from mycobacterial species. ORFs are designated by (UniProt gene name | protein name).
Classification of Actinomycetales yrbE and mce genes a
| 0167 | 0168 | 0169 | 0170 | 0171 | 0172 | 0173 | 0174 | |
| 0176 | 0177rc | 0178 | 0179 | 0180 | 0181 | 0182 | 0183 | |
| 0173 | 0174 | 0175 | 0176 | 0177 | 0178 | 0179 | 0180 | |
| 2587 | 2588 | 2589 | 2590 | 2591 | 2592 | 2593 | 2594 | |
| 3602 | 3603 | 3604 | 3605 | 3606 | 3607 | 3608 | 3609 | |
| 0126 | 0127 | 0128 | 0129 | 0130 | 0131 | 0132 | 0133 | |
| 0587 | 0588 | 0589 | 0590 | 0591 | 0592 | 0593 | 0594 | |
| 0616 | 0617 | 0618 | 0619 | 0621 | 0622 | 0623 | 0624 | |
| 0602 | 0603 | 0604 | 0605 | 0606 | 0607 | 0609 | 0610 | |
| 4082 | 4083 | 4084 | 4085 | 4086 | 4087 | 4088 | 4089 | |
| 1964 | 1965 | 1966 | 1967 | 1968 | 1969 | 1970 | 1971 | |
| 2016 | 2017 | 2018 | 2019 | 2020 | 2021 | 2022 | 2023 | |
| 1999 | ||||||||
| 2117c | 2117c.1d | 2116c | 2115c | 2114c | 2113c | 2112c | 2111c | |
| 0335 | 0336e | 0337 | 0338 | 0339 | 0340 | 0341 | 0342 | |
| 3451c | 3450c | 3499c | 3498c | 3497c | 3496c | 3495c | 3494c | |
| 3605 | 3604 | 3603 | 3602 | 3601 | 3600 | 3599 | 3598 | |
| 3531c | 3530c | 3529c | 3528c | 3527c | 3526c | 3525c | 3524c | |
| 0562 | 0563 | 0564 | 0565 | 0566 | 0567 | 0568 | 0569 | |
| 5861 | 5860 | 5859.3e | 5859.2e | 5859.1e | 5859 | 5858 | 5857.1e | |
| 5350 | 5360 | 5370 | 5380 | 5390 | 5400 | 5410 | 5420 | |
| 0757 | 0758 | 0759 | 0760 | 0761 | 0762/3g | 0764 | 0765 | |
| 2189 | 2190 | 2191 | 2192 | 2193h | 2194 | |||
| 2855 | 2856 | 2857 | 2858 | 2859 | 2860 | 2861 | 2862 | |
| 4785 | 4784 | 4783 | 4782 | mei | 4777 | 4776 | 4775 | |
| 51090 | 51080 | 51070 | 51060 | 51050 | 51040 | 51030 | 51020 | |
| 5901 | 5900 | 5899 | 5898 | 5897 | 5896 | 5895 | 5894 | |
| 2421 | 2420 | 2419 | 2418 | 2417 | 2416 | 2415 | 2514 | |
| mei | 0107 | 0108 | 0109 | 0110 | 0111 | 0112 | 0113 | |
| 1849 | 1850 | 1851 | 1852 | 1853 | 1854 | 1855 | 1856 | |
| 1131 | 1132 | 1133 | 1134 | 1135 | 1136 | 1137 | 1138 | |
| 50540 | 50530 | 50520 | 50510 | 50500 | 50490 | 50480 | 50470 | |
| 56330 | 56320 | 56310 | 56300 | 56290 | 56280 | 56270 | 56260 | |
| 11130 | 11140 | 11150 | 11160 | 11170 | 11180 | 11190 | 11200 | |
| 29780 | 29770 | 29760h | 29750 | 29740 | 29730 | 29720 | 29710 | |
a Operons mce1-4 designated as in TubercuList; mce5-8 designated herein. Gene names in organisms other than M. tuberculosis do not correspond to those given in genome annotation.
b Organism specific gene number prefix: Rv, M. tuberculosis H37Rv; MT, M. tuberculosis CDC1551; Mb, M. bovis; ML, M. leprae; MAP, M. paratuberculosis; MSMEG, M. smegmatis; nfa, N. farcinica; SCO, S. coelicolor; SAV, S. avermitilis.
c Orthologous sequence present, but ORF annotated in reverse direction.
d Orthologous sequence present, but not annotated. ORF extends ~400 bp at 5'end.
e Orthologous sequence present, but not annotated.
f Orthology inferred from synteny.
g Contains frameshift mutation, resulting in two ORFs.
h Not a member of IPR003399 or IPR005693.
i Insertion of mobile element.
j Orthology inferred from synteny.
Figure 5Phylogenetic tree of . A non-redundant set of Mce protein sequences were aligned and an unrooted neighbor-joining tree was computed by MEGA. Coloring corresponds to the classification scheme specified in Table 3. ORFs are designated by [gene locus name | operon number (1–8) and gene position (A-F)]. Where operon orthology cannot be inferred, operons are designated: -1, -2.
Figure 6Illustration of conserved regions and predicted secondary structure of . Six separate alignments of the Mce proteins (A-F) listed in Table 3 were submitted to JPred and the consensus secondary structure prediction estimated manually. White boxes represent α-helices and grey arrows β-strands. The C-terminal proline-rich region had low complexity and varied in length from 10–250 amino acids. Signal sequences were identified by SignalP and lipid attachment sites matched the ProSite motif PS00013.
Figure 7Phylogenetic tree of . A non-redundant set of YrbE protein sequences were aligned and an unrooted neighbor-joining tree was computed by MEGA. Coloring corresponds to the classification scheme specified in Table 3. ORFs are designated by [gene locus name | operon number (1–8) and gene position (A, B)]. Where operon orthology cannot be inferred, operons are designated: -1, -2.
Figure 8Predicted topology and conserved sequence motif of . (A) The consensus topology prediction of Actinomycetales YrbE proteins analysis is shown compared to that of a typical ABC permease [42]. (B) WebLogo illustration of the conserved YrbE EExDA sequence motif identified through MEME analysis.
Mas Homologs in Selected Actinomycetales Genomesab
| 0175 (213) | 2595 (182) | 3610 (213) | 0134 (202) | ||||
| 0176 (322) | 2596 (325) | 3611 (323) | 0135 (288) | ||||
| 0177 (184) | 2597 (184) | 3612 (184) | 0136 (182) | ||||
| 0178 (244) | 2598 (184) | 3613 (252) | 0137 (296) | ||||
| 1972 (191) | 2110c (203) | 0343 (200) | |||||
| 1973 (160) | 2109/7cc | 0344 (202) | |||||
| 3493c (242) | 0570 (243) | 5857 (233) | 5430 (315) | ||||
| 3492c (160) | 0571 (164) | 5856 (161) | 5440 (162) | ||||
| 51010 (248) | 5893 (177) | 2413 (170) | |||||
| 51000 (274) | 5892 (272) | 2412 (219) | |||||
| 2411 (184) | |||||||
| 2410 (253) | |||||||
| 0114 (198) | 1139 (230) | 50460 (246) | |||||
| 1857 (227) | 56250 (321) | ||||||
| 1363c (261) | 0751c (295) | 4759.2 (303) | |||||
| 1362c (220) | 0750c (187) | 4759 (200) | |||||
| 0768c (298) | 2867 (190) | ||||||
| 0767c (224) | 2868 (218) | ||||||
| 0199 (219) | 2614 (229) | 0225 (206) | 6070 (197) | ||||
| 0200 (229) | 2615 (224) | 0226 (229) | |||||
| 2390c (185) | 0090c (212) | ||||||
| 0878 (167) | |||||||
| 0879 (496) | |||||||
| 5189 (231) | |||||||
| 5190 (192) | |||||||
a Organism specific gene number prefix: Rv, M. tuberculosis H37Rv; ML, M. leprae; MAP, M. paratuberculosis; MSMEG, M. smegmatis; nfa, N. farcinica; SCO, S. coelicolor; SAV, S. avermitilis.
b Each row contains putative orthologs. Length of protein in amino acids shown in parentheses.
c ORF is interrupted by a transposase, MAP2108.
Figure 9Phylogenetic tree of mycobacterial Mas domain sequences. The conserved Mas domains of mycobacterial proteins listed in Table 4 were aligned and an unrooted neighbor-joining tree was computed by MEGA. Coloring corresponds to the classification scheme specified in Table 3. ORFs are designated by [gene locus name | operon number (1, 3, 4, 7) and gene position (A-D)]. Where operon orthology cannot be inferred, operons are designated: -1, -2.
Figure 10Representative architectures of Mas domain-containing proteins. Membrane topology predictions for the 66 Mas proteins listed in Table 4 indicated that the conserved domain was located on the extracellular side of the cytoplasmic membrane. The Mas domain was predicted to remain anchored in the majority of proteins (A), but cleaved in eight (B). Three transmembrane segments were identified in seven proteins and four of these were classified as RDD domains (C, D). Five proteins contained an N-terminal coiled-coil region (E), and one, a serine-threonine protein kinase domain (STPK; F).