Literature DB >> 25370496

Comparative analyses of nonpathogenic, opportunistic, and totally pathogenic mycobacteria reveal genomic and biochemical variabilities and highlight the survival attributes of Mycobacterium tuberculosis.

Syed Asad Rahman1, Yadvir Singh2, Sakshi Kohli2, Javeed Ahmad3, Nasreen Z Ehtesham, Anil K Tyagi4, Seyed E Hasnain5.   

Abstract

UNLABELLED: Mycobacterial evolution involves various processes, such as genome reduction, gene cooption, and critical gene acquisition. Our comparative genome size analysis of 44 mycobacterial genomes revealed that the nonpathogenic (NP) genomes were bigger than those of opportunistic (OP) or totally pathogenic (TP) mycobacteria, with the TP genomes being smaller yet variable in size--their genomic plasticity reflected their ability to evolve and survive under various environmental conditions. From the 44 mycobacterial species, 13 species, representing TP, OP, and NP, were selected for genomic-relatedness analyses. Analysis of homologous protein-coding genes shared between Mycobacterium indicus pranii (NP), Mycobacterium intracellulare ATCC 13950 (OP), and Mycobacterium tuberculosis H37Rv (TP) revealed that 4,995 (i.e., ~95%) M. indicaus pranii proteins have homology with M. intracellulare, whereas the homologies among M. indicus pranii, M. intracellulare ATCC 13950, and M. tuberculosis H37Rv were significantly lower. A total of 4,153 (~79%) M. indicus pranii proteins and 4,093 (~79%) M. intracellulare ATCC 13950 proteins exhibited homology with the M. tuberculosis H37Rv proteome, while 3,301 (~82%) and 3,295 (~82%) M. tuberculosis H37Rv proteins showed homology with M. indicus pranii and M. intracellulare ATCC 13950 proteomes, respectively. Comparative metabolic pathway analyses of TP/OP/NP mycobacteria showed enzymatic plasticity between M. indicus pranii (NP) and M. intracellulare ATCC 13950 (OP), Mycobacterium avium 104 (OP), and M. tuberculosis H37Rv (TP). Mycobacterium tuberculosis seems to have acquired novel alternate pathways with possible roles in metabolism, host-pathogen interactions, virulence, and intracellular survival, and by implication some of these could be potential drug targets. IMPORTANCE: The complete sequence analysis of Mycobacterium indicus pranii, a novel species of Mycobacterium shown earlier to have strong immunomodulatory properties and currently in use for the treatment of leprosy, places it evolutionarily at the point of transition to pathogenicity. With the purpose of establishing the importance of M. indicus pranii in providing insight into the virulence mechanism of tuberculous and nontuberculous mycobacteria, we carried out comparative genomic and proteomic analyses of 44 mycobacterial species representing nonpathogenic (NP), opportunistic (OP), and totally pathogenic (TP) mycobacteria. Our results clearly placed M. indicus pranii as an ancestor of the M. avium complex. Analyses of comparative metabolic pathways between M. indicus pranii (NP), M. tuberculosis (TP), and M. intracellulare (OP) pointed to the presence of novel alternative pathways in M. tuberculosis with implications for pathogenesis and survival in the human host and identification of new drug targets.
Copyright © 2014 Rahman et al.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25370496      PMCID: PMC4222108          DOI: 10.1128/mBio.02020-14

Source DB:  PubMed          Journal:  MBio            Impact factor:   7.867


INTRODUCTION

The evolution of Mycobacterium species is usually driven by processes, including deletion (nonfunctional genes are deleted/inactivated and subsequently eroded), insertion (horizontal transfer and gene duplication), or a combination of these events, which aid in survival under different environmental conditions or geographic niches (1–8). In nature, the free-living species require larger genomes than parasitic species (9, 10). This trend is also clearly evident from analyses of mycobacterial genomes where a distinct pattern of decreasing genomic content is seen as one moves from nonpathogenic pathogens (NP) to opportunistic pathogens (OP) to true pathogens (TP). We therefore performed genome size analysis with 44 Mycobacterium strains (Table 1) that represented NP, OP, and TP, and our analysis revealed that NP strains on average are bigger than those of OP and TP strains. One of the largest genomes in the Mycobacterium genus is that of Mycobacterium smegmatis, a nonpathogenic mycobacterium with approximately 6,717 protein-coding genes (genome size, 6.9 Mb). On the other extreme is a true pathogenic mycobacterium, Mycobacterium leprae (the leprosy bacterium), with the smallest genome, of approximately 2,770 protein-coding genes (genome size, 3.3 Mb) and approximately 1,600 functional and 1,100 nonfunctional/inactive genes (11).
TABLE 1 

Mycobacterial genomes selected for analysis

OrganismKEGG nameYr of sequencingNo. ofgenesGenomesize (bp)Pathogenicity
Mycobacterium smegmatis MC2 155 uid57701[a]msm20066,9386,988,209NP
Mycobacterium smegmatis MC2 155 uid171958msg20126,7426,988,208NP
Mycobacterium vanbaalenii PYR 1[a]mva20066,1366,491,865NP
Mycobacterium sp. KMSmkm20066,0796,256,079NP
Mycobacterium sp. JLSmjl20075,8426,048,425NP
Mycobacterium gilvum PYR-GCK[a]mgi20075,6695,982,829NP
Mycobacterium sp. MCSmmc20065,6985,920,523NP
Mycobacterium gilvum Spyr1msp20105,5525,783,292NP
Mycobacterium indicus prani[a]mid20125,3185,589,007NP
Mycobacterium sp. JDM601mjd20114,3984,643,668NP
Mycobacterium tuberculosis H37Ramra20074,0844,419,977NP
Mycobacterium bovis BCG Pasteur 1173P2mbb20074,0334,374,522NP
Mycobacterium bovis BCG Tokyo 172mbt20094,0274,371,711NP
Mycobacterium bovis BCG Mexicombm20124,0314,350,386NP
Mycobacterium rhodesiaemrh20126,3366,415,739OP
Mycobacterium chubuensemcb20126,0686,342,624OP
Mycobacterium sp. MOTT36Ymmm20125,1775,613,626OP
Mycobacterium intracellulare MOTT-64mir20125,2975,501,090OP
Mycobacterium avium 104[a]mav20065,3135,475,491OP
Mycobacterium intracellulare MOTT-02mit20125,1985,409,696OP
Mycobacterium intracellulare ATCC 13950[a]mia20125,1935,402,402OP
Mycobacterium abscessus ATCC 19977[a]MAb20084,9915,090,491OP
Mycobacterium massiliensemmv20122,6805,068,807OP
Mycobacterium avium paratuberculosis K-10[a]mpa20044,3994,829,781OP
Mycobacterium marinum M[a]mmi20085,5706,660,144TP
Mycobacterium ulcerans[a]mul20065,0625,805,761TP
Mycobacterium canettiimce20113,9824,482,059TP
Mycobacterium tuberculosis F11mtf20073,9984,424,435TP
Mycobacterium tuberculosis UT205mtd20123,8594,418,088TP
Mycobacterium tuberculosis H37Rv uid170532mtv20124,1704,411,708TP
Mycobacterium tuberculosis H37Rv uid57777[a]mtu19984,0624,411,532TP
Mycobacterium tuberculosis RGTB423mti20123,6704,406,587TP
Mycobacterium tuberculosis CCDC5180mtl20123,6384,405,981TP
Mycobacterium tuberculosis CDC1551mtc20014,2934,403,837TP
Mycobacterium tuberculosis KZN 605mtz20124,0714,399,120TP
Mycobacterium tuberculosis CCDC5079mte20123,6954,398,812TP
Mycobacterium tuberculosis CTRI-2mto20124,0014,398,525TP
Mycobacterium tuberculosis KZN 1435mtb20094,1074,398,250TP
Mycobacterium tuberculosis KZN 4207mtk20124,0444,394,985TP
Mycobacterium africanummaf20113,9834,389,314TP
Mycobacterium tuberculosis RGTB327mtg20123,7394,380,119TP
Mycobacterium bovis AF2122/97 [a]mbo20034,0014,345,492TP
Mycobacterium leprae TN[a]mle20012,7703,268,203TP
Mycobacterium leprae Br4923mlb20092,7703,268,071TP

Strain used for further analyses.

Mycobacterial genomes selected for analysis Strain used for further analyses. Mycobacterium indicus pranii (12) has been shown to have novel immunomodulatory properties (13–17) and proven therapeutic value in the treatment of leprosy (13, 14). The evolution of this clinically “benevolent” bacterium has been suggested to be at the point of transition to pathogenicity (12), despite earlier data from DNA sequence analysis of select genes, hsp70 (EU688981), gyrA (EU688980), and dnaJ (EU688982) of M. indicus pranii, which showed identity (99%) with corresponding genes of M. intracellulare. Comparative proteomic analyses of virulence factors of M. tuberculosis and their homologs in 12 different mycobacterial species, including M. indicus pranii, point toward gene cooption as an important mechanism in the evolution of mycobacteria (18). We now describe comparative proteomic analyses of 13 species of Mycobacterium, including M. indicus pranii (Table 2). The 13 Mycobacterium species were selected because they represented TP, OP, and NP. True pathogens, the most virulent mycobacteria, include Mycobacterium tuberculosis, the causative agent of human tuberculosis; Mycobacterium bovis, the causative agent of bovine tuberculosis; Mycobacterium leprae, the causative agent of leprosy, and a virulent nontuberculous mycobacterium (NTM), Mycobacterium ulcerans, which causes Buruli ulcers, which are the third most common mycobacterial disease in humans (19). Mycobacterium marinum, the causative agent of fish tank granuloma in humans and granulomatous lesions similar to those of M. tuberculosis in zebrafish, was also included in the true pathogen group for our analyses (20, 21). Opportunistic pathogens belong to the NTM group and cause pulmonary and other disseminated infections in immunocompromised individuals (22). Members of the Mycobacterium avium complex (MAC), Mycobacterium avium and Mycobacterium avium-M. intracellulare, cause opportunistic pulmonary infections in humans, whereas Mycobacterium avium subsp. paratuberculosis, the third member of the MAC group, is the suspected causative agent of Crohn’s disease in humans (22, 23). Mycobacterium abscessus is a rapid-growing mycobacterium which causes pulmonary and cutaneous infections in immunocompromised hosts (24). The nonpathogenic group includes Mycobacterium gilvum, Mycobacterium vanbaalenii, and Mycobacterium smegmatis, which rarely cause disseminated infections, even in immunocompromised individuals (25–27). Our results convincingly establish the very upstream evolutionary position of M. indicus pranii and also highlight some important differences in the metabolic pathway of M. tuberculosis H37Rv which are of possible significance in virulence and pathogenesis.
TABLE 2 

The 13 Mycobacterium species included in the analyses

Mycobacterium speciesKEGG aliasCategorization based on virulenceNCBI RefSeq accession no.No. of proteins
Mycobacterium tuberculosis H37RvMYCTUTrue pathogenNC_0009624,003
Mycobacterium bovis subsp. bovis AF2122/97MYCBOTrue pathogenNC_0029453,918
Mycobacterium leprae TNMYCLETrue pathogenNC_0026771,605
Mycobacterium ulcerans Agy99MYCUATrue pathogenNC_005916, NC_0086114,241
Mycobacterium marinum MMYCMMTrue pathogenNC_010604, NC_0106125,452
Mycobacterium avium 104MYCA1Opportunistic pathogenNC_0085955,120
Mycobacterium intracellulare ATCC 13950MIAOpportunistic pathogenNC_0169465,144
Mycobacterium avium subsp. paratuberculosis K-10MYCPAOpportunistic pathogenNC_0029444,350
Mycobacterium abscessus ATCC 19977MYCABOpportunistic pathogenNC_010394, NC_0103974,941
Mycobacterium indicus pranii MTCC 9506MIPNonpathogenNC_0186125,254
Mycobacterium smegmatis MC2 155MYCS2NonpathogenNC_0085966,717
Mycobacterium gilvum PYR-GCKMYCGINonpathogenNC_009338, NC_009339, NC_009340, NC_0093415,579
Mycobacterium vanbaalenii PYR-1MYCVPNonpathogenNC_0087265,979
The 13 Mycobacterium species included in the analyses

RESULTS AND DISCUSSION

Reannotation of the M. indicus pranii proteome.

InterPro/Pfam domain knowledge for M. indicus pranii proteins was used to assign potential functions to 4,363 M. indicus pranii open reading frames (ORFs; ~83% of the M. indicus pranii proteome) (Fig. 1). Of the remaining 891 proteins, 164 were annotated using the phylogenetic classification of proteins encoded in complete genomes known as COG (Cluster of Orthologous Groups classification), but they failed to match with any domain in Pfam or InterPro. Previously, 3,870 (~70%) of M. indicus pranii ORFs were assigned a putative function on the basis of COG classification (Fig. 1). Out of 1,554 hypothetical proteins in M. indicus pranii based on the COG annotation, 656 have been assigned a putative function based on functional domain knowledge from the InterPro/Pfam database.
FIG 1 

Comparative plot for annotation of M. indicus pranii (MIP) based on annotations in COG and InterPro/Pfam.

Comparative plot for annotation of M. indicus pranii (MIP) based on annotations in COG and InterPro/Pfam. Interestingly, 60 proteins were found to have conflicting COG- and InterPro/Pfam-based annotations. In such ambiguous cases, the protein sequences were further submitted to analysis using GENE3D to further confirm the annotation. GENE3D upheld the Pfam/InterPro annotation for all except two cases (MIP_02898 and MIP_06278), for which no hit was found in GENE3D. COG and Pfam/InterPro annotations of these 2 proteins have no link in the existing literature or protein family knowledge (see Table S1 in the supplemental material). Thus, annotating a new proteome using Interpro not only provides better annotation coverage but also increases the confidence of annotation by providing in-depth knowledge regarding domains, motifs, and a structural annotation of the given protein sequence.

Comparative genome size analysis.

The complete genome sequences of the 44 mycobacterial species used in our analyses were available in the public domain. The Mycobacterium sp. MOTT36Y (5,613,626 bp) represents the OP group of mycobacteria closest to M. indicus pranii (5,589,007 bp) in terms of genome size. Among the OP group of mycobacteria, those closest to Mycobacterium intracellulare (28) (5,402,402 to 5,501,090 bp) are Mycobacterium sp. MOTT36Y (5,613,626 bp), Mycobacterium avium 104 (5,475,491 bp), and Mycobacterium abscessus ATCC 19977 (5,090,491 bp). It is interesting that based on the genome size, the genome of M. avium 104, an OP, fits between M. intracellulare MOTT-64 (5,501,090 bp) and M. intracellulare MOTT-02 (5,409,696 bp) (Fig. 2).
FIG 2 

The genome size of NP mycobacteria is larger than that of OP and TP mycobacteria. The green box plot represents the NP genome sizes, the orange box represents OP genomes, and TP genomes are represented by the pink box, which is very tight. The red line is the regression line, and the blue line is the lowess line. The small circles denote outliers. The TP genomes were on average smaller yet variable in size. The genome plasticity for TP mycobacteria possibly highlights their abilities to evolve and survive under various environmental conditions.

The genome size of NP mycobacteria is larger than that of OP and TP mycobacteria. The green box plot represents the NP genome sizes, the orange box represents OP genomes, and TP genomes are represented by the pink box, which is very tight. The red line is the regression line, and the blue line is the lowess line. The small circles denote outliers. The TP genomes were on average smaller yet variable in size. The genome plasticity for TP mycobacteria possibly highlights their abilities to evolve and survive under various environmental conditions.

Sequence-based functional analysis.

Homologs obtained by BLASTp analysis were assigned functional relationships by comparing their Interpro/Pfam functional domains. About 90% functional similarity between proteins can be observed if their sequences are at least 60% identical (29); neither the percentage of sequence identity nor expectation value can give complete insight into the relationship between two proteins (30). Taking these reports into account, we performed an analysis to establish functional assignments based on InterPro/Pfam domain hits to the sequence identity data of BLASTp between proteins of M. indicus pranii and 12 other Mycobacterium species (Table 2; Fig. 3A). Our analyses revealed that Interpro/Pfam hits indicated M. indicus pranii to be most closely related to members of the Mycobacterium avium complex, with M. intracellulare and M. avium 104 having 77.9% and 74.9% of proteins functionally similar to those in the M. indicus pranii proteome, respectively. The functional relatedness of homologs fits well into the upper left corner of the receiver operating characteristics (ROC) space, indicating high sensitivity and specificity, which qualifies the functional relatedness analyses as an optimal model (Fig. 3B).
FIG 3 

(A) Homologs of M. indicus pranii proteins in 12 other Mycobacterium species were determined to be functionally related using InterPro/Pfam domain knowledge. (B) The ROC points are in the upper left corner, suggesting the findings are statistically significant. MIP, M. indicus pranii.

(A) Homologs of M. indicus pranii proteins in 12 other Mycobacterium species were determined to be functionally related using InterPro/Pfam domain knowledge. (B) The ROC points are in the upper left corner, suggesting the findings are statistically significant. MIP, M. indicus pranii. This analysis was further used to find a BLASTp sequence identity cutoff score below which no homologs shared a functional domain. In order to correlate the BLASTp sequence identities of homologs with their functional relatedness, numbers of true positives (tp), false positives (fp), true negatives (tn), and false negatives (fn) were distributed as per BLASTp sequence identity, ranging from 0 to 100% and using 10% as the bin size. When we plotted the true-positive rate (TPR) versus a range of lower sequence identity cutoffs (0 to 100%), it was observed that below a 20% sequence identity no homologs shared functional domains (see Fig. S1 in the supplemental material). This cutoff was further confirmed when we plotted the Matthews correlation coefficient (MCC) for a range of sequence identity cutoffs (0 to 100%). MCC values were zero, as calculated for the sequence identity cutoff of 20%, and exhibited a sharp rise at 30% sequence identity (see Fig. S2 in the supplemental material). For further comparative analyses, a 20% sequence identity cutoff with an e value of <1e−4 was used to analyze BLASTp results. This cutoff was used to determine the number of homologous protein-coding genes shared between M. indicus pranii, M. intracellulare ATCC 13950, and M. tuberculosis H37Rv. Our analysis of homologous protein-coding genes shared between M. indicus pranii (NP), M. intracellulare ATCC 13950 (OP), and M. tuberculosis H37Rv (TP) revealed 4,995 (~95%) M. indicus pranii proteins had homology with M. intracellulare ATCC 13950, whereas homologies between M. indicus pranii, M. intracellulare ATCC 13950, and M. tuberculosis H37Rv were significantly lower. A total of 4,153 (~79%) M. indicus pranii proteins and 4,093 (~79%) M. intracellulare ATCC 13950 proteins exhibited homology with the M. tuberculosis H37Rv proteome, while 3,301 (~82%) and 3,295 (~82%) M. tuberculosis H37Rv proteins showed homology with M. indicus pranii and M. intracellulare ATCC 13950 proteomes, respectively (Fig. 4; see also Table S2 in the supplemental material).
FIG 4 

Comparative genomics of selected mycobacterial genomes. The genomes of Mycobacterium indicus pranii (shown in green; an NP), Mycobacterium intracellulare ATCC 13950 (orange; an OP), and Mycobacterium tuberculosis H37Rv (pink; a TP) were selected for comparative genomic analyses. We used BLASTp, with a cutoff of 20% identity and e value of 10e–4, to determine the number of homologous protein-coding genes common between them (shown as edge labels between the nodes). The arrowhead represents the query genome, whereas the arrow tail represents the subject genome.

Comparative genomics of selected mycobacterial genomes. The genomes of Mycobacterium indicus pranii (shown in green; an NP), Mycobacterium intracellulare ATCC 13950 (orange; an OP), and Mycobacterium tuberculosis H37Rv (pink; a TP) were selected for comparative genomic analyses. We used BLASTp, with a cutoff of 20% identity and e value of 10e–4, to determine the number of homologous protein-coding genes common between them (shown as edge labels between the nodes). The arrowhead represents the query genome, whereas the arrow tail represents the subject genome.

Comparative metabolic pathway analyses.

There were 387 enzymes (EC numbers) common between M. intracellulare ATCC 13950 and M. indicus pranii (part of the MAC complex). When these two genomes were compared to M. tuberculosis (part of the MTB complex), only 17 enzymes remained uniquely shared between the M. intracellulare ATCC 13950 and M. indicus pranii genomes (Fig. 5a). Compared to M. avium 104, only 12 enzymes remained uniquely shared between M. intracellulare ATCC 13950 and M. indicus pranii (Fig. 5b). Three enzymes, EC 1.8.7.1 (sulfite reductase [ferredoxin]) (31), EC 2.7.1.6 (galactokinase [phosphorylating]), and EC 5.4.2.8 (phospho mannose mutase) (32), were shared both between M. intracellulare ATCC 13950 and M. tuberculosis H37Rv and between M. intracellulare ATCC 13950 and M. avium 104. As these enzymes were absent from the M. indicus pranii genome and were shared between OP and TP, they may be linked to the pathogenesis of Mycobacterium tuberculosis.
FIG 5 

The enzymatic similarities between M. indicus pranii (MIP), M. intracellulare ATCC 13950 (MIA), M. avium 104 (MYCA1), and M. tuberculosis H37Rv (MYCTU) highlight interesting enzymatic plasticity properties. M. intracellulare ATCC 13950 (OP; orange) shares three enzymes (EC 1.8.7.1, sulfite reductase [ferredoxin]; EC 2.7.1.6, galactokinase [phosphorylating]; EC 5.4.2.8, phosphor mannose mutase) with M. avium 104 (OP; blue) and M. tuberculosis H37Rv (TP; red), which are absent in M. indicus pranii (NP; green). M. intracellulare ATCC 13950 and M. indicus pranii share 17 enzymes between them that are absent in M. tuberculosis H37Rv (a), while they share 12 enzymes between them that are absent in M. avium 104 (b).

The enzymatic similarities between M. indicus pranii (MIP), M. intracellulare ATCC 13950 (MIA), M. avium 104 (MYCA1), and M. tuberculosis H37Rv (MYCTU) highlight interesting enzymatic plasticity properties. M. intracellulare ATCC 13950 (OP; orange) shares three enzymes (EC 1.8.7.1, sulfite reductase [ferredoxin]; EC 2.7.1.6, galactokinase [phosphorylating]; EC 5.4.2.8, phosphor mannose mutase) with M. avium 104 (OP; blue) and M. tuberculosis H37Rv (TP; red), which are absent in M. indicus pranii (NP; green). M. intracellulare ATCC 13950 and M. indicus pranii share 17 enzymes between them that are absent in M. tuberculosis H37Rv (a), while they share 12 enzymes between them that are absent in M. avium 104 (b). Although the genome sizes of OPs (M. intracellulare ATCC 13950 and M. avium 104) and TP (M. tuberculosis) are reduced compared to the genome size of M. indicus pranii (an NP), our analysis indicated that the OP and TP genomes have acquired few enzyme-coding genes. It is tempting to suggest a likely association between these acquired enzymes and the virulence of these OPs and TPs. One of the three shared enzymes, EC 1.8.7.1, which encodes a ferredoxin-dependent sulfite reductase (encoded by the nirA gene), is active during the dormant phase and has been reported to be a potential drug target for Mycobacterium tuberculosis (33). Comparative metabolic pathway analysis (Fig. 6) between M. tuberculosis, M. intracellulare ATCC 13950, and M. indicus pranii showed the presence of alternate pathways, such as those in the fatty acid elongation pathway (fabH and fabK) and lipid biosynthesis. M. tuberculosis has acquired some novel pathways which involve 23 enzymes that are not present in M. indicus pranii or M. intracellulare ATCC 13950, such as those for butanoate metabolism, amino acid biosynthesis pathways, etc. (Fig. 6, shown in red). Alternate metabolic pathways must have evolved during mycobacterial evolution. M. tuberculosis H37Rv has few unique enzymes, which might be part of its evolutionary adaptation, and they thereby present potential drug targets. For example, gene Rv1771 (EC 1.1.3.8) is found in the ascorbate and aldarate metabolism pathways. Gene Rv3097c (EC 3.1.1.3) is an important precursor enzyme in the fatty acid pathway (Fig. 6b and c, highlighted by the cyan circle), which is absent in M. indicus pranii and M. intracellulare ATCC 13950. A few other examples include the genes Rv0069c (EC 4.3.1.17), Rv1905c (EC 1.4.3.3), Rv2192c (EC 2.4.2.18), Rv2006 (EC 3.2.1.28), Rv3393 (EC 3.2.2.1), and Rv0091 (EC 3.2.2.9), which are present in M. tuberculosis but absent in M. indicus pranii and M. intracellulare ATCC 13950. These genes might play an important role in the metabolism of M. tuberculosis as well as in the host-pathogen interaction and as a virulence factor.
FIG 6 

A comparative metabolic pathway analysis between M. tuberculosis H37Rv, M. intracellulare ATCC 13950, and M. indicus pranii reveals the presence of novel pathways in Mycobacterium tuberculosis that are not present in M. indicus pranii or M. intracellulare ATCC 13950. The comparative analysis of metabolic enzymes present in M. tuberculosis H37Rv, M. intracellulare ATCC 13950, and M. indicus pranii based on KEGG pathways are shown. (a) The unique pathways in M. tuberculosis H37Rv are shown in red. The common pathways between M. intracellulare ATCC 13950 and M. indicus pranii are shown in pink. The common pathways present in these three organisms are shown in black. Note that M. tuberculosis H37Rv has acquired alternate pathways (red) for its survival. There are few common pathways between M. intracellulare ATCC 13950 and M. indicus pranii (pink) that are absent in M. tuberculosis H37Rv. (b) A section (circled) of the lipid biosynthesis subpathway (glycerolipid metabolism) highlights the presence of an alternate enzyme (EC 3.1.1.3, Rv3097c) that performs molecular transformations in M. tuberculosis H37Rv. (c) A section (circled) of the lipid biosynthesis subpathway (fatty acid metabolism) highlights an alternate enzyme (FabH gene) present in M. intracellulare ATCC 13950 and M. indicus pranii but absent in M. tuberculosis H37Rv.

A comparative metabolic pathway analysis between M. tuberculosis H37Rv, M. intracellulare ATCC 13950, and M. indicus pranii reveals the presence of novel pathways in Mycobacterium tuberculosis that are not present in M. indicus pranii or M. intracellulare ATCC 13950. The comparative analysis of metabolic enzymes present in M. tuberculosis H37Rv, M. intracellulare ATCC 13950, and M. indicus pranii based on KEGG pathways are shown. (a) The unique pathways in M. tuberculosis H37Rv are shown in red. The common pathways between M. intracellulare ATCC 13950 and M. indicus pranii are shown in pink. The common pathways present in these three organisms are shown in black. Note that M. tuberculosis H37Rv has acquired alternate pathways (red) for its survival. There are few common pathways between M. intracellulare ATCC 13950 and M. indicus pranii (pink) that are absent in M. tuberculosis H37Rv. (b) A section (circled) of the lipid biosynthesis subpathway (glycerolipid metabolism) highlights the presence of an alternate enzyme (EC 3.1.1.3, Rv3097c) that performs molecular transformations in M. tuberculosis H37Rv. (c) A section (circled) of the lipid biosynthesis subpathway (fatty acid metabolism) highlights an alternate enzyme (FabH gene) present in M. intracellulare ATCC 13950 and M. indicus pranii but absent in M. tuberculosis H37Rv.

Conclusions.

The COG method of proteome annotation is based on assignment of a sequence-based orthology, whereas function prediction tools like InterPro add to in-depth annotation of a gene by utilizing the domain and signature knowledge. We found that COG-based annotation of the M. indicus pranii proteome consisted of some ambiguous cases compared with other protein domain databases that are used for annotation. The combination of homology-based COG and a functional domain database like InterPro/Pfam provided the maximum coverage for annotating a proteome. The protein domain knowledge available using InterPro/Pfam and the Conserved Domains Database (CDD) can help associate sequence-based homologs with the functional orthologs. From the above approach, we found that among mycobacterial species, for a protein to be a homolog, its sequence identity should be above 20%. We have also highlighted here the importance of comparative genomics and protein domains by curating 60 misannotated M. indicus pranii genes in the public database (see Table S1 in the supplemental material). Our comparative genomic and proteomic analyses of pathogenic and nonpathogenic mycobacterial species provided strong evidence suggesting that despite having identical rRNA genes (except for notable differences in the 23S rRNA gene) with M. intracellulare, M. indicus pranii (an NP with strong immunomodulatory properties) is a predecessor of the M. avium complex (12) and is at an evolutionarily transitory position with respect to a fast versus slow grower and as a saprophyte versus a seasoned pathogen (6, 12). During the process of evolution, M. indicus pranii evolved into M. intracellulare ATCC 13950 (an OP) when a few genes were deleted and a few enzyme-encoding genes were acquired (which may provide a common/evolutionary link between M. avium 104 [OP] and M. intracellulare ATCC 13950 [OP]). A similar pattern as with M. intracellulare ATCC 13950 is exhibited by Mycobacterium tuberculosis, where a large portion of genes with a conserved proline-glutamate (PE) motif or proline-proline-glutamate motif (PPE) family have been acquired (5), although most of them are still hypothetical proteins. Although we know that members of the PE/PPE gene family code for virulence factors (4, 34–36), it will be interesting if some of these hypothetical proteins have any enzymatic function, as until now only one PE protein, PE30 (Rv3097c), has been reported to exhibit enzymatic activity (37). Such proteins can be exploited for antituberculosis drug therapy. The host-pathogen interaction network between M. tuberculosis and humans (38) might provide some insight into the evolutionary pressure under which M. tuberculosis obtained a new set of pathways for its survival, which can be exploited again for antituberculosis interventions. Furthermore, our findings on the presence of alternative metabolic pathways in Mycobacterium tuberculosis pose important questions about their role in virulence and the consequent implications for designing new interventions against tuberculosis.

MATERIALS AND METHODS

The prediction of protein function domains for the ORFs of M. indicus pranii was carried out using InterPro (39, 40) and Pfam (41). The domain hits of individual proteins were compared to annotations of the COG database (42). As InterProscan uses CATH GENE3D version 3.3.0 (43), in cases of ambiguity the protein sequences were submitted to GENE3D v11.0 and the Domain Enhanced Lookup Time Accelerated BLAST (DELTA-BLAST) system (44) to confirm the annotation.

Functional relatedness of homologs.

A sequence identity above 60% between two proteins is required to have 90% functional similarity (29); however, neither the sequence identity nor the expectation value can give complete insight into the relationship between two proteins (30). We therefore tried to relate the functional similarity based on Interpro/Pfam domain hits to the coverage and sequence identity of BLASTp results between M. indicus pranii and other Mycobacterium species. To show the functional relationships between homologs, we assigned a value of 1 for a positive functional relationship if the homologs shared at least one InterPro/Pfam ID and a value of 0 if they shared none, indicating no functional relationship. The statistical significance of our approach was determined using data from DELTA-BLAST, which returns the domain hits of a protein from the CDD. For each homolog pair, we assigned a value of 1 if they shared at least one conserved domain and a value of 0 if they did not share any conserved domain listed in the CDD. The findings from the above two lists (InterPro/Pfam and CDD) were compared to calculate the number of tp, tn, fp, and fn (see Chart S1 in the supplemental material), which were used to plot ROC curves. In order to obtain a sequence identity cutoff below which no functional similarity or homologs were observed in the Mycobacterium genus (13 species included in the analyses), we plotted the number of tp against sequence identity (varying from 0% to 100%) for M. indicus pranii versus all 12 Mycobacterium other species (see Fig. S1 in the supplemental material). Furthermore, MCC values were calculated for sequence identity cutoffs ranging from 0 to 100% (see Fig. S2 in the supplemental material). We also performed all-against-all BLASTp-based homology searches for 13 Mycobacterium species, using a sequence identity cutoff of 20% and an e value of <1e−04 (45). Analysis of metabolic enzymes was carried out based on the IUBMB EC numbers (46) in the KEGG database (47) (accessed in December 2012) for M. indicus pranii (387 EC enzymes), M. intracellulare ATCC 13950 (394 EC), M. avium 104 (413 EC), and M. tuberculosis (396 EC) genomes (48). Comparative metabolic pathway analysis between M. tuberculosis, M. intracellulare ATCC 13950, and M. indicus pranii was performed using iPath2.0 (49). Chart summarizing the bases for statistical analyses. Download Chart S1, TIF file, 1.3 MB Correlation of BLASTp sequence identities between homologs of M. indicus pranii proteins in 12 other Mycobacterium species with their functional relatedness (Interpro/Pfam). Plots of the number of true positives among the total number of homologs versus various sequence identities showed that below a 20% sequence identity cutoff, no functional similarity between homologs was observed for Mycobacterium species. Download Figure S1, TIF file, 1.8 MB Correlation for BLASTp sequence identities between homologs of M. indicus pranii proteins in 12 other Mycobacterium species, with their functional relatedness (Interpro/Pfam). Mathews correlation coefficient values were plotted for various sequence identity cutoffs. MCC values are 0 for sequence identity cutoffs of 0 to 20% and show a steep rise at 30% sequence identity, suggesting that below the 20% sequence identity cutoff no functional similarity between homologs can be observed for Mycobacterium species. Download Figure S2, TIF file, 1.6 MB List of ambiguous cases found while reannotating the M. indicus pranii proteome based on InterPro domains Table S1, DOCX file, 0.02 MB. Mycobacterial genomes selected for comparative genomics analysis Table S2, DOCX file, 0.01 MB.
  48 in total

1.  Twilight zone of protein sequence alignments.

Authors:  B Rost
Journal:  Protein Eng       Date:  1999-02

Review 2.  Nontuberculous mycobacteria: opportunistic environmental pathogens for predisposed hosts.

Authors:  James L Cook
Journal:  Br Med Bull       Date:  2010-10-25       Impact factor: 4.291

Review 3.  The nature and dynamics of bacterial genomes.

Authors:  Howard Ochman; Liliana M Davalos
Journal:  Science       Date:  2006-03-24       Impact factor: 47.728

4.  Effect of an immunomodulator containing Mycobacterium w on sputum conversion in pulmonary tuberculosis.

Authors:  Naresh Patel; M M Deshpande; Maya Shah
Journal:  J Indian Med Assoc       Date:  2002-03

5.  Computational reconstruction of metabolic networks from KEGG.

Authors:  Tingting Zhou
Journal:  Methods Mol Biol       Date:  2013

6.  Siroheme- and [Fe4-S4]-dependent NirA from Mycobacterium tuberculosis is a sulfite reductase with a covalent Cys-Tyr bond in the active site.

Authors:  Robert Schnell; Tatyana Sandalova; Ulf Hellman; Ylva Lindqvist; Gunter Schneider
Journal:  J Biol Chem       Date:  2005-05-24       Impact factor: 5.157

Review 7.  Buruli ulcer: reductive evolution enhances pathogenicity of Mycobacterium ulcerans.

Authors:  Caroline Demangel; Timothy P Stinear; Stewart T Cole
Journal:  Nat Rev Microbiol       Date:  2009-01       Impact factor: 60.633

8.  A study of some fast-growing scotochromogenic mycobacteria including species descriptions of Mycobacterium gilvum (new species) and Mycobacterium duv alii (new species).

Authors:  J L Stanford; W J Gunthorpe
Journal:  Br J Exp Pathol       Date:  1971-12

9.  Comparative genomic and proteomic analyses of PE/PPE multigene family of Mycobacterium tuberculosis H₃₇Rv and H₃₇Ra reveal novel and interesting differences with implications in virulence.

Authors:  Sakshi Kohli; Yadvir Singh; Khushbu Sharma; Aditya Mittal; Nasreen Z Ehtesham; Seyed E Hasnain
Journal:  Nucleic Acids Res       Date:  2012-05-22       Impact factor: 16.971

10.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

View more
  26 in total

Review 1.  Clinical implication of novel drug resistance-conferring mutations in resistant tuberculosis.

Authors:  N P Mnyambwa; D-J Kim; E S Ngadaya; R Kazwala; P Petrucka; S G Mfinanga
Journal:  Eur J Clin Microbiol Infect Dis       Date:  2017-06-07       Impact factor: 3.267

Review 2.  Survey of (Meta)genomic Approaches for Understanding Microbial Community Dynamics.

Authors:  Anukriti Sharma; Rup Lal
Journal:  Indian J Microbiol       Date:  2016-11-11       Impact factor: 2.461

3.  Comprehensive profiling of functional attributes, virulence potential and evolutionary dynamics in mycobacterial secretomes.

Authors:  Ayan Roy; Arnab Sen; Sourav Chakrobarty; Indrani Sarkar
Journal:  World J Microbiol Biotechnol       Date:  2017-12-04       Impact factor: 3.312

Review 4.  Cutaneous Mycobacterial Infections.

Authors:  Carlos Franco-Paredes; Luis A Marcos; Andrés F Henao-Martínez; Alfonso J Rodríguez-Morales; Wilmer E Villamil-Gómez; Eduardo Gotuzzo; Alexandro Bonifaz
Journal:  Clin Microbiol Rev       Date:  2018-11-14       Impact factor: 26.132

5.  Biofilms on Hospital Shower Hoses: Characterization and Implications for Nosocomial Infections.

Authors:  Maria J Soto-Giron; Luis M Rodriguez-R; Chengwei Luo; Michael Elk; Hodon Ryu; Jill Hoelle; Jorge W Santo Domingo; Konstantinos T Konstantinidis
Journal:  Appl Environ Microbiol       Date:  2016-04-18       Impact factor: 4.792

Review 6.  Macrophage: A Cell With Many Faces and Functions in Tuberculosis.

Authors:  Faraz Ahmad; Anshu Rani; Anwar Alam; Sheeba Zarin; Saurabh Pandey; Hina Singh; Seyed Ehtesham Hasnain; Nasreen Zafar Ehtesham
Journal:  Front Immunol       Date:  2022-05-06       Impact factor: 8.786

7.  Mycobacterium tuberculosis Methyltransferase Rv1515c Can Suppress Host Defense Mechanisms by Modulating Immune Functions Utilizing a Multipronged Mechanism.

Authors:  Anshu Rani; Anwar Alam; Faraz Ahmad; Manjunath P; Abhinav Saurabh; Sheeba Zarin; Dipendra Kumar Mitra; Seyed E Hasnain; Nasreen Z Ehtesham
Journal:  Front Mol Biosci       Date:  2022-06-24

8.  The many lives of nontuberculous mycobacteria.

Authors:  Tiffany A Claeys; Richard T Robinson
Journal:  J Bacteriol       Date:  2018-02-26       Impact factor: 3.490

Review 9.  Application of Volatilome Analysis to the Diagnosis of Mycobacteria Infection in Livestock.

Authors:  Pablo Rodríguez-Hernández; Vicente Rodríguez-Estévez; Lourdes Arce; Jaime Gómez-Laguna
Journal:  Front Vet Sci       Date:  2021-05-24

10.  Chloroquine Analogues as Leads against Pneumocystis Lung Pathogens.

Authors:  Ana Gomes; Ricardo Ferraz; Lauren Ficker; Margaret S Collins; Cristina Prudêncio; Melanie T Cushion; Cátia Teixeira; Paula Gomes
Journal:  Antimicrob Agents Chemother       Date:  2018-10-24       Impact factor: 5.191

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.