| Literature DB >> 29971057 |
Sneha M Pinto1, Renu Verma2, Jayshree Advani2,3, Oishi Chatterjee1,2,4, Arun H Patil1,2,5, Saketh Kapoor1, Yashwanth Subbannayya1, Remya Raja2, Sheetal Gandotra6, T S Keshava Prasad1,2.
Abstract
H37Ra is a virulence attenuated strain of Mycobacterium tuberculosis widely employed as a model to investigate virulence mechanisms. Comparative high-throughput studies have earlier correlated its avirulence to the presence of specific mutations or absence of certain proteins. However, a recent sequencing study of H37Ra, has disproved several genomic differences earlier reported to be associated with virulence. This warrants further investigations on the H37Ra proteome as well. In this study, we carried out an integrated analysis of the genome, transcriptome, and proteome of H37Ra. In addition to confirming single nucleotide variations (SNVs) and insertion-deletions that were reported earlier, our study provides novel insights into the mutation spectrum in the promoter regions of 7 genes. We also provide transcriptional and proteomic evidence for 3,900 genes representing ~80% of the total predicted gene count including 408 proteins that have not been identified previously. We identified 9 genes whose coding potential was hitherto reported to be absent in H37Ra. These include 2 putative virulence factors belonging to ESAT-6 like family of proteins. Furthermore, proteogenomic analysis enabled us to identify 63 novel proteins coding genes and correct 25 existing gene models in H37Ra genome. A majority of these were found to be conserved in the virulent strain H37Rv as well as in other mycobacterial species suggesting that the differences in the virulent and avirulent strains of M. tuberculosis are not entirely dependent on the expression of certain proteins or their absence but may possibly be ascertained to functional changes.Entities:
Keywords: genome annotation; multiomics; next-generation sequencing; orbitrap; virulence attenuation
Year: 2018 PMID: 29971057 PMCID: PMC6018540 DOI: 10.3389/fmicb.2018.01314
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Characterization of Mycobacterium tuberculosis H37Ra using an integrated OMICS analysis. (A) Schematic representation of the workflow employed in the current study to characterize the genome repertoire of Mycobacterium tuberculosis H37Ra. (B) Summary of genomic, transcriptomic and proteomics analysis of Mycobacterium tuberculosis H37Ra.
Figure 2Circos plot depicting the integrated view of the data obtained through multi-OMICs approach.
Figure 3Summary of transcriptomic and proteomic analysis of Mycobacterium tuberculosis H37Ra.(A) Comparison of the mRNA and protein levels reveals ~ 80% of the genes have both transcriptome and proteome evidence. 763 genes were identified based on transcript evidence alone. (B) Correlation plot depicting concordance between RNAseq and proteomic data for Mycobacterium tuberculosis H37Ra strain. (C) Gene Ontology-based classification of identified proteins based on biological process, molecular function and cellular component.
Figure 4(A) Abundance distribution of the proteins based on the iBAQ method. (The most highly expressed 10% of proteins are dominated by proteins relating to protein synthesis, protein folding and energy metabolism. The least abundant 10% of proteins are enriched in proteins with transporter activity). (B) Comparison of proteins identified by Malen et al., Jhingan et al., Verma et al. and our study. We identified 408 proteins which have not been identified in previous datasets.
List of proteins hitherto reported as absent in H37Ra.
| MRA_RS12470 (Rv2346c) | esxO | ESAT-6 like protein EsxO | 31.07 | 10.43 | 1.3 | 0.9 |
| MRA_RS06370 (Rv1197) | esxK | ESAT-6-like protein EsxK | 30.84 | 6.07 | – | – |
| MRA_RS12475 (Rv2347c) | esxP | ESAT-6 like protein EsxP | – | 10.39 | – | – |
| MRA_RS05510 (Rv1038c) | esxJ | ESAT-6 like protein EsxJ | – | 5.87 | 2.1 | 1.7 |
| MRA_RS19185 (Rv3620c) | esxW | ESAT-6 like protein EsxW | – | 5.15 | – | – |
| MRA_RS18460 (Rv3479) | Rv3479 | Membrane protein | 20.08 | 2.90 | 0.8 | 0.3 |
| MRA_RS20115 (Rv3792) | aftA | Arabinofuranosyltransferase | 22.30 | 3.98 | 1.2 | 1.0 |
| MRA_RS12320 (Rv2319c) | Rv2319c | Universal stress protein | 18.67 | 2.66 | 0.5 | 0.8 |
| MRA_RS06305 (Rv1184c) | Rv1184c | Exported protein | – | 2.25 | 0.7 | 1.0 |
Partial list of putative virulence factors identified in H37Ra.
| MRA_RS11925 (Rv2246) | 3-oxoacyl-ACP synthase II | Lipids and Fatty Acid Metabolism | 28.66 | 8.2 | 1.5 | 1.5 | |
| MRA_RS03395 (Rv0642c) | SAM-dependent methyltransferase | Mycolic acid synthesis | 27.33 | 8.62 | 1 | 1.2 | |
| MRA_RS02465 (Rv0470c) | Cyclopropane mycolic acid synthase | 27.23 | 7.65 | 1.3 | 1.7 | ||
| MRA_RS16390 (Rv3083) | FAD-containing monooxygenase MymA | 17.06 | 3.77 | ND | ND | ||
| MRA_RS16395 (Rv3084) | Acetylhydrolase | 23.29 | 2.31 | 0.5 | 1.3 | ||
| MRA_RS16400 (Rv3085) | Acetoin dehydrogenase | 22.18 | 2.91 | 0.1 | 0.5 | ||
| MRA_RS16405 (Rv3086) | Alcohol dehydrogenase | 21.44 | 4.97 | 0.7 | 0.5 | ||
| MRA_RS16410 (Rv3087) | Diacylglycerol O-acyltransferase | 23.33 | 4.62 | 1 | 1 | ||
| MRA_RS16415 (Rv3088) | Diacylglycerol O-acyltransferase | 22.37 | 4.07 | 0.7 | 0.9 | ||
| MRA_RS16420 (Rv3089) | Long-chain-fatty-acid–CoA ligase FadD13 | 22.62 | 4.32 | 0.9 | 0.8 | ||
| MRA_RS15645 (Rv2946c) | Polyketide synthase | Synthesis of complex lipids | 18.69 | 3.56 | ND | ND | |
| MRA_RS15650 (Rv2947c) | Polyketide synthase | 22.5 | 3.65 | ND | ND | ||
| MRA_RS20275 (Rv3823c) | transporter mmpL8 | Sulfolipid synthesis | 15.79 | 3.06 | ND | ND | |
| MRA_RS10130 (Rv1916) | isocitrate lyase subunit B | Others genes related in lipid synthesis | 16.84 | 5.81 | ND | ND | |
| MRA_RS12495 (Rv2351c) | Phospholipase C1 | 24.95 | 5.54 | 0.8 | 1.2 | ||
| MRA_RS12490 (Rv2350c) | Phospholipase C2 | 25.86 | 6.24 | 1 | 1.1 | ||
| MRA_RS18805 (Rv3544c) | acyl-CoA dehydrogenase | Catabolism of cholesterol | 20.68 | 3.22 | ND | ND | |
| MRA_RS18785 (Rv3540c) | Lipid-transfer protein | 20.44 | 2.27 | ND | ND | ||
| MRA_RS00910 (Rv0167) | Membrane protein | Cell wall proteins | 21.46 | 7.88 | ND | ND | |
| MRA_RS18565 (Rv3500c) | ABC transporter permease | 20.37 | 4.72 | ND | ND | ||
| MRA_RS07170 (Rv1348) | Iron import ATP-binding/permease IrtA | Metal importer | 18.06 | 2.78 | ND | ND | |
| MRA_RS06490 (Rv1221) | ECF RNA polymerase sigma factor SigE | Sigma factors | 21.1 | 8.62 | ND | ND | |
| MRA_RS17475 (Rv3286c) | RNA polymerase sigma factor SigF | 24.55 | 4.58 | ND | ND | ||
| MRA_RS03875 (Rv0735) | RNA polymerase sigma factor SigL | 20.98 | 3.12 | ND | ND |
Figure 5Refinement of genome annotation of Mycobacterium tuberculosis H37Ra genome based on transcriptomic and proteomic evidence.(A) Graphical representation of revised gene annotation owing to a frameshift mutation.(B) Orthologous evidence for the revised gene annotation.(C) Representative spectra of one among the 7 peptides supporting the example.