| Literature DB >> 33653882 |
T A Gray1,2, J T Wade1,2, J A Judd3, J Canestrari3, R Clark3, A Joseph3, P Lapierre3, E Lasek-Nesselquist3, M Mir3, M Palumbo3, C Smith3, M Stone3, A Upadhyay3, S E Wirth3, R M Dedrick4, C G Meier4, D A Russell4, A Dills5, E Dove5, J Kester5, I D Wolf5, J Zhu5, E R Rubin5, S Fortune5, G F Hatfull4, K M Derbyshire1,2.
Abstract
Functional characterization of bacterial proteins lags far behind the identification of new protein families. This is especially true for bacterial species that are more difficult to grow and genetically manipulate than model systems such as Escherichia coli and Bacillus subtilis To facilitate functional characterization of mycobacterial proteins, we have established a Mycobacterial Systems Resource (MSR) using the model organism Mycobacterium smegmatis This resource focuses specifically on 1,153 highly conserved core genes that are common to many mycobacterial species, including Mycobacterium tuberculosis, in order to provide the most relevant information and resources for the mycobacterial research community. The MSR includes both biological and bioinformatic resources. The biological resource includes (i) an expression plasmid library of 1,116 genes fused to a fluorescent protein for determining protein localization; (ii) a library of 569 precise deletions of nonessential genes; and (iii) a set of 843 CRISPR-interference (CRISPRi) plasmids specifically targeted to silence expression of essential core genes and genes for which a precise deletion was not obtained. The bioinformatic resource includes information about individual genes and a detailed assessment of protein localization. We anticipate that integration of these initial functional analyses and the availability of the biological resource will facilitate studies of these core proteins in many Mycobacterium species, including the less experimentally tractable pathogens M. abscessus, M. avium, M. kansasii, M. leprae, M. marinum, M. tuberculosis, and M. ulcerans IMPORTANCE Diseases caused by mycobacterial species result in millions of deaths per year globally, and present a substantial health and economic burden, especially in immunocompromised patients. Difficulties inherent in working with mycobacterial pathogens have hampered the development and application of high-throughput genetics that can inform genome annotations and subsequent functional assays. To facilitate mycobacterial research, we have created a biological and bioinformatic resource (https://msrdb.org/) using Mycobacterium smegmatis as a model organism. The resource focuses specifically on 1,153 proteins that are highly conserved across the mycobacterial genus and, therefore, likely perform conserved mycobacterial core functions. Thus, functional insights from the MSR will apply to all mycobacterial species. We believe that the availability of this mycobacterial systems resource will accelerate research throughout the mycobacterial research community.Entities:
Keywords: CRISPRi clones; Mycobacterium; conserved mycobacterial proteins; knockout library; protein localization
Year: 2021 PMID: 33653882 PMCID: PMC8092266 DOI: 10.1128/mBio.02401-20
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1(a) A pan-genome analysis with Roary v.3.13.0 (31) identifies conserved core genes among M. smegmatis mc2155, M. avium 104, M. tuberculosis H37Rv, M. abscessus ATCC 19977, and M. leprae TN. Orthologs were required to share at least 50% global amino acid identity and are indicated by a vertical blue line in each genome along the x axis (the pan genome); white (no blue) line indicates the ortholog is absent in that genome. Approximately 15,000 genes were compared in the analysis, with the conserved core genes indicated on the left of the panel. Results were visualized with Phandango v.1.3.0 (59) and Figtree v.1.4.4 (https://github.com/rambaut/figtree). (b) Schematic representation of the MSR and the constructs generated from the core genes.
Summary of biofilm and colony phenotypes of knockout mutants screened for altered biofilm formation
| MSMEG_gene | Gene name | Putative function | Colony morphology | Biofilm morphology compared with wild type |
|---|---|---|---|---|
| 1342 | ACP dehydratase | Flatter smoother | Delayed formation, less developed architecture | |
| 1824 | LytR family transcriptional regulator | Flat, smooth, round shape | Smooth, no architecture | |
| 2760 | Polyphosphate glucokinase | Similar to wild-type | No biofilm | |
| 4323 | Pyruvate dehydrogenase | Cream colored, smooth surface, round shape | No biofilm | |
| 5256 | UDP diphosphate synthase | Flat with pebbled surface | Hyper-pellicle, extensive architecture | |
| 5439 | Resuscitation-promoting factor | Smooth, flat, irregular shape | Smooth, no architecture | |
| 5487 | Sensor histidine kinase | Similar to wild-type | Smooth little architecture | |
| 5534 | ATP-dependent DNA helicase PcrA | Flat, bumpy surface, irregular shape | Smooth little architecture | |
| 6363 | Cysteine desulfurase | Whiter, smooth, flat doughnut surface | Smooth, no architecture |
See Fig. 2a and b for representative examples of different morphotypes. The symbol “-” indicates no gene name has been assigned to date.
FIG 2Biofilm (a) and colony morphotypes (b) of the indicated deletion mutants after 7 days of biofilm formation or 4 days of colony formation.
FIG 3Growth defects caused by expression of CRISPRi sgRNAs. (a) M. smegmatis containing pJR962 derivatives expressing sgRNAs targeting individual genes were grown to stationary phase in TSB in the absence of Atc. The cells were serially diluted 10-fold and 5 μl of each dilution was spotted onto Sauton’s medium without (left) or with (right) Atc, which induces expression of the CRISPRi system. The strain in column 5 is the control, wild-type mc2155, not containing a pJR962 clone. Genes targeted in each column are from left to right: 1-MSMEG_0029, 2-0244, 3-0317, 4-0482, 6-0709, 7-0789, 8-0832, 9-0956, 10-1019, 11-1066 and 12-1214. (b) Three clones (dnaN, gyrA, and trpG) were selected for further analysis. Cultures were diluted and spotted onto TSA or Sauton’s medium with or without Atc and growth was compared. The effect of repressing trpG can be seen in the more minimal Sauton’s medium. (c) Growth curves of cultures grown in TSB with or without Atc provide a more quantitative assessment on overall growth. MKD10 is the mc2155 parental control containing the empty vector pJR962. All cultures were diluted 100-fold before addition (or not) of Atc; the first readable data point was following overnight growth at 16.5 h.
FIG 4(a) Schematic representation of the vector used for expression of Dendra gene fusions. Digestion with AseI and HindIII facilitates InFusion cloning of open reading frame amplicons, disrupting the ATTAAT AseI site to ATTAtg to form an ATG initiation codon for ORF expression. The AAGCTT HindIII site is regenerated and replaces the native ORF stop codon to allow continued translation into the alanine/glycine linker and Dendra. (b) Examples of protein localization to poles or septa with different Dendra fusion proteins expressed in M. smegmatis. Gene numbers and name are indicated.
FIG 5A unique class of proteins is localized to the intracellular membrane domain (IMD) (53). Each Dendra fusion protein is shown expressed in M. smegmatis and characteristically features two prominent patches at the cell pole (top panel). The heat map below visually reinforces the similarity of protein localization for each fusion protein. The heat map is a representative cell with consolidated mapping data for 50 to 100 cells. Red indicates no fluorescence, while increasing intensity and protein localization is reflected by changes from yellow to green, blue, and purple. There were ∼30 protein fusions that had similar localization patterns. Gene numbers and name (if known) are given.