| Literature DB >> 34239815 |
Amol O Bajaj1, Suraj Saraswat1, Juha E A Knuuttila2, Joanna Freeke3,4, J Benjamin Stielow3,4, Adam P Barker1.
Abstract
Rapid and accurate differentiation of Mycobacterium tuberculosis complex (MTBC) species from other mycobacterium is essential for appropriate therapeutic management, timely intervention for infection control and initiation of appropriate health care measures. However, routine clinical characterization methods for Mycobacterium tuberculosis (Mtb) species remain both, time consuming and labor intensive. In the present study, an innovative liquid Chromatography-Mass Spectrometry method for the identification of clinically most relevant Mycobacterium tuberculosis complex species is tested using a model set of mycobacterium strains. The methodology is based on protein profiling of Mycobacterium tuberculosis complex isolates, which are used as markers of differentiation. To test the resolving power, speed, and accuracy of the method, four ATCC type strains and 37 recent clinical isolates of closely related species were analyzed using this new approach. Using different deconvolution algorithms, we detected hundreds of individual protein masses, with a subpopulation of these functioning as species-specific markers. This assay identified 216, 260, 222, and 201 proteoforms for M. tuberculosis ATCC 27294™, M. microti ATCC 19422™, M. africanum ATCC 25420™, and M. bovis ATCC 19210™ respectively. All clinical strains were identified to the correct species with a mean of 95% accuracy. Our study successfully demonstrates applicability of this novel mass spectrometric approach to identify clinically relevant Mycobacterium tuberculosis complex species that are very closely related and difficult to differentiate with currently existing methods. Here, we present the first proof-of-principle study employing a fast mass spectrometry-based method to identify the clinically most prevalent species within the Mycobacterium tuberculosis species complex.Entities:
Keywords: Mycobacterium tuberculosis; clinical diagnostics; clinical mycobacteriology; mass spectrometry; species delimitation
Year: 2021 PMID: 34239815 PMCID: PMC8259740 DOI: 10.3389/fcimb.2021.656880
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1Schematic overview of sample processing, data acquisition and analysis steps employed in stepwise analysis of LC-MS Orbitrap data.
Figure 2Heat map generated for all 4 species analyzed indicating top-200 discriminatory proteoforms. Horizontal bar on top, indicates the color assignment for the type and non-type strains used in the study, horizontal bar below indicates species assignment alone for visual clarity (see legend). Mass spectrometry data is here depicted as a heatmap, where rows represent individual monoisotopic masses (deconvoluted protein masses) and columns individual strains (technical LC-MS replicates for each strain were merged to a single leaf in the horizontal dendrogram) of the study cohort. Data has been converted to a binary state, where presents of a protein mass in each strain (species) is encoded as 1 (red) and absents 0 (blue). The representation indicates proteoform overlap and non-overlap between strains among all species due to their closeness at species level. Unique proteoforms detected among individual species specifically enable the classification down to species level (Specific number of exclusive red blocks for a given taxon). Central red block of proteoforms is shared between the species of interest and is non-species specific (conserved). The dendrogram shows that the M. bovis and M. bovis BCG clinical isolates are split in two separate branches, allowing discrimination of these fundamentally different strains (non-BCG and BCG strains). Additional work is required to understand proteomic diversity of atypical isolates, considering that a reliable identification must ideally capture all clinically relevant variants of a given species as paramount criterion to microbial diagnostics.
Classification success for identification of clinically prevalent MTBC species.
| Species | N | Identification success (Top match only) | Combined top 3 matches for each data file |
|---|---|---|---|
|
| 16 | 88% | 41 hits to |
| 7 hits to | |||
|
| 20 | 90% | 56 hits to |
| 4 hits to | |||
|
| 22 | 100% | 64 hits to |
| 2 hits to | |||
|
| 22 | 100% | 66 hits to |
|
|
|
|
N refers to the total number of mass spectra measured for each species. Total identification success is presented as ‘percent success’ as a result to predict the target species with the employed algorithm. For each data file, the 3 closest matches were collected (additional and including the top match) if applicable, and the species results for these top 3 matches is displayed in the right-hand column. In all cases the best match was to another strain of the same species, but in some few cases the 2nd or 3rd best match was for a different species as detailed in the column on the right. In the analysis we used only the type-strain of the M. microti, accordingly no species predictions are returned in this case as training and test data would be identical.