Literature DB >> 36160200

The shortcut of mycobacterial interspersed repetitive unit-variable number tandem repeat typing for Mycobacterium tuberculosis differentiation.

Shima Hadifar1,2, Mansour Kargarpour Kamakoli1,2, Sana Eybpoosh3, Mehran Nakhaeizadeh4,5, Mohammad Ali Kargarpour Kamakoli6, Nasim Ebrahimifard1,2, Abolfazl Fateh1,2, Seyed Davar Siadat1,2, Farzam Vaziri1,2.   

Abstract

The 24-loci mycobacterial interspersed repetitive unit-variable number tandem repeat (MIRU-VNTR) genotyping has been used as an international standard method for Mycobacterium tuberculosis (Mtb) genotyping. However, different optimized VNTR loci sets for improving the discrimination of specific Mtb genotypes have been proposed. In this regard, we investigated the efficacy of accumulation of the percentage differences (APDs) compared with the least absolute shrinkage and selection operator (LASSO) regression strategy to identify a customized genotype-specific VNTR loci set which provides a resolution comparable to 24-loci MIRU-VNTR in divergent Mtb populations. We utilized Spoligotyping and 24-loci MIRU-VNTR typing for genotyping 306 Mtb isolates. The APD and LASSO regression approaches were used to identify a customized VNTR set in our studied isolates. Besides, the Hunter-Gaston discriminatory index (HGDI), sensitivity, and specificity of each selected loci set were calculated based on both strategies. The selected loci based on LASSO regression compared with APD-based loci showed a better discriminatory power for identifying all studied genotypes except for T genotype, which APD-based loci showed promising discriminative power. Our findings suggested the LASSO regression rather than the APD approach is more effective in the determination of possible discriminative VNTR loci set to precise discrimination of our studied Mtb population and may be beneficial to be used in finding reduced number loci sets in other Mtb genotypes or sublineages. Moreover, we proposed customized genotype-specific MIRU-VNTR loci sets based on the LASSO regression and APD approaches for precise Mtb strains identification. As the proposed VNTR sets offered a comparable discriminatory power to the standard 24 MIRU-VNTR loci set could be promising alternatives to the standard genotyping for using in resource-limited settings.
Copyright © 2022 Hadifar, Kargarpour Kamakoli, Eybpoosh, Nakhaeizadeh, Kargarpour Kamakoli, Ebrahimifard, Fateh, Siadat and Vaziri.

Entities:  

Keywords:  LASSO regression; MIRU-VNTR; Mycobacterium tuberculosis; accumulation of the percentage differences; genotyping

Year:  2022        PMID: 36160200      PMCID: PMC9493315          DOI: 10.3389/fmicb.2022.978355

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   6.064


Introduction

Understanding and monitoring the patterns and dynamics in the population structure of Mycobacterium tuberculosis (Mtb), as one of the deadly infectious agents, is imperative for epidemiological and biological purposes. Molecular typing methods play an important role in achieving this purpose and provide powerful tools to measure local and global tuberculosis (TB) surveillance (Allix-Béguec et al., 2008; Hachisu et al., 2013). Over the previous decade, different polymerase chain reaction (PCR)-based genotyping methods, such as the 24-loci mycobacterial interspersed repetitive unit-variable number tandem repeat (MIRU-VNTR) typing and Spoligotyping have been widely used as valuable tools for identification of Mtb strain (Allix-Béguec et al., 2008; Ei et al., 2016). Although Whole-Genome Sequencing (WGS) has been suggested as a proxy to trace Mtb transmission and provides a comprehensive identification; however, due to several limitations have not been in resource-limited settings compared with PCR-based methods (Satta et al., 2018). MIRU-VNTR typing is considered a promising genotyping method, providing a comparable or superior resolution compared with other PCR-based genotyping methods like IS6110-RFLP and Spoligotyping depending on the selected VNTR loci set (De Beer et al., 2012; de Beer et al., 2013). Generally, MIRU-VNTR loci are selected based on the discrimination power of each locus, which can result in genotyping errors for different Mtb families due to the different genetic backgrounds of the populations and reduce discrimination power based on specific genotypes (Murase et al., 2008; Luo et al., 2014). It has been proposed that an optimized locus set based on the accumulation of the percentage differences (APDs) approach can efficiently discriminate specific genotypes (Beijing, NEW-1, and CAS) (Pan et al., 2017; Hadifar et al., 2019a). Accordingly, in this study, we adopted another approach to investigate the capacity and efficacy of APD compared with the least absolute shrinkage and selection operator (LASSO) regression approach to identify a customized genotype-specific MIRU-VNTR loci set, which provides a resolution comparable to 24-loci VNTR in divergent Mtb population.

Materials and methods

Clinical isolates collection

A total of 306 Mtb isolates were randomly collected from who had been diagnosed with pulmonary TB from January 2017 to January 2019. All patients had no history of anti-TB treatment. None of the samples were collected from the same patient. In addition, all Mtb isolates were confirmed by PCR for IS6110. Culture-negative pulmonary TB cases were also excluded. This study was approved by the Ethics Committee of the Pasteur Institute of Iran. All experiments were performed in accordance with guidelines approved by Pasteur Institute International networks. Ethical reviews and informed consent approval were granted by the Ethical Committee of the Pasteur Institute of Iran. Informed consent was obtained from all patients in this study. The results of this research did not influence TB patients’ treatment.

Twenty-four-loci mycobacterial interspersed repetitive unit-variable number tandem repeat typing

The extraction of Mtb genomic DNA was performed by a PROBA-NK DNA extraction kit (DNA Technology Company, Moscow, Russia) according to the manufacturer’s instructions and was used for the 24-loci MIRU-VNTR genotyping method as described previously (Supply et al., 2006).

Spoligotyping

Spoligotyping commercial kit (Mapmygenome Genomics Company, India) was used for analyzing all Mtb isolates based on the Spoligotyping protocol described by Kamerbeek et al. (1997).

Statistical analysis

Mtb genotypes were initially identified with the MIRU-VNTR technique and confirmed with the Spoligotyping method. The MIRU-VNTR and Spoligotyping data were analyzed by the MIRU-VNTRplus web server[1] and SITVIT-WEB online tool[2] (Demay et al., 2012), respectively. In addition, TBminer databases (Azé et al., 2015) were used to analyze and confirm the results. Simpson’s diversity index for each VNTR locus and VNTR loci in combination, as described by Hunter and Gaston (1988), was calculated using the online tool at http://www.comparingpartitions.info/index.php. Based on the Hunter-Gaston discriminatory index (HGDI) score, each locus showed a high (h > 0.6), moderate (0.3 ≤ h ≤ 0.6), and poor (h < 0.3) discriminative power (Sola et al., 2003). The data set consisted of variables with more than 30 percent missing values. We checked the mechanisms of missing values which were completely at random. Therefore, the missing values were imputed using Multiple Imputations via the Chained Equations (MICE) algorithm. LASSO regression method was used to provide a reduced, and hence, more cost-effective set of loci that could predict the Mtb genotypes with reasonable accuracy. The LASSO regression model, which is acknowledged to be suitable for high dimensional data regression analysis (Sauerbrei et al., 2007), is a regularization-based regression model that uses shrinkage. The lasso shrinks the coefficient estimates toward zero. It performs as a variable selection method and adds a penalty term (L1) to the cost function. Indeed, in lasso regression, we aim to find estimates of the coefficients β to βp that minimize RSS + where RSS is the residual sum of squares. So, if λ = 0, then we have regular ordinary least squares regression. Accordingly, LASSO regression was performed to select the optimal set of VNTR loci for predicting Mtb genotypes. Afterward, the predicted values, sensitivity, and specificity indices were computed and used for comparison with APD method. Sensitivity and specificity were used to evaluate the accuracy of a test that predicts dichotomous outcomes (e.g., logistic regression). Sensitivity evaluates model performance and measures a model’s capability to identify positive examples correctly. When sensitivity is used to evaluate model performance, it is often compared to specificity. Specificity measures the proportion of true negatives that the model correctly identifies. We assessed internal and external validity to validate the performance of LASSO regression. The fivefold cross-validation (CV) was used for evaluating internal validity. This approach randomly divides the data 5 times as training and test samples and runs the analysis on each fold. Then the sensitivity and specificity indices were calculated. External validity was tested by defining our dataset as a train set and an additional source of data as a test set, including the collection of 225 strains with their 24 MIRU-VNTR loci and genotype assignments provided from the MIRU-VNTRplus reference database (see text footnote 1) and a part of data from two previous studies (Homolka et al., 2008; Shuaib et al., 2022; Supplementary Table 2). Then the sensitivity and specificity indices were also calculated. Analyses were performed in R software. To APD calculation in each VNTR locus, firstly, we categorized our MIRU-VNTR data into eight data groups, including Beijing and Non-Beijing, CAS and Non-CA, NEW-1 and Non-NEW-1, Haarlem and Non-Haarlem, Cameroon and Non- Cameroon, LAM and Non-LAM, T and Non-T, and Ural1 and Non-Ural1. Then, the frequency of repeat numbers (patterns) at any VNTR locus was determined in both groups of each category. Finally, the APD value for each VNTR locus in every category was calculated as follows: Where s is the total number of distinct repeat patterns, Aj (Bj) is the percentage of A (B) category strains with the jth repeat pattern from all the strains of the A (B) category.

Results

According to results of the standard 24-loci MIRU-VNTR genotyping with TBminer database, we identified eight genotypes as follows: 101 CAS1-Delhi (33.00%), 88 NEW-1 (28.76%), 56 Beijing (18.30%), followed by 16 isolates as Haarlem (5.22%), 16 isolates as Cameroon (5.22%), 12 isolates as T (3.92%), 9 isolates as Ural1 (2.94%), and 8 isolates as LAM genotype (2.61%). The Spoligotyping method showed a similar result to MIRU-VNTR data (Supplementary Table 1). HGDI of each VNTR locus was calculated for each genotype. According to the global HGDI, 8 VNTR loci showed a high discriminatory power including QUB26 (4052), MIRU10 (960), MIRU26 (2996), ETRA (2165), MTUB04 (424), MTUB21 (1955), MIRU16 (1644), and QUB11B (2163b); while VNTR MIRU24 (2687), MTUB29 (2347), MIRU02 (154), MIRU27 (3007), MIRU04 (580), and MIRU20 (2059) were poorly discriminative (HGDI < 0.3). We previously defined a combination of loci for specific genotypes (NEW-1 and CAS1-Delhi) based on APD (Hadifar et al., 2019a). Here, we examined an additional strategy to select the customized genotype-specific MIRU-VNTR loci set based on LASSO regression and APD to improve the power of differentiation in the Mtb population. Besides, HGDI, sensitivity, and specificity of each selected loci set were investigated based on the strategies. As shown in Table 1, selected loci based on LASSO regression approach showed a better discriminatory power for identifying all the studied genotypes (especially in locally dominant Mtb genotypes) compared with those based on the APD approach except for T genotype, which APD-based loci showed the better result. Besides, external and internal validity results for the LASSO regression approach showed proper and comparable sensitivity and specificity in most of the studied genotypes (Table 1).
TABLE 1

HGDI values of proposed genotype-specific MIRU-VNTR loci based on APD and LASSO regression approaches.

SpoligotypefamilyLoci(APD)HGDISensitivity%Specificity%Loci(LASSO)HGDISensitivity% aSpecificity% aSensitivit% bSpecificity% bN
Global24loci0.96424loci 0.964 306
CASETRC (577),MIRU10(960),MIRU 16 (1644),MIRU26 (2996),MTUB21(1955)0.923 83.1 99.5 ETRC(577),MIRU10(960),MIRU16(1644),MIRU26(2996),MIRU39(4348) 0.956 9893.710094.4101
NEW−1ETRB(2461),MIRU10(960),MTUB34(3171),MTUB04(424),ETRA(2165)0.92 79.3 100 MIRU10(960),MTUB21(1955),ETRB(2461),MTUB34(3171),MIRU31(3192) 0.943 98.996.810094.188
BeijingQUB11B(2163b),MIRU10(960),MTUB30(2401),ETRA(2165),MIRU31(3192)0.876 80.5 98.9 MTUB21(1955),QUB11B(2163b),ETRA(2165),MTUB30(2401),MIRU31(3192) 0.934 96.498.410083.956
HaarlemMIRU10(960),MTUB21(1955),MTUB30(2401),MIRU26(2996)0.933 25 100 MIRU16(1644),MTUB21(1955),MTUB30(2401),MIRU26(2996) 0.952 81.267.696.485.816
CameroonMIRU10(960),MTUB21(1955),QUB11B(2163b),MIRU26(2996),MIRU31(3192),QUB26(4052)0.859 77.8 97.7 MIRU02(154),MIRU16(1644),MTUB21(1955),QUB11B(2163b),MTUB30(2401),ETRB(2461),MIRU24(2687),QUB26(4052) 0.934 93.878.310061.616
TETRC(577),MIRU40(802),MIRU10(960),MTUB21(1955),MIRU23(2531),MIRU26(2996)0.949 100 98.3 MTUB04(424),MIRU40(802),MIRU10(960),MTUB21(1955),MIRU23(2531),MTUB39(3690),QUB26(4052) 0.917 10071.162.952.112
Ural1MIRU40(802),MIRU10(960),MIRU16(1644),MTUB21(1955),MTUB30(2401),MIRU26(2996),QUB4156(4156)0.924 80 100 MIRU02(154),MTUB04(424),MIRU10(960),MTUB30(2401),MIRU26(2996),QUB4156(4156) 0.94 10097.6100909
LAMMIRU10(960),MTUB21(1955),QUB11B(2163b),,ETRA(2165),MIRU23(2531)0.912 83.3 98.7 MIRU20(2059),ETRA(2165),MIRU23(2531),MIRU31(3192) 0.93 10092.39091.78

Internal validation: The 5-fold cross-validation (CV) was used for evaluating internal validity. This approach randomly divides the data 5 times as training and test samples and runs the analysis on each fold. External validation: External validity was tested by defining our dataset as a train set and an additional source of data as a test set.

HGDI values of proposed genotype-specific MIRU-VNTR loci based on APD and LASSO regression approaches. Internal validation: The 5-fold cross-validation (CV) was used for evaluating internal validity. This approach randomly divides the data 5 times as training and test samples and runs the analysis on each fold. External validation: External validity was tested by defining our dataset as a train set and an additional source of data as a test set. In the CAS-Delhi population, a combination of five VNTR loci [ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), and MTUB21 (1955)] was found as a proper set based on the descending order of the APD values (Table 2). According to the LASSO regression value, five VNTR loci [ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), MIRU39 (4348)] were identified as the customized genotype-specific MIRU-VNTR loci. Four out of five LASSO regression-based loci were similar to APD-based loci [VNTR ETRC (577), MIRU10 (960), MIRU16 (1644), and MIRU26 (2996)].
TABLE 2

The APDs of 24 loci VNTR in all the studied genotypes.

MIRU-VNTR locusBeijing and non-BeijingCAS and non-CASNEW-1 and non-NEW-1Haarlem and non-HaarlemCameroon and non-CameroonLAM and non-LAMT and non-TUral1 and non-Ural1
APD% APD% APD% APD% APD% APD% APD% APD%
MIRU02 (154)3.011.955.744.5318.4111.495.4432.32
MTUB04 (424)47.2745.6163.3829.6643.6223.9538.6941.92
ETRC (577)41.8693.6154.3624.3343.6224.9274.4040.24
MIRU04 (580)7.212.332.448.738.105.298.163.87
MIRU40 (802)23.5814.4615.4427.5429.6629.1955.5372.73
MIRU10 (960)76.2479.6592.9248.0267.5459.5668.7978.11
MIRU16 (1644)32.4974.3944.7032.6129.3132.5522.8757.24
MTUB21 (1955)36.0054.9546.8058.0077.1651.5160.8887.21
MIRU20 (2059)7.111.301.9011.926.9424.2913.3531.14
QUB11B (2163b)89.0441.3835.9234.6846.0658.1826.7029.29
ETRA (2165)59.3524.2759.2529.0130.8674.9639.6325.42
MTUB29 (2347)1.213.642.384.013.623.526.1220.37
MTUB30 (2401)68.508.4046.5645.8635.0033.7234.5271.89
ETRB (2461)33.2346.0394.0625.8230.6536.2420.9230.64
MIRU23 (2531)13.8012.0820.665.0923.7188.5970.2817.85
MIRU24 (2687)0.891.221.150.8612.330.840.850.84
MIRU26 (2996)28.8957.5638.5040.3253.4138.0059.1894.28
MIRU27 (3007)16.063.526.697.849.8311.669.6917.68
MTUB34 (3171)36.4033.5669.2324.7821.4921.1421.7713.47
MIRU31 (3192)57.0937.5854.4029.8946.3835.4022.8743.94
MTUB39 (3690)5.436.9614.367.5926.3825.6746.5126.09
QUB26 (4052)41.0633.1354.7512.3159.0337.9242.1849.16
QUB4156 (4156)11.3425.0728.1811.0614.8916.0722.2853.87
MIRU39 (4348)34.5548.8944.9237.2037.2042.6234.5231.31
The APDs of 24 loci VNTR in all the studied genotypes. The VNTR loci set obtained based on LASSO regression showed the similar level of discrimination as the 24 loci reference method (HGDI: 0.956). Besides, the estimated sensitivity and specificity of the selected loci set based on LASSO regression were more significant compared with the APD results (Table 1). In the NEW-1 population, the VNTR loci set of ETRB (2461), MIRU10 (960), MTUB34 (3171), MTUB04 (424), and ETRA (2165) were identified based on the APD approach. The APD value of the selected loci was more than 59% (Table 2). Based on the LASSO regression approach, the VNTR loci set of MIRU10 (960), MTUB21 (1955), ETRB (2461), MTUB34 (3171), and MIRU31 (3192) were identified as a customized loci set. According to the HGDI value of each set, the loci set obtained based on LASSO regression exhibited the promising discriminative power in this genotype (HGDI: 0.943). The specificity and sensitivity values of the LASSO-based loci also were more significant compared with the APD results (Table 1). In the Beijing population, as shown in Tables 1, 2, the effective VNTR loci for the discrimination of the genotype had an APD value of more than 50% consistent with our previous result (Hadifar et al., 2019a). Based on LASSO regression, MTUB21 (1955), QUB11B (2163b), ETRA (2165), MTUB30 (2401), and MIRU31 (3192) loci were identified as a customized loci set, for identification of Beijing genotype with a promising discriminatory value (HGDI: 0.934). As depicted in Table 1, in other genotypes, including Haarlem, Cameroon, LAM, and Ural1, the LASSO-based loci set showed a higher HGDI value than the APD-based loci set. The values of sensitivity and specificity exhibited a variation between the two strategies in each genotype. In T genotype, HGDI value obtained based on APD approach showed promising discriminative power.

Discussion

The 24-loci MIRU–VNTR genotyping has been used as an international standard method for Mtb genotyping. However, regarding a specific strain background and polymorphism which can affect the resolution of discrimination, various studies in the different geographical regions have tried to explore optimized VNTR sets for improving the standard typing method and offered different VNTRs combinations (Ali et al., 2007; Asante-Poku et al., 2014). In addition, previous studies have shown that a combination of VNTR loci selected based on the discrimination power method (HGDI) displayed a low capacity for a precise classification in endemic genotypes (Pan et al., 2017; Hadifar et al., 2019a). In this regard, we used two different approaches, including APD and LASSO regression to identify an optimized VNTR set in our studied Mtb population. We first established a specific genotype panel based on two utilized methods. In the next step, a comparison of the HGDI of each VNTR set with HGDI, sensitivity, and specificity of the reference method showed that LASSO regression compared with APD was more effective in discrimination of the studied Mtb genotype, especially for CAS, NEW-1, and Beijing genotypes. Different studies showed that there is a remarkable genotypic diversity among Mtb population in Iran (Merza et al., 2010; Mansoori et al., 2018; Hadifar et al., 2019b,2021; Kochkaksaraei et al., 2019; Vaziri et al., 2019). According to the reports, certain genotypes, such as CAS, NEW-1, and Beijing are more medically important than other genotypes in this geographical region. These genotypes showed a high capacity for drug resistance acquisition (Sinkov et al., 2016). A more recent meta-analysis study using MIRU-VNTR typing and Spoligotyping data demonstrated that NEW-1 and CAS are the most dominant Mtb genotypes in Iran. Besides, Beijing was shown as the prominent genotype circulating among the multidrug-resistant Mtb population (Hadifar et al., 2021). Based on this diversity, an in-depth study for the precise identification of Mtb genotypes can be helpful for better management and control of their infection. Some MIRU-VNTR-based studies have suggested a set of loci including MIRU10 (960), MIRU16 (1644), MIRU26 (2996), MIRU27 (3007), MIRU31 (3192), MIRU39 (4348), MIRU40 (802), MTUB21 (1955), and QUB26 (4052) are more discriminative in the standard 24 MIRU-VNTR loci panel for identification of different Mtb isolates circulating in Iran (Zamani et al., 2016; Mansoori et al., 2018). However, the described loci were also offered based on HGDI, which previously showed that selecting VNTR loci based on this index displayed a low capacity for a precise classification in endemic genotypes (Pan et al., 2017; Hadifar et al., 2019a). We have previously reported that using the APD approach may facilitate efficient discrimination and clustering of NEW-1 and CAS populations (Hadifar et al., 2019a). In the present study, 5 optimized VNTR loci namely, ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), and MIRU39 (4348) (with a sensitivity of 98% and a specificity of 93.7% in internal validity and sensitivity of 100%, and a specificity of 94,.4% in external validity) and five optimized loci, including ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), and MTUB21 (1955) (with a sensitivity of 83.1% and a specificity of 99.5%) were identified based on LASSO regression and APD approaches for CAS genotype, respectively. The APD result was consistent with our previous study (Hadifar et al., 2019a), in which VNTR loci with APDs of more than 50% could efficiently discriminate the CAS and non-CAS genotypes. In addition, the selected loci were similar to those reported in our previous study (Hadifar et al., 2019a). Four VNTR loci, including ETRC (577), MIRU10 (960), MIRU16 (1644), and MIRU26 (2996) were also found in LASSO analysis. In a study by Rasoahanitralisoa et al. (2017) a combination of ten VNTR loci [MIRU10 (960), QUB4156 (4156), MTUB04 (424), MIRU16 (1644), MIRU40 (802), ETRC (577), MTUB39 (3690), QUB26 (4052), MIRU31 (3192), and QUB11B (2163b)] have been proposed as discriminative loci for CAS genotype combined principal component analysis (PCA) and HGDI methods, which only five loci (QUB4156 (4156), MIRU10 (960), MIRU16 (1644), MIRU40 (802), and MTUB04 (424) of the panel have been selected based on the PCA method. Overall, these findings suggest that the combination of VNTR ETRC (577), MIRU10 (960), and MIRU16 (1644), might be considered a preliminary marker that facilitates the identification of the CAS population with higher discriminatory power. In addition, we found the LASSO regression-based VNTR selection process offers a more helpful strategy for precise discrimination of CAS from the non-CAS population in different settings. In the NEW-1 population, as a member of the Euro-American super lineage, five combinations of VNTR loci based on APD and five combinations of VNTR loci based on LASSO were found with a high capacity for efficient discrimination. In addition, a combination of VNTR MIRU10 (960), ETRB (2461), and MTUB34 (3171) which were similar between the two methods could be used a primary set for identifying the NEW-1 family. The efficient implementation of these methods for genotype data is helpful for the effective selection of VNTR loci for NEW-1 discrimination. However, based on the specificity of each selected set, we can suggest that using LASSO may be more effective than APD for discrimination of the NEW-1 population. In Beijing population, as the third dominant genotype, a set of loci, which can more effectively classify this genotype were obtained based on the LASSO method, including VNTR MTUB21 (1955), QUB11B (2163b), ETRA (2165), MTUB30 (2401), and MIRU31 (3192). (With a sensitivity of 96.4% and a specificity of 98.4% in internal validity and sensitivity of 100%, and a specificity of 83.9% in external validity, HGDI: 0.934). Five loci set selected based on APD method also showed the high discriminatory power in this genotype (with a sensitivity of 80.5% and a specificity of 98.9%, HGDI: 0.876). In a previous study, 12 optimized loci were suggested for discrimination of Beijing genotype based on the combined PCA and HGDI methods. Among these loci, only five loci [QUB11B (2163b), MIRU26 (2996), QUB26 (4052), MTUB30 (2401), and ETRC (577)] were selected based on the PCA result (Rasoahanitralisoa et al., 2017). A different combination of loci has been described in Pan et al. (2017) study based on the APD method. It can be speculated that these discordances reflect the difference in the specific circulating sub-lineage, geographical region, VNTR loci to identify a suitable set, and sample size. Other genotypes identified in our study were a part of the Euro-American super lineage, which is the most widely dispersed lineage. Different customized VNTR loci sets were offered based on both applied approaches to high effective discrimination of these groups. However, the selected loci sets based on both methods were reliable and promising in most of these genotypes, based on HGDI value, LASSO-based VNTR loci were more discriminative than the APD-based loci. Besides, no relevant study was found on the discriminative power of the loci combination based on APD, LASSO, and/or other statistical methods for Haarlem, Cameroon, LAM, and Ural1. Further investigations are crucial to accredit the efficiency of our implemented approaches in selecting discriminative loci in the other main Mtb genotypes and some genotypes in our study with small sample size like T. Another limitation of this study is that selection of VNTR loci was based on the predefined panel for standard 24 loci MIRU-VNTR typing, so evaluating additional loci could be beneficial to improve the efficiency of the selected customized loci set in discrimination of Mtb isolates in different settings. In addition, more Mtb genotypes should have been included in the current study.

Conclusion

In summary, we proposed customized genotype-specific MIRU-VNTR loci sets based on the LASSO regression and APD approaches for precise Mtb strains identification. As the proposed VNTR set offered a comparable discriminatory power to the standard 24 MIRU-VNTR loci set could be promising alternatives to the 24-loci MIRU–VNTR genotyping for using in resource-limited-settings. Moreover, our findings suggested the LASSO regression rather than the APD approach is more effective in the determination of possible discriminative VNTR loci set to precise discrimination of our studied Mtb population and may be beneficial to be used in finding reduced number loci set in other Mtb genotypes or sublineages.

Data availability statement

The original contributions presented in this study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author/s.

Ethics statement

The studies involving human participants were reviewed and approved by Pasteur Institute of Iran. The patients/participants provided their written informed consent to participate in this study.

Author contributions

SH and FV wrote the manuscript. FV supervised the project. FV, SDS, and AF designed the project. SH, NE, and MKK performed laboratory work. SE, MN, and MAKK performed the statistical analysis. All authors have read and approved the final manuscript.
  28 in total

1.  SITVITWEB--a publicly available international multimarker database for studying Mycobacterium tuberculosis genetic diversity and molecular epidemiology.

Authors:  Christophe Demay; Benjamin Liens; Thomas Burguière; Véronique Hill; David Couvin; Julie Millet; Igor Mokrousov; Christophe Sola; Thierry Zozio; Nalin Rastogi
Journal:  Infect Genet Evol       Date:  2012-02-17       Impact factor: 3.342

Review 2.  [Analytical procedure of variable number of tandem repeats (VNTR) analysis and effective use of analysis results for tuberculosis control].

Authors:  Yushi Hachisu; Ruiko Hashimoto; Kazunori Kishida; Eiji Yokoyama
Journal:  Rinsho Byori       Date:  2013-12

3.  The most predominant spoligopatterns of Mycobacterium tuberculosis isolates among Iranian, Afghan-immigrant, Pakistani and Turkish tuberculosis patients: a comparative analysis.

Authors:  Muayad A Merza; Parissa Farnia; Ahmad M Salih; Mohammad Reza Masjedi; Ali Akbar Velayati
Journal:  Chemotherapy       Date:  2010-06-16       Impact factor: 2.544

4.  Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis.

Authors:  Philip Supply; Caroline Allix; Sarah Lesjean; Mara Cardoso-Oelemann; Sabine Rüsch-Gerdes; Eve Willery; Evgueni Savine; Petra de Haas; Henk van Deutekom; Solvig Roring; Pablo Bifani; Natalia Kurepina; Barry Kreiswirth; Christophe Sola; Nalin Rastogi; Vincent Vatin; Maria Cristina Gutierrez; Maryse Fauville; Stefan Niemann; Robin Skuce; Kristin Kremer; Camille Locht; Dick van Soolingen
Journal:  J Clin Microbiol       Date:  2006-09-27       Impact factor: 5.948

5.  Origin and Global Expansion of Mycobacterium tuberculosis Complex Lineage 3.

Authors:  Yassir A Shuaib; Christian Utpatel; Thomas A Kohl; Ivan Barilar; Margo Diricks; Nadia Ashraf; Lothar H Wieler; Glennah Kerubo; Eyob A Mesfin; Awa Ba Diallo; Sahal Al-Hajoj; Perpetua Ndung'u; Margaret M Fitzgibbon; Farzam Vaziri; Vitali Sintchenko; Elena Martinez; Sofia O Viegas; Yang Zhou; Aya Azmy; Khaled Al-Amry; Sylvain Godreuil; Mandira Varma-Basil; Anshika Narang; Solomon Ali; Patrick Beckert; Viola Dreyer; Mwila Kabwe; Matthew Bates; Michael Hoelscher; Andrea Rachow; Andrea Gori; Emmanuel M Tekwu; Larissa K Sidze; Assam A Jean-Paul; Veronique P Beng; Francine Ntoumi; Matthias Frank; Aissatou Gaye Diallo; Souleymane Mboup; Belay Tessema; Dereje Beyene; Sadiq N Khan; Roland Diel; Philip Supply; Florian P Maurer; Harald Hoffmann; Stefan Niemann; Matthias Merker
Journal:  Genes (Basel)       Date:  2022-05-31       Impact factor: 4.141

Review 6.  Mycobacterium tuberculosis and whole-genome sequencing: how close are we to unleashing its full potential?

Authors:  G Satta; M Lipman; G P Smith; C Arnold; O M Kon; T D McHugh
Journal:  Clin Microbiol Infect       Date:  2017-11-03       Impact factor: 8.067

7.  Genotyping of the Mycobacterium tuberculosis complex using MIRUs: association with VNTR and spoligotyping for molecular epidemiology and evolutionary genetics.

Authors:  Christophe Sola; Ingrid Filliol; Eric Legrand; Sarah Lesjean; Camille Locht; Philippe Supply; Nalin Rastogi
Journal:  Infect Genet Evol       Date:  2003-07       Impact factor: 3.342

8.  A quantitative and efficient approach to select MIRU-VNTR loci based on accumulation of the percentage differences of strains for discriminating divergent Mycobacterium tuberculosis sublineages.

Authors:  Xin-Ling Pan; Chun-Lei Zhang; Chie Nakajima; Jin Fu; Chang-Xia Shao; Li-Na Zhao; Jia-Yi Cui; Na Jiao; Chang-Long Fan; Yasuhiko Suzuki; Toshio Hattori; Di Li; Hong Ling
Journal:  Emerg Microbes Infect       Date:  2017-07-26       Impact factor: 7.163

9.  Characterization of Mycobacterium tuberculosis Central Asian Strain 1 using mycobacterial interspersed repetitive unit genotyping.

Authors:  Asho Ali; Zahra Hasan; Mahnaz Tanveer; Amna R Siddiqui; Solomon Ghebremichael; Gunilla Kallenius; Rumina Hasan
Journal:  BMC Microbiol       Date:  2007-08-09       Impact factor: 3.605

10.  Enhancing the differentiation of specific genotypes in Mycobacterium tuberculosis population.

Authors:  Shima Hadifar; Mansour Kargarpour Kamakoli; Abolfazl Fateh; Seyed Davar Siadat; Farzam Vaziri
Journal:  Sci Rep       Date:  2019-11-29       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.