| Literature DB >> 34477155 |
Zhezhe Cui1, Jun Liu2, Yue Chang3, Dingwen Lin1, Dan Luo4, Jing Ou1, Liwen Huang1.
Abstract
ABSTRACT: We aimed to investigate the genetic and demographic differences and interactions between areas where observed genomic variations in Mycobacterium tuberculosis (M. tb) were distributed uniformly in cold and hot spots.The cold and hot spot areas were identified using the reported incidence of TB over the previous 5 years. Whole genome sequencing was performed on 291 M. tb isolates between January and June 2018. Analysis of molecular variance and a multifactor dimensionality reduction (MDR) model was applied to test gene-gene-environment interactions. Adjusted odds ratios (OR) and 95% confidence intervals (CI) were computed to test the extent to which genetic mutation affects the TB epidemic using a multivariate logistic regression model.The percentage of the Beijing family strain in hot spots was significantly higher than that in cold spots (64.63% vs 50.69%, P = .022), among the elderly, people with a low BMI, and those having a history of contact with a TB patient (all P < .05). Individuals from cold spot areas had a higher frequency of out-of-town traveling (P < .05). The mutation of Rv1186c, Rv3900c, Rv1508c, Rv0210, and an Intergenic Region (SNP site: 3847237) showed a significant difference between cold and hot spots. (P < .001). The MDR model displayed a clear negative interaction effect of age groups with BMI (interaction entropy: -3.55%) and mutation of Rv0210 (interaction entropy: -2.39%). Through the mutations of Rv0210 and BMI had a low independent effect (interaction entropy: -1.46%).Our data suggests a statistically significant role of age, BMI and the polymorphisms of Rv0210 genes in the transmission and development of M. tb. The results provide clues for the study of susceptibility genes of M. tb in different populations. The characteristic strains showed a local epidemic. Strengthening genotype monitoring of strains in various regions can be used as an early warning signal of epidemic spillover.Entities:
Mesh:
Year: 2021 PMID: 34477155 PMCID: PMC8415957 DOI: 10.1097/MD.0000000000027125
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Comparison of demographic characteristics of in TB cold and hot spots.
| Variables | Cold spot (n%) | Hot spot (n%) | |
| Age_ group | |||
| <30 | 35 (24.3) | 10 (6.8) | <.001 |
| 30–49 | 40 (27.8) | 47 (32.0) | |
| ≥50 | 69 (47.9) | 90 (61.2) | |
| Gender | |||
| Male | 114 (79.2) | 113 (76.9) | .74 |
| Female | 30 (20.8) | 34 (23.1) | |
| Ethnicity | |||
| Han | 139 (96.5) | 5 (3.4) | <.001 |
| Others∗ | 5 (3.5) | 142 (96.6) | |
| Income (Yuan) | |||
| <3000 | 100 (72.5) | 133 (91.1) | <.001 |
| 3000–4999 | 21 (15.2) | 12 (8.2) | |
| ≥5000 | 17 (12.3) | 1 (0.7) | |
| Migration | |||
| No | 110 (76.4) | 129 (87.8) | .017 |
| Yes | 34 (23.6) | 18 (12.2) | |
| TB patient contact history | |||
| No | 130 (91.5) | 118 (80.8) | .014 |
| Yes | 12 (8.5) | 28 (19.2) | |
| BMI | |||
| <18 | 11 (7.6) | 39 (26.5) | <.001 |
| 18–24.99 | 127 (88.2) | 100 (68) | |
| ≥25 | 6 (4.2) | 8 (5.4) | |
| BCG vaccination | |||
| No | 18 (12.5) | 23 (15.6) | .624 |
| Yes | 112 (77.8) | 113 (76.9) | |
| Unknown | 14 (9.7) | 11 (7.5) | |
| History of DM | |||
| No | 122 (84.7) | 111 (75.5) | .134 |
| Yes | 21 (14.6) | 33 (22.4) | |
| Unknown | 1 (0.7) | 3 (2) | |
| Drinking status | |||
| Never | 98 (68.1) | 87 (59.2) | .289 |
| Former | 28 (19.4) | 36 (24.5) | |
| Current | 18 (12.5) | 24 (16.3) | |
| Smoking status | |||
| Never | 73 (50.7) | 70 (47.6) | .696 |
| Former | 48 (33.3) | 48 (32.7) | |
| Current | 23 (16.0) | 29 (19.7) | |
| Status of MDR | |||
| Yes | 2 | 6 | .296 |
| No | 142 | 141 | |
More than 95% were from the Zhuang ethnic minority group. BCG = Bacillus Calmette–Guérin. BMI = body mass index, DM = diabetes mellitus, MDR = multi-drug resistance, TB = tuberculosis.
Figure 1Fixed index (F-statistics) distribution of M. tb mutations between cold and hot spot areas.
The information of 5 SNP sites with high fixed index.
| Reference sequence | SNP position | F | Gene locus | Gene product |
| AL123456.3 | 1328687 | 0.133141 | Rv1186c | Conserved protein |
| AL123456.3 | 4386228 | 0.11225 | Rv3900c | Conserved hypothetical alanine rich protein |
| AL123456.3 | 3847237 | 0.111604 | IGR∗ | |
| AL123456.3 | 1699849 | 0.107462 | Rv1508c | Probable membrane protein |
| AL123456.3 | 251575 | 0.100608 | Rv0210 | Hypothetical protein |
IGR = intergenic region.
Comparison of high-difference SNP sites.
| SNP locus | Cold spot (n%) | Hot spot (n%) | Odds ratio (OR) | |
| 1328687 (Rv1186c) | ||||
| None (Ref.) | 46 (31.9) | 87 (59.2) | 0.32 (0.2,0.52) | <.001 |
| Mutation | 98 (68.1) | 60 (40.8) | ||
| 4386228 (Rv3900c) | ||||
| None (Ref.) | 85 (59) | 50 (34) | 2.79 (1.74,4.5) | <.001 |
| Mutation | 59 (41) | 97 (66) | ||
| 3847237 (IGR) | ||||
| None (Ref.) | 103 (71.5) | 69 (46.9) | 2.84 (1.75,4.62) | <.001 |
| Mutation | 41 (28.5) | 78 (53.1) | ||
| 1699849 (Rv1508c) | ||||
| None (Ref.) | 95 (66) | 61 (41.5) | 2.73 (1.7,4.4) | <.001 |
| Mutation | 49 (34) | 86 (58.5) | ||
| 251575 (Rv0210) | ||||
| None (Ref.) | 81 (56.2) | 48 (32.7) | 2.65 (1.65,4.27) | <.001 |
| Mutation | 63 (43.8) | 99 (67.3) | ||
The best model for predicting the likelihood of M. tb strains appearing in hot or cold spots.
| Best Model | Training accuracy (%) | Testing accuracy (%) | CVC |
| |
| Rv0210 | 63.8 | 59.8 | 8/10 | 21.75 | <.001 |
| Rv0210, BMI | 67.4 | 61.2 | 8/10 | 35.47 | <.001 |
| Rv0210, Age groups, BMI | 70.1 | 60.5 | 4/10 | 44.04 | <.001 |
CVC = cross-validation consistency
Figure 2Distribution of high-risk and low-risk genotypes in the best model. Dark gray and light gray boxes represent high and low-risk factor combinations, respectively. Bars on the left within each box represent hot spots while those on the right represent cold spots. The numbers 0 and 1 appearing at the top and left of each box represent the classification of variables, respectively. The heights of the bars are proportional to the sample size in each group.
Figure 3Hierarchical interaction graphs and interaction dendrograms. (A) For the hierarchical interaction graphs, the proportions at the bottom of each factor/mutation of SNP sites represent entropy, and the percentage on each line represents the interaction proportion of entropy between the 2 factors/mutations of SNP sites. The blue line represents redundancy interaction and the green line represents weak redundancy interaction. (B) For the interaction dendrograms, the red line represents synergy redundancy interaction and the blue line represents redundancy interaction. From left to right the interaction was more intensive.