| Literature DB >> 34193941 |
Nguyen Thi Le Hang1, Minako Hijikata2, Shinji Maeda3, Akiko Miyabayashi2, Keiko Wakabayashi2, Shintaro Seto2, Nguyen Thi Kieu Diem4, Nguyen Thi Thanh Yen4, Le Van Duc5, Pham Huu Thuong6, Hoang Van Huan6, Nguyen Phuong Hoang7, Satoshi Mitarai8, Naoto Keicho9,10, Seiya Kato11.
Abstract
Mycobacterium tuberculosis (Mtb) has different features depending on different geographic areas. We collected Mtb strains from patients with smear-positive pulmonary tuberculosis in Da Nang, central Vietnam. Using a whole genome sequencing platform, including genome assembly complemented by long-read-sequencing data, genomic characteristics were studied. Of 181 Mtb isolates, predominant Vietnamese EAI4_VNM and EAI4-like spoligotypes (31.5%), ZERO strains (5.0%), and part of EAI5 (11.1%) were included in a lineage-1 (L1) sublineage, i.e., L1.1.1.1. These strains were found less often in younger people, and they genetically clustered less frequently than other modern strains. Patients infected with ZERO strains demonstrated less lung infiltration. A region in RD2bcg spanning six loci, i.e., PE_PGRS35, cfp21, Rv1985c, Rv1986, Rv1987, and erm(37), was deleted in EAI4_VNM, EAI4-like, and ZERO strains, whereas another 118 bp deletion in furA was specific only to ZERO strains. L1.1.1.1-sublineage-specific deletions in PE_PGRS4 and PE_PGRS22 were also identified. RD900, seen in ancestral lineages, was present in majority of the L1 members. All strains without IS6110 (5.0%) had the ZERO spoligo-pattern. Distinctive features of the ancestral L1 strains provide a basis for investigation of the modern versus ancestral Mtb lineages and allow consideration of countermeasures against this heterogeneous pathogen.Entities:
Year: 2021 PMID: 34193941 PMCID: PMC8245516 DOI: 10.1038/s41598-021-92984-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Proportions of Mtb lineages/sublineages by the SNP barcode and clades by in-silico spoligotyping in Da Nang samples (N = 181).
| (Sub)Lineage/spoligotype | The spoligo international type (SIT) numbers | Freq. (n) | Proportion per lineage (%) | Overall proportion (%) |
|---|---|---|---|---|
| L1.1.1 | 9 | 10.3 | 5.0 | |
| EAI5 | 234, 236, 792 | 6 | 6.9 | 3.3 |
| EAI5-like | NA | 3 | 3.4 | 1.7 |
| L1.1.1.1 | 78 | 89.7 | 43.1 | |
| EAI4_VNM | 139, 456, 564, 514, 622, 1731, 2722, Orphan | 49 | 56.3 | 27.1 |
| EAI4_VNM-like | 3196, NA | 8 | 9.2 | 4.4 |
| ZERO | 405, 802 | 9 | 10.3 | 5.0 |
| EAI5 | 236, 458, 618 | 8 | 9.2 | 4.4 |
| EAI5-like | NA | 3 | 3.4 | 1.7 |
| Unknown | NA | 1 | 1.1 | 0.6 |
| L2.1 (proto Beijing) | 623, NA | 3 | 4.7 | 1.7 |
| L2.2.2 (ancient Beijing) | 1, 269 | 2 | 3.1 | 1.1 |
| L2.2.1 (ancient Beijing) | 1, 190 | 14 | 21.9 | 7.7 |
| L2.2.1 (modern Beijing) | 1, 190 | 38 | 59.4 | 21.0 |
| L2.2.1.1 (modern Beijing) | 1 | 7 | 10.9 | 3.9 |
| L4.1 (Unknown, X1) | 336, NA | 8 | 26.7 | 4.4 |
| L4.1.1 (X1) | 119 | 2 | 6.7 | 1.1 |
| L4.1.2 (T) | 73 | 1 | 3.3 | 0.6 |
| L4.1.2.1 (H2, T2) | 2, 888, 52 | 3 | 10.1 | 1.7 |
| L4.2.2. (T1) | 51 | 4 | 13.3 | 2.2 |
| L4.3.1 (LAM9) | 42 | 1 | 3.3 | 0.6 |
| L4.3.3 (LAM9) | 388 | 1 | 3.3 | 0.6 |
| L4.4.1.2 (Unknown) | NA | 3 | 10.0 | 1.7 |
| L4.4.2 (T2, Unknown) | 52, NA | 2 | 6.7 | 1.1 |
| L4.5 (H3) | 50 | 1 | 3.3 | 0.6 |
| L4.8 (T1, Unknown) | 53, NA | 4 | 13.3 | 2.2 |
Mtb Mycobacterium tuberculosis, NA not available, SNP single nucleotide polymorphism.
Updated SNP barcode by Napier et al.[5].
Figure 1Phylogenetic tree of 181 Da Nang strains (a) and 332 Hanoi strains (the northern Vietnam data set) (b), constructed using variants after mapping with H37Rv. Phylogenetic trees were constructed with the maximum likelihood method using RAxML version 8.2.8 (https://github.com/stamatak/standard-RAxML) and visualized with plotTree for python v2.7 (https://github.com/katholt/plotTree). Regions of difference (RDs), deletions, and SNVs in correlation with Mtb clades are depicted. Mtb Mycobacterium tuberculosis, SNV single nucleotide variant, Del deletion, aBJ ancient Beijing, mBJ modern Beijing, NA not available.
Figure 2Phylogenetic tree of 87 lineage-1 strains from the Da Nang cohort. AP018033.1 (EAI4_VNM) was used as a reference genome. Phylogenetic trees were constructed with the maximum likelihood method using RAxML version 8.2.8 (https://github.com/stamatak/standard-RAxML) and visualized with plotTree for python v2.7 (https://github.com/katholt/plotTree). RD239, RD2bcg, deletion in furA, and SNVs in correlation with Mtb clades are shown. Mtb Mycobacterium tuberculosis, SNV single nucleotide variant, Del deletion.
Univariate and multivariate analyses using logistic regression models.
| Factors | n/N (%) | Univariate | Multivariate | ||
|---|---|---|---|---|---|
| OR | 95% CI | aOR** | 95% CI | ||
| Age strata* (increased by one level) | NA | 1.60 | 1.06–2.44 | 1.56 | 1.01–2.42 |
| Gender | |||||
| Male | 67/150 (44.7) | Ref | Ref | ||
| Female | 11/31 (35.5) | 0.68 | 0.31–1.52 | 0.85 | 0.36–1.99 |
| BMI (increased by one unit) | NA | 1.03 | 0.91–1.17 | 1.03 | 0.91–1.17 |
| Lineage/sublineage | |||||
| Other | 7/38 (18.4) | Ref | Ref | ||
| L1.1.1.1 | 15/72 (20.8) | 1.17 | 0.43–3.16 | 1.33 | 0.48–3.69 |
| L2/Beijing**** | 24/56 (42.9) | 3.32 | 1.25–8.82 | 3.57 | 1.30–9.77 |
| Age strata (increased by one level) | NA | 0.66 | 0.40–1.07 | 0.62 | 0.36–1.07 |
| Gender: Male | 39/139 (28.1) | Ref | Ref | ||
| Female | 7/27 (25.9) | 0.90 | 0.35–2.29 | 0.58 | 0.21–1.66 |
| BMI (increased by one unit) | NA | 0.94 | 0.81–1.09 | 0.96 | 0.82–1.13 |
| Lineage/sublineage | |||||
| Other | 16/42 (38.1) | Ref | Ref | ||
| L1.1.1.1 | 5/78 (6.4) | 0.11 | 0.04–0.33 | 0.11 | 0.03–0.32 |
| L2/Beijing**** | 6/61 (9.8) | 0.18 | 0.06–0.51 | 0.16 | 0.06–0.48 |
| Age strata (increased by one level) | NA | 0.89 | 0.51–1.58 | 1.15 | 0.61–2.15 |
| Gender: Male | 21/150 (14.0) | Ref | Ref | ||
| Female | 6/31 (19.4) | 1.47 | 0.54–4.02 | 1.85 | 0.59–5.82 |
| BMI (increased by one unit) | NA | 1.02 | 0.86–1.20 | 1.00 | 0.82–1.21 |
BMI body mass index, NA not applicable, Mtb Mycobacterium tuberculosis, SNV single nucleotide variant.
*Age strata: < 35.0, 35.0–54.9, ≥ 55 years old.
**aOR: adjusted odds ratio.
***Phenotypic resistance to at least one of the four major drugs (Rifampicin, Isoniazid, Ethambutol, and Streptomycin).
****Except L2.1, the proto-Beijing genotype.
Characteristics of ZERO strains (a) and multivariate analysis using logistic regression models to investigate factors possibly associated with ZERO strains (b).
| (a) | Patients ID | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Characteristics | DN-010 | DN-059 | DN-068 | DN-084 | DN-101 | DN-110 | DN-128 | DN-161 | DN-263 |
| Gender, age | M, 48.6 | M, 50.8 | M, 26.9 | M, 60.1 | M, 44.5 | M, 61.2 | M, 42.1 | M, 28.7 | M, 64.6 |
| Body mass index | 18.7 | 17.2 | 19.5 | 20.2 | 14.6 | 17.9 | 20.0 | 20.8 | 15.2 |
| Type of cases | New | New | New | New | New | Retreated | New | New | New |
| HIV | Neg | Neg | Neg | Neg | Neg | Neg | Neg | Neg | Neg |
| Phenotypic drug susceptibility | |||||||||
| Isoniazid 0.2 µg/mL | S | S | S | S | S | S | R | R | S |
| Isoniazid 1 µg/mL | S | S | S | S | S | S | R | S | S |
| Rifampicin | S | S | S | S | S | S | S | S | S |
| Streptomycin | S | S | S | S | S | S | S | R | S |
| Ethambutol | S | S | S | S | S | S | S | S | S |
| Cavity area* on chest X-ray | NA | 1 | 1 | 4 | 0 | 0 | 0 | 0 | NA |
| Infiltrate area* on chest X-ray | NA | 1 | 0 | 0 | 1 | 4 | 1 | 1 | NA |
| Smear grade before treatment | 2 + | 3 + | 1 + | 1 + | 3 + | 1 + | 2 + | 3 + | 2 + |
| Culture grade before treatment | 3 + | 1 + | 2 + | Scanty | 2 + | 2 + | 1 + | 2 + | 1 + |
| Sublineage classification | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 | L1.1.1.1 |
| Spoligotype | ZERO | ZERO | ZERO | ZERO | ZERO | ZERO | ZERO | ZERO | ZERO |
| SIT | 405 | 405 | 405 | 405 | 405 | 405 | 802 | 405 | 405 |
| RD239 | Del | Del | Del | Del | Del | Del | Del | Del | Del |
| RD2bcg | Del | Del | Del | Del | Del | Del | Del | Del | Del |
| No | No | No | No | No | No | Yes | Yes | No | |
| Clustered | No | No | No | No | No | No | No | No | No |
| Copy number of IS | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
HIV human immunodeficiency virus, M male, S sensitive, R resistant, OR odds ratio, 95% CI: 95% confidence interval.
*Number of lung zones.