| Literature DB >> 27149626 |
Erasto V Mbugi1,2, Bugwesa Z Katale2,3, Elizabeth M Streicher4, Julius D Keyyu3, Sharon L Kendall5, Hazel M Dockrell6, Anita L Michel7, Mark M Rweyemamu8, Robin M Warren4, Mecky I Matee2, Paul D van Helden4, David Couvin9, Nalin Rastogi9.
Abstract
The aim of this study was to assess and characterize Mycobacterium tuberculosis complex (MTBC) genotypic diversity in Tanzania, as well as in neighbouring East and other several African countries. We used spoligotyping to identify a total of 293 M. tuberculosis clinical isolates (one isolate per patient) collected in the Bunda, Dar es Salaam, Ngorongoro and Serengeti areas in Tanzania. The results were compared with results in the SITVIT2 international database of the Pasteur Institute of Guadeloupe. Genotyping and phylogeographical analyses highlighted the predominance of the CAS, T, EAI, and LAM MTBC lineages in Tanzania. The three most frequent Spoligotype International Types (SITs) were: SIT21/CAS1-Kili (n = 76; 25.94%), SIT59/LAM11-ZWE (n = 22; 7.51%), and SIT126/EAI5 tentatively reclassified as EAI3-TZA (n = 18; 6.14%). Furthermore, three SITs were newly created in this study (SIT4056/EAI5 n = 2, SIT4057/T1 n = 1, and SIT4058/EAI5 n = 1). We noted that the East-African-Indian (EAI) lineage was more predominant in Bunda, the Manu lineage was more common among strains isolated in Ngorongoro, and the Central-Asian (CAS) lineage was more predominant in Dar es Salaam (p-value<0.0001). No statistically significant differences were noted when comparing HIV status of patients vs. major lineages (p-value = 0.103). However, when grouping lineages as Principal Genetic Groups (PGG), we noticed that PGG2/3 group (Haarlem, LAM, S, T, and X) was more associated with HIV-positive patients as compared to PGG1 group (Beijing, CAS, EAI, and Manu) (p-value = 0.03). This study provided mapping of MTBC genetic diversity in Tanzania (containing information on isolates from different cities) and neighbouring East African and other several African countries highlighting differences as regards to MTBC genotypic distribution between Tanzania and other African countries. This work also allowed underlining of spoligotyping patterns tentatively grouped within the newly designated EAI3-TZA lineage (remarkable by absence of spacers 2 and 3, and represented by SIT126) which seems to be specific to Tanzania. However, further genotyping information would be needed to confirm this specificity.Entities:
Mesh:
Year: 2016 PMID: 27149626 PMCID: PMC4858144 DOI: 10.1371/journal.pone.0154571
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Description of clusters containing 3 or more isolates in this study, and their worldwide distribution in the SITVIT2 database.
| SIT/Lineage | Number in study | % in study | % in study vs. database | Distribution in Regions with > = 3% of a given SITs | Distribution in countries with > = 3% of a given SITs |
|---|---|---|---|---|---|
| 21/CAS1-Kili | 76 | 25.94 | 15.97 | AFRI-E 65.97, EURO-N 10.08, EURO-W 6.3, ASIA-W 5.25, AMER-N 5.25, AFRI-S 4.83 | TZA 34.66, ETH 9.87, MDG 9.66, USA 5.25, ZAF 4.83, ZMB 4.62, SWE 3.99, KEN 3.57 |
| 59/LAM11-ZWE | 22 | 7.51 | 4.54 | AFRI-E 58.35, AFRI-S 15.26, EURO-W 14.64, AMER-N 3.3 | ZMB 21.86, ZWE 15.88, ZAF 15.26, BEL 13.4, TZA 11.13, USA 3.3, MWI 3.3 |
| 126/EAI5 | 18 | 6.14 | 12.08 | ASIA-S 37.58, ASIA-W 16.78, AFRI-E 15.44, EURO-N 13.42, EURO-W 5.37, ASIA-SE 4.03, AMER-N 4.03 | IND 30.2, TZA 14.77, OMN 11.41, GBR 10.07, SAU 5.37, NLD 4.7, LKA 4.7, USA 4.03 |
| 53/T1 | 15 | 5.12 | 0.23 | AMER-S 14.97, EURO-W 14.82, AMER-N 12.77, EURO-S 8.91, EURO-N 7.09, ASIA-W 6.92, AFRI-S 4.7, AFRI-E 4.63, ASIA-E 4.04, AFRI-N 3.33, EURO-E 3.09, CARI 3.06, AMER-C 3.06 | USA 12.5, FXX 7.46, BRA 5.55, ITA 5.05, ZAF 4.6, PER 3.69, TUR 3.29, AUT 3.24 |
| 26/CAS1-Delhi | 14 | 4.78 | 0.8 | ASIA-S 55.97, AMER-N 12.46, ASIA-W 12.12, AFRI-E 5.06, EURO-W 4.84, EURO-N 3.75 | IND 38.91, USA 12.46, PAK 8.19, OMN 6.2, BGD 4.72, SAU 4.44, IRN 3.53 |
| 1/Beijing | 12 | 4.1 | 0.11 | ASIA-E 32.35, AMER-N 19.02, ASIA-SE 10.11, AFRI-S 7.85, ASIA-N 6.59, ASIA-S 5.64, EURO-N 3.46, AMER-S 3.44 | CHN 19.22, USA 18.78, JPN 10.91, ZAF 7.85, RUS 6.59, VNM 3.69, IND 3.26, PER 3.0 |
| 8/EAI5 | 12 | 4.1 | 6 | AFRI-E 21.5, ASIA-W 18.5, AMER-C 17.0, EURO-N 15.0, AMER-N 14.5, ASIA-S 4.0, AFRI-S 3.0 | MEX 17.0, USA 14.5, OMN 14.5, TZA 11.5, DNK 8.0, MOZ 6.0, GBR 5.0, ZMB 4.0, IND 4.0, ZAF 3.0, SAU 3.0 |
| 345/T3-ETH | 7 | 2.39 | 33.33 | AFRI-E 57.14, EURO-N 33.33, AMER-N 9.52 | TZA 33.33, GBR 19.05, ETH 14.29, USA 9.52, KEN 9.52, FIN 9.52, NOR 4.76 |
| 42/LAM9 | 6 | 2.05 | 0.17 | AMER-S 31.94, AMER-N 11.17, EURO-S 10.66, EURO-W 8.92, AFRI-N 8.07, EURO-N 4.62, CARI 4.0, AFRI-E 3.52, AMER-C 3.35 | BRA 12.89, USA 11.17, COL 7.18, MAR 6.64, ITA 6.16, FXX 4.76, PER 3.49, ESP 3.15, VEN 3.12 |
| 420/T2-uganda | 6 | 2.05 | 24 | AFRI-E 40.0, AMER-N 32.0, EURO-W 24.0, EURO-N 4.0 | USA 32.0, TZA 32.0, BEL 20.0, ZMB 4.0, UGA 4.0, SWE 4.0, DEU 4.0 |
| 2196/LAM11-ZWE | 6 | 2.05 | 30 | AFRI-E 70.0, AFRI-S 20.0, EURO-W 5.0, EURO-N 5.0 | TZA 35.0, MOZ 25.0, ZAF 20.0, ZMB 10.0, FIN 5.0, BEL 5.0 |
| 244/T1 | 5 | 1.71 | 4.03 | AFRI-E 16.13, AFRI-S 15.32, EURO-S 14.52, AMER-S 14.52, ASIA-S 12.9, EURO-W 12.1, AFRI-W 6.45, AMER-N 4.03 | ZAF 15.32, PRT 12.9, BRA 12.9, BGD 11.29, FXX 10.48, ZMB 6.45, GNB 5.65, TZA 4.84, USA 4.03 |
| 37/T3 | 4 | 1.37 | 0.73 | AFRI-E 18.98, AMER-S 12.96, EURO-N 11.31, EURO-W 10.95, ASIA-W 10.04, AMER-N 8.21, ASIA-E 5.84, EURO-S 5.47, EURO-E 4.2, ASIA-S 3.47, AFRI-S 3.1 | ETH 15.88, USA 7.3, CHL 5.47, SWE 4.93, SAU 4.75, CHN 4.75, FXX 4.01, ITA 3.83, BRA 3.47, ZAF 3.1, DNK 3.1 |
| 135/T2-uganda | 4 | 1.37 | 11.43 | EURO-W 34.29, AMER-N 17.14, AFRI-S 17.14, AFRI-E 17.14, ASIA-W 5.71 | ZAF 17.14, USA 17.14, TZA 11.43, DEU 11.43, NLD 8.57, AUT 8.57, SAU 5.71, FXX 5.71 |
| 149/T3-ETH | 4 | 1.37 | 0.82 | AFRI-E 58.4, EURO-N 21.52, ASIA-W 6.97, EURO-W 6.35, AMER-N 4.71 | ETH 57.58, DNK 11.68, SAU 6.56, USA 4.71, SWE 4.51, NLD 3.48 |
| 25/CAS1-Delhi | 3 | 1.02 | 0.45 | ASIA-W 25.26, AFRI-E 23.76, ASIA-S 17.59, AFRI-N 12.63, AMER-N 7.82, EURO-N 5.41, EURO-W 4.06 | ETH 22.26, SAU 14.14, IND 13.38, SDN 10.38, USA 7.82, OMN 6.32, IRQ 4.36 |
| 50/H3 | 3 | 1.02 | 0.07 | AMER-S 26.56, EURO-W 14.86, AMER-N 14.86, EURO-S 9.77, CARI 4.92, EURO-E 4.67, EURO-N 4.62, AFRI-N 3.6, AFRI-S 3.42, AFRI-M 3.2 | USA 14.84, PER 13.54, BRA 7.1, FXX 5.82, AUT 5.15, ITA 4.6, ESP 4.6, ZAF 3.42, CMR 3.15, CZE 3.1 |
| 128/T2-uganda | 3 | 1.02 | 8.57 | EURO-W 40.0, AFRI-S 25.71, AFRI-E 25.71 | ZAF 25.71, TZA 17.14, FXX 14.29, DEU 11.43, ZMB 8.57, NLD 8.57, BEL 5.71 |
* Worldwide distribution is reported for regions with more than 3% of a given SITs as compared to their total number in the SITVIT2 database. The definition of macro-geographical regions and sub-regions (http://unstats.un.org/unsd/methods/m49/m49regin.htm) is according to the United Nations; Regions: AFRI (Africa), AMER (Americas), ASIA (Asia), EURO (Europe), and OCE (Oceania), subdivided in: E (Eastern), M (Middle), C (Central), N (Northern), S (Southern), SE (South-Eastern), and W (Western). Furthermore, CARIB (Caribbean) belongs to Americas, while Oceania is subdivided in 4 sub-regions, AUST (Australasia), MEL (Melanesia), MIC (Micronesia), and POLY (Polynesia). Note that in our classification scheme, Russia has been attributed a new sub-region by itself (Northern Asia) instead of including it among rest of the Eastern Europe. It reflects its geographical localization as well as due to the similarity of specific TB genotypes circulating in Russia (a majority of Beijing genotypes) with those prevalent in Central, Eastern and South-Eastern Asia.
** The 3 letter country codes are according to http://en.wikipedia.org/wiki/ISO_3166-1_alpha-3; countrywide distribution is only shown for SITs with ≥3% of a given SITs as compared to their total number in the SITVIT2 database.
Fig 1Distribution map of Mycobacterium tuberculosis lineages in several countries of Africa according to the SITVIT2 database.
Fig 2A minimum spanning tree (MST) illustrating evolutionary relationships between M. tuberculosis spoligotypes.
MST constructed on all isolates including the orphan patterns (n = 293) in function of various characteristics. (A)MST based on phylogenetical lineages; (B) MST based on cities of isolation; and (C) MST based on HIV serology. The phylogenetic tree connects each genotype based on degree of changes required to go from one allele to another. The structure of the tree is represented by branches (continuous vs. dashed and dotted lines) and circles representing each individual pattern. Note that the length of the branches represents the distance between patterns while the complexity of the lines (continuous, gray dashed and gray dotted) denotes the number of allele/spacer changes between two patterns: solid lines, 1 or 2 or more changes (thicker ones indicate a single change, while the thinner one indicate 2 changes); gray dashed lines represent 3 changes; and gray dotted lines represent 4 or more changes. The size of the circle is proportional to the total number of isolates in our study, illustrating unique isolates (smaller nodes) versus clustered isolates (bigger nodes). The color of the circles indicates the phylogenetic lineage to which the specific pattern belongs.
Fig 3(A) Zoomed part of Spoligoforest tree showing SIT126 tentatively relabeled "EAI3-TZA" and its spoligotype descendants, along with their binary spoligotyping descriptions highlighting the specific absence of spacers 37, 38 and 40; and (B) Geographic distribution maps/Intensity maps of SIT126 and SIT8 (by percentage in country) according to SITVIT2 database before entering this study.
Distribution of lineages/sublineages of isolates from Tanzania (n = 293).
| Lineage/Sublineage | Number | % |
|---|---|---|
| CAS1-Delhi | 19 | 6,48 |
| CAS1-Kili | 78 | 26,62 |
| CAS2 | 2 | 0,68 |
| CAS (or CAS_Like) | 6 | 2,05 |
| EAI1-SOM | 3 | 1,02 |
| EAI5 | 39 | 13,31 |
| EAI6-BGD1 | 2 | 0,68 |
| EAI8-MDG | 2 | 0,68 |
| EAI (or EAI_Like) | 3 | 1,02 |
| H1 | 2 | 0,68 |
| H3 | 5 | 1,71 |
| LAM6 | 2 | 0,68 |
| LAM9 | 9 | 3,07 |
| LAM11-ZWE | 32 | 10,92 |
| Manu_ancestor | 1 | 0,34 |
| Manu1 | 2 | 0,68 |
| Manu2 | 7 | 2,39 |
| T1 | 29 | 9,90 |
| T2-uganda | 19 | 6,48 |
| T3 | 4 | 1,37 |
| T3-ETH | 11 | 3,75 |
| T (or T_Like) | 1 | 0,34 |
| X2 | 1 | 0,34 |
Descriptive statistics on age of patients in this study.
| Count | 293 |
| Mean | 35,11 |
| Sample variance | 101,56 |
| Sample standard deviation | 10,08 |
| Minimum | 18 |
| Maximum | 75 |
| Range | 57 |
| 1st quartile | 27,00 |
| Median | 35,00 |
| 3rd quartile | 41,00 |
| Interquartile range | 14,00 |
| Mode | 39,00 |
No significant statistical differences were noted when comparing lineage distribution vs. gender/sex of the patients (p-value = 0.653).
Distribution of Spoligotype International Types (SIT) in this study.
| SIT | Spoligotype pattern (Octal format) | Number in study (%) | % in study vs. Database | Lineage | Unique or Clustered SIT |
|---|---|---|---|---|---|
| 1 | 000000000003771 | 12 (4.1) | 0.11 | Beijing | Clustered |
| 4 | 000000007760771 | 1 (0.34) | 0.26 | Unknown | Unique |
| 8 | 400037777413771 | 12 (4.1) | 6 | EAI5 | Clustered |
| 10 | 477777277413771 | 1 (0.34) | 1.12 | EAI8-MDG | Unique |
| 21 | 703377400001771 | 76 (25.94) | 15.97 | CAS1-Kili | Clustered |
| 22 | 703777400001771 | 1 (0.34) | 1.08 | CAS1-Delhi | Unique |
| 25 | 703777740003171 | 3 (1.02) | 0.45 | CAS1-Delhi | Clustered |
| 26 | 703777740003771 | 14 (4.78) | 0.8 | CAS1-Delhi | Clustered |
| 34 | 776377777760771 | 1 (0.34) | 0.11 | S | Unique |
| 36 | 777737777720771 | 1 (0.34) | 0.63 | H3 | Unique |
| 37 | 777737777760771 | 4 (1.37) | 0.73 | T3 | Clustered |
| 42 | 777777607760771 | 6 (2.05) | 0.17 | LAM9 | Clustered |
| 48 | 777777777413731 | 1 (0.34) | 0.22 | EAI1-SOM | Unique |
| 50 | 777777777720771 | 3 (1.02) | 0.07 | H3 | Clustered |
| 53 | 777777777760771 | 15 (5.12) | 0.23 | T1 | Clustered |
| 59 | 777777606060771 | 22 (7.51) | 4.54 | LAM11-ZWE | Clustered |
| 62 | 777777774020731 | 1 (0.34) | 0.17 | H1 | Unique |
| 64 | 777777607560771 | 2 (0.68) | 0.48 | LAM6 | Clustered |
| 73 | 777737777760731 | 1 (0.34) | 0.34 | T | Unique |
| 126 | 477777777413771 | 18 (6.14) | 12.08 | EAI5 | Clustered |
| 128 | 637775777760730 | 3 (1.02) | 8.57 | T2-uganda | Clustered |
| 135 | 777777777760730 | 4 (1.37) | 11.43 | T2-uganda | Clustered |
| 137 | 777776777760601 | 1 (0.34) | 0.1 | X2 | Unique |
| 149 | 777000377760771 | 4 (1.37) | 0.82 | T3-ETH | Clustered |
| 244 | 777777777760601 | 5 (1.71) | 4.03 | T1 | Clustered |
| 245 | 777777777760671 | 1 (0.34) | 4.55 | T1 | Unique |
| 281 | 777775777760771 | 1 (0.34) | 3.13 | T1 | Unique |
| 288 | 700377740003771 | 2 (0.68) | 1.27 | CAS2 | Clustered |
| 345 | 777000377760731 | 7 (2.39) | 33.33 | T3-ETH | Clustered |
| 420 | 637774777760730 | 6 (2.05) | 24 | T2-uganda | Clustered |
| 458 | 777777777403771 | 1 (0.34) | 2.7 | EAI5 | Unique |
| 486 | 703777740000371 | 1 (0.34) | 1.75 | CAS | Unique |
| 522 | 777777777760770 | 1 (0.34) | 6.25 | T1 | Unique |
| 523 | 777777777777771 | 1 (0.34) | 1.69 | Manu_ancestor | Unique |
| 702 | 700775747413771 | 1 (0.34) | 2.86 | EAI6-BGD1 | Unique |
| 727 | 777737774020731 | 1 (0.34) | 2.44 | H1 | Unique |
| 815 | 777777606060731 | 2 (0.68) | 1.34 | LAM11-ZWE | Clustered |
| 891 | 777777607660771 | 2 (0.68) | 6.67 | LAM9 | Clustered |
| 1090 | 077777777413771 | 2 (0.68) | 22.22 | EAI5 | Clustered |
| 1192 | 777777677763771 | 1 (0.34) | 10 | Manu2 | Unique |
| 1369 | 477777777413671 | 1 (0.34) | 12.5 | EAI5 | Unique |
| 1549 | 775777606060731 | 2 (0.68) | 18.18 | LAM11-ZWE | Clustered |
| 1765 | 703337400001771 | 1 (0.34) | 14.29 | CAS1-Kili | Unique |
| 1801 | 777777777413730 | 1 (0.34) | 25 | EAI1-SOM | Unique |
| 1824 | 760377777760701 | 2 (0.68) | 25 | T1 | Clustered |
| 1957 | 477777777013771 | 1 (0.34) | 6.67 | EAI5 | Unique |
| 2196 | 777775606060731 | 6 (2.05) | 30 | LAM11-ZWE | Clustered |
| 2267 | 701377400001771 | 1 (0.34) | 25 | CAS | Unique |
| 2269 | 703377400000771 | 1 (0.34) | 20 | CAS | Unique |
| 2364 | 703601740003771 | 1 (0.34) | 25 | CAS1-Delhi | Unique |
| 2391 | 703377400003771 | 1 (0.34) | 11.11 | CAS1-Kili | Unique |
| 2484 | 737377677760771 | 1 (0.34) | 25 | T1 | Unique |
| 2736 | 777777777760741 | 1 (0.34) | 25 | T1 | Unique |
| 3880 | 437774777760730 | 2 (0.68) | 50 | T2-uganda | Clustered |
| 4056 | 477777777413411 | 2 (0.68) | 100 | EAI5 | Clustered |
| 4057 | 777017777760771 | 1 (0.34) | 50 | T1 | Unique |
| 4058 | 477777777413371 | 1 (0.34) | 50 | EAI5 | Unique |
* A total of 54/57 SITs containing 264 isolates matched a preexisting shared-type in the database, whereas 3/57 SITs (n = 4 isolates) were newly created. A total of 27/57 SITs containing 238 isolates were clustered within this study (2 to 76 isolates per cluster) while 30/57 SITs containing 30 isolates were unique (for total unique strains, one should add to this number the 25 orphan strains, which brings the number of unclustered isolates in this study to 55/293 or 18.77%, and clustered isolates to 238/293 or 81.23%). Note that SITs followed by an asterisk indicates "newly created” SITs due to 2 or more strains belonging to an identical new pattern within this study or after a match with an orphan in the database; SIT designations followed by number of strains: 4056* this study n = 2; 4057* this study n = 1, SDN n = 1; 4058* this study n = 1, NLD n = 1.
** Lineage designations according to SITVIT2; “Unknown” designates patterns with signatures that do not belong to any of the major lineages described in the database.
*** Clustered strains correspond to a similar spoligotype pattern shared by 2 or more strains “within this study”; as opposed to unique strains harboring a spoligotype pattern that does not match with another strain from this study. Unique strains matching a preexisting pattern in the SITVIT2 database are classified as SITs, whereas in case of no match, they are designated as “orphan”.
Distribution of phylogenetical lineages vs. demographic characteristics of patients and cities of isolation.
| Lineage | Female | Male | Mean age [Min—Max] | SD | HIV– | HIV+ | Bunda | Dar es Salaam | Ngoron-goro | Serengeti |
|---|---|---|---|---|---|---|---|---|---|---|
| Beijing | 3 | 9 | 34.42 [19–51] | 9.64 | 7 | 1 | 3 | 4 | 1 | 4 |
| CAS | 49 | 56 | 36.62 [21–75] | 11.15 | 54 | 3 | 11 | 48 | 34 | 12 |
| EAI | 25 | 24 | 31.27 [19–54] | 8.85 | 32 | 2 | 20 | 15 | 10 | 4 |
| Haarlem | 3 | 4 | 38.14 [30–45] | 4.78 | 6 | 1 | 1 | 0 | 4 | 2 |
| LAM | 22 | 21 | 34.07 [18–53] | 8.63 | 32 | 6 | 14 | 5 | 18 | 6 |
| Manu | 4 | 6 | 34.00 [22–53] | 9.49 | 10 | 0 | 1 | 0 | 9 | 0 |
| S | 1 | 0 | 41.00 [41–41] | _ | 0 | 1 | 1 | 0 | 0 | 0 |
| T | 28 | 36 | 36.28 [20–63] | 10.20 | 50 | 7 | 26 | 7 | 15 | 16 |
| X | 1 | 0 | 37.00 [37–37] | _ | 1 | 0 | 0 | 0 | 1 | 0 |
| Unknown | 1 | 0 | 25.00 [25–25] | _ | 1 | 0 | 0 | 0 | 1 | 0 |
SD = Standard deviation