| Literature DB >> 26035295 |
Catarina Lopes Santos1, Hanna Nebenzahl-Guimaraes2, Marta Vaz Mendes3, Dick van Soolingen4, Margarida Correia-Neves1.
Abstract
Tuberculosis presents a myriad of symptoms, progression routes and propagation patterns not yet fully understood. Whereas for a long time research has focused solely on the patient immunity and overall susceptibility, it is nowadays widely accepted that the genetic diversity of its causative agent, Mycobacterium tuberculosis, plays a key role in this dynamic. This study focuses on a particular family of genes, the mclxs (Mycobacterium cyclase/LuxR-like genes), which codify for a particular and nearly mycobacterial-exclusive combination of protein domains. mclxs genes were found to be pseudogenized by frameshift-causing insertion(s)/deletion(s) in a considerable number of M. tuberculosis complex strains and clinical isolates. To discern the functional implications of the pseudogenization, we have analysed the pattern of frameshift-causing mutations in a group of M. tuberculosis isolates while taking into account their microbial-, patient- and disease-related traits. Our logistic regression-based analyses have revealed disparate effects associated with the transcriptional inactivation of two mclx genes. In fact, mclx2 (Rv1358) pseudogenization appears to be primarily driven by the microbial phylogenetic background, being mainly related to the Euro-American (EAm) lineage; on the other hand, mclx3 (Rv2488c) presents a higher tendency for pseudogenization among isolates from patients born on the Western Pacific area, and from isolates causing extra-pulmonary infections. These results contribute to the overall knowledge on the biology of M. tuberculosis infection, whereas at the same time launch the necessary basis for the functional assessment of these so far overlooked genes.Entities:
Mesh:
Year: 2015 PMID: 26035295 PMCID: PMC4452763 DOI: 10.1371/journal.pone.0128983
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Pairwise identity and indels in the mclx genes in a panel of 150 Mtb complex strains/clinical isolates .
| Organism | Lineage |
|
|
|
|---|---|---|---|---|
| Maf GM041182 | WA-2 |
| 99.90% | 99.90% |
| Maf K85 | WA-2 |
| 99.90% | 99.97% |
| Mbv 04–303 | - | 99.97% | 99.90% | 100.00% |
| Mbv AF2122/97 | - | 99.90% | 99.90% | 100.00% |
| Mbv AN5 | - | 100.00% | 99.90% | 100.00% |
| Mbv BCG ATCC 35733 | - | 100.00% | 99.90% | 99.97% |
| Mbv BCG ATCC 35740 | - | 99.90% | 99.90% | 99.97% |
| Mbv BCG China | - | 100.00% | 99.90% |
|
| Mbv BCG Frappier | - | 100.00% | 99.90% | 99.97% |
| Mbv BCG Korea 1168P | - | 100.00% | 99.90% | 99.97% |
| Mbv BCG Mexico | - | 100.00% | 99.90% | 99.97% |
| Mbv BCG Moreau | - | 100.00% | 99.90% | 99.97% |
| Mbv BCG Pasteur 1173P2 | - | 100.00% | 99.90% | 99.90% |
| Mbv BCG Tokyo 172 | - | 100.00% | 99.90% | 99.97% |
| Mcn CIPT 140010059 | - | 99.90% | 99.90% | 93.40%—ins—2519–2521 (CCA) |
| Mcn CIPT 140060008 | - | 99.80% | 99.60% | 93.50%—ins—2521–2523 (ACC) |
| Mcn CIPT 140070002 | - | 99.90% | 99.90% | 98.90% |
| Mcn CIPT 140070005 | - | 99.70% | 99.60% | 98.60% |
| Mcn CIPT 140070007 | - | 99.30% | 99.50% | 99.00% |
| Mcn CIPT 140070008 | - | 99.70% | 99.90% | 99.10% |
| Mcn CIPT 140070013 | - | 99.30% | 98.50% | 98.80% |
| Mcn CIPT 140070017 | - | 99.00% | 99.20% | 97.00%—ins—2531–2533 (GCC) |
| Mtb '98-R604 INH-RIF-EM' | EAm | 100.00% |
| 99.97%—del—2716 (A) |
| Mtb 02_1987 | EAs | 99.90% | 99.97% | 100.00% |
| Mtb 1034 | EAs | 99.97% | 99.97% | 100.00% |
| Mtb 210 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb 43–16836 | IO | 100.00% | 99.97% |
|
| Mtb 7199–99 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb BT1 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb BT2 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb BTB05-552 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb BTB05-559 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb C | EAm | 99.50%—ins—3195 (T); ins—3211 (G); ins—3218 (G) | 99.90% | 99.90%—ins—2919 (G); ins—2992 (G) |
| Mtb CAS/NITR204 | EAI |
| 99.20%- 13 del |
|
| Mtb CCDC5079 | EAs | 99.97% | 99.90%—del—2523–2525 (CGA) | 99.97% |
| Mtb CCDC5180 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb CDC1551 | EAm | 100.00% | 99.96% | 100.00% |
| Mtb CDC1551A | 100.00% | 99.97% | 100.00% | |
| Mtb CTRI-2 | EAm | 100.00% |
| 100.00% |
| Mtb EAI5 | 100.00% | 99.90% |
| |
| Mtb EAI5/NITR206 | IO | 99.90% | 99.97% | 100.00% |
| Mtb EAS054 | IO | 99.97% | 99.97% |
|
| Mtb F11 | EAm | 99.97% |
| 100.00% |
| Mtb FJ05194 | EAs | 100.00% | 99.90% | 100.00% |
| Mtb GuangZ0019 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb H37Ra | EAm | 100.00% | 100.00% | 100.00% |
| Mtb H37RvCO | EAm | 100.00% | 100.00% | 100.00% |
| Mtb HKBS1 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb HM | EAm | 99.97% |
| 99.00%—ins—1921–1953 (TG. . .TG) |
| Mtb HN878 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb INS_MDR | EAm | 100.00% | 99.97% | 100.00% |
| Mtb INS_SEN | EAm | 100.00% | 99.97% | 100.00% |
| Mtb INS_XDR | EAm | 100.00% | 99.97% | 100.00% |
| Mtb KZN 1435 | EAm | 100.00% |
| 99.97% |
| Mtb KZN 4207 | EAm | 100.00% |
| 99.97% |
| Mtb KZN 605 | EAm | 100.00% |
| 99.97% |
| Mtb KZN R506 | EAm | 100.00% |
| 99.97% |
| Mtb KZN V2475 | EAm | 100.00% |
| 99.97% |
| Mtb MTB-489 | ? | 99.97% | 100.00% | 99.97% |
| Mtb NA-A0008 | ? |
| 99.90% |
|
| Mtb NA-A0009 | ? |
| 99.97% |
|
| Mtb NCGM2209 | EAs | 99.97% | 99.97% | 100.00% |
| Mtb OSDD071 | EAI | 100.00% | 99.97% | 99.90% |
| Mtb OSDD105 | EAm | 100.00% | 99.90% |
|
| Mtb OSDD493 | EAm |
| 99.97% | 100.00% |
| Mtb PanR0201 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0202 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb PanR0205 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0206 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb PanR0207 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0208 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb PanR0209 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0304 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0305 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb PanR0306 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0307 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0308 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0309 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb PanR0313 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0314 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0315 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb PanR0316 | EAm | 100.00% | 99.97% | 99.97% |
| Mtb PanR0317 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0401 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0402 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0403 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0404 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0405 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0409 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0410 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0411 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0412 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0501 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0503 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0505 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0602 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0603 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0604 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0605 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb PanR0606 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb PanR0607 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0609 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0610 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0611 | EAm | 99.97% |
| 99.97% |
| Mtb PanR0702 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0703 | EAm | 99.97% |
| 99.97% |
| Mtb PanR0704 | EAm | 99.97% |
| 99.97% |
| Mtb PanR0707 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0708 | EAm | 99.97% |
| 99.97% |
| Mtb PanR0801 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb PanR0802 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0803 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0804 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0805 | EAm | 99.97% |
| 99.97% |
| Mtb PanR0902 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb PanR0903 | EAm | 100.00% |
| 100.00% |
| Mtb PanR0904 | EAm | 100.00% |
| 99.97% |
| Mtb PanR0906 | EAm | 99.97% |
| 99.97% |
| Mtb PanR0907 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb PanR0909 | EAm | 100.00% |
| 99.97% |
| Mtb PanR1005 | EAm | 100.00% |
| 99.97% |
| Mtb PanR1006 | EAm | 100.00% |
| 99.97% |
| Mtb PanR1007 | EAm | 99.97% |
| 99.97% |
| Mtb PanR1101 | EAm | 100.00% |
| 100.00% |
| Mtb PR05 | ? | 100.00% | 99.97% |
|
| Mtb R1207 | EAs | 99.90% | 99.97% | 100.00% |
| Mtb RGTB327 | EAm |
|
|
|
| Mtb RGTB423 | IO | 99.97%—ins—2821 (C) |
|
|
| Mtb S96-129 | EAm | 100.00% | 99.97% | 100.00% |
| Mtb Beijing/NITR203 | EAs | 99.90% | 99.90% | 99.97% |
| Mtb Erdman = ATCC 35801 | EAm | 100.00% | 99.90% | 100.00% |
| Mtb Haarlem | EAm | 99.10%- 7 ins | 99.90% |
|
| Mtb OSDD515 | EAm | 100.00% | 99.97% |
|
| Mtb SUMu001 | ? |
| 100.00% | 100.00% |
| Mtb SUMu002 | ? |
| 99.97% | 99.97%—ins—2487 (A) |
| Mtb SUMu003 | ? | 100.00% | 99.97% | 100.00% |
| Mtb SUMu004 | ? | 100.00% | 99.97% | 100.00% |
| Mtb SUMu005 | ? | 100.00% | 99.97% | 100.00% |
| Mtb SUMu006 | ? | 100.00% | 99.97% | 100.00% |
| Mtb SUMu008 | ? |
| 99.97% | 100.00% |
| Mtb SUMu010 | ? | 100.00% | 100.00% | 100.00% |
| Mtb SUMu011 | ? | 100.00% | 100.00% | 100.00% |
| Mtb SUMu012 | ? |
| 100.00% | 100.00% |
| Mtb UM 1072388579 | ? | 100.00% | 99.90% | 100.00% |
| Mtb UT205 | EAm | 100.00% |
| 100.00% |
| Mtb W-148 | EAs | 99.97% | 99.90%—del—2523–2525 (CGA) | 99.97% |
| Mtb WX3 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb X122 | EAs | 99.97% | 99.90%—del—2523–2525 (CGA) | 99.97% |
| Mtb XDR1219 | EAs | 99.97% | 99.97% | 99.97% |
| Mtb XDR1221 | EAs | 99.90% | 99.97% | 100.00% |
1The pairwise identity refers to the % of conserved residues of each gene after aligning it with its orthologue from the reference strain M. tuberculosis H37Rv; the putative pseudogenes are highlighted in bold; indel ocurrences are described by their type (ins, insertion; del, deletion) and by their location (considering an alignment with the reference mclx genes from the Mtb H37Rv strain), except when the total number of isolated occurences exceeds 3, in which case only their frequency is indicated; Maf, Mycobacterium africanum; Mvb, Mycobacterium bovis; Mcn, Mycobacterium cannetti; Mtb, Mycobacterium tuberculosis; WA-2, West-African 2; EAm, Euro-American lineage; EAs, East-Asian lineage; IO, Indo-Oceanic; EAI, East-African Indian lineage
*, indicates that the information on the lineage was obtained by aligning the H37Rv RD with the genome/scaffolds of the respective organism.
Fig 1Relative frequency of the organisms harbouring pseudogenes in each of the mclx genes.
The percentages are relative to the total number of strains/organisms listed in Table 1, and are stratified according to the three main species: Mtb (Mycobacterium tuberculosis), Mbv (Mycobacterium bovis) and Maf (Mycobacterium africanum).
Univariate ORs for mclx2 and mclx3 pseudogenization (p values <0.2 are highlighted).
| univariate ORs (95% CI) | ||||
|---|---|---|---|---|
| independent variables |
|
|
| |
|
| ||||
|
| 126 |
| 1.008 (0.983–1.033); | |
|
|
|
| ||
| female | 47 | 1.255 (0.507–3.106) |
| |
| male | 79 | 1 (ref) |
| |
|
|
|
| ||
| Africa | 23 | 0.679 (0.216–2.137); |
| |
| The Americas | 12 | 0.385 (0.073–2.022); |
| |
| Eastern Mediterranean | 23 | 0.288 (0.072–1.154); |
| |
| Europe | 38 | 1 (ref) |
| |
| South East Asia | 17 | 0.000 (0.000-.); |
| |
| Western Pacific | 13 | 0.000 (0.000-.); |
| |
|
|
|
| ||
| native dutch | 28 |
| 0.626 (0.215–1.823) | |
| foreign-born | 97 |
| 1 (ref) | |
|
|
|
| ||
| rural | 81 | 1 (ref) |
| |
| urban | 45 | 0.878 (0.343–2.248) |
| |
|
|
|
| ||
| no | 27 | 1.306 (0.385–4.431) | 0.842 (0.291–2.433) | |
| yes | 39 | 1 (ref) | 1 (ref) | |
|
|
|
| ||
| no or unknown | 109 | 1 (ref) | 1 (ref) | |
| yes | 18 | 0.494 (0.106–2.312) | 0.579 (0.156–2.148) | |
|
|
|
| ||
| no or unknown | 116 | 1 (ref) | 1 (ref) | |
| yes | 11 | 0.949 (0.192–4.707) | 0.287 (0.035–2.334) | |
|
|
|
| ||
| no or unknown | 123 | 1 (ref) | 1 (ref) | |
| yes | 4 | 1.499 (0.144–14.573) | 1.033 (0.104–10.310) | |
|
| ||||
|
|
| - | ||
| EAI | 18 | 0.107 (0.013–0.860); | - | |
| EAm | 65 | 1 (ref) | - | |
| EAs | 10 | 0.000 (0.000-.); | - | |
| IO | 31 | 0.000 (0.000-.); | 1 | |
|
|
|
| ||
| none or unknown | 118 | 1 (ref) | 1 (ref) | |
| resistant | 8 | 1.455 (0.275–7.696) | 1.023 (0.196–5.349) | |
|
|
|
| ||
| no | 49 |
|
| |
| yes | 78 |
|
| |
|
| ||||
|
|
|
| ||
| pulmonary TB | 82 | 1 (ref) |
| |
| extra-pulmonary TB | 24 | 0.589 (0.156–2.222); |
| |
| pulmonary and extra-pulmonary TB | 20 | 1.375 (0.435–4.343); |
| |
Multivariate logistic regression model for mclx2 pseudogenization (p values < 0.05 are highlighted).
| multivariate ORs (95% CI) | ||
|---|---|---|
|
| Model 1 | |
|
| ||
|
|
| |
|
| native dutch |
|
| foreign-born |
| |
|
| ||
|
| no | 1.918 (0.594–6.193); |
| yes | 1 (ref) | |
|
| 20.039/ | |
|
| 0.149 | |
|
| 0.242 | |
|
| 5.568/ | |
|
| 124 | |
Multivariate logistic regression models for mclx3 pseudogenization (p values < 0.05 are highlighted).
| multivariate ORs (95% CI) | ||||
|---|---|---|---|---|
|
| Model 1 | Model 2 | Model 3 | |
|
| ||||
|
| female | - | - | 0.532(0.149–1.893); |
| male | - | - | 1 (ref) | |
|
| rural | - | - | 1 (ref) |
| urban | - | - | 0.433 (0.118–1.593); | |
|
| - | - |
| |
| Africa | - | - |
| |
| The Americas | - | - |
| |
| Eastern Mediterranean | - | - |
| |
| Europe | - | - |
| |
| South East Asia | - | - |
| |
| Western Pacific | - | - |
| |
|
| ||||
|
| no | - | 0.761 (0.280–2.070); | 2.953 (0.648–13.458); |
| yes | - | 1 (ref) | 1 (ref) | |
|
| ||||
|
|
|
|
| |
| pulmonary TB |
|
|
| |
| extra-pulmonary TB |
|
|
| |
| pulmonary and extra-pulmonary TB |
|
|
| |
|
| 12.555/ | 12.843/ | 56.253/ | |
|
| 0.095 | 0.097 | 0.360 | |
|
| 0.141 | 0.144 | 0.536 | |
|
| 0.000/ | 0.081/ | 6.225/ | |
|
| 126 | |||
Fig 2A parsimony-based phylogenetic tree depicting the distribution of pseudogenization events across the 3 mclx genes in both epidemiologically characterized clinical isolates (n = 100; denoted in black) and publicly available NCBI strains (n = 100; denoted in red).
LSP-defined MTBC lineages/sublineages are colour-coded and indicated in the outer arc (Purple – Mycobacterium africanum; Brown—Mycobacterium bovis; blue—Euro-American lineage; red—East-Asian lineage; orange—Indo-Oceanic; green—East-African Indian lineage).