| Literature DB >> 33364506 |
Pornpen Tantivitayakul1, Tada Juthayothin2, Wuthiwat Ruangchai3, Nat Smittipat2, Areeya Disratthakit4, Surakameth Mahasirimongkol4, Katsushi Tokunaga5, Prasit Palittapongarnpim3,2.
Abstract
One-third of the world population is infected by Mycobacterium tuberculosis, which may persist in the latent or dormant state. Bacteria can shift to dormancy when encountering harsh conditions such as low oxygen, nutrient starvation, high acidity and host immune defenses. Genes related to the dormancy survival regulator (DosR) regulon are responsible for the inhibition of aerobic respiration and replication, which is required to enter dormancy. Conversely, resuscitation-promoting factor (rpf) proteins participate in reactivation from dormancy and the development of active tuberculosis (TB). Many DosR regulon and rpf proteins are immunodominant T cell antigens that are highly expressed in latent TB infection. They could serve as TB vaccine candidates and be used for diagnostic development. We explored the genetic polymorphisms of 50 DosR-related genes and 5 rpf genes among 1,170 previously sequenced clinical M. tuberculosis genomes. Forty-three lineage- or sublineage-specific nonsynonymous single nucleotide polymorphisms (nsSNPs) were identified. Ten nsSNPs were specific to all Mtb isolates belonging to lineage 1 (L1). Two common sublineages, the Beijing family (L2.2) and EAI2 (L1.2.1), differed at as many as 26 lineage- or sublineage-specific SNPs. DosR regulon genes related to membrane proteins and the rpf family possessed mean dN/dS ratios greater than one, suggesting that they are under positive selection. Although the T cell epitope regions of DosR-related and rpf antigens were quite conserved, we found that the epitopes in L1 had higher rates of genetic polymorphisms than the other lineages. Some mutations in immunogenic epitopes of the antigens were specific to particular M. tuberculosis lineages. Therefore, the genetic diversity of the DosR regulon and rpf proteins might impact the adaptation of M. tuberculosis to the dormant state and the immunogenicity of latency antigens, which warrants further investigation.Entities:
Keywords: DosR-related proteins; Genetics; Infectious disease; Lineage-specific SNP; Microbiology; Molecular biology; Mycobacterium tuberculosis; T cell epitopes; rpf proteins
Year: 2020 PMID: 33364506 PMCID: PMC7753917 DOI: 10.1016/j.heliyon.2020.e05744
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Figure 1a) Ratios of the numbers of SNPs (nsSNPs and sSNPs) occurring in the 50 DosR-related and 5 rpf genes of each Mtb isolate per total number of SNPs present in the same genome. A total of 1,170 Mtb isolates were categorized as L1-L4. The asterisk indicates that the average ratio of Mtb L1 was significantly higher than that of the other lineages (Kruskal-Wallis test, p < 0.001), while b) the ratios of the nsSNPs of each isolate to the total SNPs in the same isolate of L1 and L4 were significantly higher than those of L2 and L3 (Kruskal-Wallis test, p < 0.001).
The distribution of total SNPs and LS-SNPs in 50 DosR-related genes and 5 rpf genes among 4 major Mtb lineages. The percentages of LS-SNPs were calculated by dividing the number of LS-SNPs by the total number of SNPs identified in each lineage.
| SNP types | Lineage (no. of Mtb isolates) | |||
|---|---|---|---|---|
| L1 (480) | L2 (521) | L3 (11) | L4 (158) | |
| - | 531 | 227 | 20 | 159 |
| - | 57 | 15 | 4 | 14 |
| - | 12 (2.3%) | 0 | 1 (5%) | 1 (0.6%) |
| Number of SNPs specific to major lineages or common sublineages in intergenic region | 3 | 0 | 1 | 1 |
| - | 45 (8.5%) | 15 (6.6%) | 3 (15%) | 13 (8.2%) |
| Number of LS-nonsynonymous SNPs/LS-synonymous SNPs | 29/16 | 7/8 | 1/2 | 6/7 |
| - | 21 (3.9%) | 7 | 3 | 5 (3.1%) |
| Number of LS-Nonsynonymous SNPs/LS-Synonymous SNPs | 12/9 | 2/5 | 1/2 | 3/2 |
SNPs residing within 500 bp upstream of the start codons from annotated genes were identified as intergenic SNPs.
10 SNPs were found in more than one Mtb lineages.
A synonymous SNP in a coding region was present in all isolates of both L2 and L3.
represents major lineages (L1, L2, L3, L4) and common Mtb sublineages which comprise more than 90 Mtb isolates including L1.1.1 (269), L1.2.1 (108) and L4.5 (93).
Lists of lineage-specific SNPs in intergenic regions and LS-nonsynonymous SNPs in coding regions.
| SNP position | REF | ALT | No. of isolates | Lineage | Changing nucleotide sequence at |
|---|---|---|---|---|---|
| 2238930 | A | C | 480 | L1 | Primary DosR binding site upstream |
| 2240062 | C | G | 480 | L1 | Primary DosR binding site upstream |
| 2056184 | G | A | 11 | L3 | 72 bp upstream to start codon of |
| 2953307 | C | A | 480 | L 1∗ | 314 bp upstream to start codon of |
REF = nucleotide sequence of Mtb reference H37Rv strain.
ALT = nucleotide sequence changing from Mtb reference H37Rv strain.
NT change = nucleotide sequence changing from Mtb reference H37Rv strain.
AA change = amino acid sequence changing from Mtb reference H37Rv strain.
Mutational effect on protein function was predicted by three programs, polyphen-1 (PP-1), SIFT, SNAP.
∗corresponds to SNPs specific to Ancestral Mtb strains including Mtb L1, M. africanum L5 and L6.
∗SNPs specific to Ancestral Mtb lineages including Mtb L1, M. africanum L5 and L6.
represents major lineages (L1, L2, L3, L4) and common Mtb sublineages which comprise more than 90 Mtb isolates including L1.1.1 (269), L1.2.1 (108) and L4.5 (93).
Figure 2A diagram showing the proposed evolutionary path of the M. tuberculosis complex (Comas et al., 2013). The ancestral Mtb lineages comprise L1 (Indo-Oceanic) and M. africanum L5 and L6. The modern lineages are characterized by the loss of the TbD1 segment, and they branched into L4 (Euro-American), L2 (Beijing family) and L3 (CAS). The asterisk represents the loss of the TbD1 segment and the presence of the 6 SNPs that were different between L2-L4 and L1, L5 and L6, while the circle represents the 10 truly L1-specific SNPs.
Figure 3A diagram of ctpF, which is composed of 905 amino acids with 10 transmembrane segments (TM), as shown in the black bar. The DosR binding site upstream of the start codon was mapped as a dotted line. The locations of LS-SNPs affecting the DosR binding site and the amino acid sequence of ctpF are shown. The asterisks indicate SNPs specific to all L1 isolates in the DosR binding site and the TM10 region. The triangles represent the sublineage-specific nsSNPs of L1, and the circles represent the sublineage-specific SNPs of L4.5.3.
Figure 4a) dN/dS ratios of 5 rpf genes and 50 DosR-related genes belonging to 10 functional categories (Singh et al., 2014) including i) host-pathogen interactions (HP, n = 2), ii) uncharacterized proteins (CHP, n = 11), iii) membrane proteins (MP, n = 4), iv) nitrogen metabolism (NM, n = 5), v) nucleotide metabolism and repair (NMR, n = 4), vi) protein synthesis and cell wall synthesis (PROT, n = 2), vii) redox balance metabolism and energy (RD, n = 11), viii) resuscitation-promoting factor (RF, n = 5), ix) sensor kinases and transcription regulators (TR, n = 4), and x) universal stress proteins (SP, n = 7). The bars show the average dN/dS ratios. b) Pairwise dN/dS ratios of 50 DosR-related genes and 5 rpf genes among 4 major Mtb lineages. ∗represents a significant difference with p < 0.001, as calculated by the nonparametric Kruskal-Wallis test.
Nonsynonymous LS-SNPs affecting experimentally-proven T cell epitopes.
| Gene (Rv) | Epitope | IEDB_ID | Epitope sequences | LS-SNP (No. isolates) | NT change | AA change |
|---|---|---|---|---|---|---|
| T | 177591 | AGTAV | All Lineages except L4.8 | G204T | Gln68His | |
| T | 154515 | EPEQLAAAHE | L3 (11) | C661T | Leu221Phe | |
| T | 155102 | |||||
| T | 38529 | LPIARPTIA | L4.2 (10) | T178G | Leu60Val | |
| T | 4520 | ASLEEGLACAIL | L1.2.2 (18) | G878T | Gly293Val | |
| T | 106585 | KVQSATIYQVTDR | L1 (480) | C176T | Ser59Leu | |
| T | 229352 | LPVANVVVTPAHEA | L2.2.1.1 (22) | G793A | Val265Met |
Figure 5Comparison of the dN/dS ratios of a) concatenated T cell epitope and b) concatenated non-T cell epitope regions in 27 dormancy-related and 3 rpf antigens among isolates belonging to 4 Mtb lineages. Statistical analysis of the dN/dS values of T cell epitopes and non-T cell epitopes was performed with the nonparametric Kruskal-Wallis test. Only in L1 were the dN/dS ratios of T cell epitopes significantly higher than those of non-T cell epitopes (Kruskal-Wallis test, p < 0.001).