| Literature DB >> 29312206 |
Xuekai Xiong1,2, Rui Wang3, Dachuan Deng1,2, Yingyu Chen1,2, Han Liu1,2, Tianqi Wang1,2, Jieru Wang1,2, Xiaojie Zhu1,2, Xifang Zhu1,2, Yongqiang Zhu3, Xinyan Lu3, Huanchun Chen1,2, Huajun Zheng3, Aizhen Guo1,2.
Abstract
The Mycobacterium tuberculosis complex causes tuberculosis (TB) in humans and other animal species, but Mycobacterium tuberculosis has a distinct host preference to humans. The present study aimed to determine whether a bovine M. tb strain 1458 has evolved some genetic properties in their genome that might be associated with their bovine adaptation. The genome of the M. tb strain 1458 was sequenced and subjected to an extensive comparative genomic analysis. A phylogenetic analysis showed that strain 1458 is most closely related to a Chinese M. tb strain, CCDC5079, of the same Beijing family. Compared with three human M. tb Beijing family strains, the strain 1458 has the fewest unique genes. However, there are most (21) IS6110 insertion sequences in the strain 1458 genome at either intragenic or intergenic sites, resulting in the interruption of 11 genes including three PPE family-encoding genes (PPE16, PPE38, and PPE59). Only the strain 1458 genome has the upstream insertion in esxS and phoP genes. PCR confirmed four upstream insertions and qPCR determined that transcription of esxS, phoP, dnaN, and ctpD genes differed significantly between M. tb strain 1458 and H37Rv or M. bovis. A Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis revealed that the genes affected by non-synonymous SNPs are enriched in RNA polymerase. Moreover, 127 of the 133 unique SNPs in strain 1458 are either different to those in the M. bovis genome. In conclusion, some critical genes responsible for bacterial virulence and immunogenicity were interrupted in the genome of bovine M. tb strain 1458 by IS insertions and non-synonymous SNPs, which might contribute to its bovine adaptation, and the modification of its virulence and immunogenicity in cattle.Entities:
Keywords: Mycobacterium tuberculosis; cattle; genome; sequencing; tuberculosis; zoonosis
Year: 2017 PMID: 29312206 PMCID: PMC5733104 DOI: 10.3389/fmicb.2017.02500
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
The PCR primer sequences in this study.
| esxS | F | GGTGCCAGACATCGACTGAT | 1,784/429 |
| R | CAGGCGTTTCATCAGGGAGA | ||
| phoP | F | GCATCACCCAACGCTTGTTT | 1,740/387 |
| R | ACCGACAGCAGTTCAACGAT | ||
| dnaN | F | CCAGTCACGACAGATTGCGA | 1,739/386 |
| R | TGACTGTGGGGTGTGTGTTG | ||
| ctpD | F | GTTCCTGCGTCCCTACACTC | 1,702/349 |
| R | CATCAAGTGCCTTGTTCCGC |
The qPCR primer sequences in this study.
| esxS | F | TGTTGGATGCCCATATTCC | 106 |
| R | GACATCGCCTGCTGCTC | ||
| phoP | F | CGGCGTTGTTCCTGAC | 154 |
| R | TCCTTGTTGCCCTTGC | ||
| dnaN | F | TGTCGTGGGTGGCTAAA | 162 |
| R | AAACGCTTCCAGGAGAAA | ||
| ctpD | F | TGAACGGATCGGGTGTATT | 154 |
| R | ATGCCCAGGGAGTAGCG |
Figure 1Chromosome Atlas of M. tb 1458. The scale is shown by the outer black circle. Moving inside, the 1st and 2nd circles illustrate predicted coding sequences on the plus and minus strands respectively, colored according to different functional categories. The 3rd circle displays IS6110 elements (red). The 4th circles tRNAs (blue) and ribosomal RNA genes (gray). The 5th and 6th (innermost) circles represent mean centered G+C content of the genome (red-above mean, blue-below mean) and GC skew (G−C)/(G+C), respectively, calculated by using a 1-kb window in steps of 500 bp.
Figure 2COG annotation M. tb 1458 genome.
Figure 3Phylogenetic tree of sequenced members of Mycobacterium tuberculosis complex (MTBC).
Eleven intragenic IS6110s distributed within ORFs in M. tb 1458.
| 1070104–1071458 | BTB1458_1064, BTB1458_1067 | lprp | CAGGC |
| 1259595–1260949 | BTB1458_1273 | PPE16 | AAGC |
| 1542095–1543449 | BTB1458_1535, BTB1458_1538 | Desaturase-related protein | GAGG |
| 1716764–1718118 | BTB1458_1711 | mmpl12 | – |
| 2247431–2248785 | BTB1458_2235, BTB1458_2238 | Hypothetical protein | TCAAGG |
| 2621110–2622464 | BTB1458_2606, BTB1458_2609 | PPE38 | – |
| 3102221–3103575 | BTB1458_3111 | Hypothetical protein | – |
| 3354543–3355897 | BTB1458_3335 | Hypothetical protein | GGC |
| 3472672–3474026 | BTB1458_3459 | Hypothetical protein | CGA |
| 3781730–3783084 | BTB1458_3751, BTB1458_3754 | idsB | AATC |
| 3834127–3835481 | BTB1458_1084 | PPE59 | – |
Ten intergenic IS6110s distributed between ORFs in M. tb 1458.
| 1595–2949 | + | BTB1458_0004 | dnaN | 460 bp upstream | ATT |
| 845802–847156 | – | BTB1458_0836 | phoP | 132 bp upstream | GTAT |
| 884675–886029 | + | BTB1458_0879 | Hypothetical protein | 39 bp upstream | CCA |
| 1656829–1658183 | – | BTB1458_1647 | ctpD | 61 bp upstream | CGTT |
| 1977846–1979200 | – | BTB1458_1950 | Hypothetical protein | 313 bp upstream | – |
| 2355774–2357128 | + | BTB1458_2341 | PE22 | 539 bp upstream | CACAT |
| 3356375–3357729 | + | BTB1458_3339 | esxS | 11 bp upstream | CAG |
| 3531885–3533239 | – | BTB1458_3520 | Alanine rich protein | 57 bp upstream | – |
| 3695205–3696559 | – | BTB1458_3692 | Hypothetical protein | 242 bp upstream | – |
| 3830213–3831567 | – | BTB1458_3805 | Conserved hypothetical protein | 172 bp upstream | – |
Figure 4IS6110 insertion in the upstream of esxS, phoP, dnaN, and ctpD genes of M. tb 1458 was confirmed with PCR. Lane 1, M. tb strain 1458; lane 2, H37Rv; lane 3, M. bovis AF2122_97.
Figure 5The copy numbers of esxS, phoP, dnaN, and ctpD transcripts in M. tb 1458, H37Rv and M. bovis AF2122_97. The copy numbers of four typical genes (esxS, phoP, dnaN, and ctpD) in M. tb strain 1458, H37Rv and M. bovis AF2122_97 were determined by absolute quantitative real-time PCR. The data shown are the mean ± SD of three replicates. * represents a significant difference (p < 0.05), and ** a very significant difference (p < 0.01).
Figure 6Orthologs of M. tb 1458, H37Rv and M. bovis AF2122_97. Numbers of orthologous genes of M. tb 1458, H37Rv and M. bovis AF2122_97 genomes were evaluated using the PGAP (http://pgap.sf.net). The core genome of these three strains consists of 3,562 genes, as M. tb 1458, H37Rv and M. bovis AF2122_97 common orthologs. Letters refer to Table S3 giving the list of orthologs.
The KEGG pathways enriched with non-synonymous SNPs affected genes in M. tb H37Rv genome.
| Carbohydrate metabolism | 48 | 210 | 181 | 1,084 | 0.00 | 0.06 |
| Biosynthesis of other secondary metabolites | 6 | 17 | 181 | 1,084 | 0.01 | 0.12 |
| Replication and repair | 13 | 52 | 181 | 1,084 | 0.04 | 0.21 |
| Transcription | 5 | 17 | 181 | 1,084 | 0.05 | 0.21 |
| Signal transduction | 12 | 53 | 181 | 1,084 | 0.09 | 0.30 |
| Membrane transport | 15 | 72 | 181 | 1,084 | 0.13 | 0.37 |
| Glycan biosynthesis and metabolism | 4 | 19 | 181 | 1,084 | 0.20 | 0.46 |
| Metabolism of terpenoids and polyketides | 25 | 134 | 181 | 1,084 | 0.22 | 0.46 |
| Metabolism of other amino acids | 14 | 80 | 181 | 1,084 | 0.35 | 0.66 |
| Lipid metabolism | 17 | 100 | 181 | 1,084 | 0.40 | 0.68 |
| Nucleotide metabolism | 14 | 85 | 181 | 1,084 | 0.45 | 0.70 |
| Amino acid metabolism | 41 | 259 | 181 | 1,084 | 0.63 | 0.79 |
| Energy metabolism | 20 | 135 | 181 | 1,084 | 0.69 | 0.79 |
| Folding, sorting and degradation | 6 | 47 | 181 | 1,084 | 0.69 | 0.79 |
| Xenobiotics biodegradation and metabolism | 29 | 181 | 181 | 1,084 | 0.56 | 0.79 |
| Metabolism of cofactors and vitamins | 17 | 131 | 181 | 1,084 | 0.86 | 0.92 |
| Translation | 7 | 129 | 181 | 1,084 | 1.00 | 1.00 |
A represents the number of genes with non-synonymous SNPs involved in individual pathways; B represents the number of genes included in individual pathways; C represents total amount of genes involved in KEGG pathway among the genes with non-synonymous SNPs; D represents the total amount of genes involved in KEGG pathway in M. tb H37Rv genome; Q-values is the adjusted P-value, usually Q-values < 0.05 is defined as KEGG pathway significantly enriched.
The KEGG pathways enriched with non-synonymous SNPs affected genes in M. bovis AF2122_97 genome.
| Carbohydrate metabolism | 60 | 189 | 246 | 949 | 0.02 | 0.15 |
| Transcription | 3 | 5 | 246 | 949 | 0.02 | 0.15 |
| Signal transduction | 19 | 55 | 246 | 949 | 0.05 | 0.29 |
| Biosynthesis of other secondary metabolites | 12 | 36 | 246 | 949 | 0.11 | 0.39 |
| Energy metabolism | 42 | 144 | 246 | 949 | 0.14 | 0.39 |
| Replication and repair | 16 | 51 | 246 | 949 | 0.14 | 0.39 |
| Membrane transport | 16 | 52 | 246 | 949 | 0.16 | 0.39 |
| Metabolism of other amino acids | 24 | 82 | 246 | 949 | 0.19 | 0.41 |
| Glycan biosynthesis and metabolism | 6 | 20 | 246 | 949 | 0.24 | 0.46 |
| Nucleotide metabolism | 24 | 87 | 246 | 949 | 0.30 | 0.52 |
| Metabolism of terpenoids and polyketides | 30 | 115 | 246 | 949 | 0.43 | 0.67 |
| Lipid metabolism | 26 | 108 | 246 | 949 | 0.63 | 0.83 |
| Folding, sorting and degradation | 11 | 50 | 246 | 949 | 0.68 | 0.83 |
| Amino acid metabolism | 52 | 215 | 246 | 949 | 0.71 | 0.83 |
| Xenobiotics biodegradation and metabolism | 24 | 105 | 246 | 949 | 0.74 | 0.83 |
| Metabolism of cofactors and vitamins | 31 | 138 | 246 | 949 | 0.81 | 0.87 |
| Translation | 14 | 84 | 246 | 949 | 0.97 | 0.97 |
A represents the number of genes with non-synonymous SNPs involved in individual pathways; B represents the number of genes included in individual pathways; C represents total amount of genes involved in KEGG pathway among the genes with non-synonymous SNPs; D represents the total amount of genes involved in KEGG pathway in M. bovis AF2122_97 genome; Q-values is the adjusted P-value, usually Q-values < 0.05 is defined as KEGG pathway significantly enriched.