| Literature DB >> 35987561 |
Mahla Asadian1, Seyed Mehdi Hassanzadeh2, Azadeh Safarchi3, Masoumeh Douraghi4.
Abstract
BACKGROUND: Bacillus Calmette-Guérin (BCG) refers to a group of vaccine strains with unique genetic characteristics. BCG is the only available vaccine for preventing tuberculosis (TB). Genetic and biochemical variations among the BCG vaccine strains have been considered as one of the significant parameters affecting the variable protective efficacy of the vaccine against pulmonary tuberculosis. To track genetic variations, here two vaccine strains (Danish 1331 and Pasteur 1173P2) popularly used according to the BCG World Atlas were subjected to a comparative analysis against the Mycobacterium tuberculosis H37Rv, Mycobacterium bovis AF2122/97, and Mycobacterium tuberculosis variant bovis BCG str. Pasteur 1173P2 reference genomes. Besides, the presence or absence of the experimentally verified human T cell epitopes was examined.Entities:
Keywords: BCG vaccine; Danish 1331; Genomic analysis; Pasteur 1173P2
Mesh:
Substances:
Year: 2022 PMID: 35987561 PMCID: PMC9392950 DOI: 10.1186/s12864-022-08826-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Fig. 1General genomic features of the BCG Pasteur 1173P2 and BCG Danish 1331. a Circular representation of the BCG Pasteur 1173P2 contigs using Proksee (https://proksee.ca). The scale is shown in megabases on the black central circle. Moving inward, two outer violet circles show forward and reverse strand CDSs, respectively. Some genes are shown on the outer violet circle with the Proksee's default. The tRNAs (orange arrows), rRNAs (light blue arrows), tmRNA (red arrow) and two CRISPR sequences (light green arrows adjacent each other) are shown in CDSs circles. The next circle shows GC content (dark blue) followed by the GC skew (dark green and pink). b Circular representation of the BCG Danish 1331 contigs using Proksee (https://proksee.ca). The scale is shown in megabases on the black central circle. Moving inward, two outer dark blue circles show forward and reverse strand CDSs, respectively. Some genes are shown on the outer dark blue circle with the Proksee's default. The tRNAs (orange arrows), rRNAs (light blue arrows), tmRNA (red arrow) and two CRISPR sequences (light green arrows) are shown in CDSs circles. The next circle shows GC content (dark green) followed by the GC skew (violet and pink). Category 0 > Virulence, detoxification, adaptation. Category 1 > Lipid metabolism. Category 2 > Information pathways. Category 3 > Cell wall and cell processes. Category 5 > Insertion sequences and phages. Category 6 > PE/PPE. Category 7 > Intermediary metabolism and respiration. Category 8 > Unknown. Category 9 > Regulatory proteins. Category 10 > Conserved hypothetical proteins. Category 16 > Conserved hypothetical with an orthologous in M. bovis
Required genes for mycobacterial in vivo growth that contain non-synonymous SNPs in Pasteur 1173P2 and Danish 1331
| No | H37Rv locus tag | Gene name | Product | Non-synonymous amino acid substitution | Pasteur 1173P2 | Danish 1331 |
|---|---|---|---|---|---|---|
| 1 | Rv0101 | Peptide synthetase Nrp | L1365M | + | + | |
| 2 | Rv0169 | Mce family protein Mce1A | S313A, P359S | + | + | |
| 3 | Rv0170 | Mce family protein Mce1B | I179T | + | + | |
| 4 | Rv0171 | Mce family protein Mce1C | E212D | + | + | |
| 5 | Rv0176 | - | Mce associated transmembrane protein | N285S, S291A | + | + |
| 6 | Rv0218 | - | Transmembrane protein | C316R, D413N | + | + |
| 7 | Rv0490 | Two component sensor histidine kinase SenX3 | F109S | + | + | |
| 8 | Rv0636 | (3R)-hydroxyacyl-ACP dehydratase subunit HadB | T54A | + | + | |
| 9 | Rv0643c | Methoxy mycolic acid synthase MmaA3 | G98D | + | + | |
| 10 | Rv1028c | Sensor protein KdpD | N776D, P368S, G295D, P83S | + | + | |
| 11 | Rv1109c | - | Hypothetical protein | A147T | + | + |
| 12 | Rv1128c | - | Hypothetical protein | E270G | + | + |
| 13 | Rv1204c | - | Hypothetical protein | L484I | + | + |
| 14 | Rv1224 | Sec-independent protein translocase protein TatB | W8G | + | + | |
| 15 | Rv1244 | Lipoprotein LpqZ | Q242K | + | + | |
| 16 | Rv1338 | Glutamate racemase | R154L | + | + | |
| 17 | Rv1371 | - | Membrane protein | I368V | + | + |
| 18 | Rv1460 | - | Transcriptional regulator | I198F, A266V | + | + |
| 19 | Rv1640c | Bifunctional lysine-tRNA ligase/phosphatidylglycerol lysyltransferase | D944G, D769E | + | + | |
| 20 | Rv2048c | Polyketide synthase | A4047S, P3095L, S2964R H2147Q, G1865S | + | + | |
| 21 | Rv2072c | Precorrin-6Y C(5,15)-methyltransferase | L205P | + | + | |
| 22 | Rv2275 | - | Cyclo(L-tyrosyl-L-tyrosyl) synthase | E261A | + | + |
| 23 | Rv2359 | Zinc uptake regulation protein | H64R | + | + | |
| 24 | Rv2374c | Heat-inducible transcription repressor HrcA | R79Q | + | - | |
| 25 | Rv2388c | Oxygen-independent coproporphyrinogen III oxidase | A184T | + | + | |
| 26 | Rv2483c | Bifunctional L-3 phosphoserine phosphatase/1-acyl-sn-glycerol-3-phosphate acyltransferase | C189G | + | + | |
| 27 | Rv2502c | Acetyl/propionyl-CoA carboxylase subunit beta | F343L, G77S | + | + | |
| 27 | Rv2692 | TRK system potassium uptake protein CeoC | I133V | + | + | |
| 29 | Rv2696c | - | Hypothetical protein | D164N | + | + |
| 30 | Rv2702 | Polyphosphate glucokinase | I203T | + | + | |
| 31 | Rv2813 | Hypothetical protein | I76V | + | + | |
| 32 | Rv2845c | Proline–tRNA ligase | A232T, H177R | + | + | |
| 33 | Rv2936 | Daunorubicin ABC transporter ATP-binding protein DrrA | H309D | + | + | |
| 34 | Rv2981c | D-alanine–D-alanine ligase | T365A | + | + | |
| 35 | Rv3042c | Phosphoserine phosphatase SerB | G116E, A70S | + | + | |
| 36 | Rv3061c | Acyl-CoA dehydrogenase FadE22 | S497C, K488E | + | + | |
| 37 | Rv3087 | - | Diacylglycerol O-acyltransferase | L447V | + | + |
| 38 | Rv3114 | - | Hypothetical protein | S11P | + | + |
| 39 | Rv3277 | - | Transmembrane protein | S272L | + | + |
| 40 | Rv3335c | - | Integral membrane protein | A86V | + | + |
| 41 | Rv3371 | - | Diacylglycerol O-acyltransferase | R339G, I368V | + | + |
| 42 | Rv3497c | Mce family protein Mce4C | T46P | + | + | |
| 43 | Rv3551 | - | CoA-transferase subunit alpha | A7S | + | + |
| 44 | Rv3563 | Acyl-CoA dehydrogenase FadE32 | Q105R, W275S | + | + | |
| 45 | Rv3616c | ESX-1 secretion-associated protein EspA | T192I, A4V | + | + | |
| 46 | Rv3805c | Terminalbeta-(1- > 2)-arabinofuranosyltransferase | I327V | + | + | |
| 47 | Rv3868 | ESX-1 secretion system protein EccA1 | A243V | + | + | |
| 48 | Rv3910 | - | Peptidoglycan biosynthesis protein | V480A | + | + |
Fig. 2a SNPs rate in the functional classification of genes encoding a protein in M. tuberculosis. b Indels rate in the functional classification of genes encoding a protein in M. tuberculosis. Category 0 > Virulence, detoxification, adaptation. Category 1 > Lipid metabolism. Category 2 > Information pathways. Category 3 > Cell wall and cell processes. Category 5 > Insertion sequences and phages. Category 6 > PE/PPE. Category 7 > Intermediary metabolism and respiration. Category 9 > Regulatory proteins. Category 10 > Conserved hypothetical proteins. Category 16 > Conserved hypothetical with an orthologous in M. tuberculosis
Fig. 3a SNPs rate in the functional classification of genes encoding a protein M. bovis. b Indels rate in the functional classification of genes encoding a protein in M. bovis
M. tuberculosis antigens containing variants in non-epitope sequences in two BCG strains
| Epitope ID | H37Rv locus tag | Gene name | Product | Genetic variation | Pasteur 1173P2 | Danish 1331 |
|---|---|---|---|---|---|---|
| 2190 | Rv0171 | Mce family protein Mce1C | nSNPa | + | + | |
| 4002 | Rv3018c | PPE46 | PPE family protein PPE46 | nSNP | + | - |
| 4520 | Rv2627c | - | Hypothetical protein | nSNP | + | + |
| 9474 | Rv2608 | PPE42 | PPE family protein PPE42 | nSNP | + | + |
| 18,059 | Rv1291c | - | Hypothetical protein | sSNPb | + | + |
| 24,566 | Rv1037c | ESAT-6 like protein EsxI | nSNP | + | - | |
| 32,710 | Rv3467 | - | Hypothetical protein | nSNPs | + | + |
| 32,860 | Rv0670 | Endonuclease IV | sSNP | + | + | |
| 39,011 | Rv3804c | Diacylglycerol acyltransferase/mycolyltransferase Ag85A | sSNP | + | + | |
| 45,757 | Rv0170 | Mce family protein Mce1B | nSNP | + | + | |
| 55,156 | Rv1945 | - | Hypothetical protein | sSNPs & nSNPs | + | + |
| 55,188 | Rv1641 | Initiation factor IF-3 | nSNP | + | + | |
| 55,191 | Rv3689 | - | Transmembrane protein | nSNP | + | + |
| 55,192 | Rv3378c | - | Diterpene synthase | sSNP & nSNP | + | + |
| 55,315 | Rv2823c | - | CRISPR-associated protein Cas10/Csm1 | Insertion | + | + |
| 55,334 | Rv2476c | NAD-dependent glutamate dehydrogenase | nSNP & Deletion | + | + | |
| 57,680 | Rv0174 | Mce family protein Mce1F | sSNPs | + | + | |
| 60,095 | Rv3714c | - | Hypothetical protein | sSNP | + | + |
| 64,663 | Rv0169 | Mce family protein Mce1A | nSNPs | + | + | |
| 92,817 | Rv1886c | Diacylglycerol acyltransferase/mycolyltransferase Ag85B | nSNP | + | + | |
| 93,270 | Rv3839 | - | Hypothetical protein | nSNP | + | + |
| 99,857 | Rv2770c | PPE44 | PPE family protein PPE44 | sSNP & nSNP | + | + |
| 99,866 | Rv2770c | PPE44 | PPE family protein PPE44 | sSNP & nSNP | + | - |
| 118,590 | Rv2600 | - | Integral membrane protein | Deletion | + | + |
| 120,392 | Rv1866 | - | Hypothetical protein | sSNP & nSNP | + | + |
| 120,408 | Rv3883c | Membrane-anchored mycosin | sSNP & nSNP | + | + | |
| 120,481 | Rv0934 | Phosphate ABC transporter substrate-binding lipoprotein PstS | nSNP | + | + | |
| 120,887 | Rv3736 | - | AraC/XylS family transcriptional regulator | sSNP & nSNP | + | + |
| 121,059 | Rv0755c | PPE12 | PPE family protein PPE12 | sSNP & nSNP | + | + |
| 125,165 | Rv1361c | PPE19 | PPE family protein PPE19 | sSNPs & nSNP | + | + |
| 126,028 | Rv3296 | ATP-dependent helicase | nSNPs | + | + | |
| 126,912 | Rv0024 | - | NLP/P60 family protein | Deletion | + | + |
| 140,543 | Rv2006 | Trehalose-6-phosphate phosphatase OtsB | sSNPs & nSNP | + | + | |
| 140,561 | Rv1997 | Cation transporter ATPase F | sSNP & nSNP | + | + | |
| 140,576 | Rv2780 | L-alanine dehydrogenase | Deletion | + | + | |
| 140,597 | Rv3499c | Mce family protein Mce4A | sSNP | + | - | |
| 140,615 | Rv2531c | Amino acid decarboxylase | sSNP | + | + | |
| 140,617 | Rv2813 | - | Hypothetical protein | nSNP | + | + |
| 144,870 | Rv1769 | - | Hypothetical protein | sSNP | - | + |
| 161,402 | Rv0787 | - | Hypothetical protein | nSNP | + | + |
| 168,735 | Rv1789 | PPE26 | PPE family protein PPE26 | sSNP | + | + |
| 196,087 | Rv3343c | PPE54 | PPE family protein PPE54 | sSNPs & nSNPs | + | + |
| 229,047 | Rv1009 | Resuscitation-promoting factor RpfB | nSNPs | + | + | |
| 738,104 | Rv3616c | ESX-1 secretion-associated protein EspA | nSNPs | + | + | |
| 851,000 | Rv1626 | - | Two-component system transcriptional regulator | sSNP | + | + |
| 857,468 | Rv3792 | Arabinofuranosyltransferase | sSNP | + | + | |
| 1,081,150 | Rv0442c | PPE10 | PPE family protein PPE10 | nSNPs | + | + |
aNon-synonymous SNP
bSynonymous SNP
M. tuberculosis antigens containing variants in epitope sequences in two BCG strains
| Epitope ID | H37Rv locus tag | Gene name | Product | Epitope sequence | Genetic variation | Amino acid substitution | Pasteur 1173P2 | Danish 1331 |
|---|---|---|---|---|---|---|---|---|
| 20,707 | Rv3497c | Mce family protein Mce4C | GK | nSNPa | Thr > Pro | + | + | |
| 106,585 | Rv2628 | - | Hypothetical protein | KVQSATIYQVTDR | nSNP | Ser > Leu | + | + |
| 120,511 | Rv0956 | Phosphoribosylglycinamide formyltransferase PurN | ETLHERIKVTERRLLVAAVAALAT | sSNPb | His > His | + | + | |
| 153,959 | Rv1733c | - | Transmembrane protein | AAAGTAV | nSNP | Gln > His | + | + |
| 155,973 | Rv1733c | - | Transmembrane protein | TVSLLTIPFAAAAGTAV | nSNP | Gln > His | + | + |
| 434,619 | Rv1733c | - | Transmembrane protein | IPFAAAAGTAV | nSNP | Gln > His | + | + |
aNon-synonymous SNP
bSynonymous SNP