Literature DB >> 32401897

Newly sequenced genomes of four Bacillus Calmette Guerin vaccines.

Maria Carolina Sisco1, Marlei Gomés Silva1, Beatriz Lopez2, Claudia Arguelles3, Leila Mendonça-Lima4, Jacobus H de Waard5, Rafael Silva Duarte1, Philip Noel Suffys6.   

Abstract

Bacillus Calmette Guerin (BCG) vaccines comprise a family of related strains. Whole genome sequencing has allowed the better characterisation of the differences between many of the BCG vaccines. As sequencing technologies improve, updating of publicly available sequence data becomes common practice. We hereby announce the draft genome of four commonly used BCG vaccines in Brazil, Argentina and Venezuela.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32401897      PMCID: PMC7212995          DOI: 10.1590/0074-02760190401

Source DB:  PubMed          Journal:  Mem Inst Oswaldo Cruz        ISSN: 0074-0276            Impact factor:   2.743


Mycobacterium bovis Bacillus Calmette Guerin, commonly known as BCG, is the only vaccine against tuberculosis. The original BCG strain was obtained by serial passages of a M. bovis strain in potato-bile media. Deletion of the region of difference (RD) 1 was later confirmed as one of the reasons for the attenuation of its virulence. , After its first use in humans, the vaccine was sent to different laboratories worldwide where different culturing conditions originated strains with different genetic compositions. At present, there are more than 10 different vaccine strains being administered worldwide. In two countries in Latin-America, namely Venezuela and Argentina, the strains BCG Danish 1331 (Statens serum Institut, Denmark), BCG Pasteur 1173P2 (Instituto Nacional de Producción de Biológicos ― ANLIS Carlos G Malbrán, Argentina) and BCG Sofia SL222 (BB NCIPD Ltd, Bulgaria) are licensed for use. The vaccine BCG Pasteur produced in Argentina is a secondary seed lot of the French BCG Pasteur strain 1173P2 and is administered in the Province of Buenos Aires, while the rest of the country is vaccinated either with the Sofia or the Danish strain. In Brazil, BCG Moreau RDJ (Fundação Ataulpho de Paiva, Brazil) was used as a vaccine until 2017, when it was replaced by the Russian strain. Whole genome sequencing data of the strains Moreau, Pasteur and Danish are already available , , ) and obtained either by using shotgun sequencing and specific primers designed to close the gaps in the assembly (for Moreau and Pasteur strains) or a combination of Illumina and PacBio technology (for the Danish strain). BCG Sofia has so far only been subjected to whole genome analysis using microarrays. We sequenced the genome of these four vaccine strains with Illumina technology in an effort to update the sequencing data available and for BCG Sofia, we report the first sequence data obtained with newer technology. Genome sequencing of the four vaccine strains was performed using the Nextera XT DNA Library preparation kit on an Illumina HiSeq 2500 platform. De novo assembly was done using Unicycler and annotated with RAST. To determine intra-strain genomic variability of each vaccine, we compared the genomes with previous assemblies obtained from the NCBI , , using the software Artemis Comparison tool and Snippy. The strain BCG Sofia SL222 originated from the Russian vaccine BCG-1 and was chosen as a master seed at the BCG Bulgarian laboratory. Because there is no whole genome assembly available for BCG Sofia SL222, we decided to use the assembly of its parental strain BCG-1 Russia for the comparative studies. Among the four genomes, we obtained between 82 and 108 contigs, an average guanine-cytosine content (GC) of 65%, a size ranging between 4.2 and 4.3 Mb and the number of coding sequences (CDS) between 4205 and 4245 (Table I). The differences in the size of BCG strains genomes we noticed when compared to those available in public databases is probably due to variation in sequencing technologies and of assemblers used.
TABLE I

Assembly statistics for the four vaccine strains sequenced

Number of contigsMoreau RDJPasteur 1173P2Sofia SL222Danish 1331
8293102108
Genome size (bp)4288.2454.192.5454.201.8894.202.807
Coverage414X107X101X94X
% GC65.6265.4865.4565.47
N50197411844147069170718
CDS4232420542454227
tRNAs47474747

bp: base-pairs; %GC: guanine-cytosine content; CDS: coding sequences; tRNA: transfer RNA.

bp: base-pairs; %GC: guanine-cytosine content; CDS: coding sequences; tRNA: transfer RNA. The genome of BCG Moreau RDJ strain revealed 55 single nucleotide polymorphisms (SNPs) compared to that of the shotgun sequencing based genome of the same strain obtained in 2011, 28 of these SNPs are non-synonymous (ns) (Table II). We also detected five insertions and four deletions of 3-4 nucleotides (data not shown) and an inverted IS1608 transposase gene (position 3717335-3717826 bp).
TABLE II

Non-synonymous single nucleotide polymorphisms (SNPs) found in Bacillus Calmette Guerin (BCG) Moreau when using assembly NZ_AM412059 as a reference

PositionNZ_AM412059BCG MoreauAA changeGene
404956TGGlu713AspIron-sulphur-binding reductase
555536CTGly164GluFIG00821074: hypothetical protein
555569GAPro153LeuFIG00821074: hypothetical protein
570675TGLys91AsnAliphatic amidase AmiE
878380GTGly233ValProtease II (EC 3.4.21.83)
1217552TGHis65GlnPE family protein
1618404AGAsp284GlyAnaerobic dimethyl sulfoxide reductase chain A
1618472ACMet307LeuAnaerobic dimethyl sulfoxide reductase chain A
1618722GCArg390ProAnaerobic dimethyl sulfoxide reductase chain A
1618779TGVal409GlyAnaerobic dimethyl sulfoxide reductase chain A
1731072GAAla234ThrSorbitol-6-phosphate 2-dehydrogenase
1985896CGPro114AlaL-gulono-1,4-lactone oxidase
2400116AGLeu184ProCell division protein FtsL / proline rich membrane protein
2651260CGAla266GlyPE family protein
2701298GTPro413ThrRibonuclease E
2760281TCSer266GlyGTP-binding protein Obg
2760610GCAla156GlyGTP-binding protein Obg
2760682TCGlu132GlyGTP-binding protein Obg
3149570CGLeu224ValCoenzyme F420-dependent oxidoreductase
3273878AGVal602AlaATP-dependent DNA helicase RecG
3365033ACTrp93GlyTranscriptional regulator, TetR family
3809510CGGly67AlaFIG00820542: hypothetical protein
3879667AGAsn344AspGTP-binding protein Obg
3881120TGIle828SerGTP-binding protein Obg
3881141CAThr835AsnGTP-binding protein Obg
3891798TGAsp162AlaLong-chain fatty-acid-CoA ligase Mycobacterial subgroup FadD19
3963021CGVal222LeuTranscriptional regulator, LacI family
4172275TGMet67LeuMembrane proteins related to metalloendopeptidases

*: accession number for the assembly of BCG Moreau reported by Gomes et al.(6) A: adenine; G: guanine; C: cytosine; T: thymine; Glu: glutamic acid; Asp: aspartic acid; Gly: glycine; Pro: proline; Leu: leucine; Lys: lysine; Asn: asparagine; Val: valine; His: histidine; Gln: glutamine; Met: methionine; Arg: arginine; Thr: threonine; Ser: serine; Ala: alanine; Trp: tryptophan; Ile: isoleucine.

*: accession number for the assembly of BCG Moreau reported by Gomes et al.(6) A: adenine; G: guanine; C: cytosine; T: thymine; Glu: glutamic acid; Asp: aspartic acid; Gly: glycine; Pro: proline; Leu: leucine; Lys: lysine; Asn: asparagine; Val: valine; His: histidine; Gln: glutamine; Met: methionine; Arg: arginine; Thr: threonine; Ser: serine; Ala: alanine; Trp: tryptophan; Ile: isoleucine. Upon sequencing BCG Sofia SL222 and after comparison with the BCG-1 Russian strain, we observed one synonymous (s) SNP in the gene coding for an uridylyltransferase, in addition to three inverted regions of 42,965 bp, 17,778 bp and 6,765 bp in length. Furthermore, by mapping the reads obtained from the Sofia strain to the genome of the Danish vaccine strain, we confirmed the presence of the 1.6 kb deletion described by Stefanova et al. This deletion affects part of the gene coding for type II toxin-antitoxin system VapC family toxin, the gene for the antitoxin VapB48 and part of the glutamate - cysteine ligase gene. The genome of BCG Danish 1331 was the last to be assembled by using a combination of Illumina and PacBio reads. One advantage of performing PacBio sequencing is that it generates longer reads that improves detection of repeated regions and duplications. Upon sequencing, we observed five SNPs including four nsSNP and a stop codon (Table III). We also observed a deletion of five nucleotides in a SRPBCC family protein gene and two inversions of 26,170 bp and 7,565 bp.
TABLE III

Non-synonymous single nucleotide polymorphisms (SNPs) found in Bacillus Calmette Guerin (BCG) Danish when using assembly NZ_CP039850 as a reference

PositionNZ_CP039850BCG DanishAA changeGene
593769CTGln323**SDR family oxidoreductase
2076695CAAla142SerM56 family metallopeptidase
2500583TCHis260ArgSulfotransferase
3745609GTSer434TyrPPE family protein
3839864TGThr135ProIMP dehydrogenase

*: accession number for the sequencing of BCG Danish reported by Borgers et al.(7); **: indicates a stop codon; A: adenine; G: guanine; C: cytosine; T: thymine; Gln: glutamine; Ala: alanine; Ser: serine; Thr: threonine; Pro: proline; Tyr: tyrosine.

*: accession number for the sequencing of BCG Danish reported by Borgers et al.(7); **: indicates a stop codon; A: adenine; G: guanine; C: cytosine; T: thymine; Gln: glutamine; Ala: alanine; Ser: serine; Thr: threonine; Pro: proline; Tyr: tyrosine. Genome assembly of BCG Pasteur presented a nsSNP in the GTP-binding protein Obg gen (Asn599Asp) and two inframe insertions of three nucleotides each in the genes coding for NADPH epimerase/NADPH dehydratase and a probable cutinase. We also found one inverted region of 31,516 pb. De novo sequencing of genomes deposited in public databases becomes imperative as new sequencing technologies arise. Recently, Abdallah et al. reviewed the genomes and transcriptomes of fourteen BCG vaccine strains and together with the work of Borgers on the Danish vaccine comprise the most recent studies in BCG strains genealogy. We announce the initial draft genome of four of the most common BCG vaccines licensed worldwide in an effort to contribute to the update of publicly available data. The comparative analysis of BCG strains remains of crucial importance to trace their divergence in terms of genetic sequence, transcription and proteomic profile and, subsequently, to describe possible variation in the protective efficacy. Accession numbers - The reads of each genome have been deposited under SRA accession PRJNA575846, BioProject ID: PRJNA575846.
  13 in total

Review 1.  BCG--different strains, different vaccines?

Authors:  Marcel A Behr
Journal:  Lancet Infect Dis       Date:  2002-02       Impact factor: 25.071

2.  Genetic composition of Mycobacterium bovis BCG substrain Sofia.

Authors:  Tzvetelina Stefanova; Milliana Chouchkova; Jason Hinds; Philip D Butcher; Jacqueline Inwald; James Dale; Si Palmer; R Glyn Hewinson; Stephen V Gordon
Journal:  J Clin Microbiol       Date:  2003-11       Impact factor: 5.948

3.  ACT: the Artemis Comparison Tool.

Authors:  Tim J Carver; Kim M Rutherford; Matthew Berriman; Marie-Adele Rajandream; Barclay G Barrell; Julian Parkhill
Journal:  Bioinformatics       Date:  2005-06-23       Impact factor: 6.937

4.  Genome plasticity of BCG and impact on vaccine efficacy.

Authors:  Roland Brosch; Stephen V Gordon; Thierry Garnier; Karin Eiglmeier; Wafa Frigui; Philippe Valenti; Sandrine Dos Santos; Stéphanie Duthoy; Céline Lacroix; Carmen Garcia-Pelayo; Jacqueline K Inwald; Paul Golby; Javier Nuñez Garcia; R Glyn Hewinson; Marcel A Behr; Michael A Quail; Carol Churcher; Bart G Barrell; Julian Parkhill; Stewart T Cole
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-19       Impact factor: 11.205

5.  Genome sequence of Mycobacterium bovis BCG Moreau, the Brazilian vaccine strain against tuberculosis.

Authors:  Leonardo H F Gomes; Thomas D Otto; Erico A Vasconcellos; Patrícia M Ferrão; Renata M Maia; Aline S Moreira; Marcelo A Ferreira; Luiz R R Castello-Branco; Wim M Degrave; Leila Mendonça-Lima
Journal:  J Bacteriol       Date:  2011-10       Impact factor: 3.490

6.  Molecular analysis of genetic differences between Mycobacterium bovis BCG and virulent M. bovis.

Authors:  G G Mahairas; P J Sabo; M J Hickey; D C Singh; C K Stover
Journal:  J Bacteriol       Date:  1996-03       Impact factor: 3.490

7.  The BCG World Atlas: a database of global BCG vaccination policies and practices.

Authors:  Alice Zwerling; Marcel A Behr; Aman Verma; Timothy F Brewer; Dick Menzies; Madhukar Pai
Journal:  PLoS Med       Date:  2011-03-22       Impact factor: 11.069

8.  Genomic expression catalogue of a global collection of BCG vaccine strains show evidence for highly diverged metabolic and cell-wall adaptations.

Authors:  Abdallah M Abdallah; Grant A Hill-Cawthorne; Thomas D Otto; Francesc Coll; José Afonso Guerra-Assunção; Ge Gao; Raeece Naeem; Hifzur Ansari; Tareq B Malas; Sabir A Adroub; Theo Verboom; Roy Ummels; Huoming Zhang; Aswini Kumar Panigrahi; Ruth McNerney; Roland Brosch; Taane G Clark; Marcel A Behr; Wilbert Bitter; Arnab Pain
Journal:  Sci Rep       Date:  2015-10-21       Impact factor: 4.379

9.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

10.  Complete Genome Sequence of Mycobacterium bovis Strain BCG-1 (Russia).

Authors:  Evgeniya A Sotnikova; Egor A Shitikov; Maja V Malakhova; Elena S Kostryukova; Elena N Ilina; Alena V Atrasheuskaya; Georgy M Ignatyev; Nataliya V Vinokurova; Vyacheslav Y Gorbachyov
Journal:  Genome Announc       Date:  2016-03-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.