Literature DB >> 35142538

Draft Genome Sequences of 77 Endemic Multidrug-Resistant Mycobacterium tuberculosis Strains of SIT41 (TUR) Spoligotype from Bulgaria.

Stefan Panaiotov1, Vladimir Tolchkov1, Yordan Hodzhev1, Borislava Tsafarova1, Alberto Trovato2, Daniela Cirillo2.   

Abstract

Sequences of multidrug-resistant (MDR) Mycobacterium tuberculosis strains are of particular interest to study the molecular mechanisms of drug resistance evolution. Here, we report the draft genome sequences of 77 endemic multidrug-resistant Mycobacterium tuberculosis strains of SIT41 (TUR) spoligotype from Bulgaria. SIT41 spoligotype is dominant (>40%) among the MDR-TB strains in Bulgaria.

Entities:  

Year:  2022        PMID: 35142538      PMCID: PMC8830331          DOI: 10.1128/mra.01111-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The current control of tuberculosis mainly depends on multidrug resistance management. Multidrug-resistant M. tuberculosis (MDR-TB) strains of SIT41 (TUR) spoligotype are endemic to Bulgaria (1–4). SIT41 spoligotype is prevalent (>40%) among patients with MDR tuberculosis and a marker for MDR-TB in Bulgaria. Seventy-seven MDR-TB strains of SIT41 (TUR) spoligotype were collected in Bulgaria from 2009 to 2019. All strains originated from Bulgarian patients with pulmonary tuberculosis. One isolate per patient was tested. The Bactec MGIT automated mycobacterial detection system (BD, USA) was applied for isolation and drug susceptibility testing at the National Reference Laboratory for Tuberculosis at the National Center of Infectious and Parasitic Diseases, Sofia, Bulgaria. All strains were rifampicin and isoniazid resistant. Genotyping of the isolated MDR-TB strains was performed, applying spoligotyping (5) and 24-loci MIRU-VNTR typing (6). For DNA isolation, we used a fresh subculture of the isolated MDR-TB strains grown on Löwenstein-Jensen medium for 35 days at 37°C. The DNA was isolated according to the procedure described by van Soolingen (7) including incubations with lysozyme, 10% SDS, proteinase K, 10% cetyltrimethylammonium bromide (CTAB), protein/lipid extraction with phenol/chlorophorm, and DNA precipitation with isopropanol. Before sequencing, DNAs were quantified by SYBR Green assay (Thermo Fisher Scientific, USA). Quality was validated after agarose gel electrophoresis. The libraries were prepared by using the Nextera XT DNA Library Preparation Kit (Illumina, USA) following the manufacturer’s protocol to achieve equimolar pools of each library sample. DNAs were fragmented and ligated to Illumina adapters by “tagmentation.” After the amplification step, size selection was performed with AMPure XP Beads (Beckman Coulter, USA). The size distribution of fragments was checked using an Agilent DNA High Sensitivity chip (2100 Bionanalyzer, Agilent, USA). The libraries were sequenced on an Illumina HiSeq 2500 platform (Illumina, USA) run in paired-end reads in 2 × 100 bp using an Illumina Reagent Kit V2. The sequences were attributed to each index (8 bases) without considering any mismatches. Results demonstrated that the mean number of reads was 3.2 million and mean depth of coverage was 86×, as calculated on an estimated genome size of 4.5 Mb for M. tuberculosis. Quality assessment was performed with the use of the Prinseq-lite program (v.0.20.4) (http://prinseq.sourceforge.net) (8). Sequences from Illumina sequencing were joined using the FLASH program (v.1.2.11) (9). Illumina reads were loaded into Geneious Prime software (v.2021.2.2) (10) and trimmed with the BBDuk plugin (v.1.0, https://sourceforge.net/projects/bbmap/). Adapters on the right end and the low-quality ends (quality below 20%) were trimmed, while reads shorter than 200 bp were discarded. Then, the reads were subjected to preprocessing (https://www.geneious.com/tutorials/map-to-reference/). The genomes were assembled by using the “Map to reference” tool. As a reference genome, M. tuberculosis H37Rv (GenBank accession no. NC_000962.3) was used. A consensus sequence from aligned reads was extracted. We visually confirmed the circular genomes of the M. tuberculosis strains by assessing the reads spanning the junction between the two linearized ends and overlapping with them. Annotation was generated with the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (v.4.13) (11).

Data availability.

The Illumina raw sequencing reads and draft genomes of the 77 endemic multidrug-resistant M. tuberculosis strains of SIT41 (TUR) spoligotype from Bulgaria were deposited in GenBank under BioProject accession number PRJNA765918, SRA study number SRP339647, and BioSample accession numbers ranging between CP086331 to CP085542 and CP085617 (Table 1).
TABLE 1

Descriptions and GenBank accession numbers of sequenced MDR Mycobacterium tuberculosis SIT41 (TUR) spoligotype strains from Bulgaria

No.M. tuberculosis strain/yr of isolationGenome size (bp)GC content (%)No. of contigs (scaffolds)N50No. of CDSaNo. of readsCoverage (×)Accession no.
GenBankSRA
16/20094,418,12965.9133 (1)83,7413,9971,643,19037.6 CP085613 SRR16149118
28/20094,417,82065.5110 (1)70,5524,0281,260,83428.80 CP085611 SRR16149122
311/20094,417,01665.5107 (1)64,0463,9951,224,13028.0 CP085608 SRR16149124
418/20094,418,29065.0130 (1)96,4874,0211,241,93428.40 CP085604 SRR16149129
519/20094,418,35865.5118 (1)74,5424,0571,435,02632.8 CP085602 SRR16149131
620/20094,423,81665.8110 (1)89,9734,0502,706,86261.8 CP085600 SRR16149133
722/20094,419,04865.3115 (1)78,9544,0411,579,66836.1 CP085597 SRR16149137
834/20094,418,14765.5101 (1)78,8924,0491,548,46835.4 CP085593 SRR16149140
937/20094,417,30265.4102 (1)81,9773,9441,549,42835.4 CP085592 SRR16149141
101/20104,421,04465.1119 (1)66,6513,9971,785,09040.1 CP085614 SRR16149117
117/20104,417,10265.0120 (1)64,6474,0431,662,82430.0 CP085612 SRR16149119
128/20104,416,51465.0143 (1)78,9194,0361,585,74436.2 CP085610 SRR16149120
1310/20104,426,98365.1122 (1)83,8834,0693,698,32684.4 CP085609 SRR16149123
1412/20104,416,64265.1115 (1)63,8514,0471,554,69635.6 CP085607 SRR16149125
1513/20104,421,01165.6104 (1)72,7724,0332,524,53257.7 CP085606 SRR16149126
1614/20104,417,91164.9127 (1)78,9504,0051,949,45844.6 CP085605 SRR16149127
1718/20104,432,44865.4105 (1)83,7774,0054,320,52098.4 CP085603 SRR16149128
1819/20104,418,61364.9119 (1)88,2084,0632,235,07651.1 CP085601 SRR16149130
1920/20104,417,34365.3122 (1)70,6484,0722,072,36870.8 CP085599 SRR16149134
2021/20104,417,66665.0103 (1)74,4254,0571,931,00444.1 CP085598 SRR16149135
2122/20104,420,38365.0103 (1)82,3113,9732,266,19850.9 CP085596 SRR16149136
2223/20104,420,91865.1107 (1)82,3584,0532,646,31860.5 CP085595 SRR16149157
2324/20104,420,28865.0117 (1)89,8304,0292,693,38661.6 CP085594 SRR16149139
2438/20104,419,47865.1118 (1)82,3554,0412,480,45656.7 CP085591 SRR16149142
2539/20104,419,82165.0113 (1)70,5503,9962,645,60660.5 CP085590 SRR16149144
2641/20104,419,99665.2124 (1)81,9403,9692,478,72056.7 CP085589 SRR16149145
2745/20104,423,28164.9126 (1)83,8284,0233,100,23670.8 CP085588 SRR16149146
2849/20104,419,93065.1108 (1)86,6344,0251,673,08838.2 CP085587 SRR16149147
2952/20104,420,72264.7112 (1)97,5114,0311,889,69443.2 CP085586 SRR16149148
3060/20104,419,01265.0112 (1)74,5044,0261,640,69037.5 CP085585 SRR16149149
3162/20104,420,14764.9120 (1)75,2704,0262,078,86047.5 CP085584 SRR16149150
3272/20104,418,09865.0100 (1)79,0764,0131,681,11238.4 CP085583 SRR16149151
3332/20114,425,78864.9254 (1)58,8904,0721,354,13246.3 CP085582 SRR16149153
3478/20114,417,08265.4254 (1)58,8904,034604,65020.7 CP086331 SRR16149152
35169/20114,435,97064.7116 (1)122,2234,0417,912,402180.1 CP085573 SRR16149163
36397/20134,439,34064.7128 (1)140,9394,06710,194,418231.9 CP085572 SRR16149164
37728/20154,440,56164.9170 (1)45,9404,04011,277,128256.5 CP085581 SRR16149155
38729/20154,431,94865.1137 (1)122,2174,0637,806,876177.9 CP085580 SRR16149156
3923/20164,433,11865.1122 (1)101,6723,9838,481,276190.2 CP085579 SRR16149138
4025/20164,434,89665.0124 (1)88,9424,0678,583,572195.5 CP085578 SRR16149158
4156/20164,427,56065.0139 (1)98,6734,0667,818,980178.4 CP085577 SRR16149159
42129/20164,430,91165.1134 (1)103,0784,0628,574,282195.4 CP085576 SRR16149160
43539/20164,440,78064.8119 (1)105,7564,0539,853,192224.1 CP085571 SRR16149166
44541/20164,429,66065.0109 (1)128,8354,0569,203,682209.9 CP085575 SRR16149161
45734/20164,427,84564.9132 (1)107,1364,0487,669,828175.0 CP085574 SRR16149162
46800/20164,440,52764.5123 (1)114,3294,05911,146,772253.6 CP085570 SRR16149167
47275/20174,419,70064.9163 (1)59,0754,0333,362,504114.9 CP085564 SRR16149173
48296/20174,418,19965.0155 (1)59,8984,0482,407,59682.3 CP085566 SRR16149171
49297/20174,418,72564.9149 (1)59,3604,0182,221,23075.9 CP085563 SRR16149174
50520/20174,418,00665.0165 (1)52,1534,0632,215,62675.7 CP085569 SRR16149168
51525/20174,418,53664.9165 (1)57,2854,0702,542,82686.9 CP085565 SRR16149172
52715/20174,420,89565.1255 (1)59,2144,0592,583,46688.2 CP085547 SRR16149143
53731/20174,419,64565.2157 (1)53,8634,0582,354,20080.4 CP085548 SRR16149132
541207/20174,414,34865.1152 (1)47,5754,0671,472,35850.4 CP085542 SRR16149188
551361/20174,416,10265.0172 (1)56,6374,0532,488,51085.1 CP085549 SRR16149121
561400/20174,414,25465.2167 (1)53,2014,0501,770,31860.6 CP085550 SRR16149113
571408/20174,414,65365.1166 (1)57,7304,0611,759,17260.2 CP085567 SRR16149170
58300/20174,415,62865.0152 (1)52,1534,0641,202,30941.1 CP085568 SRR16149169
59392/20184,418,91464.9152 (1)74,4734,0542,777,06894.9 CP085557 SRR16149181
60405/20184,418,41564.9145 (1)59,4064,0702,331,79679.7 CP085551 SRR16149112
61416/20184,418,17164.9172 (1)55,9944,0751,995,06668.2 CP085559 SRR16149179
62425/20184,418,04664.9146 (1)66,9994,0582,593,18288.7 CP085558 SRR16149180
63431/20184,420,07864.9144 (1)57,1244,0572,570,99887.8 CP085546 SRR16149154
64525/20184,417,68865.0170 (1)51,8684,0522,274,84477.8 CP085562 SRR16149175
65537/20184,417,65764.8161 (1)57,3884,0802,128,11072.7 CP085556 SRR16149182
66543/20184,419,29464.8150 (1)56,1464,0802,194,51875.0 CP085554 SRR16149184
67545/20184,419,61564.8151 (1)64,2464,0822,528,72886.4 CP085552 SRR16149186
68816/20184,417,43665.0146 (1)61,0654,0672,191,71474.9 CP085561 SRR16149177
69993/20184,415,29965.0152 (1)58,1684,0422,494,24285.3 CP085560 SRR16149178
701072/20184,415,76265.0149 (1)56,0524,0652,358,97880.7 CP085555 SRR16149183
711168/20184,414,89465.0147 (1)62,5194,0391,863,45663.7 CP085553 SRR16149185
72253/20194,476,62265.2112 (1)98,6354,0563,753,884125.8 CP085615 SRR16149116
73270/20194,420,67665.1150 (1)56,6994,0502,865,29497.9 CP085545 SRR16149165
74293/20194,447,27666.0111 (1)88,1424,0603,180,676107.3 CP085616 SRR16149115
75546/20194,444,37766.096 (1)83,1904,0482,837,27695.8 CP085617 SRR16149114
76598/20194,420,90065.1147 (1)73,7604,0612,948,322100.7 CP085544 SRR16149176
77952/20194,420,79765.0148 (1)64,2304,0213,012,618102.9 CP085543 SRR16149187

CDS, coding DNA sequences.

Descriptions and GenBank accession numbers of sequenced MDR Mycobacterium tuberculosis SIT41 (TUR) spoligotype strains from Bulgaria CDS, coding DNA sequences.
  8 in total

1.  FLASH: fast length adjustment of short reads to improve genome assemblies.

Authors:  Tanja Magoč; Steven L Salzberg
Journal:  Bioinformatics       Date:  2011-09-07       Impact factor: 6.937

Review 2.  DNA fingerprinting of Mycobacterium tuberculosis.

Authors:  D van Soolingen; P E de Haas; P W Hermans; J D van Embden
Journal:  Methods Enzymol       Date:  1994       Impact factor: 1.600

3.  Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology.

Authors:  J Kamerbeek; L Schouls; A Kolk; M van Agterveld; D van Soolingen; S Kuijper; A Bunschoten; H Molhuizen; R Shaw; M Goyal; J van Embden
Journal:  J Clin Microbiol       Date:  1997-04       Impact factor: 5.948

4.  Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis.

Authors:  Philip Supply; Caroline Allix; Sarah Lesjean; Mara Cardoso-Oelemann; Sabine Rüsch-Gerdes; Eve Willery; Evgueni Savine; Petra de Haas; Henk van Deutekom; Solvig Roring; Pablo Bifani; Natalia Kurepina; Barry Kreiswirth; Christophe Sola; Nalin Rastogi; Vincent Vatin; Maria Cristina Gutierrez; Maryse Fauville; Stefan Niemann; Robin Skuce; Kristin Kremer; Camille Locht; Dick van Soolingen
Journal:  J Clin Microbiol       Date:  2006-09-27       Impact factor: 5.948

5.  Quality control and preprocessing of metagenomic datasets.

Authors:  Robert Schmieder; Robert Edwards
Journal:  Bioinformatics       Date:  2011-01-28       Impact factor: 6.937

6.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

7.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

8.  Biodiversity of Mycobacterium tuberculosis in Bulgaria Related to Human Migrations or Ecological Adaptation.

Authors:  Stefan Panaiotov; Dzheni Madzharov; Yordan Hodzhev
Journal:  Microorganisms       Date:  2022-01-11
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.