| Literature DB >> 35587637 |
Xenia Emilie Sinding Iversen1, Anders Norman1, Dorte Bek Folkvardsen1, Erik Svensson1, Erik Michael Rasmussen1, Troels Lillebaek1,2.
Abstract
In this study, 28 "historical" clinical freeze-dried nontuberculous mycobacterial isolates collected from 1948 to 1957, were analyzed by investigating their viability and performing whole genome sequencing (WGS) on DNA extracted (i) directly from freeze-dried cells versus (ii) after culturing, to determine cell properties and DNA quality after centuries of freeze-dried storage. The isolated DNA was sequenced on the Illumina MiSeq platform and data quality evaluated analyzing the per-base quality scores of paired-end sequencing reads as well as the overall contiguity of resulting de novo assemblies. After 72 years in storage, all freeze-dried isolates were viable, and showed no signs of cell damage and limited signs of contamination when reculturing. They were recultured without problems and identified through WGS with only four of 13 parameters showing statistical significance based on sequence data obtained directly from the freeze-dried cells versus after reculturing, indicating no DNA degradation. Thus, mycobacteria can be whole genome sequenced successfully directly from freeze-dried material without prior recultivation, saving laboratory time and resources, and emphasizing the value of freeze-drying for long-term storage. Our study lays the groundwork for further genomic investigations of freeze-dried bacterial isolates, and the approximately 4,000 historical isolates in our collection will provide a unique opportunity to investigate mycobacterial DNA from a variety of NTM species unexposed to antimicrobials, some maybe still undescribed species. IMPORTANCE The genus Mycobacterium was described more than a century ago and new species are continuously identified and described. There is an ongoing discussion about an increase in the incidence of disease caused by nontuberculous mycobacteria (NTM). How the different bacteria looked before exposure to antibiotics can only be investigated by looking at strains from before the antibiotic era. Strains from that era will be stored in different ways, for example by freeze-drying. The question is how to investigate these strains, and if they are still viable, whether they need to be cultured, and if that changes the DNA. Here, we test all these parameters on freeze-dried strains and show that NGS can be applied directly without culturing.Entities:
Keywords: 7 decades; NGS; ancient strains; freeze-dried; mycobateria; revivification
Mesh:
Year: 2022 PMID: 35587637 PMCID: PMC9241853 DOI: 10.1128/spectrum.00310-22
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Freeze-dried mycobacterial isolates are stored in glass ampoules packed in boxes and kept at 10°C. The International Reference Laboratory of Mycobacteriology at Statens Serum Institut preserve more than 4,000 freeze-dried isolates received from the 1940s to the 1990s, compromising a historical mycobacteria strain collection.
NTM species determination performed using multilocus sequence typing of extracted putative coding regions from de novo assemblies of all sequenced isolates
| Isolate | Protein homology to | LJ colony morphology | |
|---|---|---|---|
| Mu0049 | S, t | ||
|
|
|
| |
| Mu0051 | S, t | ||
|
|
|
| |
| Mu0054 | S, y | ||
| Mu0055 | S, t | ||
| Mu0056 | S, y | ||
| Mu0057 |
| ||
| Mu0058 | S, t | ||
| Mu0082 | S, t | ||
|
|
|
| |
| Mu0084 | S, w | ||
| Mu0086 | S, t | ||
| Mu0087 | S, t | ||
| Mu0088 | R, w | ||
| Mu0089 | S, y | ||
| Mu0090 | R, w | ||
| Mu0091 | S, t | ||
| Mu0093 | R, y | ||
| Mu0094 | - | ||
| Mu0100 | R, t | ||
| Mu0101 | S, t | ||
|
|
|
| |
| Mu0103 | S, t | ||
| Mu0132 | R, w | ||
| Mu0134 | S, t | ||
| Mu0152 | R, t | ||
| Mu0971 | R, w |
Protein blast hits of selected marker genes (secA, rpoB, rpoC, hsp65, ileS, alaS, and leuS) were used to select 25 genomes as phylogenetic reference points (Table S1 and S2). Colony morphology for isolates were observed from Lowenstein-Jensen cultures. Bold rows define potentially new and undiscovered species. -, no phenotypic characterization due to contamination.
Amino acid identity of tblastn hits against secA against reference genome assemblies downloaded from the NCBI Reference Sequence Database (RefSeq).
Morphology characteristics: surface: rough (R) or smooth (S); color: transparent (t), white (w), or yellow (y).
FIG 2Maximum likelihood phylogeny of 28 assembled historical nontypical mycobacterial isolates and 25 reference genomes downloaded from the NCBI Refseq database, using concatenated protein sequence alignments of 83 single copy core genes. Asterisk signifies isolates that differed by more than 5% from a known reference genome by calculating average nucleotide identities (ANI). Branch support values are ultrafast bootstap (UFBoot) and approximate likelihood ratio test (aLRT) values, respectively.
Statistical comparison of the Illumina MiSeq run data-output parameters for uncultured and cultured samples, obtained from samtools analysis
| Illumina run parameters | Median | Median | T-test |
|---|---|---|---|
| No. of sequence reads (forward and reverse combined) | 1,332,293 | 1,638,481 | 0.60 |
| Total no. of bases sequenced | 161,258,608 | 201,940,270 | 0.62 |
| Avg. coverage of mapped bases over contigs | 27 | 31 | 0.90 |
| Percentage of sequenced bases mapped to contigs | 71% | 70% | 0.71 |
| Percentage of mismatches per mapped base (error rate) |
|
|
|
| Percentage of sequenced bases with a quality score ≥ Q30 on forward reads |
|
|
|
| Percentage of sequenced bases with a quality score ≥ Q30 on reverse reads |
|
|
|
| Mode of the distribution of insert sizes mapped reads | 89 | 110 | 0.07 |
| Median of insert sizes of mapped reads | 259 | 233 | 0.51 |
| Lower Quartile (Q1) value of insert sizes | 166 | 155 | 0.80 |
| Upper Quartile (Q3) of insert sizes | 392 | 335 | 0.07 |
| Lower 5% limit of insert sizes | 81 | 83 | 0.40 |
| Upper 95% limit of insert sizes |
|
|
|
The median as well as a paired two-tailed T-test was calculated for the data-output parameters using Excel (v15.32), where parameter data from uncultured sample runs and cultured sample runs were chosen as vector 1 and vector 2, respectively. Significant difference between the two methods (P = ≤0.01) is defined in bold.
Significant P-value < 0.05.
FIG 3Principal component analysis (PCA) and observation chart based on 13 mapping parameters obtained from samtools was made using XLSTAT (v22.2.2) to compare sequencing of uncultured and cultured isolates. The chart is created based on the eigenvalues of factor F1 (63.41%) and F2 (24.21%) of the initial data set, presenting 87.62% of the initial data set.