| Literature DB >> 35171093 |
David Pinzauti1, Francesco Iannelli1, Gianni Pozzi1, Francesco Santoro1.
Abstract
Streptococcus mitis is a Gram-positive bacterium, member of the oral commensal microbiota, which can occasionally be the etiologic agent of diseases such as infective endocarditis, bacteraemia and septicaemia. The highly recombinogenic and repetitive nature of the S. mitis genome impairs the assembly of a complete genome relying only on short sequencing reads. Oxford Nanopore sequencing can overcome this limitation by generating long reads, enabling the resolution of genomic repeated regions and the assembly of a complete genome sequence. Since the output of a Nanopore sequencing run is strongly influenced by genomic DNA quality and molecular weight, the DNA isolation is the crucial step for an optimal sequencing run. In the present work, we have set up and compared three DNA isolation methods on two S. mitis strains, evaluating their capability of preserving genomic DNA integrity and purity. Sequencing of DNA isolated with a mechanical lysis-based method, despite being cheaper and quicker, did not generate ultra-long reads (maximum read length of 59516 bases) and did not allow the assembly of a circular complete genome. Two methods based on enzymatic lysis of the bacterial cell wall, followed by either (i) a modified CTAB DNA isolation procedure, or (ii) a DNA purification after osmotic lysis of the protoplasts allowed the sequencing of ultra-long reads up to 107294 and 181199 bases in length, respectively. The reconstruction of a circular complete genome was possible sequencing DNAs isolated using the enzymatic lysis-based methods.Entities:
Keywords: Gram positive; Oxford Nanopore sequencing; Streptococcus mitis; genomic DNA extraction; high molecular weight DNA
Mesh:
Substances:
Year: 2022 PMID: 35171093 PMCID: PMC8942023 DOI: 10.1099/mgen.0.000764
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Schematic representation of the DNA isolation methods.
Quantitative and qualitative analysis of genomic DNAs
|
Method |
DNA concentration* (ng μl−1) |
DNA absorbance* | ||||
|---|---|---|---|---|---|---|
|
|
|
|
| |||
|
|
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 |
|
CTAB |
177.13±80.1 |
355±221.73 |
1.96±0.07 |
1.79±0.17 |
2.13±0.04 |
1.97±0.10 |
|
Raffinose |
158.4±43.4 |
196.66±42.72 |
1.92±0.05 |
1.94±0.07 |
2.04±0.07 |
2.09±0.02 |
|
TissueLyser |
48.08±45.74 |
25.74±16.29 |
1.96±0.13 |
2.03±0.13 |
2.00±0.13 |
2.11±0.05 |
*Values represent the average of at least three independent DNA isolation experiments or measurements. For Oxford Nanopore sequencing acceptable DNA absorbance ratios are as follows: A 260/A 280=1.8–2.0 and A 260/A 230=2.0–2.2.
Fig. 2.Agarose gel electrophoretic analysis of genomic DNAs. (a) Representative DNAs extracted from S022-V3-A4 strain, (b) representative DNAs extracted from S022-V7-A3 strain. Lane 1, Lambda (λ) DNA/HindIII (100 ng). Genomic DNA purified with the CTAB method: lane 2, (1 µl); lane 3, (1/10 µl); lane 4, (1/20 µl). Genomic DNA purified with the Raffinose method: lane 5, (1 µl); lane 6, (1/10 µl); lane 7, (1/20 µl). Genomic DNA purified with the TissueLyser method: lane 8, (1 µl); lane 9, (1/10 µl); lane 10, (1/20 µl); lane 11, λ DNA (100 ng); lane 12, λ DNA (30 ng); lane 13, GeneRuler 1 kb Plus DNA Ladder (100 ng).
genome sequencing readout
|
Sequencing output |
DNA extraction method | |||||
|---|---|---|---|---|---|---|
|
CTAB |
Raffinose |
TissueLyser | ||||
|
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 | |
|
Mean read length |
4476.0 |
4216.3 |
5338.5 |
4270.9 |
5121.9 |
4128.9 |
|
Median read length |
2552 |
2171 |
2519 |
2047 |
4695 |
3455 |
|
Mean read quality |
11.3 |
11.2 |
11.3 |
11.2 |
11.2 |
11.1 |
|
Number of reads |
18485 |
17778 |
18920 |
24227 |
19109 |
57825 |
|
Read length N50 |
7608 |
7700 |
10610 |
8371 |
6651 |
5317 |
|
Total bases |
82739476 |
74956919 |
101004905 |
103470132 |
97875297 |
238752790 |
|
Longest reads: |
| |||||
|
1 |
82241 |
107,294 |
181199 |
124694 |
34966 |
59516 |
|
2 |
73834 |
93,505 |
130,614 |
105568 |
31951 |
52114 |
|
3 |
73597 |
84226 |
118691 |
103501 |
31676 |
47645 |
|
4 |
70310 |
82270 |
116448 |
97398 |
31098 |
38005 |
|
5 |
68235 |
80707 |
114758 |
92372 |
27268 |
36981 |
Fig. 3.Read-length distribution of genome sequencing experiments. (a) S022-V3-A4 strain, (b) S022-V7-A3 strain. Read-length values are plotted on the X-axis, while the Y-axis reports the number of reads. Reads obtained sequencing the DNAs isolated using the three methods are represented with different colours: blue for CTAB, red for Raffinose and green for TissueLyser.
genomes' assembly and data availability
|
DNA extraction method |
Length* |
BUSCO† |
Sequence read archive no. |
Genome GenBank no.‡ | ||||
|---|---|---|---|---|---|---|---|---|
|
|
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 |
S022-V3-A4 |
S022-V7-A3 |
|
CTAB |
2094027 bp |
2033697 bp |
C:370 (92%), F:25 (6.2%), M:7 (1.8%) |
C:356 (88.6%), F:36 (9%), M:10 (2.4%) |
SRX13372482 |
SRX13372479 |
CP047883.1 |
CP067992.1 |
|
Raffinose |
2087264 bp |
2033595 bp |
C:377 (93.8%), F:15 (3.7%), M:10 (2.5%) |
C:372 (92.5%), F:23 (5.7%), M:7 (1.8%) |
SRX13372483 |
SRX13372480 |
|
|
|
TissueLyser |
2083754 bp |
2032458 bp |
C:381 (94.8%), F:19 (4.7%), M:2 (0.5%) |
C:389 (96.8%), F:9 (2.2%), M:4 (1%) |
SRX13372484 |
SRX13372481 |
|
|
*Five linear contigs were obtained assembling reads obtained sequencing DNAs extracted with the CTAB and the TissueLyser method for S. mitis S002-V3-A4 strain, while DNAs extracted with the TissueLyser method generate a single, linear contig for S022-V7-A3 strain.
†Assembly completeness was evaluated with BUSCO v. 5.2.2 using the lactobacillales_odb10 database. A total of 402 BUSCO genes were identified, the number of complete and single-copy (C) fragmented (F), and missing (M) genes in each assembly is reported with their relative abundance in brackets.
‡The deposited genome sequences were assembled combining Illumina reads and Nanopore reads, obtained using DNAs isolated with the three different methods.