| Literature DB >> 35584001 |
Marcela A Johnson1,2, Haijie Liu1, Elizabeth Bush1, Parul Sharma1,2, Shu Yang1, Reza Mazloom3, Lenwood S Heath3, Mizuho Nita1,4, Song Li1, Boris A Vinatzer1.
Abstract
Early disease detection is a prerequisite for enacting effective interventions for disease control. Strains of the bacterial plant pathogen Xylella fastidiosa have recurrently spread to new crops in new countries causing devastating outbreaks. So far, investigation of outbreak strains and highly resolved phylogenetic reconstruction have required whole-genome sequencing of pure bacterial cultures, which are challenging to obtain due to the fastidious nature of X. fastidiosa. Here, we show that culture-independent metagenomic sequencing, using the Oxford Nanopore Technologies MinION long-read sequencer, can sensitively and specifically detect the causative agent of Pierce's disease of grapevine, X. fastidiosa subspecies fastidiosa. Using a DNA sample from a grapevine in Virginia, USA, it was possible to obtain a metagenome-assembled genome (MAG) of sufficient quality for phylogenetic reconstruction with SNP resolution. The analysis placed the MAG in a clade with isolates from Georgia, USA, suggesting introduction of X. fastidiosa subspecies fastidiosa to Virginia from the south-eastern USA. This proof of concept study, thus, revealed that metagenomic sequencing can replace culture-dependent genome sequencing for reconstructing transmission routes of bacterial plant pathogens.Entities:
Keywords: metagenomics; nanopore sequencing; pandemic prevention; plant biosecurity; plant disease diagnostics
Mesh:
Year: 2022 PMID: 35584001 PMCID: PMC9465077 DOI: 10.1099/mgen.0.000822
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Sample metadata, qPCR results and summary of sequencing results
|
Characteristic |
11 |
368 |
669 |
228 |
255 |
382 |
|---|---|---|---|---|---|---|
|
Symptoms |
No |
Yes |
Yes |
Yes |
Yes |
Yes |
|
Grape cultivar |
Chardonnay |
Petit Verdot |
Cabernet Franc |
Viognier |
Vidal Blanc |
Cabernet Franc |
|
Location of collection |
Winchester, VA |
Middleburg, VA |
Smithfield, VA |
Charlottesville, VA |
Fairlawn, VA |
Faber, VA |
|
Year of collection |
2020 |
2013 |
2020 |
2013 |
2013 |
2013 |
|
DNA concentration (ng μl−1) |
13.9 |
22.7 |
24.7 |
22.8 |
23.9 |
28.7 |
|
DNA quality (260/280) |
1.56 |
1.74 |
1.74 |
1.72 |
1.69 |
1.73 |
|
qPCR |
20.68 |
17.3 |
19.7 |
18.86 |
19.06 |
16.99 |
|
qPCR |
Undetermined |
17.94 |
20.4 |
34.03 |
35.87 |
18.73 |
|
Samples run per flow cell |
2 |
2 |
1* |
3 |
3 |
3 |
|
Total number of reads |
489 136 |
55 695 |
1 271 428 |
619 578 |
563 451 |
232 722 |
|
Total number of reads (ha1) |
518 818 |
59 693 |
1 305 206 |
735 768 |
649 927 |
271 524 |
|
|
0 |
640 |
26 643 |
17 |
8 |
2245 |
|
|
0 |
1.208 |
2.327 |
0.003 |
0.002 |
1.024 |
|
|
0 |
801 |
29 878 |
11 |
13 |
2906 |
|
|
0 |
1.342 |
2.289 |
0.001 |
0.002 |
1.070 |
|
Total length of reads (Mb) |
399.41 |
96.29 |
3962.02 |
1129.76 |
1247.70 |
520.69 |
|
Mean read length (bp) |
840 |
1728 |
3116 |
1823 |
2214 |
2237 |
|
Longest read length (bp) |
30 160 |
43 841 |
264 283 |
364 246 |
502 142 |
160 166 |
Xf, X. fastidiosa.
*After high-accuracy basecalling.
Fig. 1.Workflow for culture-independent detection and identification of . (a) DNA is extracted from grapevine petioles and used for DNA sequencing library preparation and, as a control, for qPCR. (b) The prepared DNA is then sequenced with the ONT MinION system and basecalled. Basecalled reads are either used directly for detection and identification of at the species and subspecies rank, or assembled for phylogenetic reconstruction for outbreak investigation. Created with BioRender.com.
Fig. 2.Detection sensitivity of metagenomic sequencing with the ONT MinION based on wimp [28] analysis for samples 368, 382 and 669. The x-axis shows the total number of reads that were randomly subsampled. The y-axis reports the corresponding mean number of (Xf) reads based on ten subsamples taken for each subsample size. The error bars show the standard deviation.
Fig. 3.Identification of at the subspecies rank using (a) wimp [28] and (b) sourmash [31, 42]. The x-axes represents the sample IDs (described in Table 1). The y-axes shows the relative abundance of subspecies as a percentage of all sequences assigned to at the species rank. Note that sample 11 was the negative control with no reads identified by either wimp or sourmash, and sourmash did not identify in samples 228 and 255. Xf, ; Xff, X. fastidiosa subsp. ; Xfm, X. fastidiosa subsp. ; Xfp, 'X. fastidiosa. subsp. pauca'; Xfs, 'X. fastidiosa subsp. sandyi'.
Fig. 4.Comparison of the MAG 669 sequence with other subsp. genomes. (a) Alignment of the 669 MAG with the genome of subsp. Temecula 1 using ANIm implemented in JSpeciesWS [35] (99.78 % ANI, 99.03 % nucleotides aligned) and visualized using Ribbon [50]; (b) Result page of LINbase [25] when using MAG 699 as the query with the ‘identify using a genome sequence’ function. The most similar genome to the query (the ‘best match’) is reported together with its LIN. The ANI value between the query and the best match is shown above the LIN. Below the best match, the taxa to which the query belongs are listed as well.
Comparison of the sequence quality of MAG 669 with the subsp. reference genome
Temecula 1 and the genome of the closely related subsp. isolate XF51 CCPM1 were both obtained from pure cultures.
|
Characteristic |
MAG 669 |
Temecula 1 |
XF51 CCPM1 |
|---|---|---|---|
|
Length (Mb) |
2.52 |
2.52 |
2.8 |
|
Contigs |
2 |
2 |
115 |
|
Scaffolds |
1 |
2 |
115 |
|
N50 (contigs) (bp) |
2 522 580 |
2 519 802 |
68 803 |
|
N50 (scaffolds) (bp) |
2 523 846 |
2 519 802 |
68 803 |
|
Completeness |
96.61 |
99.64 |
97.76 |
|
Contamination |
0.12 |
0 |
6.51 |
|
Ambiguous bases |
100 |
0 |
0 |
Fig. 5.Phylogenetic trees of MAG 669, all high-quality publicly available genomes of subsp. isolates from the USA, a selection of international subsp. isolates and ' subsp. sandyi' Ann 1 as an outgroup. (a) Core-genome maximum-likelihood tree using 1138 core genes; (b) whole-genome maximum-likelihood SNP tree based on split k-mer analysis. Genomes are coloured according to isolation location. Percentage bootstrap support values >96 % are shown.