| Literature DB >> 36033837 |
Bianca Rodrigues Jardim1,2, Lucy T T Tran-Nguyen3, Cherie Gambley4, Brendan Rodoni1,2, Fiona E Constable1,2.
Abstract
Obtaining complete phytoplasma genomes is difficult due to the lack of a culture system for these bacteria. To improve genome assembly, a non-ionic, low- and iso-osmotic iodixanol (Optiprep™) density gradient centrifugation method was developed to enrich for phytoplasma cells and deplete plant host tissues prior to deoxyribonucleic acid (DNA) extraction and high-throughput sequencing (HTS). After density gradient enrichment, potato infected with a 'Candidatus Phytoplasma australasia'-related strain showed a ∼14-fold increase in phytoplasma HTS reads, with a ∼1.7-fold decrease in host genomic reads compared to the DNA extracted from the same sample without density gradient centrifugation enrichment. Additionally, phytoplasma genome assemblies from libraries equalized to 5 million reads were, on average, ∼15,000 bp larger and more contiguous (N50 ∼14,800 bp larger) than assemblies from the DNA extracted from the infected potato without enrichment. The method was repeated on capsicum infected with Sweet Potato Little Leaf phytoplasma ('Ca. Phytoplasma australasia'-related strain) with a lower phytoplasma titer than the potato. In capsicum, ∼threefold more phytoplasma reads and ∼twofold less host genomic reads were obtained, with the genome assembly size and N50 values from libraries equalized to 3.4 million reads ∼137,000 and ∼4,000 bp larger, respectively, compared to the DNA extracted from infected capsicum without enrichment. Phytoplasmas from potato and capsicum were both enriched at a density of 1.049-1.058 g/ml. Finally, we present two highly contiguous 'Ca. Phytoplasma australasia' phytoplasma reference genomes sequenced from naturally infected Solanaceae hosts in Australia. Obtaining high-quality phytoplasma genomes from naturally infected hosts will improve insights into phytoplasma taxonomy, which will improve their detection and disease management.Entities:
Keywords: 16SrII phytoplasma; OptiprepTM; density gradient centrifugation; high-throughput sequencing; host DNA contamination; natural host; phytopathogen; unculturable bacteria
Year: 2022 PMID: 36033837 PMCID: PMC9411968 DOI: 10.3389/fmicb.2022.937648
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Sample name, host information, collection location in Australia of samples positive for phytoplasma used in the study as well as the phytoplasma identity based on the top BLASTn hit of the R16F2n/m23sr amplicon and species assignment.
| Sample name | Host common name (species name) | Host family | Sampling location in Australia | Highest percent identity (accession number) | BLASTn top hit description |
| o4P | Potato ( | Solanaceae | Melbourne, VIC | 99.94% ( | ‘ |
| o7C | Capsicum ( | Solanaceae | Stanthorpe, QLD | 99.87% ( | Sweet Potato Little Leaf phytoplasma |
Assembly details and statistics for phytoplasma genomes used as references in this study, including GenBank accession numbers, closest phytoplasma relative, sequencing details where available (tissue sampled, enrichment method, sequencing platforms, and sequencing depths), genome properties, plasmid accession number/name.
| Sample name (accession number) | NCHU2014 ( | PR08 ( | o4P ( | o7C ( |
| Putative phytoplasma species | ‘ | ‘ | ‘ | SPLL ‘ |
| Host tissue sampled | ||||
| Enrichment method | Transmission to periwinkle by dodder, and immunoprecipitation | NA | Differential centrifugation and iodixanol density gradient centrifugation, iodixanol Fraction 2 from replicate 2 | Differential centrifugation and iodixanol density gradient centrifugation, iodixanol Fraction 7 from replicate 1 |
| Sequencing platform(s) | Illumina and Oxford Nanopore Technology | Illumina and Oxford Nanopore Technology | Illumina | Illumina |
| Amount of HTS data used (bp) | Approx. 0.67 × 109 (Oxford Nanopore Technology) with approx. 17.16 × 109 (Illumina) | NA | 5.94 × 109 | 19.14 × 109 |
| No. of contigs | 1 | 1 | 30 | 29 |
| Est. genome size (bp) | 635,584 | 588,746 | 555,927 | 555,321 |
| N50 value (bp) | 635,584 | 588,746 | 41,668 | 41,719 |
| Est. genome coverage (×) | 2117 | 2757.98 | 1372 | 117 |
| GC (%) | 24.5 | 24.3 | 23.9 | 23.7 |
| No. protein coding genes | 471 | 468 | 463 | 458 |
| No. tRNAs | 24 | 27 | 25 | 25 |
| No. 16S rRNAs | 2 | 2 | 1 | 1 |
| Plasmid recovered? (accession number) | Yes ( | No | Putative plasmid: po4P16SrIID | Putative plasmid: po7C16SrIID |
| References | NA | This study | This study |
NA, not available.
and tufB gene regions extracted from the genome assemblies of strain o4 and strain o7 share 100% nucleotide sequence identity to those of ‘Ca. Phytoplasma australasia’ (GenBank accession numbers EU168728 and JQ824250, respectively) and only 91–94% with the genes of ‘Ca. Phytoplasma aurantifolia’ (GenBank accession numbers EU168731 and JQ824276, respectively). This places both strains as ‘Ca. Phytoplasma australasia’-related species based on recommendations by Bertaccini et al. (2022).
Summary of iodixanol fraction densities after gradient centrifugation of phytoplasma infected potato (o4P) and capsicum (o7C) tissues, based on three (x) technical replicates of density gradient centrifugation for each sample; DNA concentrations of all samples; mean qPCR standard curve-based estimates of phytoplasma titer for all samples; HTS read mapping results after read down-sampling; and phytoplasma genome assembly quality metrics of down-sampled libraries.
| Sample | Mean fraction density (g/ml) | DNA con | qPCR | Total reads retained in down-sampling (Million reads) | Mean phytoplasma reads mapped (%) | Mean host gDNA reads mapped (%) | Mean host cpDNA reads | Mean host mtDNA reads | Mean unmapped reads (%) | Phytoplasma genome assembly size (bp) | Largest phytoplasma contig mean (bp) | Phytoplasma genome mean N50 (bp) | Mean reference genome fold coverage |
| o4P-Homogenate | NA | 34.000 | 16.65 | 5.0 | 1.67 | 85.44 | 6.48 | 1.46 | 4.95 | 565,143 | 92,921 | 37,285 | 29 |
| o4P-TSE pellet | NA | 1.630 | 12.30 | 5.0 | 9.39 | 69.37 | 7.57 | 4.58 | 9.10 | 569,323 | 71,194 | 28,898 | 132 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| o4Px-Fraction3 | 1.063 | <0.064 | 16.09 | 5.0 | 12.36 | 61.31 | 9.05 | 7.92 | 9.37 | 565,250 | 107,678 | 37,285 | 200 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| o4Px-Fraction5 | 1.095 | <0.094 | 17.73 | 5.0 | 5.00 | 72.66 | 5.68 | 5.87 | 10.79 | 566,373 | 69,878 | 26,752 | 92 |
| o4Px-Fraction6 | 1.104 | <0.005 | 19.07 | ND | 5.74 | 50.51 | 2.26 | 1.29 | 40.20 | 565,998 | 71,194 | 33,168 | 60 |
| o4Px-Fraction7 | 1.133 | <0.005 | 19.35 | ND | 4.06 | 69.12 | 2.68 | 1.44 | 22.70 | 565,113 | 107,877 | 37,285 | 57 |
| o4Px-Fraction8 | 1.196 | <0.005 | 21.66 | ND | 4.24 | 51.38 | 2.78 | 1.90 | 39.71 | 566,077 | 71,194 | 38,094 | 54 |
| o7C-Homogenate | NA | 61.900 | 23.80 | 3.4 | 0.27 | 79.33 | 2.60 | 2.75 | 15.05 | 385,814 | 3,547 | 0,936 | 6 |
| o7C-TSE pellet | NA | 17.300 | 16.58 | 3.4 | 0.21 | 62.42 | 3.27 | 3.53 | 30.57 | 279,439 | 3,420 | 0,723 | 5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| o7Cx-Fraction5 | 1.108 | <0.198 | 22.79 | 3.4 | 0.24 | 14.58 | 1.44 | 6.36 | 77.38 | 341,085 | 4,040 | 0,932 | 7 |
| o7Cx-Fraction6 | 1.109 | 0.057 | 24.70 | 3.4 | 0.41 | 17.16 | 1.81 | 8.39 | 72.23 | 463,744 | 5,106 | 1,224 | 9 |
| o7Cx-Fraction7 | 1.119 | <0.060 | 23.31 | 3.4 | 0.61 | 30.16 | 2.03 | 14.23 | 52.98 | 545,833 | 13,829 | 2,542 | 12 |
| o7Cx-Fraction8 | 1.157 | <0.082 | 26.23 | 3.4 | 0.37 | 23.31 | 1.95 | 8.83 | 65.54 | 397,225 | 5,341 | 1,008 | 9 |
N50 refers to the contig length such that contigs of equal or greater lengths produce 50% of the genome’s length. Note that not all technical replicates were sequenced for all fractions derived from density gradient centrifugation (see Section “Results”).
aSample naming convention: the suffix “-Homogenate” refers to the unprocessed tissue homogenate, “-TSE pellet” refers to the pellet, resuspended in TSE buffer that results after two rounds of differential centrifugation and is loaded to the top of the iodixanol density gradient, “-F1 to -F8” indicates the iodixanol fraction sampled per host.
Bold indicates averages taken from replicates submitted for HTS, values not in bold font indicates a single replicate submitted for HTS.
NA, not available; ND, not done; UD, undetermined.
FIGURE 1Reciprocal mean average nucleotide identities (ANI, %) and proportion of aligned genomic segments (AF, %) for the two phytoplasma reference genomes assembled in this study and two publicly available phytoplasma reference genomes (strain NCHU2014 and strain PR08). The phytoplasma genome used as a reference in each case is indicated above the graph in black font, with an asterisk indicating a complete genome. The dashed line indicates the proposed 95% ANI species delimitation threshold value.
FIGURE 2Average nucleotide identity (ANI, %) and proportion of aligned genomic segments (AF,%) for down-sampled assemblies compared to the phytoplasma reference genomes assembled in this study and two publicly available phytoplasma reference genomes for (A) the infected potato host and (B) the infected capsicum host. The samples analyzed for each host included the unprocessed tissue homogenate (suffix “-IB”), differentially centrifuged pellet (suffix “-TSE”), and the iodixanol fraction with the highest percent of phytoplasma reads (suffix “-F1”). The phytoplasma genome used as the reference in each case is indicated above the graph in black font, with an asterisk indicating a complete genome. The dashed line indicates the proposed 95% ANI species delimitation threshold value.