| Literature DB >> 36101334 |
Christophe Debonneville1, Léa Mandelli1, Justine Brodard1, Raphaël Groux1, David Roquis2, Olivier Schumpp1.
Abstract
Members of the genus 'Candidatus Phytoplasma' are obligate intracellular bacteria restricted to phloem sieve elements and are able to colonize several tissues and the hemolymph in their insect vectors. The current unfeasibility of axenic culture and the low complexity of genomic sequences are obstacles in assembling complete chromosomes. Here, a method combining pathogen DNA enrichment from infected insects and dual deep-sequencing technologies was used to obtain the complete genome of a phytoplasma causing Grapevine Flavescence dorée. The de novo assembly generated a circular chromosome of 654,223 bp containing 506 protein-coding genes. Quality assessment of the draft showed a high degree of completeness. Comparative analysis with other phytoplasmas revealed the absence of potential mobile units and a reduced amount of putative phage-derived segments, suggesting a low genome plasticity. Phylogenetic analyses identified Candidatus Phytoplasma ziziphi as the closest fully sequenced relative. The "Flavescence dorée" phytoplasma strain CH genome also encoded for several putative effector proteins potentially playing a role in pathogen virulence. The availability of this genome provides the basis for the study of the pathogenicity mechanisms and evolution of the Flavescence dorée phytoplasma.Entities:
Keywords: effectors; genome sequencing; “Flavescence dorée” phytoplasma
Year: 2022 PMID: 36101334 PMCID: PMC9312162 DOI: 10.3390/biology11070953
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1Genome assembly and quality assessment. (A) Workflow proposed as the successful approach for de novo genome assembly. (B) BUSCO analysis to assess completeness of the assembled chromosome.
Figure 2Genome map of the circular chromosome of FDp strain CH. Rings from outside in: (1 and 2) coding sequences on the forward and reverse strand, respectively (color-coded by functional categories); (3) G/C content (above average: green; below average: red); (4) G/C skew (above average: green; below average: red); (5) scale marks. (PE): putative effectors; (PMU): genes belonging to the potential mobile unit core set. The grey line indicates the positions of the 10 kb duplicated region. The G/C content and G/C skew are displayed with a sliding window of 1 kb and a step size of 0.5 kb.
General characteristics of FDp strain CH and comparison to representative complete phytoplasma genomes.
| Strain (Accession) | 16S rRNA Group | Genome Size (bp) | G/C Content (%) | No. of CDS | Protein-Coding Regions (%) | No. of tRNA Genes | No. of rRNA Operons |
|---|---|---|---|---|---|---|---|
| FDp strain CH (CP097583) | V | 654,223 | 21.7 | 506 | 77 | 32 | 2 |
| ‘ | V | 750,803 | 23.2 | 671 | 78 | 32 | 2 |
| ‘ | I | 706,569 | 26.9 | 636 | 73.9 | 32 | 2 |
| ‘ | I | 853,092 | 27.8 | 752 | 72.8 | 32 | 2 |
| ‘ | VIII | 769,143 | 23.3 | 725 | 80.1 | 31 | 2 |
| ‘ | XII | 959,779 | 27.2 | 928 | 75.5 | 35 | 2 |
| ‘ | X | 601,943 | 21.4 | 500 | 78.2 | 32 | 2 |
| ‘ | II | 635,584 | 24.5 | 471 | 66.1 | 24 | 2 |
Figure 3Gene Ontology categories associated with the CDSs from FDp strain CH genome. (A) Bar chart showing the distribution among the three main GO domains. One protein can have several molecular functions. (B) Bar chart showing the five most represented GO categories in each domain.
Figure 4Bar chart representing the KEGG protein families associated with the CDSs from FDp strain CH genome.
Putative effectors encoded by the FDp genome.
| Protein ID | Function | SignalP5.0 | Phobius | Accession # of Orthologues in Other Phytoplasmas (Max. 2) |
|---|---|---|---|---|
| FlDop_00023 | Variable membrane protein B | SP | SP and 1 TM | VIO49504 (Alder yellows P.) |
| FlDop_00041 | Hypothetical protein (htmp2) | no prediction | TM | WP_225696264 ( |
| FlDop_00048 * | Hypothetical protein | SP | SP and 1 TM | none |
| FlDop_00049 * | Hypothetical protein | SP | SP and 1 TM | none |
| FlDop_00090 | Hypothetical protein | SP | SP | WP_225696128 ( |
| FlDop_00101 | Hypothetical protein | SP | TM | WP_238055118 ( |
| FlDop_00112 | Hypothetical protein | SP | SP | AYJ01330 ( |
| FlDop_00153 | Hypothetical protein (htmp5) | no prediction | TM | WP_121464226 ( |
| FlDop_00158 * | Hypothetical protein | SP | SP | none |
| FlDop_00183 | Hypothetical protein | SP | SP | WP_026072021 (Poinsettia branch-inducing P.) |
| FlDop_00185 | Hypothetical protein (SVM family) | SP | SP | WP_034172411 (Chrysanthemum yellows P.) |
| FlDop_00190 | Hypothetical protein | SP | SP | PQP79517 ( |
| FlDop_00246 * | Hypothetical protein | no prediction | SP | none |
| FlDop_00265 | Hypothetical protein | SP | TM | WP_121464113 ( |
| FlDop_00266 | Hypothetical protein | no prediction | SP | WP_121464114 ( |
| FlDop_00278 * | Hypothetical protein | SP | SP | none |
| FlDop_00279 * | Hypothetical protein (htmp1) | no prediction | TM | none |
| FlDop_00298 | Hypothetical protein | SP | TM | WP_012504569 ( |
| FlDop_00404 | Hypothetical protein (htmp3) | no prediction | TM | WP_225696004 ( |
| FlDop_00420 * | Hypothetical protein (htmp4) | SP | SP | none |
| FlDop_00531 | SAP21-like protein | SP | SP | QKX95099 (Rapeseed phyllody P.) |
SP: signal peptide; TM: transmembrane domain. * singletons (proteins with no homologue in other analyzed phytoplasma genomes).
Figure 5Maximum likelihood phylogeny based on nucleotide sequences of (A) groEL and (B) smpB genes. The numbers on branches indicate the level of bootstrap support (500 replicates). Support values above 65% are labeled. The scale bar shows the number of substitutions per site.
Figure 6Maximum likelihood phylogeny inferred using 203 single-copy orthologue genes; the concatenated alignment contains 70,619 aligned amino acid sites. The numbers on branches indicate the level of bootstrap support (1000 replicates). Support values above 80% are labeled. The scale bar shows the number of substitutions per site.
Pairwise genome comparisons.
| FDp Strain CH | ‘ | ‘ | ‘ | ‘ | ‘ | ‘ | ‘ | |
|---|---|---|---|---|---|---|---|---|
| ‘ | 306 | 335 | 332 | 337 | 302 | 301 | 305 | 290 |
| ‘ | 295 | 275 | 282 | 291 | 293 | 295 | 309 | |
| ‘ | 321 | 300 | 301 | 301 | 322 | 307 | ||
| ‘ | 300 | 299 | 303 | 301 | 302 | |||
| ‘ | 359 | 296 | 307 | 303 | ||||
| ‘ | 299 | 329 | 384 | |||||
| ‘ | 302 | 343 | ||||||
| ‘ | 304 |
The numbers indicate the orthologue clusters between any pair of genomes.