| Literature DB >> 32380694 |
Parichate Tangkanchanapas1,2, Annelies Haegeman1, Tom Ruttink1, Monica Höfte2, Kris De Jonghe1.
Abstract
Columnea latent viroid (CLVd) is one of the most serious tomato diseases. In general, viroids have high mutation rates. This generates a population of variants (so-called quasi-species) that co-exist in their host and exhibit a huge level of genetic diversity. To study the population of CLVd in individual host plants, we used amplicon sequencing using specific CLVd primers linked with a sample-specific index sequence to amplify libraries. An infectious clone of a CLVd isolate Chaipayon-1 was inoculated on different solanaceous host plants. Six replicates of the amplicon sequencing results showed very high reproducibility. On average, we obtained 133,449 CLVd reads per PCR-replicate and 79 to 561 viroid sequence variants, depending on the plant species. We identified 19 major variants (>1.0% mean relative abundance) in which a total of 16 single-nucleotide polymorphisms (SNPs) and two single nucleotide insertions were observed. All major variants contained a combination of 4 to 6 SNPs. Secondary structure prediction clustered all major variants into a tomato/bolo maka group with four loops (I, II, IV and V), and a chili pepper group with four loops (I, III, IV and V) at the terminal right domain, compared to the CLVd Chaipayon-1 which consists of five loops (I, II, III, IV and V).Entities:
Keywords: Columnea latent viroid; amplicon sequencing; high-throughput sequencing; mutations; population study; quasi-species; variant analysis; viroid secondary structure; viroid–host interaction
Mesh:
Year: 2020 PMID: 32380694 PMCID: PMC7246631 DOI: 10.3390/ijms21093262
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Columnea latent viroid (CLVd) variant spectra in tomato cultivars Insaf and Rutgers, bolo maka, chili pepper and Thai round eggplant. (A) Summary statistics of the number of variants per plant. Reproducibility across PCR replicates separates false variants (non-reproducible) from true variants (reproducible in all PCR replicates). Number of dominant variants (>1% relative frequency) per plant. Variant abundance is expressed as % relative frequency based on read depth per sample. (B) Relative frequency per variant per PCR replicate. The class labeled swarm variants contains all true variants (present in all PCR replicates per plant) with low frequency in the quasi-species population (<1% frequency per plant). The class labeled non-reproducible contains all false variants (reads derived from variants observed in less than 0.01% of at least one of the PCR replicates per plant). The order of plants is the same as in panel A) and all individual PCR replicates are shown per plant. (C) Relative frequency per dominant variant per plant. Values are average variant frequencies across PCR replicates per plant. Bi-colored heat map displays variants in the frequency range 0–5% as white-to-grey false color scale, and variants in the frequency range 5–100% as shades of red. § indicates the variants (Variant3 and Variant4) are present in these samples, but at non-reproducible variant frequencies. * indicates that for Rutgers plant3 all values are calculated across 5 PCR replicates.
Figure 2Host specific CLVd major variant analysis. (A) Phylogenetic tree showing the sequence similarity between major variants (>1% average relative frequency in at least one plant). Bootstrap values are indicated at the nodes. Colored circles next to the phylogenetic tree indicate abundance across plant species (black >5%; dark grey, 1–5%; light grey, 0–1%; white, 0%). (B) Chaipayon-1 sequence and major variant sequences at specific positions where they differ from each other. “position” indicates the position in the Chaipayon-1 reference sequence. “substitution” shows the mutation that occurred. - indicates that the sequence was the same as the Chaipayon-1 sequence. * indicates insertions. (C) Predicted secondary structure of each major variant. Variant3, Variant16, Variant4 and Variant7 are shaded gray because their loops are positioned at other locations compared to the other major variants.
Figure 3Structural consequences of major single-nucleotide polymorphisms (SNPs) of dominant variants and mutation frequency (in percentage) per individual plant. The table shows the positions and domains where mutations occurred with mutation frequency in percentage and the secondary structures. The color scale represents a mutation frequency ranging from grey (0%) over green and brown to red (100%).