| Literature DB >> 23940565 |
Alexandra Olarte1, Nitin Mantri, Gregory Nugent, Edwin C K Pang.
Abstract
Echinacea, native to the Canadian prairies and the prairie states of the United States, has a long tradition as a folk medicine for the Native Americans. Currently, Echinacea are among the top 10 selling herbal medicines in the U.S. and Europe, due to increasing popularity for the treatment of common cold and ability to stimulate the immune system. However, the genetic relationship within the species of this genus is unclear, making the authentication of the species used for the medicinal industry more difficult. We report the construction of a novel Subtracted Diversity Array (SDA) for Echinacea species and demonstrate the potential of this array for isolating highly polymorphic sequences. In order to selectively isolate Echinacea-specific sequences, a Suppression Subtractive Hybridization (SSH) was performed between a pool of twenty-four Echinacea genotypes and a pool of other angiosperms and non-angiosperms. A total of 283 subtracted genomic DNA (gDNA) fragments were amplified and arrayed. Twenty-seven Echinacea genotypes including four that were not used in the array construction could be successfully discriminated. Interestingly, unknown samples of E. paradoxa and E. purpurea could be unambiguously identified from the cluster analysis. Furthermore, this Echinacea-specific SDA was also able to isolate highly polymorphic retrotransposon sequences. Five out of the eleven most discriminatory features matched to known retrotransposons. This is the first time retrotransposon sequences have been used to fingerprint Echinacea, highlighting the potential of retrotransposons as based molecular markers useful for fingerprinting and studying diversity patterns in Echinacea.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23940565 PMCID: PMC3734018 DOI: 10.1371/journal.pone.0070347
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Description of the Echinacea species used for DNA extraction and development of genome representations.
| Taxon (McGregor, 1968) | Accession number | Abbreviation |
|
| PI631267 (OK) | ang 267 |
|
| PI631272 (OK) | ang-ang 272 |
| PI631285 (IA) | ang-ang 285 | |
| PI631318 (KS) | ang-ang 318 | |
|
| PI631266 (OK) | ang-str 266 |
| PI631320 (OK) | ang-str 320 | |
|
| PI631275 (OK) | pal 275 |
| PI631290 (IA) | pal 290 | |
| PI631293 (AR) | pal 293 | |
| PI631296 (MO) | pal 296 | |
| PI631315 (NC) | pal 315 | |
| “Hula dancer” | N/A | |
|
| PI631263 (OK) | px-neg 263 |
| PI631264 (OK) | px-neg 264 | |
| PI631265 (OK) | px-neg 265 | |
|
| PI631301 (MO) | px-px 301 |
| PI631321 (MO) | px-px 321 | |
|
| PI631307 (MO) | pur 307 |
| PI631313 (NC) | pur 313 | |
| PI633669 (LA) | pur 669 | |
| “Double Decker” | N/A | |
| “White purpurea” | N/A | |
| “purpurea” | N/A | |
|
|
| N/A |
|
| PI631294 (AR) | hyb 294 |
|
| Plot 9 | ang plot 9 |
|
| Plot 5 | pal plot 5 |
|
| Plot 10009 | pur plot 10009 |
Notes: AR, Arkansas; IA, Iowa; KS, Kansas; LA, Louisian a; MO, Missouri; NC, North Carolina; OK, Oklahoma; OR, Oregon; SC, South Carolina; TN, Tennessee; VA, Virginia.
Echinacea not included in the SDA development.
Echinacea with PI accessions numbers were obtained from the germplasm collection in the U.S. National Plant Germplasm System.
Echinacea verified specimens obtained from a specialized plant nursery (The Diggers Club. Dromana VIC).
Echinacea obtained from the Botanical Resources Australia (Tasmania).
The abbreviated names are used to refer to the accessions in the figures.
Figure 1Dissimilarity dendrogram for the SDA hybridization patterns of 27 Echinacea genotypes using the 283 features.
The steps of the dendrogram (Squared Euclidian distance, between groups linkage) show the combined clusters and the values of the distance coefficients at each step; the values have been rescaled to numbers between 0 and 25, preserving the ratio of the distances between the steps. The equivalents of the abbreviated names used for each of the genotypes are shown in Table 1.
Figure 2Principal component analysis plot for the 283 features.
The first principal component accounts for 94.7% of variation and the second component explained only 2.2% of variation. The squares represent features that account for most of the variability across the genotypes.
Significant correlations among the signal of each of the 283 features and the relative abundance of 43 lipophilic metabolites.
| Compound | H9 | L2 | M8 | I18 | F15 |
|
| 0.82 | ||||
| Sig. (2-tailed) | 0.00 | ||||
|
| 0.65 | 0.81 | |||
| Sig. (2-tailed) | 0.00 | 0.00 | |||
|
| 0.87 | 0.74 | |||
| Sig. (2-tailed) | 0.00 | 0.00 | |||
|
| 0.54 | ||||
| Sig. (2-tailed) | 0.02 | ||||
|
| 0.86 | 0.79 | 0.75 | ||
| Sig. (2-tailed) | 0.00 | 0.00 | 0.00 | ||
|
| 0.57 | ||||
| Sig. (2-tailed) | 0.02 | ||||
|
| 0.59 | ||||
| Sig. (2-tailed) | 0.00 | ||||
|
| 0.48 | ||||
| Sig. (2-tailed) | 0.04 | ||||
|
| 0.49 | ||||
| Sig. (2-tailed) | 0.03 | ||||
|
| 0.59 | ||||
| Sig. (2-tailed) | 0.00 | ||||
|
| 0.71 | ||||
| Sig. (2-tailed) | 0.00 | ||||
|
| 0.79 | ||||
| Sig. (2-tailed) | 0.00 | ||||
|
| 0.75 | ||||
| Sig. (2-tailed) | 0.00 | ||||
|
| 0.59 | ||||
| Sig. (2-tailed) | 0.00 | ||||
| Chen alkamide | 0.92 | 0.71 | 0.70 | ||
| Sig. (2-tailed) | 0.00 | 0.00 | 0.00 | ||
|
| −0.56 | ||||
| Sig. (2-tailed) | 0.02 | ||||
|
| −0.61 | 0.56 | |||
| Sig. (2-tailed) | 0.00 | 0.02 |
2,4-diene alkamides.
Monoene alkamides.
Correlation is significant at the 0.01 level.
Correlation is significant at the 0.05 level.
Figure 3Correlation among the signal strength of feature H9 and the relative content of alkamides.
Significant positive correlations are shown for signal strength of feature H9 and the content of chen alkamide and amide 7.
Figure 4Correlation among the signal strength of feature I18 and the relative content of monoene alkamides.
Significant positive correlations are shown for signal strength of feature H9 and the content of amide 14 and 16.
Predicted locus/function of the 14 sequenced SDA features using blastN program through National Centre of Biotechnology Information (www.ncbi.nlm.nih.gov).
| Feature ID | Length (bp) | Matching database entry | Putative identity | E Value |
| A8 | 323 | No hits | NA | |
| B15 | 252 | No hits | NA | |
| B17 | 344 | EL419699.1 |
| 2e-11 |
| C2 | 341 | FJ791047.1 |
| 2e-18 |
| F15 | 744 | EU362851.1 | Ambrosia asymptomatic virus 2 UKM-2007 isolate05TGP00321.Bad4 ORF1–2 gene, partial cds | 6e-54 |
| G16 | 328 | FJ791047.1 |
| 4e-21 |
| H9 | 249 | No hits | NA | |
| I9 | 550 | D85597.1 |
| 7e-08 |
| I18 | 447 | No hits | NA | |
| J8 | 300 | FJ791047.1 |
| 2e-11 |
| L2 | 643 | JN021935.1 |
| 4e-43 |
| M2 | 829 | GQ367282.1 |
| 1e-95 |
| M8 | 454 | No hits | NA | |
| O2 | 360 | No hits | NA |
The best match is shown as the putative identity for each sequence. E-value was regarded as significant if <1e-10. NA indicates the absence of significant data.
Features that were chosen by PCA.
Features that were found to have low signal strength for E. purpurea.
Features whose signal strength correlated significantly with the content of lipophilic metabolites.
Figure 5Dissimilarity dendrogram for the 27 genotypes using only the eleven most discriminatory features.
The steps of the dendrogram (Squared Euclidian distance, between groups linkage) show the combined clusters and the values of the distance coefficients at each step; the values have been rescaled to numbers between 0 and 25, preserving the ratio of the distances between the steps. The equivalents of the abbreviated names used for each of the genotypes are shown in Table 1.