| Literature DB >> 28164123 |
Kang Liu1, Azian Azamimi Abdullah1, Ming Huang1, Takaaki Nishioka1, Md Altaf-Ul-Amin1, Shigehiko Kanaya1.
Abstract
Secondary metabolites are bioactive substances with diverse chemical structures. Depending on the ecological environment within which they are living, higher plants use different combinations of secondary metabolites for adaptation (e.g., defense against attacks by herbivores or pathogenic microbes). This suggests that the similarity in metabolite content is applicable to assess phylogenic similarity of higher plants. However, such a chemical taxonomic approach has limitations of incomplete metabolomics data. We propose an approach for successfully classifying 216 plants based on their known incomplete metabolite content. Structurally similar metabolites have been clustered using the network clustering algorithm DPClus. Plants have been represented as binary vectors, implying relations with structurally similar metabolite groups, and classified using Ward's method of hierarchical clustering. Despite incomplete data, the resulting plant clusters are consistent with the known evolutional relations of plants. This finding reveals the significance of metabolite content as a taxonomic marker. We also discuss the predictive power of metabolite content in exploring nutritional and medicinal properties in plants. As a byproduct of our analysis, we could predict some currently unknown species-metabolite relations.Entities:
Mesh:
Year: 2017 PMID: 28164123 PMCID: PMC5253511 DOI: 10.1155/2017/5296729
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1(a) Bipartite graph of plant-metabolite relations. Molecular structures of metabolites are described by 166-bit atom pair fingerprints, which are used to calculate Tanimoto structure similarity score for each metabolite pair. (b) Bipartite graph of plant versus metabolite-group relations. Each plant has been associated with metabolite groups instead of single metabolites to reduce effect of incomplete data.
Figure 2(a) Structural-similarity-based network of metabolites (plotted using network analysis tool Cytoscape v3.3.0). This network is composed of many isolated components, and each component contains different number of nodes. (b) Degree distribution of the network in log scale.
Figure 3Frequency of metabolite groups with respect to group size. x-axes represent number of metabolites belonging to one metabolite group, and y-axes represent frequency of such metabolite groups. Frequency of metabolite groups in log scale is shown in inset figure.
Taxonomic and use information of 216 plants. Group ID, plant names, taxonomic ranks (family), and economic uses are mentioned in consecutive columns. Economic uses of plants are represented as the following abbreviations: E (edible), M (medicinal), L (landscaping,), T (timber), P (poisonous), and W (wild plant). Some plants are both edible and medicinal and are annotated as E/M.
| Group | Plant | Family | Use |
|---|---|---|---|
| 1 |
| Rutaceae | E |
|
| Rutaceae | E/M | |
|
| Rutaceae | E | |
|
| Rutaceae | E | |
|
| Rutaceae | E | |
|
| Rutaceae | E | |
|
| |||
| 2 |
| Saururaceae | E/M |
|
| Saururaceae | W | |
|
| Crassulaceae | M | |
|
| |||
| 3 |
| Asteraceae | M |
|
| Asteraceae | M | |
|
| Asteraceae | W | |
|
| Solanaceae | E | |
|
| |||
| 4 |
| Asteraceae | W |
|
| Asteraceae | M | |
|
| Asteraceae | W | |
|
| Caprifoliaceae | M | |
|
| Polygonaceae | M | |
|
| Lamiaceae | M | |
|
| Apiaceae | W | |
|
| |||
| 5 |
| Zingiberaceae | E/M |
|
| Zingiberaceae | E/M | |
|
| Lamiaceae | M | |
|
| Cistaceae | W | |
|
| Pinaceae | L | |
|
| |||
| 6 |
| Myrtaceae | M |
|
| Myrtaceae | M | |
|
| Asteraceae | W | |
|
| |||
| 7 |
| Zingiberaceae | E/M |
|
| Zingiberaceae | W | |
|
| Cistaceae | W | |
|
| Myrtaceae | M | |
|
| Piperaceae | W | |
|
| Piperaceae | W | |
|
| Pinaceae | L | |
|
| Cyperaceae | M | |
|
| |||
| 8 |
| Pinaceae | T |
|
| Pinaceae | T | |
|
| Pinaceae | T | |
|
| Rutaceae | E | |
|
| |||
| 9 |
| Rosaceae | E |
|
| Rosaceae | E | |
|
| Rosaceae | E | |
|
| |||
| 10 |
| Fabaceae | E |
|
| Fabaceae | L | |
|
| Amaryllidaceae | E | |
|
| |||
| 11 |
| Linaceae | T |
|
| Fabaceae | E | |
|
| Asteraceae | M | |
|
| |||
| 12 |
| Fabaceae | E |
|
| Fabaceae | E | |
|
| Fabaceae | E | |
|
| |||
| 13 |
| Poaceae | E |
|
| Poaceae | E | |
|
|
| E | |
|
| |||
| 14 |
| Brassicaceae | E |
|
| Brassicaceae | P | |
|
| Rosaceae | E | |
|
| |||
| 15 |
| Poaceae | E |
|
| Poaceae | E | |
|
| Cucurbitaceae | E | |
|
| Fabaceae | E | |
|
| Asteraceae | E | |
|
| |||
| 16 |
| Rosaceae | E |
|
| Fabaceae | M | |
|
| Hippocastanaceae | P | |
|
| Theaceae | E | |
|
| Polygonaceae | W | |
|
| |||
| 17 |
| Fabaceae | L |
|
| Fabaceae | T | |
|
| Fabaceae | W | |
|
| |||
| 18 |
| Crassulaceae | M |
|
| Crassulaceae | M | |
|
| Crassulaceae | M | |
|
| Phyllanthaceae | E/M | |
|
| Myrtaceae | E | |
|
| Rutaceae | M | |
|
| Berberidaceae | M | |
|
| |||
| 19 |
| Solanaceae | E |
|
| Solanaceae | E | |
|
| Solanaceae | M | |
|
| |||
| 20 |
| Solanaceae | E |
|
| Solanaceae | L | |
|
| Apiaceae | W | |
|
| Apocynaceae | L | |
|
| Cannabaceae | M | |
|
| Cyperaceae | M | |
|
| |||
| 21 |
| Fabaceae | M |
|
| Fabaceae | W | |
|
| Fabaceae | E/M | |
|
| Fabaceae | M | |
|
| |||
| 22 |
| Fabaceae | W |
|
| Fabaceae | E | |
|
| Fabaceae | W | |
|
| Fabaceae | L | |
|
| Fabaceae | M | |
|
| |||
| 23 |
| Fabaceae | W |
|
| Fabaceae | W | |
|
| Fabaceae | M | |
|
| Fabaceae | L | |
|
| Fabaceae | W | |
|
| Daphniphyllaceae | M | |
|
| |||
| 24 |
| Fabaceae | E |
|
| Fabaceae | E | |
|
| Fabaceae | M | |
|
| Fabaceae | T | |
|
| Fabaceae | W | |
|
| Fabaceae | W | |
|
| Fabaceae | W | |
|
| Fabaceae | T | |
|
| |||
| 25 |
| Papaveraceae | W |
|
| Papaveraceae | M | |
|
| Papaveraceae | W | |
|
| Menispermaceae | W | |
|
| Menispermaceae | W | |
|
| Menispermaceae | W | |
|
| Menispermaceae | W | |
|
| Annonaceae | E | |
|
| Annonaceae | W | |
|
| |||
| 26 |
| Brassicaceae | E |
|
| Brassicaceae | E | |
|
| Brassicaceae | E | |
|
| Brassicaceae | L | |
|
| |||
| 27 |
| Apocynaceae | T |
|
| Apocynaceae | M | |
|
| Apocynaceae | M | |
|
| |||
| 28 |
| Fabaceae | L |
|
| Fabaceae | L | |
|
| Fabaceae | W | |
|
| Fabaceae | P | |
|
| Fabaceae | L | |
|
| Fabaceae | T | |
|
| |||
| 29 |
| Orchidaceae | M |
|
| Orchidaceae | M | |
|
| |||
| 30 |
| Cupressaceae | T |
|
| Cupressaceae | T | |
|
| Cupressaceae | T | |
|
| |||
| 31 |
| Asteraceae | P |
|
| Asteraceae | M | |
|
| Asteraceae | W | |
|
| Rutaceae | E | |
|
| Rutaceae | M | |
|
| Rutaceae | M | |
|
| Cannabaceae | M | |
|
| Iridaceae | M | |
|
| |||
| 32 |
| Apocynaceae | W |
|
| Apocynaceae | W | |
|
| Apocynaceae | M | |
|
| Apocynaceae | W | |
|
| |||
| 33 |
| Caprifoliaceae | W |
|
| Asteraceae | W | |
|
| Lamiaceae | W | |
|
| Apocynaceae | W | |
|
| Fabaceae | T | |
|
| Rutaceae | W | |
|
| Zingiberaceae | E | |
|
| |||
| 34 |
| Clusiaceae | E/M |
|
| Clusiaceae | W | |
|
| |||
| 35 |
| Rutaceae | W |
|
| Rutaceae | E/M | |
|
| Rutaceae | W | |
|
| Apiaceae | E/M | |
|
| |||
| 36 |
| Acanthaceae | M |
|
| Lamiaceae | M | |
|
| |||
| 37 |
| Rutaceae | M |
|
| Rutaceae | W | |
|
| |||
| 38 |
| Magnoliaceae | M |
|
| Magnoliaceae | M | |
|
| Gesneriaceae | W | |
|
| |||
| 39 |
| Moraceae | E |
|
| Moraceae | E/M | |
|
| Moraceae | E | |
|
| |||
| 40 |
| Brassicaceae | E |
|
| Fabaceae | E | |
|
| |||
| 41 |
| Solanaceae | M |
|
| Solanaceae | M | |
|
| Apiaceae | M | |
|
| |||
| 42 |
| Fabaceae | M |
|
| Calophyllaceae | T | |
|
| Cupressaceae | W | |
|
| |||
| 43 |
| Taxaceae | P |
|
| Taxaceae | M | |
|
| Taxaceae | M | |
|
| Taxaceae | M | |
|
| Taxaceae | M | |
|
| Taxaceae | M | |
|
| Taxaceae | M | |
|
| |||
| 44 |
| Araliaceae | M |
|
| Araliaceae | M | |
|
| Araliaceae | M | |
|
| Araliaceae | M | |
|
| Apiaceae | M | |
|
| Amaranthaceae | E | |
|
| Asteraceae | E/M | |
|
| |||
| 45 |
| Meliaceae | W |
|
| Rosaceae | W | |
|
| Malvaceae | W | |
|
| Begoniaceae | W | |
|
| Zingiberaceae | W | |
|
| Asteraceae | W | |
|
| |||
| 46 |
| Aristolochiaceae | L |
|
| Aristolochiaceae | M | |
|
| |||
| 47 |
| Annonaceae | W |
|
| Annonaceae | E | |
|
| Rubiaceae | M | |
|
| Solanaceae | M | |
|
| |||
| 48 |
| Lamiaceae | E/M |
|
| Lamiaceae | W | |
|
| Plantaginaceae | M | |
|
| Rehmanniaceae | M | |
|
| Oleaceae | E/M | |
|
| Caprifoliaceae | M | |
|
| Araliaceae | M | |
|
| Ebenaceae | E | |
|
| Lythraceae | E | |
|
| Zingiberaceae | E/M | |
Reported plant-metabolite relations of 6 plants of genus Citrus with a given metabolite group (including 2 metabolites: Limonene and Cyclohexane). 1/0 indicates presence/absence of a metabolite in a plant.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Limonene | 1 | 1 | 1 | 1 | 1 | 1 |
| Cyclohexane | 0 | 1 | 1 | 1 | 1 | 0 |
Predicted unrecorded metabolites for 6 Citrus plants, encompassing 38 plant-metabolite relations.
| Species | Predicted unrecorded metabolites |
|---|---|
|
| Gibberellin A4; methyl salicylate; cyclohexane; |
|
| Methyl salicylate; citral; benzeneacetaldehyde; |
|
| Rhoifolin; isopropanol; methyl salicylate; citral; benzeneacetaldehyde; |
|
| Isoscutellarein 7,8-dimethyl ether; isoscutellarein 7,8,4′-trimethyl ether; |
|
| Gibberellin A81; gibberellin A9; isopropanol; citral; 6-demethoxytangeritin; tetramethylscutellarein |
|
| Apigenin 7-rutinoside; methyl salicylate; salvigenin; cyclohexane; benzeneacetaldehyde; |
Resulting confusion matrix from support vector machine (SVM) algorithm. 162 plants are labeled as edible (E), medicinal (M), timber (T), landscaping (L), and poisonous (P), and SVM model was constructed to classify them.
| M | E | T | L | P | Recognition rate [%] | |
|---|---|---|---|---|---|---|
| M | 81 | 0 | 0 | 0 | 0 | 100 |
| E | 1 | 47 | 0 | 0 | 0 | 97.9 |
| T | 6 | 0 | 8 | 0 | 0 | 57.2 |
| L | 8 | 1 | 0 | 5 | 0 | 35.7 |
| P | 4 | 1 | 0 | 0 | 0 | 0 |
Total: 162 plants. Accuracy: 87.0%.