| Literature DB >> 24688691 |
Farit M Afendi1, Naoaki Ono2, Yukiko Nakamura2, Kensuke Nakamura3, Latifah K Darusman4, Nelson Kibinge2, Aki Hirai Morita2, Ken Tanaka5, Hisayuki Horai6, Md Altaf-Ul-Amin2, Shigehiko Kanaya2.
Abstract
Molecular biological data has rapidly increased with the recent progress of the Omics fields, e.g., genomics, transcriptomics, proteomics and metabolomics that necessitates the development of databases and methods for efficient storage, retrieval, integration and analysis of massive data. The present study reviews the usage of KNApSAcK Family DB in metabolomics and related area, discusses several statistical methods for handling multivariate data and shows their application on Indonesian blended herbal medicines (Jamu) as a case study. Exploration using Biplot reveals many plants are rarely utilized while some plants are highly utilized toward specific efficacy. Furthermore, the ingredients of Jamu formulas are modeled using Partial Least Squares Discriminant Analysis (PLS-DA) in order to predict their efficacy. The plants used in each Jamu medicine served as the predictors, whereas the efficacy of each Jamu provided the responses. This model produces 71.6% correct classification in predicting efficacy. Permutation test then is used to determine plants that serve as main ingredients in Jamu formula by evaluating the significance of the PLS-DA coefficients. Next, in order to explain the role of plants that serve as main ingredients in Jamu medicines, information of pharmacological activity of the plants is added to the predictor block. Then N-PLS-DA model, multiway version of PLS-DA, is utilized to handle the three-dimensional array of the predictor block. The resulting N-PLS-DA model reveals that the effects of some pharmacological activities are specific for certain efficacy and the other activities are diverse toward many efficacies. Mathematical modeling introduced in the present study can be utilized in global analysis of big data targeting to reveal the underlying biology.Entities:
Year: 2013 PMID: 24688691 PMCID: PMC3962233 DOI: 10.5936/csbj.201301010
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Figure 1Integrated platform of knowledge of medicinal plants and plant and human –omics and KNApSacK Family databases. (A) The relations of attributes among individual DBs. (B) Main window of KNApSAcK Family DB, indexes from a to i in panel A correspond to those in panel B.
Studies that cite KNApSAcK Core DB.
| Article type | The purpose of study [References] |
|---|---|
| < 2006-2008 > | |
|
| Bridge between Chemistry and Biology [ |
|
| Metabolite accumulation caused by herbicidal enzyme inhibitors [ |
|
| Metabolome platform DrDMASS in FT-ICR-MS [ |
|
| Chemical biology [ |
| < 2009 > | |
|
| Integrated omics [ |
|
| Metabolic profiling in cold-temperature [ |
|
| Annotation of metabolite information to MS [ |
|
| Embedded string-search commands on MediaWiki [ |
| < 2010 > | |
|
| MS data processing [ |
|
| Metabolite composition [ |
|
| Chemical similarity search and substructure matching of compounds [ |
|
| MassBank, MS DB [ |
| < 2011 > | |
|
| Pesticide research [ |
|
| Hepatotoxicity [ |
|
| QTL informatics [ |
|
| Food phytochemicals [ |
| < 2012-13 > | |
|
| Plant responses to abiotic stress [ |
|
| Camptothecin biosynthesis [ |
|
| Repository for metabolomics studies [ |
|
| Metabolite annotation [ |
Figure 2Schematic diagram of the decomposition of both predictor and response blocks for: (a) PLS and (b) N-PLS model.
Figure 3Illustration of matricizing three-dimensional array (I x J x K) into matrix X (I x JK).
Figure 4A typical network illustrating connections between efficacy, herbal medicine, plant, and pharmacological activity of plant.
Distribution of Jamu and plant utilized in Jamu for each efficacy.
| Efficacy | Number of Jamu | Number of plants utilized in Jamu formulas |
|---|---|---|
| Urinary-related problems (URI) | 72 | 80 |
| Disorders of appetite (DOA) | 249 | 148 |
| Disorders of mood and behavior (DMB) | 22 | 47 |
| Gastrointestinal disorders (GST) | 980 | 290 |
| Female reproductive organ problems (FML) | 398 | 182 |
| Musculoskeletal and connective tissue disorders (MSC) | 840 | 270 |
| Pain and inflammation (PIN) | 311 | 183 |
| Respiratory diseases (RSP) | 107 | 105 |
| Wounds and skin infection (WND) | 159 | 120 |
Figure 5Biplot configuration based on PCA analysis of Jamu data. Plants and Jamu efficacies are represented as red points and blue lines, respectively.
Figure 6Clustergram of pharmacological activity against Jamu efficacy. The red and black cells indicate that the pharmacological activity is significant or non-significant, respectively, to the corresponding efficacy.
Confusion matrix of the prediction of Jamu efficacy using the PLS-DA model.
| Observed efficacy | Predicted efficacy | Total | % Correct | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| URI | DOA | DMB | GST | FML | MSC | PIN | RSP | WND | |||
| URI | 39 | 0 | 0 | 21 | 2 | 10 | 0 | 0 | 0 | 72 | 54.2 |
| DOA | 0 | 164 | 0 | 29 | 36 | 18 | 0 | 0 | 2 | 249 | 65.9 |
| DMB | 0 | 1 | 5 | 10 | 0 | 3 | 1 | 2 | 0 | 22 | 22.7 |
| GST | 3 | 17 | 0 | 880 | 12 | 46 | 9 | 6 | 7 | 980 | 89.8 |
| FML | 0 | 13 | 0 | 61 | 266 | 50 | 5 | 1 | 2 | 398 | 66.8 |
| MSC | 6 | 6 | 1 | 127 | 41 | 638 | 16 | 0 | 5 | 840 | 76 |
| PIN | 1 | 0 | 0 | 90 | 4 | 77 | 133 | 4 | 2 | 311 | 42.8 |
| RSP | 3 | 0 | 0 | 21 | 4 | 23 | 3 | 52 | 1 | 107 | 48.6 |
| WND | 2 | 3 | 0 | 57 | 11 | 11 | 4 | 0 | 71 | 159 | 44.7 |
| Total | 54 | 204 | 6 | 1296 | 376 | 876 | 171 | 65 | 90 | 3138 | 71.6 |
Number of significant plants for each efficacy.
| Efficacy | Total | Support from scientific paper | |
|---|---|---|---|
| URI | 20 | 15 | −75.00% |
| DOA | 21 | 20 | −95.20% |
| DMB | 12 | 6 | −50.00% |
| GST | 26 | 23 | −88.50% |
| FML | 40 | 30 | −75.00% |
| MSC | 40 | 39 | −97.50% |
| PIN | 39 | 37 | −94.90% |
| RSP | 36 | 33 | −91.70% |
| WND | 43 | 38 | −88.40% |