Literature DB >> 31765369

Predicting Meridian in Chinese traditional medicine using machine learning approaches.

Yinyin Wang1, Mohieddin Jafari1, Yun Tang2, Jing Tang1,3.   

Abstract

Plant-derived nature products, known as herb formulas, have been commonly used in Traditional Chinese Medicine (TCM) for disease prevention and treatment. The herbs have been traditionally classified into different categories according to the TCM Organ systems known as Meridians. Despite the increasing knowledge on the active components of the herbs, the rationale of Meridian classification remains poorly understood. In this study, we took a machine learning approach to explore the classification of Meridian. We determined the molecule features for 646 herbs and their active components including structure-based fingerprints and ADME properties (absorption, distribution, metabolism and excretion), and found that the Meridian can be predicted by machine learning approaches with a top accuracy of 0.83. We also identified the top compound features that were important for the Meridian prediction. To the best of our knowledge, this is the first time that molecular properties of the herb compounds are associated with the TCM Meridians. Taken together, the machine learning approach may provide novel insights for the understanding of molecular evidence of Meridians in TCM.

Entities:  

Mesh:

Year:  2019        PMID: 31765369      PMCID: PMC6876772          DOI: 10.1371/journal.pcbi.1007249

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


Introduction

Single-agent drug discovery has often experienced low success rates which can be largely attributed to the lack of efficacy as well as unsatisfactory safety, especially when treating complex diseases such as cancer [1] and diabetes [2]. Recently, polypharmacology that involves multi-drug combinations acting on distinct targets has been proposed as a paradigm shift of drug discovery [3]. However, without a systems-level understanding of disease and drug interactions, it maintains a challenge to develop a valid strategy for the rational selection of drug combinations. In East Asia, plant-derived natural products, known as herb formulas, have been commonly used in Chinese Traditional Medicine (TCM) for disease prevention and treatment. Herb formulas often involve multiple bioactive components to produce synergistic effects in a personalized medicine manner, aiming for maximal therapeutic efficacy as well as minimal side effects [4]. For example, the Fufang Danshen Diwan (Dantonic pill), a botanical drug consisting of extracts of Danshen (Radix Salviae Miltiorrhizae) and Sanqi (Radix Notoginseng) is currently approved in 26 countries outside the USA for the treatment and prevention of chronic stable angina pectoris and other cardiovascular disease related conditions [5]. In this regard, understanding the bioactive components and their mechanisms of action for herb formulas might provide important insights on the rational design of multi-drug combinations for complex diseases [6, 7]. The prescription of herb formulas in TCM has been based on a holistic principle to model the human body as a miniature system that resemble the universe, which is composed of five interacting Elements (metal, wood, water, fire and earth) [8]. Similar to other schools of systems medicine, the cause of diseases or symptoms can be perceived as the loss of balance between these Five Elements [9, 10]. Treating a given disease is therefore equivalent to restoring the balance in the system [11], which can be achieved by either acupuncture [12, 13] or herb formulas that tune specifically certain inner channels of the body, known as Meridians [14]. There are 12 principal Meridians, each of which is linked to a specific TCM Organ and can be further attributed to one of the Five Elements (). The concept of Organ in TCM is fundamentally different from that of modern anatomic perspective, as the Organs in TCM represent certain distinct states of the human body, rather than a morphological structure. Similarly, although the Meridian system has been established as a fundamental basis of TCM several thousand years ago, it is not coincided to the known patterns of blood vessels or central nervous system [15]. More recently, fascia networks [16] and perivascular space [17] have been proposed to explain Meridian, but neither of them have been experimentally confirmed.

The Meridians and their example herbs.

Each Meridian is linked to a particular Organ which is characterized by its Elements and Quality of Yin or Yang. TCM considers a disease a result of loss of balance in the Yin and Yang, which can be restored using herbs that target particular Meridians. While the anatomical and physiological evidence of Meridians are yet to be determined, the narrative of TCM allows for the classification of herb formulas based on their targeting Meridians [18-20]. The rationale of Meridian has been investigated for a few TCM herbs. For example, Jie Geng (Platycodi Radix) has been considered as a Lung Meridian herb, and it was discovered recently that an active ingredient in Jie Geng called saponin can affect the lung and respiratory systems by the inhibition of lipid peroxidation [21]. Another example is Danshen, the dried root of Salvia miltiorrhiza burge, which has been used for treating cardiovascular diseases and hepatitis as a Heart and Liver Meridian herb [22]. Recent studies have shown that its lipophilic ingredients such as tanshinones and hydrophilic ingredients such as salvianic acids may play a synergistic role to achieve its therapeutic efficacy [23]. With the increasing knowledge about the biochemical and pharmacological properties of the bioactive ingredients from the TCM herbs, it is now possible to carry out a larger-scale analysis to investigate the molecular basis of Meridians and other concepts in TCM [24]. To leverage the complex biochemical and pharmacological datasets, systems biology approaches involving machine learning techniques have been utilized to the study of herb formulas [25]. For example, Cheng et al. proposed a network-based methodology that can identify clinically efficacious drug combinations for specific diseases, which might be potentially used to explain also the pharmacology of TCM herbs [26]. Fang et al. summarized various chemo-informatics, bioinformatics and systems biology resources for reconstructing drug–target networks of natural products [27]. Fu et al. developed a data clustering method using a collection of 2,012 compounds associated with TCM herbs and discovered that the hot or cold nature of the herbs can be correlated with the physicochemical and target pathways of their ingredient compounds [28]. Wang et al. collected 5,464 compounds for 115 herbs and applied an unsupervised clustering method called Self-organizing map (SOM) to establish a classifier of cold and hot herbs based on the chemical structural fingerprints of the compounds [29]. However, these machine learning studies focused only on the hot/cold classification of TCM herbs, while it remains unknown whether the Meridian classification that involves 12 major classes can be also predicted from the chemical structure and physiochemical features of ingredient compounds. In this study, we collected the Meridian information of herbs as well as the chemical structures of their ingredient compounds (). These two datasets were utilized to determine the molecular features including structure-based fingerprints and ADME properties. With the feature matrices determined at both the herb level and the compound level, we further developed a machine learning framework to predict the Meridians of the herbs and their ingredient compounds. We tested multiple machine learning methods and showed that the classification of Meridians can be predicted especially at the compound level. These results suggested that Meridians indeed are associated with the molecular properties of herb compounds. We expected that our data integration approach may represent a novel perspective for the understanding of Meridian, which may ultimately lead to a more systematic exploration of the mechanisms of TCM.

Workflow of the study.

Herb-compound network shows the associations between herbs (green rectangles) and their active compounds (purple circles), which were used to determine the Herb-Feature and the Compound-Meridian matrices from the Herb-Meridian and Compound-Feature matrices. The features of herbs and compounds were determined from the chemical fingerprints and ADME properties. Machine learning methods were utilized to predict the Meridian classes for herbs and compounds respectively, by parameter optimization, model selection and feature selection.

Results

Distribution of Meridians at the herb level and the compound level

In total, 646 herbs including 10,053 ingredient components with Meridian and chemical structure information were obtained from the TCMID database (). The Meridian distribution at the herb and the compound levels can be seen in . At the herb level, altogether 333 herbs target the Liver Meridian, followed by Lung (n = 237), Stomach (n = 235), Spleen (n = 213), Kidney (n = 181), Heart (n = 155) and Large Intestine (n = 111) (). In contrast, much less herbs are found for the other five Meridians including Bladder (n = 57), Gallbladder (n = 33), Small Intestine (n = 24), Cardiovascular (n = 4) and Three End (n = 4). To avoid the over-interpretation of machine learning models on unbalanced datasets, we focused on the top seven abundant Meridians including Liver, Lung, Spleen, Stomach, Kidney, Heart and Large Intestine ().

Herb-Meridian and Compound-Meridian distributions.

(A-B) The color bars at the bottom left represent the frequency of herbs or compounds for each of the seven major Meridians, which can be further collapsed into subclasses depending on whether an herb or a compound is shared by one or several Meridians. The vertical bars show the frequency of herbs or compounds for a particular subclass of Meridian combination, as indicated by the connected lines below the x-axis between the Meridians. (C-D) The Jaccard coefficients between the Meridian pairs at the herb and the compound levels. The size of blue circles on the upper diagonal shows the degree of the similarity. As expected, the majority of herbs (n = 580; 89.8%) target more than one Meridian, however, there is a varying degree of overlap between them. It can be seen that Kidney and Liver has the biggest number of shared herbs (n = 51), followed by 36 herbs that are common between Liver and Heart, and then 30 herbs between Liver and Stomach. The overlap between the Meridians illustrates the multi-target characteristics of TCM herbs. For example, Huo Xiang (Agastache rugose) belongs to Lung, Spleen and Stomach simultaneously [30], as this herb is known to relieve the symptoms of Lung, Spleen and Stomach diseases [31]. On the other hand, there are relatively fewer herbs that target only one Meridian. For example, 42 of the 384 (11%) Liver herbs are classified exclusively as Liver herbs and 26 of all the 260 (10%) Lung herbs do not target other Meridians. In contrast, all the herbs that belong to Stomach, Spleen and Large Intestine also target other Meridians. At the compound level, similar patterns was observed, where the Liver Stomach and Lung are again the top abundant Meridians (). In order to quantify the overall similarity between these seven major Meridians, we calculated the Jaccard coefficients using the R package ‘Corrplot’ [32, 33]. The Jaccard coefficient, also known as Jaccard index, is a measure of overlap between two sets, with a value of zero for complete non-overlap while a value of one for identical sets [34, 35]. As shown in , the Jaccard coefficients between the Meridians are generally low, with the lowest score found between Heart and Large Intestine (0.04 at the herb level and 0.14 at the compound level), and the highest score found between Spleen and Stomach (0.31 at the herb level and 0.42 at the compound level). The average pairwise Jacaard coefficients are 0.15 and 0.26 for the herb level and for the compound level respectively, indicating that there are weak correlations between Meridians in term of the herb and compound distributions. Therefore, we considered the prediction of each Meridian separately in the following machine learning tasks. Ultimately, for a given new herb or a compound, its Meridians can be predicted using the best machine learning models.

Prediction accuracy of Meridians using machine learning approaches

We carried out the prediction of the seven major Meridians at two data levels including herb level and compound level, for which their features were determined based on structure-based fingerprints and ADME properties. At the herb level, the ADME properties were also utilized to filter out those compounds with low water solubility or low gastrointestinal absorption (see Materials and Methods for more details). As a result, only 583 herbs remained after the filtering, covering 4,922 compounds. We evaluated the prediction performance under scenarios of different machine learning methods, feature types and data levels. More specifically, for each one of the seven Meridians, 84 machine learning-based models were constructed including all possible combinations from the four machine learning methods (SVM, DT, RF and kNN), seven feature configurations (Ext, PubChem, Sub, MACCS, ADME, Ext + ADME and All fingerprints + ADME) and three data levels (compound level, herb levels with or without ADME filtering). The model was trained by a five-fold cross validation using 70% data and then tested for its prediction accuracy using the remaining 30% data (see Materials and Methods for more details). To benchmark the model performance for each Meridian, we permutated the Meridian labels while keeping the ratio of positive and negative cases unchanged. The model performance for the permutated data was considered as the baseline. As shown in , all the major Meridians achieved the top Balanced accuracy close to 0.65. Note that we pooled all the 84 machine learning models that differ in their feature combinations and machine learning methods, some of which were sub-optimal and therefore led to poorer prediction results. Still, these machine learning models performed significantly better than the baseline prediction of permutated models, in terms of Balanced accuracy and Matthews coefficient (, p-value < 0.0001, Wilcoxon rank-sum test). These results supported the general feasibility of using machine learning approaches to relate chemical information of herbs and compounds to explain Meridians ().

Evaluation of the machine learning model predictions.

(A) The overall Balanced accuracy for the seven Meridians. Dashed line indicates the level of 0.65. (B) The Balanced accuracy at the three data levels (compound-level, herb-level before and after ADME filtering). (C) The balanced accuracy for the four machine learning methods at the compound level. (D) The balanced accuracy for the ADME and fingerprint feature types at the compound level. Wilcox rank sum test. *: p < 0.05; **: p < 0.01; ***: p < 0.001; ****: p < 0.0001. Furthermore, using the Balanced accuracy metric, we found that the compound-level prediction performed significantly better than the herb-level predictions (, p-value < 0.001, Wilcoxon rank-sum test). The same trend has been observed by the AUROC and AUPRC metric ( and , respectively). At the herb level, filtering out compounds with poor ADME properties improved the prediction significantly in Heart and Stomach (p-value < 0.05, Wilcoxon rank-sum test), while for Kidney, Lung and Spleen only the top machine learning models achieved higher prediction accuracy. In contrast, the ADME filtering seemed not helping the prediction of Large Intestine and Liver Meridians. In order to determine the chemical fingerprint features for an herb, we took the average of its compound features, based on the assumption that all the ingredient compounds are equally contributing to the pharmacology of the herb. This was likely an oversimplification of the actual mechanisms of action for a majority of herbs. However, the biological roles about the ingredient compounds were largely missing from TCMID and other resources, suggesting that the actual contributions of these ingredient compounds have not been thoroughly resolved. In contrast, the compound-level data was more reliable, as each compound was treated independently when determining its molecular features and Meridians. This may explain the superior performance of compound-level predictions compared to the herb-level predictions. We anticipated that the herb-level prediction may be further improved when the actual composition and bioactivity of the compounds can be determined using modern high-throughput techniques e.g. mass spectrometry or HPLC (High performance liquid chromatography) [36]. As the compound-level prediction showed better performance than the herb-level prediction, we further compared the prediction accuracy between different machine learning methods at the compound level. As shown in , top models of RF performed better than kNN, DT and SVM across all the seven Meridians, suggesting that RF was able to detect the predictive features due to the use of ensemble learning techniques. We also evaluated the prediction accuracy of the machine learning methods using different feature types. As shown in , models with the different fingerprint types resulted in similar performance, while Ext and PubChem fingerprints achieved the top Balanced Accuracy (0.67 and 0.66, respectively). This result was expected as the Ext fingerprint and PubChem fingerprint contains 1024 bits and 881 bits, respectively, which are the longer than MACCS (166 bits) and Sub (307 bits) fingerprint types. Furthermore, models using all the fingerprint types combined with ADME achieved higher top accuracies, compared to the use of ADME alone (). Taken together, we concluded that the combination of all fingerprints with ADME features may carry the most comprehensive information to predict the Meridians at the compound level, for which the RF method achieved the best balanced accuracy compared to other machine learning methods ().

Important fingerprint and ADME features to explain Meridian at the compound level

After determining RF as the best model, we determined the feature importance score according to its contribution to the change of model prediction accuracy at the compound level: if the removal of a feature resulted in a much worse prediction by the model, then the feature will be given a higher importance score. We selected the top 30 most important features for each Meridian, resulting in 59 unique features in total, including 27 ADME properties and 32 fingerprints. We confirmed that the 59 important features were significantly more predictive than the other features across all the seven Meridians (p < 0.0001, Wilcoxon rank-sum test), with the median importance score for these 59 top features ranging from 2.77 for Large Intestine to 6.4 for Spleen ().

Important features determined at the compound-level prediction of Meridian.

(A) The distribution of importance scores for the top 59 features as compared to all features. (B-C) The bi-clustering of the importance scores for the 27 ADME features and 32 fingerprints. To evaluate the top features across the Meridians, we generated the bi-clustering heatmaps for the top ADME and fingerprint features separately. As shown in , lipophilicity features including iLOGP, WLOGP, MLOGP are among the top ADME features across all the seven Meridians, with the mean Z-score of feature importance of 1.66, 0.74 and 0.67, separately. This suggested that lipophilicity plays important roles for the Meridian classification of compounds. Molar refractivity (MR), a measure of the total polarizability of a substance, was identified as another important feature (mean Z-score 0.96). In addition, Solubility features predicted by the multiple methods using SwissADME have also shown relatively higher importance, with mean Z-scores ranging from 0.92 to 1.14. Lipophilicity is known to affect pharmacokinetic properties and the overall suitability of drug candidates [37]. Molar refractivity and Solubility are known to play important roles for the absorption and subsequent bioavailability of a drug in vivo. Our results suggest the rationale of including the ADME evaluation for understanding the pharmacology and pharmacokinetics of ingredient compounds in herb medicine. We also evaluated the importance scores of the chemical fingerprints. As shown in , the fingerprint features from the same types tend to cluster together, with a Rand Index of 0.66 when comparing the similarity between the clustering by cutting the hierarchical tree at 1.5 and their actual feature types [38]. For example, the most important fingerprint features for Stomach Meridian formed a cluster (Cluster I in ), which consisted of mainly Ext fingerprint features (Ext169, Ext483, Ext157 and Ext1016); The most important fingerprint features for Kidney are PubChem fingerprint features (PubChem228, PubChem189, PubChem839 and PubChem860) (Cluster II). Similar patterns were also found for Spleen (Cluster III as an Ext fingerprint dominant cluster) and for Lung (Cluster IV as a MACCS fingerprint dominant cluster). In general, the importance scores for the Ext fingerprints were higher among all the four fingerprint types (), which is also consistent with the better machine learning performances of Ext fingerprints described earlier in section 3.2 (). Finally, we determined the important substructure fragments based on the top fingerprints. As shown in , the representative fragments for each Meridian are quite different from each other, which is in line with the limited overlap of herbs between the Meridians (). This result indicates that there might be enrichment of basic chemical structures that differs between Meridians, which can be further explored using pharmacophore modeling approaches [39].

Discussion

Traditional Chinese Medicine (TCM) has gained increasing popularity in the drug discovery field, as shown by a few successful examples including the discovery of artemisinin for treating malaria and arsenic trioxide for treating acute promyelocytic leukemia [40]. Currently, there are around 1000 clinical trials on TCM herb medicine registered in the Clinicaltrials.gov [41] (retrieved in January, 2019), suggesting that the therapeutic potential of TCM has been actively researched through more rigorous scientific investigation. While the TCM theory is largely self-consistent as a philosophical narrative, the scientific rationale of why and how it is working remains elusive. For example, the interpretation of five elements and qi is rather metaphysical than physical, which makes many of the TCM concepts difficult to be translated into modern physiological and medical entities [9]. Furthermore, TCMs usually involve many active compounds that modulate various biological targets, where little is known about how these interactions lead to therapeutic relevance under a specific disease context. With the development of molecular profiling technologies, the extraction and characterization of the herb constituents is now possible and is expected to provide a comprehensive source of pharmacology data. Therefore, there have been strong needs for data integration to deconvolute the mechanisms of action of herb medicine in relation to the disease biology, so that a formal framework for testing and understanding of TCM can be established [42]. In this study, we built a computational framework to study the concept of Meridians, which has been long established for the classification of TCM herbs and thus constitutes the fundamental basis of treatment strategy in TCM. We collected the Meridian information for major TCM herbs and determined their features based on the chemical fingerprints and ADME properties. We found that an herb is commonly classified into multiple Meridians and that the correlations between them were generally low (). Therefore, we decided to apply the one-vs-the-rest strategy to build classifiers for each meridian separately. Using supervised classification methods including Random Forests, Support Vector Machines, Decision Trees and K-Nearest Neighbor algorithms, we showed that the Meridians can be accurately predicted especially at the compound level, with a top balanced accuracy of 0.67 (; ). Therefore, we concluded that molecular features of the compounds can be considered as the essential information for an herb to be classified as a particular Meridian. In particular, we showed that the ADME properties improved the prediction accuracy, suggesting the relevance and reliability of the in-silico predicted ADME properties for the understanding of Meridians. For example, we found that Random Forests utilizing ADME features alone produced an AUPRC ratio of 2.29 for Large Intestine, topping the other Meridians, suggesting that indeed there is an evidence that ADME properties tend to be more predictive for this Large Intestine (). Ideally, experimentally-validated ADME properties for the ingredient compounds would be needed to confirm the prediction results. Furthermore, we considered 36 ADME features that were determined by SwissADME, assuming that TCM herb compounds become active when absorbed in the bloodstream. However, the therapeutic efficacy of herb medicine may be induced on gut microbiota, which do not necessarily interact with the bloodstream [43]. More relevant factors that may affect the ADME of herb medicine are expected to enhance the model prediction results. For example, another popular tool called admetSAR has been recently updated, which can provide 47 models for a more comprehensive evaluation of ADME [44, 45]. On the other hand, we evaluated four major structure-based fingerprint types, and found that their performances were similar. Despite that certain fingerprint types contain more bits than the others (e.g. 1024 bits for Ext fingerprint as compared to 166 bits for MACCS fingerprint), it seemed that all of them captured the essential structural information of TCM herbs and compounds. We found that the compound-level prediction is in general more accurate than the herb-level prediction. There might be three reasons for that. Firstly, the exact compound composition for a given herb might not be accurate, as the extraction and detection of active components from herb medicine remains a challenge [46]. Secondly, even though certain compounds can be detected from a given herb, they may not be absorbable due to their poor ADME properties. As a result, the features that were determined for these compounds may play no therapeutic roles and thus do not affect the Meridian of the herbs. Thirdly, although the same compounds can be found from different herbs, their actual abundance may differ. In our construction of binary herb-feature matrix, there is lack of information to differentiate the different levels of compound abundance and their bioactivity. We expected the prediction accuracy at the herb level can be improved, providing that more accurate compound composition and activity data become available. In our modeling framework, the extraction of key features at the herb level can be done easily by first extracting the key features at the Compound level, and then combining them for a particular herb, using the Compound-Feature matrix and Herb-Compound matrix. With this framework, we may predict not only the Meridian for new herbs, but also for approved synthetic compounds for which their disease indications are already known. The link between Meridian and disease indications may provide more physiological understanding of Meridian. We identified that Random Forest (RF) as the best classification method, corroborating the superior performance of RF in similar machine learning tasks [47]. As an Ensemble Learning method, RF averaged the predictions from multiple decision trees and thus lowered the risk of overfitting. In the future, more advanced machine learning methods such as Deep Learning may be worth trying [48]. To make sense of TCM, the ultimate objective is not only a predictive model but also an interpretable model that can help understand the underlying mechanisms of action. Here, we identified the predictive features that may provide initial evidence for the molecular basis of Meridians, which may facilitate the discovery of novel active compounds from TCM herbs. As the main focus of our work is to provide the first evidence that machine learning approaches are feasible for interrogating the concepts of Meridians, we have not evaluated other more advanced methods including artificial neural networks. By further improving the knowledge of active ingredients for TCM herbs and the accuracy of machine learning algorithms, we expected that the machine learning framework can be greatly expanded towards a more systematic understanding of Meridians as well as other concepts in TCM. TCMID is currently the largest database of TCM that collects over 49,000 prescriptions including 8,159 herbs and 25,210 ingredients. However, the majority of these herbs are lack of appropriate annotation on their Meridian information, highlighting the limited understanding of the topic. We extracted a subset of herbs from TCMID (n = 646) with known Meridian information and then included their ingredient compounds with known chemical structures (n = 10,053), with which the most predictive machine learning models and features were determined. To be able utilize our machine learning framework to predict the unknown Meridian for a given herb, the structural information of its ingredient compounds need to be provided as input data. With the structural information, it is then possible to determine the fingerprint and ADME features. In the future, we envisage that more comprehensive structural information about the active ingredients in herbs can be determined, so that the Meridian annotation of herbs can be done more systematically and more accurately. The advanced machine learning approaches that are tailored for analyzing such complex datasets may hold the key to the understanding of TCM rationale, which may ultimately provide novel insights for drug discovery and disease treatment [39].

Materials and methods

The entire workflow of the present study was illustrated in . First, herbs and their ingredient compounds were extracted from public databases. Molecular fingerprints and ADME properties were determined based on the chemical structures of the ingredient compounds, and were used to construct an Herb-feature matrix and a Compound-Meridian matrix. After obtaining all the features and Meridian classification for the herbs and the compounds, the prediction of Meridians at the herb and compound levels was implemented using four machine learning methods, including Support vector machine (SVM) [49], Decision tree (DT) [50], Random forests (RF) [51, 52] and K-nearest neighbor (kNN) [53]. The predication performance was further evaluated by cross-validation, based on which we identified the best models and feature types to predict the Meridians. The most predictive fingerprint features and ADME properties were identified for each Meridian separately.

Data collection

Meridian and ingredient compound information for TCM herbs

We extracted the information of TCM herbs including the Meridian and the chemical components from the newly published database called TCMID [54], which is the largest database of TCM with over 49,000 prescriptions including 8,159 herbs and 25,210 ingredients. However, not all the herbs were included in our data analysis. As the aim of the study was to predict the Meridians based on the structural fingerprints of the herb ingredients, we focused on the herbs with known Meridian information from TCMID. Furthermore, for each herb we included only those ingredient compounds with known SMILES information, such that their structural fingerprints and ADME properties can be determined. The herbs with missing Meridian as well as missing chemical structure information of their ingredient compounds were discarded in this study. The curated dataset contained 18,140 herb-compound pairs including 646 herbs and 10,053 ingredient compounds.

Chemical structural fingerprints for the ingredient compounds

The canonical SMILES representations for the compound structures were determined using Open Babel [55]. We used the PaDEL-Descriptor software [56] to encode SMILES into a list of binary fingerprint features that indicate whether a particular substructure is present or absent in the compound. We considered four common fingerprint types including PubChem [57], MACCS (Molecular ACCess System) [58], Substructure (Sub) [59] and Extended fingerprint (Ext) [60]. PubChem fingerprint was extracted from the PubChem database (n = 881 bits) while MACCS fingerprint was originated from the cheminformatics system provided by the MDL company (n = 166 bits). Substructure fingerprint was used to represent the specific substructures based on SMARTS Patterns for Functional Group Classification (n = 307 bits) [59, 61]. Extended fingerprint complements the Substructure fingerprint with additional bits describing circular topological features (n = 1024).

ADME properties for the ingredient compounds

ADME properties play important roles to determine the pharmacokinetics of a compound, constituting the key factors that determine the hit and lead optimization processes in drug discovery. ADME properties describe how a compound deposits inside the human body in terms of the processes of absorption, distribution, metabolism and excretion. For instance, water solubility, usually measured as the decimal logarithm of solubility (log S) in the units of mol/l or mg/ml, indicates the maximum dissolvable concentration of a compound in water. After oral administration, a drug reaches the initial portion of the gastrointestinal tract, where the level of gastrointestinal absorption affects the fraction of the drug dose that enters the bloodstream. Lipophilicity, on the other hand, represents the affinity of a compound in a lipophilic environment and thus determines how easily the compound can pass through the lipid membrane of cells. For the TCM herbs, the ADME properties for their ingredient compounds have been largely uncharacterized. Therefore, we resorted to computational methods as an alternative, which have been shown previously to be able to reliably and efficiently determine ADME. For example, the Lipinski’s Rule-of-five has been long used for evaluating the bioavailability based on the structure information of compounds [62]. Classical QSAR (Quantitative Structure-Activity Relationship) approaches also rely heavily on computational prediction of bioactivity properties based on the compound structures [56]. We determined the ADME properties of the ingredient compounds using an online tool SwissADME [63]. In the original publication, the authors of SwissADME showed that the prediction of Lipophilicity achieved an accuracy of r (correlation) = 0.72, MAE (Mean absolute error) = 0.89 and RMSE (root mean square error) = 1.14 against experimental data for 11,993 compounds. SwissADME also showed superior performance on the water solubility prediction with R2 (coefficient of determination) of 0.75, 0.69 and 0.81 based on three different models including the FILTER-IT model [63], the ESOL model [64] and the Ali model [65]. Notably, SwissADME has been recently applied to the study of plant-derived compounds including anticancer polyphenols from Syzygium alternifolium [66], PTPN1 (protein tyrosine phosphatase non-receptor type 1) inhibitors from several plant extracts [67] and a TCM called Zhi-zhu Wan [68]. Therefore, we considered the use of SwissADME as a reliable method to probe the ADME properties for TCM herb compounds. The SMILES of each compound was loaded as input to SwissADME, and the result consisted of 36 ADME features including 6 drug likeness features, 5 lipophilicity features, 4 medicinal chemistry features, 9 pharmacokinetics features, 9 physicochemical properties and 3 water solubility properties ().

Construction of Compound-feature matrix and Herb-feature matrix

In this study, the features of a compound were considered as the combination of its fingerprint and ADME features, including 2378 fingerprint features (1024 Ext bits, 881 PubChem bits, 307 Sub bits and 166 MACCS bits) and 36 ADME property features. The four fingerprint types (Ext, PubChem, Sub and MACCS) were first evaluated separately in the machine learning models to determine the best fingerprint type. Then, we combined this best fingerprint type with the ADME features to check whether model performance can be further improved. The resulting Compound-feature matrix XC contained 10,053 rows of compounds and 2,414 columns of features. Based on a previous study, a drug combination’s molecular features can be represented by merging the features of its component drugs [69]. We considered also an herb as a mixture of different ingredient compounds, and determined the herb features as below: Let C = (c1,c2,…,c) denote the set of ingredient compounds for herb j, where k is the number of compounds. For each compound, its compound feature vector is denoted as Fcompound = (f1,f2,…,f), where n is the number of features. We modelled the herb feature Fherb = (g1,g2,…,g) as the average of its compound features, i.e. We collected 646 herbs and determined 2414 features including 2378 fingerprints and 36 ADME properties for their ingredient compounds. The Herb-feature matrix (HF) thus was size of 646x2414: Furthermore, to evaluate whether filtering out the compounds with poor ADME properties affects the model prediction, we removed compounds that were predicted with logS lower than -6 by all the three water solubility models (the FILTER-IT model [63], the ESOL model [64] and the Ali model [65]) as well as low gastrointestinal absorption below 30%, which was a commonly accepted threshold to separate well-absorbed from poorly- absorbed compounds. After the filtering, 583 herbs and 4922 compounds were retained. We compared the model prediction accuracies before and after the ADME filtering.

Construction of Herb-Meridian matrix and Compound-Meridian matrix

TCM herbs can be assigned to one or more of the 12 Meridians as shown in . For each herb, its Meridian vector is denoted as Mherb = (m1,m2,…,m12). From the 646 herbs that we collected from TCMID, the Meridian classification for the herbs was represented as a binary Herb-Meridian matrix (HM) for the 12 Meridians as below: We denoted that H = (h1,h2,…,h) is a set of p herbs that contain the compound j. The Meridian vector for this compound Mcompound = (l1,l2,…,l12) was determined as the union of the Meridians of the herbs in H, i.e. where I(∙) is an indicator function. The full Compound-Meridian (CM) matrix was constructed accordingly for the 10,053 compounds on the 12 Meridians:

Training the machine learning models

We set up the machine learning framework for each Meridian with binary response variables. Four supervised classification methods including SVM, DT, RF and kNN [70] were employed to predict the Meridians. These methods were implemented using the R package caret [71], with the default parameters listed in . SVM is an algorithm which can determine a hyper plane to maximize the separation between the classes with minimal error. DT constructs a decision tree by representing an observation as a branch node and its classification result by a leave node. kNN is a distance-based learning algorithm where an object is classified according to a majority vote of its neighbors. RF is a decision tree-based ensemble learning approach where each tree votes for its preferred classification and the majority vote classification returns as the final prediction. We used five-fold cross validation to avoid overfitting when evaluating the model performance. Initially the data was split randomly to the training (70%) and testing (30%) sets. A five-fold cross-validation was applied to split the training data randomly into five equally sized folds. At each iteration, one unique fold was hold out while the remaining four folds were used to train a machine learning model. The model performance was then evaluated on the hold-out fold. Such a process was repeated five times, after which the model that produced the highest accuracy was selected as the best model to predict the testing set, which comprise 30% of the total data. The model performance on the independent testing set was reported. The R scripts and input data for the machine learning framework are publically accessible at https://github.com/herb-medicne/meridian-prediction.

Evaluating the prediction accuracy

We obtained a confusion matrix to evaluate the prediction accuracy for the test data. To avoid the inflated overall accuracy for imbalanced data, Balanced accuracy was also used to evaluate the performance of models, which is the average of sensitivity and specificity: True positive (TP) is the number of positive samples (i.e. herbs or compounds) which are correctly identified for a given Meridian; False positive (FP) is the number of positive samples which are not correctly identified. True negative (TN) is the number of negative samples which are correctly identified and false negative (FN) is the number of negative samples which are not correctly identified. Furthermore, Matthews correlation coefficient (MCC) and the Area Under the Receiver Operating Characteristic curve (AUROC) were also utilized for the model evaluation, defined separately as: and The true positive rate (TPR) and false positive rate (FDR) were defined as and for a given classification threshold t, where f1(x) and f0(x) are the probability density functions for the predicted score for an instance if it belongs to positive and negative class, separately. Similarly, we evaluated the Area Under the Precision Recall curve (AUPRC) to focus on the prediction accuracy of positive cases.

Identification of key features for the prediction of Meridians at the compound level

To find the most important features which play important roles for the Meridian classification, we used the varImp package [72] to estimate the variable importance based on the best models. Furthermore, the SARpy [73] tool was employed to detect key substructures (fragments) that emerge the most frequently as important features when predicting a specific Meridian. SARpy evaluates the significance of each substructure based on the likelihood ratio: , where TP and FP stand for the number of compounds which contain the substructure and belong, or do not belong to the Meridian, respectively. We selected the top ten important substructures ranked by the likelihood ratio score for each Meridian. These substructures can be therefore considered as the most frequent fragments among the compounds of a specific Meridian. Balanced Accuracy (A) and Matthews correlation (B) for all the machine learning methods on the real data as compared to permutated data at the compound and herb levels. ****: p-value < 0.0001. (TIF) Click here for additional data file.

Evaluation of the machine learning model predictions by AUROC (The area under the receiver operating characteristic curve).

(A) The overall AUROC for the seven Meridians. (B) The AUROC at the three data levels (compound-level, herb-level before and after ADME filtering). (C) The AUROC for the five machine learning methods at the compound level. (D) The AUROC for the ADME and fingerprint feature types at the compound level. Wilcox rank sum test. *: p < 0.05; **: p < 0.01; ***: p < 0.001; ****: p < 0.0001 (TIF) Click here for additional data file.

Evaluation of the machine learning model predictions by AUPRC ratio, defined as the actual AUPRC divided by the baseline of random prediction.

(A) The overall AUPRC ratio for the seven Meridians. (B) The AUPRC ratio at the three data levels (compound-level, herb-level before and after ADME filtering). (C) The AUPRC ratio for the five machine learning methods at the compound level. (D) The AUPRC ratio for the ADME and fingerprint feature types at the compound level. Wilcox rank sum test. *: p < 0.05; **: p < 0.01; ***: p < 0.001; ****: p < 0.0001. (TIF) Click here for additional data file.

The importance scores grouped by the feature types according to Random Forest predictions for the seven Meridians at the compound level.

(TIF) Click here for additional data file.

The Meridians and other TCM annotations for the 646 herbs.

(XLSX) Click here for additional data file.

The numbers of positive and negative samples for each Meridian at the herb and the compound levels

(XLSX) Click here for additional data file.

The prediction performances for the combinations of data levels, feature types and machine learning methods.

(XLSX) Click here for additional data file.

Top 30 important ADME features, fingerprint bits and important substructure fragments for each Meridian determined at the compound level.

(XLSX) Click here for additional data file.

The 36 ADME properties based on the chemical structure of compounds.

(XLSX) Click here for additional data file.

Parameters of the machine learning models.

(XLSX) Click here for additional data file. 12 Aug 2019 Dear Dr Tang, Thank you very much for submitting your manuscript 'Predicting Meridian in Chinese Traditional Medicine Using Machine Learning Approaches' for review by PLOS Computational Biology. Your manuscript has been fully evaluated by the PLOS Computational Biology editorial team and in this case also by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the manuscript as it currently stands. While your manuscript cannot be accepted in its present form, we are willing to consider a revised version in which the issues raised by the reviewers have been adequately addressed. We cannot, of course, promise publication at that time. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Your revisions should address the specific points made by each reviewer. Please return the revised version within the next 60 days. If you anticipate any delay in its return, we ask that you let us know the expected resubmission date by email at ploscompbiol@plos.org. Revised manuscripts received beyond 60 days may require evaluation and peer review similar to that applied to newly submitted manuscripts. In addition, when you are ready to resubmit, please be prepared to provide the following: (1) A detailed list of your responses to the review comments and the changes you have made in the manuscript. We require a file of this nature before your manuscript is passed back to the editors. (2) A copy of your manuscript with the changes highlighted (encouraged). We encourage authors, if possible to show clearly where changes have been made to their manuscript e.g. by highlighting text. (3) A striking still image to accompany your article (optional). If the image is judged to be suitable by the editors, it may be featured on our website and might be chosen as the issue image for that month. These square, high-quality images should be accompanied by a short caption. Please note as well that there should be no copyright restrictions on the use of the image, so that it can be published under the Open-Access license and be subject only to appropriate attribution. Before you resubmit your manuscript, please consult our Submission Checklist to ensure your manuscript is formatted correctly for PLOS Computational Biology: http://www.ploscompbiol.org/static/checklist.action. Some key points to remember are: - Figures uploaded separately as TIFF or EPS files (if you wish, your figures may remain in your main manuscript file in addition). - Supporting Information uploaded as separate files, titled Dataset, Figure, Table, Text, Protocol, Audio, or Video. - Funding information in the 'Financial Disclosure' box in the online system. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see here. We are sorry that we cannot be more positive about your manuscript at this stage, but if you have any concerns or questions, please do not hesitate to contact us. Sincerely, Alexander MacKerell Associate Editor PLOS Computational Biology Weixiong Zhang Deputy Editor PLOS Computational Biology A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately: [LINK] Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: This research presents a very interesting idea: using traditional machine learning algorithms and fingerprints of chemical compounds in herbs to predict herbs' Meridians. The manuscript is well organized and clearly written, and a large amount of computation is performed. However, I do see several major statistical issues. First, since an herb could belong to multiple Meridians as clearly stated by authors, this is a typical multiclass classification problem. However, this is totally ignored by the authors. From the method descriptions, they seem to use one-vs-rest approach. The authors need to explain why one-vs-rest is a good approach for this multiclass classification problem. Second, relevant to the first issue, neural network models could be used for this multiclass classification problem without such one-vs-rest approach. Have the authors tried any NN models? Did NN models perform poorly in comparison with these traditional algorithms? Third, apparently, the data is quite unbalanced. Machine learning models are generally very sensitive to "unbalancedness" of the training data. Authors did not discuss this at all. Correspondingly, numbers of positive and negative samples should be added in "Supplementary Table 3.xlsx". Fourth, for unbalanced data, AUC-ROC (the area under the receiver operating characteristic curve) is a widely used metrics; I highly recommend AUC-ROC numbers are calculated. Reviewer #2: Wang and colleagues proposed a comprehensive machine learning-based study to predict Meridian in Chinese Traditional Medicine. They integrated multiple types of molecular fingerprints and ADME properties of active compounds in herbs. They then evaluated four different machine learning algorithms by combing different types fingerprints and ADME properties, which is quite a novel and comprehensive insight. Some machine learning models reveal good accuracy in predicting herb-Meridian associations in cross validation. This is an impressive study which offers powerful machine learning-based approaches for evaluations of Meridian by Chinese Traditional Medicine. The main findings are well presented and the manuscript is well written. Several specific comments may help improve the manuscript further. 1. The reviewers appreciated that the authors collected large-scale herbs with specific ingredients from database. However, each active ingredient has different concentration across herbs. The authors use equal weight (concentration) for each ingredient for calculation of molecular fingerprints and ADME properties to build machine learning models. This limitation has to be well explained or discussed in the revised manuscript. 2. The authors only evaluated accuracy only for machine learning models. Several comprehensive indexes, such as AUC (area under ROC), and precision-recall curves should be added. 3. The authors integrated both molecular fingerprints and ADME properties for building models. However, the reviewer cannot find how they integrated ADME properties in Figure 1. 4. The authors systematically evaluated four different machine learning algorithms in this study. More details of parameters of machine learning models are suggested to provided. For example, which k used for kNN, which function (kernel or linear) used for SVM, how many trees used for Random forest, etc. The authors may get better performance if they optimize tree number in random forest models. 5. The authors calculated ADME properties using public tools. One popular tool, admetSAR should be discussed. 6. It is impressive that the authors found that RF model shows the best performance for large intestine as several key ADME properties are highly correlated with large intestine. Could the authors evaluate the performance of RF models on large intestine using ADME properties only. 7. Several key refs related to polypharmacy (10.1038/s41467-019-09186-x) and polypharmacology of natural products (doi: 10.1093/bib/bbx045) should be discussed. Overall, this is an interesting study, which offer powerful computational tools and models for systematic evaluation of Meridian-herb associations, an important, complex biomedical research question in traditional Medicine. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: yuhong wang Reviewer #2: No 26 Sep 2019 Submitted filename: PLoS_R1_response_letter_final_v2.docx Click here for additional data file. 20 Oct 2019 Dear Dr Tang, We are pleased to inform you that your manuscript 'Predicting Meridian in Chinese Traditional Medicine Using Machine Learning Approaches' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Once you have received these formatting requests, please note that your manuscript will not be scheduled for publication until you have made the required changes. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pcompbiol/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. One of the goals of PLOS is to make science accessible to educators and the public. PLOS staff issue occasional press releases and make early versions of PLOS Computational Biology articles available to science writers and journalists. PLOS staff also collaborate with Communication and Public Information Offices and would be happy to work with the relevant people at your institution or funding agency. If your institution or funding agency is interested in promoting your findings, please ask them to coordinate their releases with PLOS (contact ploscompbiol@plos.org). Thank you again for supporting Open Access publishing. We look forward to publishing your paper in PLOS Computational Biology. Sincerely, Alexander MacKerell Associate Editor PLOS Computational Biology Weixiong Zhang Deputy Editor PLOS Computational Biology Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The revision addressed my concerns and suggestions. The accuracy and ROC numbers remain low using the common machine learning standards. However, the relationship between the meridians and compounds is expected to be very complex, and these numbers may still be meaningful. Similar problems will likely occur more frequently as people try machine learning models to more complex biological phenomena. The thinkings behind traditional Chinese medicine are distinct from those in Western medicine, but in my opinion they complement each other well. Machine learning, in particular deep learning which could deal with more complex relationship, could be a powerful method studying complex biological systems such as herbal formulas. I have two suggestions which may be helpful for authors' future researches. First, to further test the significance of the model, you could use bootstrap. Basically, randomly assign meridians for the used samples, perform the same procedure, calculate the same accuracy numbers, and then compute confidence intervals. Such confidence numbers could be more convincing supports for these models. Second, as I said above, the relationship between meridians and compound structures is obviously very complex. From my experiences, deep learning models, if well constructed, could help even for the sample size of this study. Congratulations for this interesting work. Reviewer #2: The authors has addressed my concerns. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No 6 Nov 2019 PCOMPBIOL-D-19-01126R1 Predicting Meridian in Chinese Traditional Medicine Using Machine Learning Approaches Dear Dr Tang, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Matt Lyles PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol
Table 1

The Meridians and their example herbs.

Each Meridian is linked to a particular Organ which is characterized by its Elements and Quality of Yin or Yang. TCM considers a disease a result of loss of balance in the Yin and Yang, which can be restored using herbs that target particular Meridians.

Meridian nameQuality of Yin or YangMain OrganExample herb
Taiyin Lung Channel of HandGreater Yin (taiyin)LungRhizoma Pinelliae
Shaoyin Heart Channel of HandLesser Yin (shaoyin)HeartSalvia miltiorrhiza
Jueyin Cardiovascular Channel of HandFaint Yin (jueyin)CardiovascularMotherwort Herb
Hand's Minor Yang Three EndLesser Yang (shaoyang)Three EndCape jasmine fruit
Taiyang Small Intestine Channel of HandGreater Yang (taiyang)Small IntestineAdsuki Bean
Yangming Large Intestine Channel of HandYang Bright (yangming)Large IntestineRadix et rhizoma rhei
Taiyin Spleen Channel of FootGreater Yin (taiyin)SpleenPueraria Root
Shaoyin Kidney Channel of FootLesser Yin (shaoyin)KidneyRadix Angelicae Biseratae
Jueyin Liver Channel of FootFaint Yin (jueyin)LiverBupleurum chinense DC
Shaoyang GallbladderLesser Yang (shaoyang)Gall BladderSpica Prunellae
Taiyang Bladder Channel of FootGreater Yang (taiyang)Urinary bladderCommon Andrographis Herb
Yangming Stomach Channel of FootYang Bright (yangming)StomachRhizoma Cyperi
Table 2

The balanced accuracy that was achieved for each Meridian at the compound level by Random Forest using all the available features.

MeridianFeatureMethodBalanced Accuracy
HeartADME + All fingerprintRF0.65
KidneyADME + All fingerprintRF0.65
Large intestineADME + All fingerprintRF0.62
LiverADME + All fingerprintRF0.65
LungADME + All fingerprintRF0.64
SpleenADME + All fingerprintRF0.67
StomachADME + All fingerprintRF0.65
Table 3

The AUPRC ratio that was achieved for each Meridian at the compound level by Random Forest using ADME features only.

MeridianMethodAUPRC ratio
Large intestineRF2.29
HeartRF1.67
SpleenRF1.51
KidneyRF1.68
StomachRF1.40
LungRF1.45
LiverRF1.29
  54 in total

Review 1.  Meridian studies in China: a systematic review.

Authors:  Guang-Jun Wang; M Hossein Ayati; Wei-Bo Zhang
Journal:  J Acupunct Meridian Stud       Date:  2010-03

2.  Extended-connectivity fingerprints.

Authors:  David Rogers; Mathew Hahn
Journal:  J Chem Inf Model       Date:  2010-05-24       Impact factor: 4.956

3.  Proteomics and traditional medicine: new aspect in explanation of temperaments.

Authors:  Mohieddin Jafari; Hassan Rezadoost; Mehrdad Karimi; Mehdi Mirzaie; Mostafa Rezaie-Tavirani; Mahvash Khodabandeh; Gholamreza Kordafshari; Nafiseh Abbasian; Payman Nickchi; Kambiz Gilany; Alireza Ghassempour
Journal:  Forsch Komplementmed       Date:  2014-08-16

4.  A study on the antioxidant activity and tissues selective inhibition of lipid peroxidation by saponins from the roots of Platycodon grandiflorum.

Authors:  Xian-Jun Fu; Hong-Bing Liu; Peng Wang; Hua-Shi Guan
Journal:  Am J Chin Med       Date:  2009       Impact factor: 4.667

Review 5.  Salvia miltiorrhizaBurge (Danshen): a golden herbal medicine in cardiovascular therapeutics.

Authors:  Zhuo-Ming Li; Suo-Wen Xu; Pei-Qing Liu
Journal:  Acta Pharmacol Sin       Date:  2018-04-26       Impact factor: 6.150

6.  Herb network construction and co-module analysis for uncovering the combination rule of traditional Chinese herbal formulae.

Authors:  Shao Li; Bo Zhang; Duo Jiang; Yingying Wei; Ningbo Zhang
Journal:  BMC Bioinformatics       Date:  2010-12-14       Impact factor: 3.169

7.  Review of evidence suggesting that the fascia network could be the anatomical basis for acupoints and meridians in the human body.

Authors:  Yu Bai; Jun Wang; Jin-Peng Wu; Jing-Xing Dai; Ou Sha; David Tai Wai Yew; Lin Yuan; Qiu-Ni Liang
Journal:  Evid Based Complement Alternat Med       Date:  2011-04-26       Impact factor: 2.629

8.  Interlog protein network: an evolutionary benchmark of protein interaction networks for the evaluation of clustering algorithms.

Authors:  Mohieddin Jafari; Mehdi Mirzaie; Mehdi Sadeghi
Journal:  BMC Bioinformatics       Date:  2015-10-05       Impact factor: 3.169

9.  Herb-target interaction network analysis helps to disclose molecular mechanism of traditional Chinese medicine.

Authors:  Hao Liang; Hao Ruan; Qi Ouyang; Luhua Lai
Journal:  Sci Rep       Date:  2016-11-11       Impact factor: 4.379

10.  Gut microbiota-involved mechanisms in enhancing systemic exposure of ginsenosides by coexisting polysaccharides in ginseng decoction.

Authors:  Shan-Shan Zhou; Jun Xu; He Zhu; Jie Wu; Jin-Di Xu; Ru Yan; Xiu-Yang Li; Huan-Huan Liu; Su-Min Duan; Zhuo Wang; Hu-Biao Chen; Hong Shen; Song-Lin Li
Journal:  Sci Rep       Date:  2016-03-02       Impact factor: 4.379

View more
  9 in total

1.  To Explore the Mechanism and Equivalent Molecular Group of Fuxin Mixture in Treating Heart Failure Based on Network Pharmacology.

Authors:  Yi-Ding Yu; Yi-Ping Xiu; Yang-Fan Li; Yi-Tao Xue
Journal:  Evid Based Complement Alternat Med       Date:  2020-11-21       Impact factor: 2.629

2.  Research of insomnia on traditional Chinese medicine diagnosis and treatment based on machine learning.

Authors:  Yuqi Tang; Zechen Li; Dongdong Yang; Yu Fang; Shanshan Gao; Shan Liang; Tao Liu
Journal:  Chin Med       Date:  2021-01-06       Impact factor: 5.455

3.  IrGO: Iranian traditional medicine General Ontology and knowledge base.

Authors:  Ayeh Naghizadeh; Mahdi Salamat; Donya Hamzeian; Shaghayegh Akbari; Hossein Rezaeizadeh; Mahdi Alizadeh Vaghasloo; Reza Karbalaei; Mehdi Mirzaie; Mehrdad Karimi; Mohieddin Jafari
Journal:  J Biomed Semantics       Date:  2021-04-16

Review 4.  Practical Implementation of Artificial Intelligence-Based Deep Learning and Cloud Computing on the Application of Traditional Medicine and Western Medicine in the Diagnosis and Treatment of Rheumatoid Arthritis.

Authors:  Shaohui Wang; Ya Hou; Xuanhao Li; Xianli Meng; Yi Zhang; Xiaobo Wang
Journal:  Front Pharmacol       Date:  2021-12-23       Impact factor: 5.810

5.  Predicting the Associations between Meridians and Chinese Traditional Medicine Using a Cost-Sensitive Graph Convolutional Neural Network.

Authors:  Hsiang-Yuan Yeh; Chia-Ter Chao; Yi-Pei Lai; Huei-Wen Chen
Journal:  Int J Environ Res Public Health       Date:  2020-01-23       Impact factor: 3.390

Review 6.  Anticancer Plants: A Review of the Active Phytochemicals, Applications in Animal Models, and Regulatory Aspects.

Authors:  Tariq Khan; Muhammad Ali; Ajmal Khan; Parveen Nisar; Sohail Ahmad Jan; Shakeeb Afridi; Zabta Khan Shinwari
Journal:  Biomolecules       Date:  2019-12-27

7.  Assigning the Origin of Microbial Natural Products by Chemical Space Map and Machine Learning.

Authors:  Alice Capecchi; Jean-Louis Reymond
Journal:  Biomolecules       Date:  2020-09-28

8.  Research on the potential mechanism of Chuanxiong Rhizoma on treating Diabetic Nephropathy based on network pharmacology.

Authors:  Shanshan Hu; Siteng Chen; Zhilei Li; Yuhang Wang; Yong Wang
Journal:  Int J Med Sci       Date:  2020-08-21       Impact factor: 3.738

Review 9.  Machine Learning Applications in Drug Repurposing.

Authors:  Fan Yang; Qi Zhang; Xiaokang Ji; Yanchun Zhang; Wentao Li; Shaoliang Peng; Fuzhong Xue
Journal:  Interdiscip Sci       Date:  2022-01-23       Impact factor: 3.492

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.