Literature DB >> 30365030

ETCM: an encyclopaedia of traditional Chinese medicine.

Hai-Yu Xu1,2, Yan-Qiong Zhang1, Zhen-Ming Liu3, Tong Chen2, Chuan-Yu Lv3, Shi-Huan Tang1, Xiao-Bo Zhang2, Wei Zhang1, Zhi-Yong Li4, Rong-Rong Zhou4, Hong-Jun Yang1, Xiu-Jie Wang5, Lu-Qi Huang2.   

Abstract

Traditional Chinese medicine (TCM) is not only an effective solution for primary health care, but also a great resource for drug innovation and discovery. To meet the increasing needs for TCM-related data resources, we developed ETCM, an Encyclopedia of Traditional Chinese Medicine. ETCM includes comprehensive and standardized information for the commonly used herbs and formulas of TCM, as well as their ingredients. The herb basic property and quality control standard, formula composition, ingredient drug-likeness, as well as many other information provided by ETCM can serve as a convenient resource for users to obtain thorough information about a herb or a formula. To facilitate functional and mechanistic studies of TCM, ETCM provides predicted target genes of TCM ingredients, herbs, and formulas, according to the chemical fingerprint similarity between TCM ingredients and known drugs. A systematic analysis function is also developed in ETCM, which allows users to explore the relationships or build networks among TCM herbs, formulas,ingredients, gene targets, and related pathways or diseases. ETCM is freely accessible at http://www.nrc.ac.cn:9090/ETCM/. We expect ETCM to develop into a major data warehouse for TCM and to promote TCM related researches and drug development in the future.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30365030      PMCID: PMC6323948          DOI: 10.1093/nar/gky987

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Traditional Chinese medicine (TCM) holds great potentials for health improvement as well as prevention and treatment of various diseases, especially complex diseases such as autoimmune disorders, cardiovascular diseases and cancers (1–3). TCM is also a great resource for modern drug research and development. Many TCM-derived drugs have shown remarkable effects in curing diseases, such as artemisinin, digitoxin, quinine and celastrol. The most recognized effect of TCM is the use of artemisinin-based remedies to treat malaria, which was awarded the Nobel Prize in Physiology and Medicine in 2015 (4). After that, growing attention has been attracted to TCM, which also brought the increasing needs for TCM related data resources. Herbs are the most commonly used substances in TCM. Over 11 000 herb plants have been recorded in various TCM related pharmacopeia, and the commonly used ones are ∼700 species. As TCM usually combines multiple herbs as formulas in disease treatments, thousands of herbal formulas have been generated and widely applied in clinics. In theory, TCM herbal formulas contain multiple effective ingredients, thus can simultaneously regulate many targets within cells, therefore to reestablish balanced physiological regulatory networks of patients and to cure diseases (5,6). Yet the molecular targets of most TCM formulas and ingredients still remain elusive, which becomes one of the biggest hurdles in the application of TCM and TCM-based drug discovery. The recent development of several TCM-related databases, such as HIT (7), TCMGeneDIT (8), TCM-MESH (9), TCM-ID (10), TCMSP (11), TCMID (12,13), have provided useful data and tools for TCM-based research and drug discovery. Yet most databases only focus on herbs and their components, thus the relationships between formulas and herbs/components are missing. None of these databases provide the habitat and quality control information of herbs, which is considered as a major factor for the effectiveness of TCM. In addition, the design of some databases are obsolete, with limited information and are difficult to use. Here, we present ETCM, an Encyclopedia of Traditional Chinese Medicine (http://www.nrc.ac.cn:9090/ETCM/), which includes multiple aspects of clinical and functional essential information on 403 TCM herb species, 3962 TCM formulas, 7274 herbal ingredients, 2266 validated or predicted drug targets, as well as 3027 related diseases. All these information is comprehensively linked to each other in the database and displayed with user friendly interfaces, which can serve as a valuable resource for TCM-related research and drug discovery.

DATA RESOURCES AND DATABASE CONTENTS

ETCM combines data from multiple sources and provides comprehensive information on the commonly used TCM herbs, herbal formulas, ingredients, as well as the predicted drug targets of ingredients, and their related diseases. The major contents of ETCM are summarized in Table 1.
Table 1.

Content statistics of ETCM

Data typeNumberResources
Herbs403The Fourth National Survey on Chinese Materia Medica Resources (http://www.zyzypc.com.cn/)
Herbs with quantitative information of marker ingredients 263 Pharmacopoeia of the People's Republic of China (2015 version)
TCM formulas3962China Food and Drug Administration (http://eng.sfda.gov.cn/WS03/CL0755/)
Formulas with quantitative information of marker ingredients 478 Pharmacopoeia of the People's Republic of China (2015 version)
Ingredients7274Manual literature retrievals and Pubchem (https://pubchem.ncbi.nlm.nih.gov/)
Ingredients with drug-likeness evaluation 7269 A quantitative estimate model of drug-likeness reported byBickerton group [Nat Chem. 2012 Jan 24;4(2):90-8.]
Drug target genes2266MedChem Studio (version 3.0; Simulations Plus, Inc., Lancaster, CA, USA, 2012)
TCM-related diseases3027Human Phenotype Ontology (HPO, http://human-phenotype-ontology.github.io/)
Online Mendelian Inheritance in Man (OMIM, http://www.omim.org/)
Database of gene-disease associations (DisGeNET, http://www.disgenet.org/web/DisGeNET/menu;jsessionid=c807m1cvnhyn1uc0dr111ox8c, v5.0)
ORPHANET (https://www.orpha.net/consor/cgi-bin/Disease.php?lng=EN)
Content statistics of ETCM

The herb information

The herb section contains the general and drug effectiveness-related information of 403 commonly used Chinese herbs, including the name, type, species, collection time, property, flavor, meridian tropism, indication and specification of herbs. These information was collected from the Pharmacopoeia of the People's Republic of China (2015 version). The pictures and habitat map of herbs were obtained from the Fourth National Survey on Chinese Materia Medica Resources (http://www.zyzypc.com.cn/), which was initiated by the State Administration of Traditional Chinese Medicine in China and carried out a large-scale investigation on Chinese medicinal resources in 1345 counties of China. The colored provinces on the habitat maps are locations where the herbs can grow in China. The ingredients of herbs were mainly extracted from published papers, as well as from ChEMBL (https://www.ebi.ac.uk/chembl/) and PubChem (https://pubchem.ncbi.nlm.nih.gov/).

The formula information

A total of 3962 TCM prescriptions approved by China Food and Drug Administration (CFDA) were collected and stored in the formula section of ETCM. These prescriptions are the commonly used TCM formulas for treating various diseases or maintaining healthy body conditions in clinical practice. The information of formula name, type, dosage form, composition, applicable indications and syndromes, as well as route of administration were obtained from the Pharmacopoeia of the People's Republic of China (2015 version) and Drug Standard of Ministry of Public Health of the People's Republic of China (1993 version). The external links to CFDA are provided to search more detailed drug information by clicking the formula name.

Quality standard of herbs and TCM prescriptions

To facilitate quality evaluation of TCM, the quantitative standards of marker ingredients of herbs and formulas are provided in the herb and formula sections of ETCM. The quantitative standards are set according to the Pharmacopoeia of the People's Republic of China (2015 version), which are the official TCM quality evaluation standards in China.

The ingredient information

The ingredients of herbs and formulas reported in ETCM were collected manually according to the Pharmacopoeia of the People's Republic of China (2015 version) and other literatures. For each ingredient, its name, molecular formula, molecular weight, 2D structure, partition coefficient (ALogP), distribution coefficient (LogD), molecular solubility, molecular volume, surface area and polar surface area, as well as the number of rotatable bonds, H acceptors and H donors are included. These physico-chemical properties were calculated using the Pipeline Pilot software (version 7.5). External links to Public database sources, such as ChEMBL (https://www.ebi.ac.uk/chembl/) and PubChem (https://pubchem.ncbi.nlm.nih.gov/), are also provided for each ingredient. To estimate drug-likeness of each ingredient, pharmacokinetic parameters were calculated based on models in Pipeline Pilot ADMET collection, such as aqueous solubility, blood brain barrier penetration, CYP450 2D6 inhibition, hepatotoxicity, human intestinal absorption and plasma protein binding. A quantitative metrics, known as the quantitative estimate of drug-likeness (QED), was used for assessing drug-likeness, and the estimated values range from 0 (all properties are unfavorable) to 1 (all properties are favorable). The reported mean QED values for attractive and unattractive components in drug development are 0.67 and 0.49, respectively (14–16). Thus, we classified all 7274 ingredients collected in ETCM into three groups according to their QED scores, good (QED > 0.67), moderate (0.49 ≤ QED ≤ 0.67) and weak (QED < 0.49).

The target information

We predicted potential targets for herbal ingredients in ETCM using MedChem Studio (version 3.0), an efficient drug similarity search tool to identify known drugs with high structural similarity (Tanimoto score > 0.8) to herbal ingredients. The Tanimoto Score is in the range of [0,1], where ‘0’ denotes completely different structures between ingredients and known drugs, and ‘1’ denotes identical structures of two components. The therapeutic targets in the DrugBank database (17) of known drugs are considered as the candidate targets of the herbal ingredients with Tanimoto Scores >0.8 to the known drugs. The biological functions and participated pathways of the candidate drug targets are investigated according to the Gene Ontology (18) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway database (19). Enrichment of Gene Ontology terms and KEGG pathways by target genes of each formula or herb is analyzed using in house python scripts with hypergeometric test.

The disease information

The disease section records the detailed information of diseases and their related genes, which were collected from Human Phenotype Ontology (HPO, March 2018 release), Online Mendelian Inheritance in Man (OMIM, April 2018 release), Database of gene–disease associations (DisGeNET, v5.0) and ORPHANET database. The inconsistent gene or protein IDs of different resources were manually inspected and converted into Official Gene Symbols and UniProt Accession Numbers. The diseases are linked to herbs and formulas according to the overlap of disease-causing genes and the putative target genes of herbs/formulas.

Network analysis

ETCM uses the network module of a dynamic browser based visualization library vis.js (v4.21.0, http://visjs.org/index.html) to construct user-friendly networks for multi-level interactions among herbs/formulas, targets and diseases. The gene-gene interaction data were collected from five existing molecular interaction databases, including Reactome (https://reactome.org/, version 65), Human Protein Reference Database (HPRD, http://www.hprd.org/, Release 9), the Molecular INTeraction Database (MINT, https://mint.bio.uniroma2.it/, updated in August 2011), IntAct Molecular Interaction Database (https://www.ebi.ac.uk/intact/, version: 4.2.12) and Database of Interacting Proteins (DIP, https://dip.doe-mbi.ucla.edu/dip/, updated in Feb 13, 2017). Connections among herbs, formulas, pathways and diseases are established by the ingredients of herbs/formulas and the putative targets of each ingredient.

DATABASE FEATURES AND WEB INTERFACE

The ETCM database provides an easy-to-use platform for users to browse, search, and analyze TCM related information from various aspects.The major functional schema of ETCM is summarized in Figure 1. To facilitate TCM related basic researches, clinical applications and drug development, the ETCM database not only includes detailed medicinal properties of TCM herbs and formulas, but also provides potential links between TCM with target genes and modern diseases.
Figure 1.

The schema and objectives of ETCM. Boxes with yellow highlights refer to the novel information and functions in ETCM compared with other available TCM-related data resources.

The schema and objectives of ETCM. Boxes with yellow highlights refer to the novel information and functions in ETCM compared with other available TCM-related data resources.

TCM-specific functional features

TCM understands human bodies and diseases from a special systematic point of view, which is quite different from modern anatomy and medicines. For example, in TCM, herbs are classified according to their flavours (sour, salty, sweet, bitter and pungent), the properties of herbs are characterized according to their effects (cold, hot, warm, cool, and even), and the target organs of herbs are defined by meridian tropisms (lung meridian, liver meridian, etc.). All these information are presented in the ‘Herbs’ section of ETCM (Figure 2A). By clicking on the pie chart of each above mentioned category, users can obtain a full list of all herbs belonging to each category (Figure 2B).
Figure 2.

Major function illustration of ETCM. (A) Function summary of ETCM. (B) Summary of herbs included in ETCM. Clicking on any proportion of the pie charts will bring out corresponding herbs in the below list. (C) The picture and habitat map of herb DA HUANG. (D) Quality control standard of herb DA HUANG. (E) Enrichment and list of GO terms of the putative target genes of herb DA HUANG. (F) Pathways enriched by the putative target genes of herb DA HUANG.

Major function illustration of ETCM. (A) Function summary of ETCM. (B) Summary of herbs included in ETCM. Clicking on any proportion of the pie charts will bring out corresponding herbs in the below list. (C) The picture and habitat map of herb DA HUANG. (D) Quality control standard of herb DA HUANG. (E) Enrichment and list of GO terms of the putative target genes of herb DA HUANG. (F) Pathways enriched by the putative target genes of herb DA HUANG. Detailed information of each herb can be retrieved by clicking on its Chinese or Pinyin name, including the habitat, best collection time, property, flavor, meridian tropism, indications, and chemical components of the herb. The picture of each herb as well as its habitat map in China and quality control standard are provided (Figure 2C and D). Formulas containing each herb are also provided in the herb information page, clicking each formula name can link to the information page of the formula. It is worth to note that the herb indication information provided in the ‘Herbs’ section is described according to the records in the Pharmacopoeia of the People's Republic of China (2015 version), these indications are different from modern diseases, therefore we tried to use genes related to both TCM ingredients and modern diseases to build the links between TCM herb indications and modern diseases. Similarly, formulas collected in the ETCM database are also classified according to TCM syndromes, such as heart-clearing, blood-regulation, etc.

Links to molecular biology and modern diseases

To facilitate the understanding of TCM functions from the modern science point of view, we performed target prediction for ingredients of the collected herbs and formulas according to the 2D MACCS fingerprint similarity between TCM ingredients and known drugs in the DrugBank database, and considered the target genes of known drugs as the putative targets of TCM ingredients passed the fingerprint similarity test of the known drugs. The collective targets of all ingredients of a herb or formula are considered as the putative targets of the herb or formula. Diseases associated with those target genes are also considered as diseases that may be cured by the herbs or formulas. Gene Ontology terms or pathways enriched by genes targeted by certain ingredients, herbs, formulas or associated with certain diseases, are also included in ETCM (Figure 2E and F). To better illustrate the relationships among ingredients, herbs, formulas, target genes, gene-involved pathways and diseases, ETCM provides a systematic analysis function, which allows users to build networks among two or more above mentioned items. By entering a query item and selecting one or more categories, users are capable to build herb-ingredient-target, ingredient-gene-pathway-disease, as well as many other networks (Figure 3). They can also mark or modify nodes and edges of a network to facilitate further researchers. All database contents and analysis results are available for download, and a clear user-manual is also provided.
Figure 3.

Examples of various networks constructed by the systematic analysis function of ETCM.

Examples of various networks constructed by the systematic analysis function of ETCM.

DISCUSSION

TCM regards human body as a whole system and gives specific prescriptions to each individual patient according to their disease conditions, these concepts are in good accordance with the goals of precision medicine. In recent years, the values of TCM are more and more appreciated. With the hundreds and thousands of historically practiced herbs and formulas, TCM can also serve as a treasure house for modern drug development. Yet TCM-related data resources are still very limited. To meet the increasing needs, we presented ETCM as a novel TCM-related database, which includes various aspects of essential information related to TCM herbs and formulas. Compared with the limited available TCM-related data resources (7–13), ETCM is more comprehensive (e.g. with the habitat and quality control information of herbs, and drug-likeness information of ingredients) and more functional versatile (e.g. allows users to carry out cross-sectional analysis and build networks), as well as includes more items in each category. To facilitate mechanistic studies on TCM herbs and formulas, we predicted target genes of TCM ingredients according to the structural and chemical similarity of ingredients with known drugs. Although the prediction method we used has been evaluated as one of the best performed methods in similarity based drug discovery (20,21), there may still be many false positives in the prediction results. Thus, the target prediction results could only serve as a mechanism indication of TCM ingredients, herbs and formulas, and await to be investigated in the future. Multi-drug combination therapeutics has been considered as a rational and efficient therapeutic approach to control complex diseases by regulating various targets and exerting multiple pharmacological effects simultaneously (22). As an empirical system of multi-drug combination therapeutics, TCM prescriptions often include two or more Chinese herbs, the combinational effect of which may be greater than the sum of the individual effects (23). Growing lines of evidence have shown that TCM patent prescriptions with multi-herb combination can increase the possibility of conquering complex diseases with reduced side effects and less adaptive resistance (24). Therefore, understanding the synergistic mechanisms of herbs contained in TCM prescriptions may be of great significance to optimize and discover novel drug combinations. The ETCM database not only includes detailed information of 3962 CFDA approved TCM prescriptions which have been extensively used in clinics with verified therapeutic effects, but also provides a systematic analysis tool for users to investigate putative mechanisms underlying the synergistic effects of ingredients or herbs in a TCM prescription. In conclusion, ETCM is a comprehensive data resource for aiding the mechanistic investigation, new drug discovery, and clinical application of TCM. It is free for academic use and the data can be conveniently exported. The database will be continually updated and expanded to include new data and functions in the future.

DATA AVAILABILITY

All data of ETCM is available at http://www.nrc.ac.cn:9090/ETCM/.
  22 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  TCM: Made in China.

Authors:  Felix Cheung
Journal:  Nature       Date:  2011-12-21       Impact factor: 49.962

3.  Quantifying the chemical beauty of drugs.

Authors:  G Richard Bickerton; Gaia V Paolini; Jérémy Besnard; Sorel Muresan; Andrew L Hopkins
Journal:  Nat Chem       Date:  2012-01-24       Impact factor: 24.427

4.  Database of traditional Chinese medicine and its application to studies of mechanism and to prescription validation.

Authors:  X Chen; H Zhou; Y B Liu; J F Wang; H Li; C Y Ung; L Y Han; Z W Cao; Y Z Chen
Journal:  Br J Pharmacol       Date:  2006-11-06       Impact factor: 8.739

Review 5.  The application of Chinese medicine to novel drug discovery.

Authors:  Nikolaus J Sucher
Journal:  Expert Opin Drug Discov       Date:  2012-11-21       Impact factor: 6.098

Review 6.  Traditional Chinese medicine and immune regulation.

Authors:  Hong-Di Ma; Yan-Ru Deng; Zhigang Tian; Zhe-Xiong Lian
Journal:  Clin Rev Allergy Immunol       Date:  2013-06       Impact factor: 8.667

7.  HIT: linking herbal active ingredients to targets.

Authors:  Hao Ye; Li Ye; Hong Kang; Duanfeng Zhang; Lin Tao; Kailin Tang; Xueping Liu; Ruixin Zhu; Qi Liu; Y Z Chen; Yixue Li; Zhiwei Cao
Journal:  Nucleic Acids Res       Date:  2010-11-21       Impact factor: 16.971

8.  Synergistic drug combinations tend to improve therapeutically relevant selectivity.

Authors:  Joseph Lehár; Andrew S Krueger; William Avery; Adrian M Heilbut; Lisa M Johansen; E Roydon Price; Richard J Rickles; Glenn F Short; Jane E Staunton; Xiaowei Jin; Margaret S Lee; Grant R Zimmermann; Alexis A Borisy
Journal:  Nat Biotechnol       Date:  2009-07-05       Impact factor: 54.908

9.  TCMID: Traditional Chinese Medicine integrative database for herb molecular mechanism analysis.

Authors:  Ruichao Xue; Zhao Fang; Meixia Zhang; Zhenghui Yi; Chengping Wen; Tieliu Shi
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

10.  TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining.

Authors:  Yu-Ching Fang; Hsuan-Cheng Huang; Hsin-Hsi Chen; Hsueh-Fen Juan
Journal:  BMC Complement Altern Med       Date:  2008-10-14       Impact factor: 3.659

View more
  143 in total

1.  An Integrative Pharmacology-Based Analysis of Refined Qingkailing Injection Against Cerebral Ischemic Stroke: A Novel Combination of Baicalin, Geniposide, Cholic Acid, and Hyodeoxycholic Acid.

Authors:  Chongyang Ma; Xueqian Wang; Tian Xu; Shuang Zhang; Shuling Liu; Changming Zhai; Zisong Wang; Jie Mu; Changxiang Li; Fafeng Cheng; Qingguo Wang
Journal:  Front Pharmacol       Date:  2020-05-08       Impact factor: 5.810

2.  Data Mining and Systems Pharmacology to Elucidate Effectiveness and Mechanisms of Chinese Medicine in Treating Primary Liver Cancer.

Authors:  Zhen Zhang; Jun-Wei Li; Pu-Hua Zeng; Wen-Hui Gao; Xue-Fei Tian
Journal:  Chin J Integr Med       Date:  2021-08-25       Impact factor: 1.978

3.  Prediction of Targets of Curculigoside A in Osteoporosis and Rheumatoid Arthritis Using Network Pharmacology and Experimental Verification.

Authors:  Jiawen Han; Minjie Wan; Zhanchuan Ma; Cong Hu; Huanfa Yi
Journal:  Drug Des Devel Ther       Date:  2020-11-26       Impact factor: 4.162

4.  Elucidating direct kinase targets of compound Danshen dropping pills employing archived data and prediction models.

Authors:  Tongxing Wang; Lu Liang; Chunlai Zhao; Jia Sun; Hairong Wang; Wenjia Wang; Jianping Lin; Yunhui Hu
Journal:  Sci Rep       Date:  2021-05-05       Impact factor: 4.379

5.  Structural analogues in herbal medicine ginseng hit a shared target to achieve cumulative bioactivity.

Authors:  Wei Zhang; Wei-Wei Tao; Jing Zhou; Cheng-Ying Wu; Fang Long; Hong Shen; He Zhu; Qian Mao; Jun Xu; Song-Lin Li; Qi-Nan Wu
Journal:  Commun Biol       Date:  2021-05-10

6.  Network pharmacology integrated with experimental validation revealed the anti-inflammatory effects of Andrographis paniculata.

Authors:  Naiqiang Zhu; Jingyi Hou; Ning Yang
Journal:  Sci Rep       Date:  2021-05-07       Impact factor: 4.379

7.  Investigation of the Mechanism of Traditional Chinese Medicines in Angiogenesis through Network Pharmacology and Data Mining.

Authors:  Wingyan Yun; Wenchao Dan; Jinlei Liu; Xinyuan Guo; Min Li; Qingyong He
Journal:  Evid Based Complement Alternat Med       Date:  2021-04-28       Impact factor: 2.629

8.  Pharmacological Mechanisms Underlying the Therapeutic Effects of Danhong Injection on Cerebral Ischemia.

Authors:  Yifei Qi; Yihuai Zou; Lin Chen; Jun Liu; Yingying Zhang; Zhong Wang
Journal:  Evid Based Complement Alternat Med       Date:  2021-05-21       Impact factor: 2.629

9.  Exploring the Mechanism of Zhibai Dihuang Decoction in the Treatment of Ureaplasma Urealyticum-Induced Orchitis Based on Integrated Pharmacology.

Authors:  Dong-Hua Bin; Shi-Ying Zhang; Min Zhan; Ling Li; Ying-Qiu Li; Xing Zhou; Fang-Guo Lu; Qing Zhou; Qing-Hu He
Journal:  Front Pharmacol       Date:  2021-05-10       Impact factor: 5.810

10.  Network Pharmacology-Based Systematic Analysis of Molecular Mechanisms of Dingji Fumai Decoction for Ventricular Arrhythmia.

Authors:  Yi Liang; Bo Liang; Xin-Rui Wu; Wen Chen; Li-Zhi Zhao
Journal:  Evid Based Complement Alternat Med       Date:  2021-05-08       Impact factor: 2.629

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.