| Literature DB >> 34986599 |
Deyu Yan1, Genhui Zheng1, Caicui Wang1, Zikun Chen1, Tiantian Mao1, Jian Gao2,3, Yu Yan1, Xiangyi Chen1, Xuejie Ji1, Jinyu Yu1, Saifeng Mo1, Haonan Wen1, Wenhao Han1, Mengdi Zhou1, Yuan Wang1, Jun Wang1, Kailin Tang1, Zhiwei Cao4.
Abstract
Literature-described targets of herbal ingredients have been explored to facilitate the mechanistic study of herbs, as well as the new drug discovery. Though several databases provided similar information, the majority of them are limited to literatures before 2010 and need to be updated urgently. HIT 2.0 was here constructed as the latest curated dataset focusing on Herbal Ingredients' Targets covering PubMed literatures 2000-2020. Currently, HIT 2.0 hosts 10 031 compound-target activity pairs with quality indicators between 2208 targets and 1237 ingredients from more than 1250 reputable herbs. The molecular targets cover those genes/proteins being directly/indirectly activated/inhibited, protein binders, and enzymes substrates or products. Also included are those genes regulated under the treatment of individual ingredient. Crosslinks were made to databases of TTD, DrugBank, KEGG, PDB, UniProt, Pfam, NCBI, TCM-ID and others. More importantly, HIT enables automatic Target-mining and My-target curation from daily released PubMed literatures. Thus, users can retrieve and download the latest abstracts containing potential targets for interested compounds, even for those not yet covered in HIT. Further, users can log into 'My-target' system, to curate personal target-profiling on line based on retrieved abstracts. HIT can be accessible at http://hit2.badd-cao.net.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34986599 PMCID: PMC8728248 DOI: 10.1093/nar/gkab1011
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Workflow of HIT 2.0. (1, 2) Retrieve PubMed using different names of herbal ingredients. (3) Mine PubMed abstracts to identify gene/protein entities. (4) Detect whether ‘compound’, ‘gene’ and ‘keyword’ are in a directional dependency tree path. 5&6. Manual check and complete the information.
Overview of the literature-described targets from peering databases
| Published year | Literature-described targets | Herbal ingredients | Herbal ingredient-target activity pairs | Sourcing literatures | |
|---|---|---|---|---|---|
| HIT 2.0 | 2208 | 1237 | 10 031 | 7100 PubMed abstracts | |
| HIT | 2011 | 1301 | 586 | 5208 | 3250 PubMed abstracts |
| HERB | 2020 | 1241 | 370 | 4815 | 1966 PubMed abstracts |
| TCMID | 2013 | 680 | / | / | 4500 Chinese Literatures |
| NPASS | 2017 | 464 | 719 | 1936 | 1288 PubMed abstracts |
| TCMSP | 2014 |
|
|
| / |
Figure 2.Searching and resulting pages in HIT 2.0. (A) Database structure and data statistics. (B) Herbs can be searched via keywords such as Chinese Pinyin, Chinese characters and Latin names. (C) Herbal ingredients can be searched via structure similarity or keywords of name, CID and CAS number. (D) Targets can be searched via keywords of gene/protein name, gene symbol and Uniprot ID. (E) Detailed information of the targets. (F) Additional targets of the compound. (G) ‘ Literature evidence ’ provides the key descriptions parsed from sourcing literatures.
Figure 3.Target-mining and My-target curation system. (A) The interface of Target-mining function. Compound name, MeSH ID and Pubchem ID can be submitted to retrieve potential targets. (B) PubMed abstract retrieved by Target-mining. (C) The interface of My-target curation system.