| Literature DB >> 28938573 |
Xiang Cheng1,2, Shu-Guang Zhao1, Xuan Xiao2,3, Kuo-Chen Chou3,4,5.
Abstract
Recommended by the World Health Organization (WHO), drug compounds have been classified into 14 main ATC (Anatomical Therapeutic Chemical) classes according to their therapeutic and chemical characteristics. Given an uncharacterized compound, can we develop a computational method to fast identify which ATC class or classes it belongs to? The information thus obtained will timely help adjusting our focus and selection, significantly speeding up the drug development process. But this problem is by no means an easy one since some drug compounds may belong to two or more than two ATC classes. To address this problem, using the DO (Drug Ontology) approach based on the ChEBI (Chemical Entities of Biological Interest) database, we developed a predictor called iATC-mDO. Subsequently, hybridizing it with an existing drug ATC classifier, we constructed a predictor called iATC-mHyb. It has been demonstrated by the rigorous cross-validation and from five different measuring angles that iATC-mHyb is remarkably superior to the best existing predictor in identifying the ATC classes for drug compounds. To convenience most experimental scientists, a user-friendly web-server for iATC-mHyd has been established at http://www.jci-bioinfo.cn/iATC-mHyb, by which users can easily get their desired results without the need to go through the complicated mathematical equations involved.Entities:
Keywords: ATC classification; Chou's five intuitive metrics; drug ontology; multi-label system
Year: 2017 PMID: 28938573 PMCID: PMC5601669 DOI: 10.18632/oncotarget.17028
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
The jackknife success rates achieved by iATC-mHyb and ATC-mISF on the benchmark dataset of Eq.1 (cf. Supporting Information S1)
| Predictor | Five metrics for multi-label systema | ||||
|---|---|---|---|---|---|
| Aiming↑b | Coverage↑b | Accuracy↑b | Absolute true↑b | Absolute false↓c | |
| iATC-mISFd | 67.83% | 67.10% | 66.41% | 60.98% | 5.85% |
| iATC-mHybe | 71.91% | 71.46% | 71.32% | 66.75% | 2.43% |
aSee Eq.12 for the definitions of the five metrics used to measure the prediction quality for multi-label systems [3].
bThe upper arrow means that the larger the rate the better the prediction quality is.
cThe down arrow means that the smaller the rate the better the prediction quality is.
dThe predictor proposed in [4].
eThe predictor proposed in the current paper.
Figure 1The semi-screenshot for the top page of the iATC-mHyb web-server, which is located at http://www.jci-bioinfo.cn/iATC-mHyb
Figure 2The semi-screenshot for the output generated by the Step 3 of users’ guide in the Results and Discussion section
Breakdown of the 3,883 drug compounds in the benchmark dataset according to the 14 ATC classes (cf. Eq.1)
| Subset | Name | Number of drugs |
|---|---|---|
| Alimentary tract and metabolism | 540 | |
| Blood and blood forming organs | 133 | |
| Cardiovascular system | 591 | |
| Dermatologicals | 421 | |
| Genito-urinary system and sex hormones | 248 | |
| Systemic hormonal preparations, excluding sex hormones and insulins | 126 | |
| Antiinfectives for systemic use | 521 | |
| Antineoplastic and immunomodulating agents | 232 | |
| Musculo-skeletal system | 208 | |
| Nervous system | 737 | |
| Antiparasitic products, insecticides and repellents | 127 | |
| Respiratory system | 427 | |
| Sensory organs | 390 | |
| Various | 211 | |
| Number of total virtual drugs | 4,912a | |
| Number of total structural different drugs | 3,883b | |
a The number of virtual drugs is counted as follows: for a structurally same drug, its contribution to the total number of virtual drugs is 2 if it occurs in two different ATC classes; that is 3 if it occurs in three different ATC classes; and so forth.
b Of the 3,883 structural different drugs, 3,295 belong to one class, 370 to two classes, 110 to three classes, 37 to four classes, 27 to five classes, and 44 to six classes. See Supporting Information S1 for the detailed drug codes listed in each of 14 ATC-classes.
Figure 3A plot to show the process of finding the optimal θ value in Eq.9
See the main text for further explanation.