| Literature DB >> 22514724 |
Lei Chen1, Wei-Ming Zeng, Yu-Dong Cai, Kai-Yan Feng, Kuo-Chen Chou.
Abstract
The Anatomical Therapeutic Chemical (ATC) classification system, recommended by the World Health Organization, categories drugs into different classes according to their therapeutic and chemical characteristics. For a set of query compounds, how can we identify which ATC-class (or classes) they belong to? It is an important and challenging problem because the information thus obtained would be quite useful for drug development and utilization. By hybridizing the informations of chemical-chemical interactions and chemical-chemical similarities, a novel method was developed for such purpose. It was observed by the jackknife test on a benchmark dataset of 3,883 drug compounds that the overall success rate achieved by the prediction method was about 73% in identifying the drugs among the following 14 main ATC-classes: (1) alimentary tract and metabolism; (2) blood and blood forming organs; (3) cardiovascular system; (4) dermatologicals; (5) genitourinary system and sex hormones; (6) systemic hormonal preparations, excluding sex hormones and insulins; (7) anti-infectives for systemic use; (8) antineoplastic and immunomodulating agents; (9) musculoskeletal system; (10) nervous system; (11) antiparasitic products, insecticides and repellents; (12) respiratory system; (13) sensory organs; (14) various. Such a success rate is substantially higher than 7% by the random guess. It has not escaped our notice that the current method can be straightforwardly extended to identify the drugs for their 2(nd)-level, 3(rd)-level, 4(th)-level, and 5(th)-level ATC-classifications once the statistically significant benchmark data are available for these lower levels.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22514724 PMCID: PMC3325992 DOI: 10.1371/journal.pone.0035254
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Breakdown of the benchmark dataset according to the 14 main ATC classes.
| Tag | The 1st-level ATC class | Number of drugs |
|
| Alimentary tract and metabolism | 540 |
|
| Blood and blood forming organs | 133 |
|
| Cardiovascular system | 591 |
|
| Dermatologicals | 421 |
|
| Genito-urinary system and sex hormones | 248 |
|
| Systemic hormonal preparations, excluding sex hormones and insulins | 126 |
|
| Antiinfectives for systemic use | 521 |
|
| Antineoplastic and immunomodulating agents | 232 |
|
| Musculo-skeletal system | 208 |
|
| Nervous system | 737 |
|
| Antiparasitic products, insecticides and repellents | 127 |
|
| Respiratory system | 427 |
|
| Sensory organs | 390 |
|
| Various | 211 |
| Number of total virtual drugs | 4,912 | |
| Number of total structural different drugs | 3,883 | |
See Eqs.2–3 for the definition about the number of virtual drugs, and its relation with the number of structural different drugs.
Of the 3,883 structural different drugs, 3,295 belong to one class, 370 to two classes, 110 to three classes, 37 to four classes, 27 to five classes, and 44 to six classes. See Supporting Information S1 for the detailed drug codes listed in each of 14 ATC-classes.
Figure 1An illustration to show the distribution about the numbers of ATC-classes a same drug may belong to.
For the 3,883 drugs in , 3,295 belong to one class, 370 to two classes, 110 to three classes, 37 to four classes, 27 to five classes, 44 to six classes, and none to seven or more classes.
The jackknife success rates by three different methods in identifying the drugs among the 14 main ATC-classes.
| Prediction order | Interaction-based | Similarity-based | Integrated |
| 1 | 67.72% | 78.49% | 72.55% |
| 2 | 21.13% | 18.86% | 20.11% |
| 3 | 13.43% | 8.63% | 11.28% |
| 4 | 7.18% | 5.23% | 6.31% |
| 5 | 4.76% | 2.88% | 3.91% |
| 6 | 3.54% | 1.73% | 2.73% |
| 7 | 1.63% | 0.12% | 0.95% |
| 8 | 0.75% | 0.35% | 0.57% |
| 9 | 0.75% | 0.12% | 0.46% |
| 10 | 0.56% | 0.06% | 0.33% |
| 11 | 0.09% | 0.00% | 0.05% |
| 12 | 0.28% | 0.00% | 0.15% |
| 13 | 0.09% | 0.00% | 0.05% |
| 14 | 0.05% | 0.00% | 0.03% |
Using Eq.6 on the 2,144 drugs in the benchmark dataset that had the chemical-chemical interaction information.
Using Eq.10 on the drugs in the benchmark dataset that had no chemical-chemical interaction information.
Using the integrated method by hybridizing Eq.6 and Eq.10 on the 3,883 drugs in the benchmark dataset as given in Supporting Information S1.
A comparison between the similarity-based method (Eq.10) and the interaction-based method (Eq.6) in identifying the 2,138 drugs in the dataset (cf. Supporting Information S2).
| Prediction order | Similarity-based | Interaction-based | Difference |
| 1 | 40.36% | 67.40% | 27.04% |
| 2 | 13.89% | 21.09% | 7.20% |
| 3 | 9.17% | 13.47% | 4.30% |
| 4 | 5.99% | 7.16% | 1.17% |
| 5 | 3.32% | 4.91% | 1.59% |
| 6 | 2.76% | 3.46% | 0.70% |
| 7 | 0.65% | 1.54% | 0.89% |
| 8 | 0.23% | 0.75% | 0.52% |
| 9 | 0.09% | 0.75% | 0.66% |
| 10 | 0.05% | 0.56% | 0.51% |
| 11 | 0.05% | 0.09% | 0.04% |
| 12 | 0.00% | 0.33% | 0.33% |
| 13 | 0.09% | 0.09% | 0.00% |
| 14 | 0.05% | 0.05% | 0.05% |