| Literature DB >> 20011464 |
Abstract
Two problems now threaten the future of anticancer drug development: (i) the information explosion has made research into new target-specific drugs more duplication-prone, and hence less cost-efficient; and (ii) high-throughput genomic technologies have failed to deliver the anticipated early windfall of novel first-in-class drugs. Here it is argued that the resulting crisis of blockbuster drug development may be remedied in part by innovative exploitation of informatic power. Using scenarios relating to oncology, it is shown that rapid data-mining of the scientific literature can refine therapeutic hypotheses and thus reduce empirical reliance on preclinical model development and early-phase clinical trials. Moreover, as personalised medicine evolves, this approach may inform biomarker-guided phase III trial strategies for noncytotoxic (antimetastatic) drugs that prolong patient survival without necessarily inducing tumor shrinkage. Though not replacing conventional gold standards, these findings suggest that this computational research approach could reduce costly 'blue skies' R&D investment and time to market for new biological drugs, thereby helping to reverse unsustainable drug price inflation.Entities:
Keywords: bibliometrics; clinical trials; drug development; medical informatics; oncology
Year: 2009 PMID: 20011464 PMCID: PMC2791493 DOI: 10.4137/cin.s2666
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Schematic illustration of a text-mined association. The extent to which a phenotype such as finger clubbing is associated with a specific lung cancer histology may be estimated by association (see Supplement S2), as visualized here.
Figure 2Phenotypic distinction of ‘adenocarcinoma’ vs. ‘squamous carcinoma’ using text-mining; for each phenotype shown, the PubMed database was exclusively interrogated (e.g. adenocarcinoma NOT squamous cell carcinoma, and vice versa). The figures on the horizontal axis represent the multiplier (M) by which each phenotype is preferentially associated with the histology labelled on the adjacent vertical axis.
Text-mined associative phenotyping of drug action. TKIs are more strongly associated with {growth and replication} than MPIs, whereas the reverse appears true of {invasion and metastasis} (χ2 = 2099.6, df = 1, p < 0.01).
| AND (growth OR replication) | AND (invasion OR metastasis) | |
|---|---|---|
| Tyrosine kinase inhibitor ( | 10741 | 1010 |
| Metalloprotease inhibitor ( | 3063 | 1877 |
Association of tumor subtypes with a putative surrogate biomarker for the mTOR pathway, Akt, showing that the strongest association is with prostate cancer. As expected, the much stronger correlation of the mTOR inhibitor temsirolimus, which is already licensed for use in renal cell cancer, is with the latter malignancy. P, primary set size; C, intersecting set size.
| Breast cancer | Renal cancer | Colorectal cancer | Prostate cancer | Lung cancer | Endometrial cancer | |
|---|---|---|---|---|---|---|
| P | 196883 | 75450 | 117408 | 78484 | 166465 | 20194 |
| AND | 1208 | 152 | 349 | 719 | 638 | 117 |
| (100 × C) | 0.61 | 0.20 | 0.30 | 0.92 | 0.38 | 0.58 |
| AND | 26 | 129 | 3 | 8 | 3 | 3 |
| (100 × C) | 0.01 | 0.17 | 0.003 | 0.01 | 0.002 | 0.015 |
Figure 3Text-mined comparison of survival vs. cost data for different cancer treatment modes, suggesting that adjuvant therapies are most strongly linked to survival benefit (open columns) but least to cost-effectiveness (solid columns), whereas preventive treatments have the opposite associations. Palliative and maintenance treatments appear intermediate for both secondary endpoints.
| AND incidence | AND mortality | ||
|---|---|---|---|
| Breast cancer | (197552) | 39175 | 19733 |
| Lung cancer | (166982) | 33320 (39318) | 24871 (29348) |
| M | 1.18 (NS) | 1.49 |
Text associations suggesting that lung cancer is a significantly more lethal disease than is breast cancer (χ2 = 1059.99, df = 1, p < 0.01).
| AND | AND | M | |||
|---|---|---|---|---|---|
| Lung cancer | Squamous cell carcinoma | 16493 | Finger clubbing | 54 | 3.27 |
| Small-cell carcinoma | 34255 | 42 | 1.23 | ||
| Adenocarcinoma | 28885 | 101 | 3.50 |
Text-mining data showing that {small-cell lung cancer} is significantly less often associated with {finger clubbing) than are the other two histologic labels (χ2 = 37.96, df = 2, p < 0.01).
| AND | AND | ||
|---|---|---|---|
| 473 | 2037 | ||
| AND | 19 | 1 | |
| M1 | 4.02 | 0.05 | |
| M2 | 80.4 |
Data set showing a significantly stronger association of {brain metastases} in {breast cancer} associated with the term {HER2-positive} than with {ER-positive) (χ2 = 73.461, df = 1, p < 0.01).
| AND | AND | |
|---|---|---|
| 13310 | 4649 | |
| AND | 28 | 30 |
| 13310 | 4649 | |
| AND | 243 | 209 |
| 243 | 209 | |
| AND | 28 | 30 |
χ2 = 18.75, df = 1, p < 0.01.
χ2 = 92.98, df = 1, p < 0.01.
χ2 = 0.42, df = 1, p = 0.5169.
Data showing stronger association of {invasive lobular cancer} with {peritoneal metastases} than with {invasive ductal cancer}, while implicating {E-cadherin} as a molecular co-variable of this clinical relationship (see text).
| AND | AND | |
|---|---|---|
| No p53 | 4794 | 772 |
| +p53 | 1568 | 429 |
Text-mined data showing a significantly stronger association of {p53} with {ARF} than with {INK4A}. (χ2 = 63.19, df = 1, p < 0.01).
| Insulin-like growth factor-1 | Hepatocyte growth factor | |
|---|---|---|
| Drug resistance | 319 | 100 |
| Metastasis | 100 | 815 |
Data showing significantly stronger association of {metastasis} with {HGF} than with {IGF1}, but stronger association of {drug resistance} with IGF1 than with HGF (χ2 = 564.15, df = 1, p < 0.01).
| AND (growth OR replication OR mitosis) | AND (migration OR motility OR metastasis) | |
|---|---|---|
| Kinase | 147941 | 23748 |
| Protease | 44028 | 15732 |
Quantification of text-mined molecular phenotype: there is a greater than 16-fold difference in the association of ‘kinase’ with pro-mitogenic vs. pro-metastatic key words when compared with the corresponding associations of ‘protease’ (χ2 = 4889.64, df = 1, p < 0.01).