| Literature DB >> 29264408 |
Samet Keserci1, Eric Livingston2, Lingtian Wan1, Alexander R Pico3, George Chacko1.
Abstract
Drug discovery and subsequent availability of a new breakthrough therapeutic or 'cure' is a compelling example of societal benefit from research advances. These advances are invariably collaborative, involving the contributions of many scientists to a discovery network in which theory and experiment are built upon. To document and understand such scientific advances, data mining of public and commercial data sources coupled with network analysis can be used as a digital methodology to assemble and analyze component events in the history of a therapeutic. This methodology is extensible beyond the history of therapeutics and its use more generally supports (i) efficiency in exploring the scientific history of a research advance (ii) documenting and understanding collaboration (iii) portfolio analysis, planning and optimization (iv) communication of the societal value of research. Building upon prior art, we have conducted a case study of five anti-cancer therapeutics to identify the collaborations that resulted in the successful development of these therapeutics both within and across their respective networks. We have linked the work of over 235,000 authors in roughly 106,000 scientific publications that capture the research crucial for the development of these five therapeutics. Applying retrospective citation discovery, we have identified a core set of publications cited in the networks of all five therapeutics and additional intersections in combinations of networks. We have enriched the content of these networks by annotating them with information on research awards from the US National Institutes of Health (NIH). Lastly, we have mapped these awards to their cognate peer review panels, identifying another layer of collaborative scientific activity that influenced the research represented in these networks.Entities:
Keywords: Cancer research; Information science
Year: 2017 PMID: 29264408 PMCID: PMC5727381 DOI: 10.1016/j.heliyon.2017.e00442
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Case studies of five anti-cancer agents. Five anti-cancer therapeutics, with FDA approval dates ranging from 2001 to 2014, were selected as case studies. The unique identifier for each therapeutic is an FDA assigned NDA or BLA number. While multiple patents are typically associated with a drug or biological, the single US patent number displayed represents the primary invention that preceded approval of the therapeutic. The publication date for each patent is listed in the last column.
| Therapeutic | FDA approval date | Unique identifier | US patent | Publication date |
|---|---|---|---|---|
| May 2001 | BLA: 103948 | US5846534 | Dec 1998 | |
| May 2001 | NDA: 021335 | US5521184 | May 1996 | |
| Oct 2005 | NDA: 021877 | US5424295 | Jun 1995 | |
| Apr 2014 | BLA: 125477 | US7498414 | Mar 2009 | |
| Jan 2006 | NDA: 021938 | US6573293 | Jun 2003 |
Citation counts and mapping between bibliographic databases. Five anti-cancer therapeutics were selected as case studies. A foundational set of references (citing_pmid) was assembled for each therapeutic from patents, clinical trials, regulatory documents, and the scientific literature (Materials and Methods). Citing_pmids were mapped to Scopus identifiers (citing_sid), which were used, in turn, to retrieve cited publications (cited_sid). Cited_sids were mapped back to PubMed identifiers (cited_pmid). The number of identifiers at each stage of the mapping process is shown along with percentage loss (in parentheses) when mapping across PubMed and Scopus or due to null values in the cited_sid field.
| Therapeutic | citing_pmid count | citing_sid count | cited_sid count | cited_pmid count |
|---|---|---|---|---|
| 599 | 587 (1%) | 8840 (2%) | 7071 (20%) | |
| 1380 | 1373 (1%) | 27326 (1%) | 23340 (17%) | |
| 104 | 104 (0%) | 2476 (1%) | 1990 (20%) | |
| 1820 | 1804 (1%) | 48587 (0%) | 40973 (19%) | |
| 1512 | 1509 (0%) | 33895 (0%) | 28661 (15%) |
Intersection of five networks. Publications at the intersection of all five networks are listed above. All 14 publications are found in the second generation of references (cited_sid, Figure 1 right panel).
| SourceYear | SourceName | Author(s) |
|---|---|---|
| 1958 | J. Am. Stat. Assoc. | Kaplan ER, Meier P. |
| 1963 | Science | Jerne, NK and Nordin, AA. |
| 1972 | J R Stat Soc | Cox DR. |
| 1976 | Anal. Biochem. | Bradford MM. |
| 1977 | Br J Cancer | R. Peto, M.C. Pike, and P. Armitage |
| 1977 | Proc. Natl. Acad. Sci. | Sanger FS., Nicklen S, Coulson AR. |
| 1983 | J Immunol Methods | Mosmann T. |
| 1984 | Adv Enzyme Regul | Chou TC, Talalay P. |
| 1989 | Molecular Cloning: A Laboratory Manual | Sambrook, J., Fritsch, E. and Maniatis, T. |
| 1994 | Acta Crystallogr D | Collaborative Computational Project 4 |
| 1994 | Acta Crystallogr. A | Navaza J. |
| 1997 | Cell | Levine AJ. |
| 1997 | Am. J. Pathol. | Perez-Atayde AR, Sallan SE, Tedrow U, Connors S, Allred E, Folkman J. |
| 1998 | CA: A Cancer Journal for Clinicians | Landis SH, Murray T, Bolden S, Wingo PA. |
Figure 1Intersecting publications in five networks. Intersections were calculated across all five networks for the first generation of references (citing_pmids) and as well as for the second generation of references (cited_sids) and displayed as Venn diagrams. Left panel. No first generation publications are observed common to all five networks. A single publication is cited in four of five networks. Right panel. 14 publications are common to all five networks. Abbreviations: alem (Alemtuzumab), imat (Imatinib), nela (Nelarabine), ramu (Ramucirumab), suni (Sunitinib).
Figure 2Core publications in networks. The outer arcs of blue nodes identify first generation publications (citing_sid) for each therapeutic. Nodes in the inner ring are sized by a gradient proportion to total degree count with an upper limit of 30 and are colored by a gradient proportional to the number of drug connections (2 to 5). 14 publications are common to all five networks (Table 3) and are colored red. The remaining nodes in the inner ring connect to between 2 and 4 drugs each and are labeled accordingly. Abbreviations: alem (Alemtuzumab), imat (Imatinib), nela (Nelarabine), ramu (Ramucirumab), suni (Sunitinib).
Figure 3NIH research support. Grant and contract support for publications from NIH in the five networks was identified using ExPORTER data (Materials and Methods). 19,104 unique project numbers were identified as sources of support for publications in all five networks. Of these, 112 projects were common to all five networks. Projects were grouped by mechanism (i) P – Research Program Projects and Centers (ii) R – Research Projects (iii) M – General Clinical Research Centers Programs (iv) N – Research and Development-Related Contracts (v) U – Cooperative Agreements (vi) T – Training Programs (vii) Z – Intramural Research. For each mechanism, the number of projects in the intersection of all five networks was plotted against the number in the union of all five networks (both expressed as percentages of their respective totals). A higher proportion of Research Program Projects and Centers awards is found in the intersection group.
Elite performers. Researchers with the highest nPIR scores in the network of 5 anti-cancer therapeutics are listed. Also shown for each researcher is their PIRpartitionRatio (PPR). The nPIR indicates influence across all five networks and the PPR provides an estimate of how this influence is partitioned across each of the five networks (Materials and Methods). This list should be considered in the context of the data being analyzed and not interpreted as an absolute ordering of research excellence in the field.
| Name | nPIR | PPR |
|---|---|---|
| Ferrara N. | 46693 | 1.12 |
| Folkman J. | 23660 | 1.15 |
| Ullrich A. | 23034 | 1.46 |
| Jain R. | 15267 | 1.13 |
| Heldin C. | 15148 | 1.21 |
| Druker B. | 15088 | 1.16 |
| Schlessinger J. | 14996 | 1.48 |
| Dvorak H. | 14230 | 1.13 |
| Alitalo K. | 13812 | 1.24 |
| Slamon D. | 13587 | 1.46 |
| Baselga J. | 12908 | 1.36 |
| Kantarjian H. | 12029 | 1.17 |
| Hicklin D. | 11775 | 1.17 |
| Witte O. | 11449 | 1.08 |
| Hanahan D. | 11072 | 1.19 |
| Buchdunger E. | 11032 | 1.22 |
| Risau W. | 10950 | 1.24 |
| Talpaz M. | 10713 | 1.13 |
| Mendelsohn J. | 10534 | 1.54 |
| Lydon N. | 9988 | 1.18 |
| Goldman J. | 9927 | 1.11 |
| Shibuya M. | 9639 | 1.21 |
| Kitamura Y. | 9486 | 1.24 |
| Waldmann H. | 9363 | 1.04 |
| Kerbel R. | 9266 | 1.16 |