Literature DB >> 32577559

Evaluating "startup readiness" for researchers: case studies of research-based startups with biopharmaceutical research topics.

Tomotaka Goji1, Yuki Hayashi1, Ichiro Sakata2.   

Abstract

For research domains such as life sciences, which pursue fundamental scientific understanding and applications intended for immediate use, academic entrepreneurship has played a pivotal role in commercialization. This paper presents an evaluation method of researchers related to user-inspired fundamental research, using global databases of startup finances and academic research papers of "startup readiness." Case studies of startups related to biopharmaceutical research topics suggest that the biopharmaceutical field has rich opportunities stemming from scientific research, commercialization, and entrepreneurship. This evaluation method sorts specific industry segments by which financing activities are active, and by which related growing research topics attract increased academic attention. We constructed networks of author citation and co-authorship from paper citation networks related to research topics in industry segments in the biopharmaceutical domain. Results obtained across all research topics we surveyed demonstrated that authors in the top 10% of degree centrality ranking in both networks are far more likely to be startup participants than other authors. Our computational approach might provide convenient, dynamic, global, and real-time understanding of the "startup readiness" of researchers working with research topics for which academic attention is emerging in actively financed biopharmaceutical fields.
© 2020 The Author(s).

Entities:  

Keywords:  Academic entrepreneurship; Biopharmaceuticals; Biotechnology; Business; Co-authorship networks; Computer science; Paper citation networks; Pharmaceutical science; Research-based startups; Startup finances; Startup readiness; Technological innovation; User-inspired fundamental research; Venture capital

Year:  2020        PMID: 32577559      PMCID: PMC7300125          DOI: 10.1016/j.heliyon.2020.e04160

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

Conventionally, research examining factors contributing to creation of scientific research-based startups and academic entrepreneurship has specifically examined factors other than scientific research itself. One example of a generally accepted notion about commercialization from advanced technology research is the technology readiness level (TRL) concept. In the mid-1970s, NASA introduced TRL: a criterion to evaluate the maturity of technologies derived from science based on a scale of 1–9, with 1 being the most basic technology and 9 being the most mature technology. The criterion has been used to explain why some new technologies engender industrial transitions. However, it has been used for project management at stages of performance assessment to schedule to budget without addressing the emergence of the research itself [1]. Actually, TRL has been expected to facilitate transdisciplinary expertise between academia and practitioners by supporting the analysis and design of an industry's transition [2]. Patents, which represent practical applications of technology concepts that are classified as TRL 2 according to the TRL concept, have been regarded as an important antecedent for academic entrepreneurship in life sciences [1, 3]. University scientists who appear as inventors on patents presumably have more opportunities to enter the private sector than other scientists. Such might be the case also for biotechnology startups. By contrast, a research paper itself has not been regarded as a precursor for academic entrepreneurship because the purpose of a research paper has been regarded as communicating research findings to the relevant scientific community and to the general public. Moreover, patents are legal documents used to prevent others from commercializing what research papers describe, thereby facilitating commercialization by patent holders of their intended applications. The “startup readiness” concept developed in our earlier study [4] was proposed as a criterion to be applied earlier than TRL, focusing on individual scientists. Whereas TRL is used to assess the maturity of a technology, startup readiness is the state at which a researcher is ready to participate in a startup. Given the growing interest in academic startups among increasing stakeholders, academic startups have recently been gaining easier access to venture capital and managerial talent. Long before TRL matures, research topics and researchers show startup readiness, which might later beget important firms by leveraging their scientific strengths. These research topics and researchers are investment opportunities for venture capitalists. They represent career opportunities for managerial talent as well. An earlier report of the literature has described that biotechnology research paper publications linked to patents are cited more than publications without a patent link [5], suggesting that research papers citations signal commercial value. We hypothesized that “startup readiness” of research topics and researchers, in contrast to “technology readiness,” can be signaled by research papers. This study presumes the resource-based view of firms (Barney (1991); Kogut and Zander (1992); Conner and Prahalad (1996); Grant (1996)) and its extended literature related to academic startups (Landry et al. (2006); Knockaert et al. (2010); Rasmussen and Borch, (2010); Huynh et al. (2017); Corsi et al. (2019)) to assume that, as with entrepreneurs, startup readiness by academic researchers will increase when either the resources or their coordination will be appropriate or sufficient [6, 7, 8, 9, 10, 11, 12, 13, 14]. Specifically for factors contributing to new academic firm creation, researchers have assessed individual and non-individual determinants of academic entrepreneurship using surveys of scientists classified according to their research protocols. For instance, Rothaermel et al. (2007) report that university policy, faculty, technology transfer offices, investors, founding teams, networks in which a firm is embedded, and other external conditions affect new firm creation [15]. Bercovitz and Feldman (2008) examine faculty members' backgrounds and work environments and their subsequent academic entrepreneurship activities. They find that participation of researchers, if they had accepted the new initiative and had been active in technology transfer, is more likely at institutions where they trained [16]. Jain et al. (2009) investigate the sense-making process accompanying university scientist participation in academic entrepreneurship and potential modification of their role identity. They suggest that scientists participate to preserve their academic role identity [17]. Clarysse et al. (2011) examine how academic professionals’ opportunity recognition capacity and their prior entrepreneurial experience affect the likelihood of their involvement in starting up a new venture and shape the social environment and roles of university technology transfer offices [18]. Abreu and Grinevich (2013) analyze determinants of academic engagement such as demographic factors (seniority and gender), research type, entrepreneurial experience and training, and institutional support [19]. Aldridge et al. (2014) examine motivations for scientists to start companies, specifically examining roles of scientist characteristics including academic rank, experience, networks and industry ties, access to human and financial resources, and supportive university conditions [20]. The earlier literature of resource-based theory describes the commercialization of academic research: resources that enable startup creation include knowledge assets, intellectual property assets, financial assets, social capital assets, personal assets, and organizational assets as discussed above. Except for organizational assets that are environmental factors, these assets are individual factors. However, for the biopharmaceutical domain, which is a particularly user-inspired, intense science-based technology commercialization field, we assume that the scientific prominence of researchers is more important than their other attributes. That point has not been explored in earlier studies of this field. This report describes our study assessing the notion that researchers’ standing in the biopharmaceutical academic community, which signals their scientific prominence, is a factor indicating their “startup readiness,” while supporting the basic view of resource-based theory. Furthermore, earlier studies examining factors leading to the creation of startups commercializing scientists' research make little or no reference to any of the following: (1) collection of real-time data related to intended entrepreneurial researchers’ academic and startup activities in broad disciplines in a scalable manner (although earlier studies survey past data of scientists in specified academic organizations or regions); (2) selection of specific research topics that entrepreneurial researchers are pursuing (although Abreu and Grinevich (2013) introduce life sciences as research fields with greater commercialization activity) [19]; and (3) bibliometric analyses of those researchers specifically in terms of their research topics in their research communities (although Rothaermel et al. (2007) and Aldridge et al. (2014) assess academic titles such as professor) [15, 20]. The industry segments presented herein highlight case studies in life sciences, specifically the biopharmaceutical domain. The classic definition of “biopharmaceutical,” both in science and industry, is pharmaceuticals (medicinal products, therapeutics, prophylactics, and in vivo diagnostics) with active agents that are inherently biological in nature and which are manufactured using biotechnology (products manufactured by or from living organisms, usually involving bioprocessing). In addition, “drug” is defined as a pharmaceutical that is inherently chemical (not biological) in nature and which is manufactured using chemical methods. Biopharmaceuticals are distinct from drugs, most of which are composed of small molecules or other synthetic chemical substances. The inherent differences between these two classes include product and active agent sources, identity, structure, composition, manufacturing methods and equipment, intellectual property, formulation, handling, dosing, regulation, and marketing [21]. The biopharmaceutical domain is selected for our case studies here for several reasons. (a) Much of life sciences such as biopharmaceutical research can be characterized as situated in Pasteur's quadrant, a classification of scientific research projects that is aimed at fundamental understanding of scientific problems and at providing immediate benefits for society. Many studies associated with that quadrant have revealed evidence of greater commercialization activities through academic entrepreneurship [22, 23, 24, 25, 26]. In addition, (b) biopharmaceutical science has expanded considerably in terms of both commercialization and entrepreneurship since the beginning of the 21st century. Of the top 10 selling pharmaceutical products worldwide in 2017, 9 were biopharmaceutical; also, 6 had origins in startup companies (Table 1). By contrast, 2001 had only one biopharmaceutical drug without a startup origin [27].
Table 1

Top 10 pharmaceutical products by Global sales in 2017 (million U.S. dollars).

RankProductTherapeutic SubcategoryVendor CompanyOriginator2017 Sales ($m)
1HumiraOther anti-rheumaticsAbbVie, EisaiKnoll18,923
2EnbrelOther anti-rheumaticsAmgen, Pfizer, TakedaImmunex, acquired by Amgen8,234
3RevlimidOther cytostaticsCelgene, BeiGeneCelgene8,211
4RituxanAnti-neoplastic MAbsRocheIDEC Pharmaceuticals, merged with Biogen7,528
5RemicadeOther anti-rheumaticsJohnson & Johnson, Merck & Co, Mitsubishi Tanabe PharmaCentocor, renamed Janssen Pharmaceutical7,172
6HerceptinAnti-neoplastic MAbsRocheGenentech7,126
7AvastinAnti-neoplastic MAbsRocheGenentech6,795
8EyleaEye/Ophthalmic preparationsRegeneron Pharmaceuticals, Bayer, Santen PharmaceuticalRegeneron Pharmaceuticals6,291
9OpdivoAnti-neoplastic MAbsBristol-Myers Squibb, Ono PharmaceuticalOno Pharmaceutical5,761
10Prevnar 13VaccinesPfizerWyeth5,693
Top 10 pharmaceutical products by Global sales in 2017 (million U.S. dollars). This study explores the proposition that, in user-inspired fundamental research such as that in the biopharmaceutical domain, specific knowledge regarding academic entrepreneurship can be attained as follows: (1) Global databases of startup finances and academic research papers can yield real-time data that are useful to analyze and predict academic entrepreneurship in broad disciplines on a global scale. (2) Data of startup finances and keyword analyses both of the data of startup finances and academic research papers can elucidate trends indicating which scientific topics are becoming active areas of research that enhance “startup-readiness.” (3) Bibliometric analyses of research papers related to the scientific topics above can reveal potentially entrepreneurial scientists with high startup readiness in industry segments related to the scientific topics. Results of this study contribute to a rich, convenient, and dynamic understanding of academic entrepreneurship. The computational approach used herein can be useful to assess the startup readiness of researchers in emerging research topics because names of research topics and authoring researchers are identifiable easily in real time. Better scalability and adaptability are obtainable using these names instead of published paper titles, personal interviews or field projects. Consequently, our method can be useful even for practitioners including venture capitalists and managerial entrepreneurs who are unfamiliar with science fields, but who aspire to engage in research-based startups. Our approach is also unique because it directly bridges authors in citation networks to a startup financing database, which enables us to relate citation network analysis to the startup and investment contexts. described above. Concretely, our bibliometric analyses first require construction of author citation networks derived from biopharmaceutical research paper citation networks. Then we refer the author data to the startup database to analyze whether the authors are biopharmaceutical startup participants such as a founder, chief scientific officer, or director. Herein, we present this method of evaluating researchers' startup readiness of biopharmaceutical startups. Then we measure the degree centrality of such authors’ nodes in author citation networks and co-authorship networks. Centrality represents the degree to which an author is central in terms of the position in the author citation network. Degree centrality can be ascertained using several methods: often, network analyses are conducted among actors (i.e. network nodes), as some earlier studies have done [28, 29, 30, 31, 32]. Our similar approach measured a centrality index with respect to targeted nodes and others to explain causes of their performance or activities. For an earlier study that analyzed author citation networks, Ding applied PageRank and weighted PageRank algorithms in author citation networks to assess the popularity and prestige of scholars [33]. Our approach differs: Ding analyzed only first authors, whereas we analyzed all co-authors comprehensively in their research field. An earlier study by Cainelli et al. (2015) analyzed co-authorship of economists and co-authorship effects using social network analysis and economic analysis to assess co-authorship structures and to explain researcher productivity in terms of variables including cooperation and two centrality indexes: betweenness and closeness [34].

Methodology

This section presents the methodology proposed for this research. The analytical scheme is depicted in Figure 1.
Figure 1

Methodology proposed herein.

Methodology proposed herein. Using the VentureSource global database of startups, we analyzed all financing deals struck between January 1, 2017 and December 31, 2017 to construct a list of industry segments that were most active in venture capital finance in 2017 (A-1). We narrowed this analysis to 2017 while analyzing the years 2014–2017 to identify the most rapidly growing keyword use in research papers and to create author citation networks and co-authorship networks associated with those keywords (A-3 and A-4) because we were interested in how “hot” each industry segment in VentureSource in the latest year. VentureSource, compiled by Dow Jones & Company, is a comprehensive global database of companies backed by venture capital and private equity in every region, industry, and stage of development. The database includes daily data of startup investment deals according to a specific industry code/subcode. Information related to financing amounts, financing rounds, company overview keywords, startup participants, etc. is available. Using VentureSource, based on the sum of the ranks of both the average financing size and the number of financing rounds (A-1), we compiled a ranked list of the top 30 most active financing industry codes/subcodes among all 281 VentureSource industry codes/subcodes in 2017. Then we extracted industry codes/subcodes for the VentureSource “Biopharmaceutical” industry segment. For startups in the target codes/subcodes belonging to the biopharmaceutical segment on VentureSource, we surveyed keywords that appeared multiple times (A-2). We analyzed the keyword frequency in research papers of 2014–2017 by searching the Web of Science Core Collection database compiled by Clarivate Analytics. We identified the keywords showing the most growth in use during that period for each target code/subcode, as emerging research topics (A-3). Then, we created author citation networks and co-authorship networks based on the research papers for each selected keyword (A-4). Simultaneously, we detected authors in both networks who were participating in startups related to those keywords (A-5). Finally, after analyzing the distribution of the startup participant authors in the author citation networks and in the co-authorship networks to assess their degree centrality (B-1), we tested the top 10% degree centrality authors of both networks to infer their “startup readiness” (B-2).

Constructing a list of industry fields most actively financed based on average financing size and number of financing rounds in 2017

Using the VentureSource database, we constructed lists of the most active financing industry fields based on the average financing size and the number of financing deals per field during the 365 days of 2017. We analyzed 2017 to ascertain how financially active each industry segment was during the latest year among the years 2014–2017, throughout which we conducted analyses for A-3 and A-4, as described later. According to VentureSource, 17,681 financing deals were concluded in 2017. We sorted all of their 281 industry codes/sub-codes in descending order by the average financing deal size and by the number of financing rounds. We then constructed our top 30 (approximately top 10%) ranking of most actively financing industry fields, arranged based on the sum of the orders of both the average financing size and the number of financing rounds. We examined the number of financing rounds in addition to the average financing size because this analysis primarily addresses venture capital financing where the number of rounds of financing in seed or early stage companies is important. Private equity investment deals typically have a few large investment rounds in middle or later stage companies (Table 2).
Table 2

Top 30 most actively financing industry fields among 281 VentureSource industry codes/subcodes based on 17,681 financing deals during January 1, 2017 through December 31, 2017.

RankIndustry SegmentIndustry Code/SubcodeAverage Finance SizeOrder# of RoundsOrder
1Travel and LeisureTransportation Services148.25422418
2Financial Institutions & ServicesLending46.31253595
3Consumer Information ServicesShopping Facilitators34.23369211
4Business Support ServicesFacilities/Operations Management32.354327213
5Consumer Information ServicesEmail/Messaging58.61169752
6Financial Institutions & ServicesInsurance33.503813336
7Financial Institutions & ServicesRetail Investment Services/Brokerages63.95158160
8Wholesale Trade and ShippingLogistics/Delivery Services29.725116629
9BiopharmaceuticalsPharmaceuticals32.284412938
10Financial Institutions & ServicesPayment/Transactional Processing20.608631310
11Business Support ServicesData Management Services18.50943396
12BiopharmaceuticalsBiotechnology Therapeutics23.877215432
13Financial Institutions & ServicesReal Estate23.467314035
14Electronics & Computer HardwareConsumer Electronics17.71993169
15BiopharmaceuticalsImmunotherapy/Vaccines29.05549355
16Medical Devices & EquipmentMedical Lab Instruments/Test Kits34.51356574
17Machinery & Industrial GoodsGeneral Industrial Goods35.91346077
18Vehicles and PartsAutomotive Parts50.62194992
19RetailersFood/Drug Retailers22.737611841
20BiopharmaceuticalsSmall Molecule Therapeutics25.726710451
21BiopharmaceuticalsGene Therapy30.58476873
22SoftwareSecurity15.121153367
23RetailersVehicle Parts Retailers/Vehicle Dealers49.572345100
24Travel and LeisureTravel Arrangement/Tourism17.5310017925
25Consumer Information ServicesEntertainment15.8511026416
26Financial Institutions & ServicesInstitutional Investment Services19.799112937
27Vehicles & PartsAutomobiles157.79230130
28Business Support ServicesProcurement/Supply Chain16.1510816030
29Media and ContentBroadcasting39.562837112
30Electronics & Computer HardwareElectronic Components/Devices15.0811719423

Note: Rank here is based on the sum of both orders.

Top 30 most actively financing industry fields among 281 VentureSource industry codes/subcodes based on 17,681 financing deals during January 1, 2017 through December 31, 2017. Note: Rank here is based on the sum of both orders. From the top 30 industry fields above, we extracted industry codes/subcodes of the “Biopharmaceuticals” industry segment as case studies for exploration. Particularly, we selected five industry codes/subcodes for an additional survey presented below: “Pharmaceuticals,” “Biotechnology Therapeutics,” “Immunotherapy/Vaccines,” “Small Molecule Therapeutics” and “Gene Therapy.”

Extraction of keywords related to active financing of biopharmaceutical industry fields

Keywords representing company overviews are available in the VentureSource database. From VentureSource, we extracted keywords representing company overviews of the startups with the five industry codes/subcodes described in A-1, by gleaning those keywords from company overviews using the “webdriver” module of Python's “selenium” library. Subsequently, we constructed keyword lists for each industry code/subcode. Keywords appearing twice or more on each list are presented in Table 3, with calculations of each keyword's appearance frequency using Python's “pandas” library.
Table 3

VentureSource keywords appearing twice or more for startups related to actively financing biopharmaceutical industry fields in 2017.

Appearance FrequencyKeywords (Appearance Frequency)
Pharmaceuticals

5 or Greaterdrug∗' (20), 'medic∗' (12), 'cancer∗' (12), 'therap∗' (8), 'pharma∗' (8), 'health∗' (8), 'pharm∗' (8), 'nan' (7), 'pain∗' (5), 'vitamin∗' (5), 'tumor∗' (5), 'marijuana∗' (5), 'cannabis∗' (5), 'disease∗' (5), 'pharmaceutic∗' (5), 'protein∗' (5)
3–4vaccine∗' (4), 'supplement∗' (4), 'skin∗' (4), 'oncolog∗' (4), 'antibiotic∗' (4), 'nutrition∗' (4), 'capsule∗' (4), 'intermediate∗' (4), 'biotech∗' (3), 'rare disease∗' (3), 'bacteria∗' (3), 'cardi∗' (3), 'diabetes' (3), 'plant∗' (3), 'treatment∗' (3), 'oncology∗' (3), 'therapeutic∗' (3), 'child∗' (3), 'cannabis' (3), 'medicine∗' (3), 'treat∗' (3), 'API∗' (3), 'enzyme∗' (3), 'immun∗' (3), 'blood∗' (3), 'tablet∗' (3)
2osteo∗', 'OA', 'herpes', 'probiotic∗', 'intestin∗', 'digest∗', 'molecule∗', 'small molecul∗', 'chemother∗', 'immunoth∗', 'opioid∗', 'analges∗', 'neuro∗', 'molecul∗', 'brain∗', '∗gum∗', '∗candy∗', '∗infect∗', 'gynecolog∗', 'clinic∗', 'antimicrob∗', 'bacteri∗', 'broad-spectrum', 'cardio∗', 'develop∗', 'allerg∗', 'neurolog∗', 'cancer', 'respiratory∗', 'asthma∗', 'immune∗', 'antibod∗', 'obes∗', 'compound∗', 'antibiot∗', 'bacteria', 'super∗bug∗', 'injection∗', 'prescription', 'health', 'autoimmune∗', 'Chinese medicine', 'herb∗', 'diagnos∗', 'exosome∗', 'cosme∗', 'tissue∗', 'wrinkle∗', 'API', 'glaucoma∗', 'infect∗', 'nutraceutic∗'

Small Molecule Therapeutics

5 or Greaterdrug∗' (23), 'cancer∗' (22), 'therap∗' (20), 'molecule∗' (15), 'disease∗' (9), 'immun∗' (8), 'neuro∗' (8), 'health∗' (8), 'cancer' (8), 'small molecule∗' (7), 'tumor∗' (6), 'treat∗' (6), 'protein∗' (6), 'antibod∗' (6), 'cell∗' (5), 'infect∗' (5), 'molecul∗' (5), 'inflam∗' (5), 'treatment∗' (5), 'cardio∗' (5), 'small molecule' (5), 'oncolog∗' (5)
3–4therapeutic∗' (4), 'medic∗' (4), 'pathogen∗' (4), 'chronic∗' (4), 'Alzheimer∗' (4), 'biotech∗' (3), 'immuno∗' (3), 'food' (3), 'ion channel' (3), 'inhibit∗' (3), 'nano∗' (3), 'immune∗' (3), 'C5a′ (3), 'terminal∗' (3), '∗infect∗' (3), 'therapeut∗' (3), 'bio∗' (3), 'kinase∗' (3), 'tumor' (3), '∗onco∗' (3), 'diagnos∗' (3), 'PSVT' (3), 'calcium∗' (3), '∗ventricul∗' (3), 'brain∗' (3), 'neurodegenerat∗' (3)
2biopharma∗', 'affinity purification', 'bioprocess∗', 'onco∗', 'AI', 'substitut∗', 'plant∗', 'milk', 'cheese∗', 'mayonnaise∗', 'yogurt∗', 'emul∗', 'mitochondrial', 'eosinophil', 'leukocyte', '∗immun∗', 'skin∗', 'spray∗', 'resist∗', 'bacteria∗', 'antibiotic', 'AF', 'heart∗', 'peptide∗', '∗cell∗', 'derma∗', 'nerv∗', 'molecule', 'oncolo∗', 'rare disease∗', 'ossific∗', 'fibrodysplasia', ' (RAR)', 'integrin', 'antibiotic∗', 'breast∗', '∗cancer', 'treatment', 'neurolog∗', 'protein', 'lung', 'apoptosis', 'respiratory∗', 'therapeutic', '∗cancer∗', 'fibrosis', 'GSK-3ß′, 'Glycogen Synthase Kinase', 'bipolar disorder', 'diabetes'

Biotechnology Therapeutics

5 or Greatercancer∗' (28), 'therap∗' (20), 'drug∗' (16), 'disease∗' (12), 'biotech∗' (11), 'cell∗' (9), 'health∗' (8), 'diseas∗' (7), 'disorder∗' (7), 'diagnos∗' (7), 'gene∗' (7), 'tumor∗' (7), 'medic∗' (6), 'bacteria∗' (6), 'neuro∗' (6), 'DNA' (6), 'oncolog∗' (6), 'Alzheimer∗' (5), 'cleantech' (5), 'treat∗' (5), 'cancer' (5), 'tissue∗' (5)
3–4molecule∗' (4), 'immune∗' (4), 'therapeut∗' (4), 'antibod∗' (4), 'therapeutic∗' (4), 'industry focused products and services' (4), 'infect∗' (4), 'regenerat∗' (4), 'protein∗' (4), 'microbiome' (4), 'treatment∗' (4), 'immun∗' (4), 'neurolog∗' (3), 'Alzheimer's∗' (3), 'spinal cord∗' (3), 'injur∗' (3), '∗amyloid∗' (3), 'receptor' (3), 'central nervous system∗' (3), 'CNS' (3), 'antibiotic∗' (3), 'compound∗' (3), 'onco∗' (3), 'CMBC' (3), 'pharma∗' (3), 'skin∗' (3), 'molecule' (3), 'vaccine∗' (3), 'inhibitor∗' (3), 'inflam∗' (3), 'respirat∗' (3), '∗cancer∗' (3), 'bone∗' (3), 'pharm∗' (3)
2blood∗', 'platelet∗', 'manufact∗', 'contract∗', 'life scienc∗,', 'metaboli∗', 'inflammat∗', 'microbe∗', 'inhal∗', 'cardiovascular', 'PAF', 'biotech∗ atria∗', 'arrhythmia', 'paediatr∗', 'nephrolog∗', 'renal∗', 'neurologic∗', 'orphan∗', 'anaesthes∗', 'tubulo∗', 'oil', 'protein', 'insect∗', 'agriculture', 'lysom∗', 'stor∗', 'HSP', 'misfold∗', 'degenerat∗', 'hear∗', 'cochlear∗', 'ear∗', 'noise', 'restor∗', 'cardiovascular∗', 'myelofibrosis', 'MF', 'JAK2′, 'ocular∗', 'probiotic∗', 'research∗', 'clinical trial∗', 'osteoporos∗', 'hypoparathy∗', 'hypoparathyroidism∗', 'biopharm∗', 'drug discover∗', 'genomic∗', '∗microbiome∗', 'th + F5eepeutic∗', 'urea cycle disorder∗', 'pathogenic∗', '∗bacteria∗', '∗health∗', 'fish∗', '∗inflamation + F23∗', 'detect∗', 'vir∗', 'T-cell∗', '∗immune∗', 'dermatolog∗', 'aesthetics', 'plasmotic', 'acne', 'hair removal', 'topical', 'vascular', 'COPD', 'addict∗', 'opiate∗', 'alcohol', 'abuse∗', 'silk∗', 'aesthetic∗', 'defect∗', 'diabetes', 'obes∗', 'molecular∗', 'rare disease∗', '∗skelet∗', 'drug', 'molecul∗', 'metabolic', 'intestin∗', 'antibody'

Gene Therapy

5 or Greatergene∗' (29), 'therap∗' (19), 'cancer∗' (16), 'DNA' (10), 'drug∗' (7), 'genomic∗' (5), 'cancer' (5), 'health∗' (5), 'oncolog∗' (5), 'cell∗' (5)
3–4DNA∗' (4), 'genom∗' (4), 'patient∗' (4), 'immun∗' (4), 'CRISPR' (4), 'gene therap∗' (4), 'cure' (4), 'therapy' (4), 'gen∗' (4), 'disease∗' (4), 'biotech∗' (4), 'virus' (3), 'gene' (3), 'AAV' (3), 'virus∗' (3), 'diseas∗' (3), 'immune∗' (3), 'brain' (3)
2Parkinson∗', 'DTC genetic test∗', 'therapeutics∗', 'genetic research∗', 'medic∗', 'adeno∗', 'treat∗', 'Cas9′, 'duchenne', 'dystroph∗', '8muscular disease∗', 'viral∗', 'tumor∗', 'research∗', 'animal∗', 'livestock', 'agricultur∗', 'breed∗', '∗medic∗', 'retina∗', 'dystrop∗', 'choroid∗', 'degenerat∗', 'CHM', 'REP-1′, 'protein∗', 'rare disorder∗', 'treatment∗', 'adeno', 'medical', 'treatment', 'health', 'stem cell∗', 'CAR-T′, 'HIV', '∗gene∗', 'chimer∗', 'receptor∗', '∗cell∗', 'CAR', 'life scien∗', 'genetic engineer∗', 'personal∗ medic∗', 'molec∗ bio∗', 'HCP', 'cell protein', 'genome∗', 'glioblastoma'

Immunotherapy/Vaccines

5 or Greater'cancer∗' (31), 'immun∗' (22), 'tumor∗' (15), 'therap∗' (14), 'antibod∗' (13), 'disease∗' (13), 'vaccine∗' (12), 'infect∗' (12), 'cancer' (11), 'drug∗' (11), 'vaccin∗' (10), 'virus∗' (8), '∗immun∗' (8), 'antigen∗' (7), 'T cell∗' (6), 'cell∗' (6), '∗therap∗' (5), 'protein∗' (5)
3–4'oncolog∗' (4), 'allerg∗' (4), '∗cancer∗' (4), 'immuno∗' (4), 'disease' (3), 'biotech∗' (3), 'inflammat∗' (3), 'viral∗' (3), '∗tumor∗' (3), 'antibody∗' (3), 'ADC' (3), 'medicine∗' (3), 'ag∗' (3), 'immunosenescence' (3), '∗infect∗' (3), 'antibiotic∗' (3), 'inflam∗' (3), 'medic∗' (3), 'prevent∗' (3), 'RSV' (3), 'target∗' (3), 'T-cell' (3)
2'therapeutic∗', 'oncology', 'RNA', 'DNA', 'HIV', 'TME', 'patient', 'pathogen∗', 'bacteria∗', 'monoclonal', 'complement system', 'COPD', 'AMD', 'PNH', 'virus', 'treatment', 'patho∗', '∗oncolog∗', 'respirat∗', 'licens∗', 'rhinovirus', 'cold∗', 'asthma∗', '∗vir∗', 'mosquito∗', 'Zika', 'Dengue∗', '∗fever∗', 'Hepati∗', 'nanomedicine∗', 'colorectal∗', 'purif∗', 'oncol∗', 'viral', 'dendritic', 'biopharma∗', 'diseas∗', 'oncobio∗', 'microb∗', 'tumor', 'antigen'
VentureSource keywords appearing twice or more for startups related to actively financing biopharmaceutical industry fields in 2017.

Identifying research paper keywords with most usage growth for emerging research topics of actively financing biopharmaceutical industry fields of 2014–2017

Then, we conducted queries of all keywords in Table 3 into the Web of Science Core Collection database to analyze growth in the frequency of those keywords that appeared in the title, keywords, or Keyword Plus of the research papers during 2014–2017. These consecutive years were observed to elucidate the growth of specific research topics throughout what we deemed as an appropriately certain length of time, four years, as determined by reference to leading cases in CRISPR and Cas9, with the maximum years to founding of a startup because its initial associated research paper took four years [4]. Rankings of keywords showing the most growth in incidence for each industry code/subcode were constructed based on growth multiples (Table 4).
Table 4

Keyword frequency growth multiple in web of science core collection related to active financing in biopharmaceutical industry fields during 2014–2017.

RankKeywordStart Year CountEnd Year CountGrowth (×)
Pharmaceuticals

1therap166.00
2cardi273.50
3super∗bug9293.22
4exosome4008802.20
5cosme482.00
6immun7142.00
7allerg122.00
8antibod122.00
9obes23441.91
10marijuana99314311.44

Small Molecule Therapeutics

1therap166.00
2inflam133.00
3immun7142.00
4ventricul122.00
5PSVT9141.56
6mayonnaise41601.46
7yogurt3965761.45
8ossific7101.43
9onco1261781.41
10rare disease8249114291.39

Biotechnology Therapeutics

1therap166.00
2inflam133.00
3microbiome173244032.54
4immun7142.00
5injur122.00
6inflammat122.00
7diseas482.00
8obes23441.91
9restor23351.52
10paediatr8121.50

Gene Therapy

1therap166.00
2Cas944823845.32
3CAR-T974855.00
4CRISPR69431384.52
5DTC genetic test13292.23
6immun7142.00
7diseas482.00
8CHM1041521.46
9AAV5668131.44
10molec∗ bio133518291.37

Immunotherapy/Vaccines

1Zika26231088.85
2therap166.00
3inflam133.00
4allerg122.00
5immun7142.00
6antibod122.00
7diseas482.00
8inflammat122.00
9TME1923651.90
10Dengue177527291.54
Keyword frequency growth multiple in web of science core collection related to active financing in biopharmaceutical industry fields during 2014–2017. From the rankings presented above, we extracted keywords with incidences that more than doubled between 2014 (Start Year) and 2017 (End Year) and which appeared more than 100 times in 2017 (End Year) for an additional survey. Five keywords met these criteria from each industry code/subcode. All are emerging research topics that have attracted academic attention to a rapidly increasing degree.

Pharmaceuticals: exosome

Exosomes are small microvesicles released from late endosomal compartments of cultured cells in many and eukaryotic fluids, including blood and urine [35, 36]. They are either released from the cell when multivesicular bodies fuse with the plasma membrane, or are released directly from the plasma membrane https://en.wikipedia.org/wiki/Exosome_(vesicle) [37]. Moreover, they have specialized functions such as coagulation, intercellular signaling, and waste management [35]. Consequently, growing interest has arisen in their clinical applications. Exosomes might be used for therapy and prognosis, or as biomarkers for health and disease.

Biotechnology Therapeutics: Microbiome

Microbiome refers to ecological communities of commensal, symbiotic and pathogenic microorganisms https://en.wikipedia.org/wiki/Microbiota [38, 39] living in and on all multicellular organisms from plants to animals. A microbiome denotes either the collective genomes of microorganisms occupying an environmental niche or the microorganisms themselves https://en.wikipedia.org/wiki/Microbiota [40, 41, 42]. By influencing adaptive and innate immune functions, a microbiome can promote or disrupt human health [43].

Gene Therapy: CRISPR, Cas9, and CAR-T

In the technology designated as “clustered, regularly interspaced, short palindromic repeats” (CRISPR) and the CRISPR-associated protein 9 (Cas9), the Cas9 enzyme functions as a fundamental part of a larger construct in which an RNA molecule guides the targeting of any possible matching DNA sequence. It is actually used to specify the critical site of cleavage. Because CRISPR–Cas has emerged as a highly flexible research tool for genome editing with the potential to enable researchers to manipulate the genome precisely, including the medical use of the system for directly treating genetic disorders, it has been publicized widely over fundamental parts of the CRISPR–Cas9 system [44, 45, 46]. The combination of chimeric antigen receptors (CARs) and artificial T cell receptors, CAR-T, is engineered receptors which graft an arbitrary specificity onto an immune effector cell (T cell). Typically, these receptors are used to graft the specificity of a monoclonal antibody onto a T cell, with transfer of their coding sequence facilitated by retroviral vectors. These receptors are chimeric because they comprise parts from different sources. The general premise of CAR T-Cells is rapid generation of T-Cells targeted to specific tumor cells. Once the T-Cell has been engineered to become a CAR T-Cell, it acquires supraphysiologic properties and develops the capability to act as a ‘Living Drug’ [47, 48, 49].

Immunotherapy and vaccines: Zika

Zika fever, also known as Zika virus disease or simply Zika, is an infectious disease caused by the Zika virus https://en.wikipedia.org/wiki/Zika_fever [50]. Symptoms include red eyes, joint pain, headache, fever, and a maculopapular rash [51, 52]. Although it has caused no associated fatalities https://en.wikipedia.org/wiki/Zika_fever [53], mother-to-child transmission during pregnancy can cause microcephaly and other brain malformations in babies https://en.wikipedia.org/wiki/Zika_fever [54]. An outbreak that started in Brazil in 2015 spread to the Americas, Pacific, Asia, and Africa. This eventuality led to the World Health Organization's declaration of Zika as a Public Health Emergency of International Concern in February 2016 [55]. Zika virus was little studied until the major outbreak. No specific antiviral treatment is available today [56].

Creation of author citation networks and Co-authorship networks related to keyword incidence in research papers relative to active financing in biopharmaceutical industry fields

Papers published during 2013–2017 including the previously described highest-growth keywords relative to actively financed biopharmaceutical industry fields of “exosome," “microbiome,” “CRISPR,” “Cas9,” “CAR-T,” or “Zika" in the title, abstract, or keywords were extracted from the Web of Science. We targeted those papers for compilation of datasets to extract names of all authors and paper citation-related information to create author citation networks and co-authorship networks. First, from the extracted data, we created lists of pairs (edge lists) of cited-and-citing papers. Using author lists for the respective papers, we constructed edge lists of cited-and-citing authors including all co-authors, irrespective of order, for all pairs of cited-and-citing papers. For instance, in a case in which paper A with five authors is cited by paper B with three authors, we created 15 (5 × 3) pairs as edge lists to build up a whole new author citation network based on them. For duplicate authors both in cited and citing papers, we eliminated pairs of duplicates to produce pairs comprising different authors only. The method described above is applied to all pairs of cited-and-citing papers. Subsequently, we introduced author networks of twokinds in an unweighted and undirected manner as described below. To observe how central the startup participant authors are and how they are distributed in the citation networks, we constructed new author citation networks with authors as nodes and with citation relations as edges (links) of these networks. The networks were based on edge lists of cited-and-citing authors with respect to each targeted keyword. At the same time, we constructed co-authorship networks from the papers above. The co-authorship network is a social network in which the authors, through participation in one or more publications through an indirect path, have been mutually linked, whereas author citation networks are based on citation relations among the authors. Therefore, we inferred that we might observe different characteristics related to how central the startup participant authors are and how they are distributed, between author citation networks and co-authorship networks (Table 5).
Table 5

Comparison among research paper citation networks, author citation networks, and Co-authorship networks relative to growing keywords in actively financed biopharmaceutical industry fields in 2014–2017.

Paper Citation NetworkAuthor Citation NetworkCo-authorship Network
Exosome

Node Count1,94111,08411,059
Edge Count7,625379,18057,697

Microbiome

Node Count8,81437,11636,877
Edge Count38,1341,694,176233,184

CRISPR

Node Count5,45125,41125,251
Edge Count52,9451,742,614171,281

Cas9

Node Count3,97419,89319,808
Edge Count38,8561,415,583133,925

CAR-T

Node Count6853,3023,281
Edge Count4,377277,37725,106

Zika

Node Count2,98713,13712,943
Edge Count29,1961,864,665102,358
Comparison among research paper citation networks, author citation networks, and Co-authorship networks relative to growing keywords in actively financed biopharmaceutical industry fields in 2014–2017.

Detecting startup participants in networks relative to emerging research topics in research papers related to actively financed biopharmaceutical industry fields

Using the VentureSource database again, we queried all names of nodes (authors) in the author citation networks and co-authorship networks to detect startup participants relative to emerging research topics represented by highest-growth keywords. We assumed that virtually any researcher with high centrality in those networks who participated in their startups were present in VentureSource because biopharmaceutical startups require large amounts of professional capital to conduct substantive research and development activities. Dow Jones, the compiler of this database, had covered such industry-focused investors intensively and comprehensively. The rankings of the startup participant authors were inferred from their degree centralities both in their author citation networks and co-authorship networks for each keyword showing increasing frequency, based on the sum of both regularized orders squared. Rankings of startup participant authors for each most growing keyword are shown in the Appendix.

Results

Constructing startup participant authors’ distribution regarding degree centrality rankings in author citation networks and Co-authorship networks relative to growing keywords

For each growing keyword above, we mapped a scatter diagram of the distribution of startup participant authors in terms of their rankings of degree centralities both in the author citation networks and the co-authorship networks (Figure 2). We presented top authors alongside their degree centrality for the author citation networks and the co-authorship networks for each keyword (Table 6).
Figure 2

Scatter Diagram, Distribution of Startup Participant Authors' Degree Centrality in Author Citation Networks & Co-authorship Networks based on Ratio from Top to Bottom, in Emerging Research Topics in Actively Financed Biopharmaceutical Industry Fields in 2014–2017

Table 6

Top authors’ degree centrality in author citation networks & Co-authorship networks in emerging research topics in actively financed biopharmaceutical industry fields for 2014–2017.

Author Citation Network
Co-Authorship Network
RankAuthorDegCentRankAuthorDegCent
[Exosome]

1Ochiya, Takahiro0.11151Jensen, Torben Heick0.0109
2Minn, Andy J.0.09922Ochiya, Takahiro0.0088
3Ghajar, Cyrus M.0.08093Liu, Yutao0.0077
4Mallya, Kavita0.08094Minn, Andy J.0.0065
5Zhang, Tuo0.08095Lotvall, Jan0.0061
6Zhang, Haiying0.08096Sandelin, Albin0.0060
ePantel, Klaus0.08097Chen, Yun0.0059
8Muller, Volkmar0.08098Helwa, Inas0.0057
9Fodstad, Oystein0.08099Gho, Yong Song0.0056
10Hernandez, Jonathan0.080910Dismuke, W. Michael0.0055

[Microbiome]

1Knight, Rob0.23431Knight, Rob0.0207
2Huttenhower, Curtis0.15312Huttenhower, Curtis0.0134
3Gevers, Dirk0.12483Wang, Jun0.0096
4Xavier, Ramnik J.0.12424Gilbert, Jack A.0.0089
5Fischbach, Michael A.0.11215Kristiansen, Karsten0.0085
eTurnbaugh, Peter J.0.10476Bork, Peer0.0082
7Wang, Jun0.10287Xavier, Ramnik J.0.0082
8Gootenberg, David B.0.10078Raes, Jeroen0.0077
9Gonzalez, Antonio0.10059Dore, Joel0.0072
10David, Lawrence A.0.099910Li, Junhua0.0069

[CRISPR]

1Zhang, Feng0.45561Zhang, Feng0.0146
eDoudna, Jennifer A.0.28342Katsanis, Nicholas0.0122
eLander, Eric S.0.26493Kim, Jin-Soo0.0097
4Scott, David A.0.25384Zhang, Wei0.0082
eJoung, J. Keith0.25255Huang, Xingxu0.0076
6Hsu, Patrick D.0.24556Davis, Erica E.0.0076
7Shalem, Ophir0.24227Li, Wei0.0069
8Kim, Jin-Soo0.21798Root, David E.0.0069
9Sanjana, Neville E.0.20379Doudna, Jennifer A.0.0067
10Charpentier, Emmanuelle0.202410Wang, Yan0.0064

[Cas9]

1Zhang, Feng0.47281Zhang, Feng0.0163
eDoudna, Jennifer A.0.29422Katsanis, Nicholas0.0126
3Scott, David A.0.29223Kim, Jin-Soo0.0111
4Hsu, Patrick D.0.28514Huang, Xingxu0.0098
5Joung, J. Keith0.27865Davis, Erica E.0.0097
6Lander, Eric S.0.27626Zhang, Wei0.0091
7Kim, Jin-Soo0.24957Wang, Yan0.0081
8Shalem, Ophir0.24648Cho, Megan T.0.0081
9Sharp, Phillip A.0.22519Hildebrandt, Friedhelm0.0079
10Root, David E.0.221410Liu, Wei0.0074

[CAR-T]

1June, Carl H.0.63191June, Carl H.0.0790
2Kochenderfer, James N.0.57802Lacey, Simon F.0.0473
3Sadelain, Michel0.55323Dotti, Gianpietro0.0430
4Stetler-Stevenson, Maryalice0.51534Cooper, Laurence J. N.0.0418
5Rosenberg, Steven A.0.50895Jensen, Michael C.0.0415
6Mackall, Crystal L.0.49986Levine, Bruce L.0.0381
7Fry, Terry J.0.49957Sadelain, Michel0.0372
8Davila, Marco L.0.46478Brenner, Malcolm K.0.0348
eRiviere, Isabelle0.45539Kochenderfer, James N.0.0341
10Feldman, Steven A.0.455310Maus, Marcela V.0.0323

[Zika]

1Musso, Didier0.52211Honein, Margaret A.0.0240
2Cao-Lormeau, Van-Mai0.47692Jamieson, Denise J.0.0237
3Jamieson, Denise J.0.41333Meaney-Delman, Dana0.0226
4Honein, Margaret A.0.40244Shi, Pei-Yong0.0172
5Teissier, Anita0.38355Kraemer, Moritz U. G.0.0158
6Roche, Claudine0.37956Diamond, Michael S.0.0155
7Petersen, Lyle R.0.37577Khan, Kamran0.0153
8Mallet, Henri-Pierre0.34428Moore, Cynthia A.0.0152
9Diamond, Michael S.0.34099Weaver, Scott C.0.0151
10Vasilakis, Nikos0.326410Oduyebo, Titilope0.0149
Scatter Diagram, Distribution of Startup Participant Authors' Degree Centrality in Author Citation Networks & Co-authorship Networks based on Ratio from Top to Bottom, in Emerging Research Topics in Actively Financed Biopharmaceutical Industry Fields in 2014–2017 Top authors’ degree centrality in author citation networks & Co-authorship networks in emerging research topics in actively financed biopharmaceutical industry fields for 2014–2017. For startup activities in which these startup participant authors are engaged, one can extract information from the VentureSource database related to the role of participants, company overview, financing to date, and so on. Although we did not conduct thorough case studies across all startup companies engaged by all participants for the analyses described in this paper, we listed the 18 top-degree centrality startup participants with their 15 startups in each emerging research topic in Table 7 to verify the collectiveness and relevance of the startup participant author pool and the significance of the selected names by exemplifying several top startup participant authors for each emerging research topic. These startups have been successful either at raising venture capital, achieving an IPO, or being acquired by big pharmaceutical companies according to the VentureSource database. We can access up-to-date information for each relevant startup using VentureSource.
Table 7

Startup Activities for which Top Startup Participants Are Engaged Relative to Each Emerging Research Topic.

Startup ParticipantRoleStartup CompanyCompany OverviewBrief DescriptionMost Recent Financing (MM)
Exosome
Zhang, BinVPCisen Pharmaceutical Co. Ltd.Manufacturer of chemical pharmaceutical agents and related products such as non- polyvinyl chloride (PVC) soft bag infusions, plastic bottle infusions, lyophilized powder injections, tablets, ointments, eye drops, and capsules09/29/2017IPO1166.00RMB (Chinese Yuan)
Chen, WeiBoard Member, OutsiderImmunophotonics, Inc.Developer of a cancer treatment09/04/2014VC 1st2.49 USD
Xu, BinEVPGrandhope Biotech Co. Ltd.Provider of medical materials and devices for the treatment of damaged tissue and organs07/06/2011IPO278.4 RMB (Chinese Yuan)
Johansson, Henrik J.Unknown ExecutiveHalo Genomics ABDeveloper of targeted re-sequencing technology for DNA sequencingAcquired by Agilent

Microbiome

Xavier, Ramnik J.CofounderJnana TherapeuticsDeveloper of drugsthat target cellular proteins12/14/2017VC 1st50.00 USD
de Vos, Willem M.Chairman, Scientific Advisory BoardMicroDish BVDeveloper of micro-engineered culture chips and nanoscale reagents aiming to improve microbial culture03/31/2011VC 1stN.A.
Mazmanian, SarkisDirectorAxialBiotherapeutics Inc.Developer of biotherapeuticsthat target neurological diseases and disorders06/22/2017VC 1st 19.20 USD

CRISPR

Zhang, FengCofounderEditas MedicineDeveloper of human therapeutics based on genome editing technologies02/03/2016IPO94.40 USD
Doudna, Jennifer A.CofounderEditas MedicineSame as aboveSame as above
CofounderIntellia Therapeutics, Inc.Provider of CRISPR-Cas9 focused biotechnology05/06/2016IPO108.00USD
Joung, J. KeithCofounderEditas MedicineDeveloper of human therapeutics based on genome editing technologiesSame as above

Cas9

Zhang, FengCofounderEditas MedicineDeveloper of human therapeutics based on genome editing technologiesSame as above
Doudna, Jennifer A.CofounderEditas MedicineSame as aboveSame as above
CofounderIntellia Therapeutics, Inc.Provider of CRISPR-Cas9 focused biotechnologySame as above
Joung, J. KeithCofounderEditas MedicineDeveloper of human therapeutics based on genome editing technologiesSame as above

CAR-T

June, Carl H.CofounderTmunity TherapeuticsDeveloper of T-cell immunotherapies01/23/2018VC 2nd100.00USD
Sadelain, MichelCofounderJuno Therapeutics Inc.Developer of medicines to treat cancer12/19/2014IPO264.55USD
Riviere, IsabelleCofounderJuno Therapeutics Inc.Same as aboveSame as above

Zika

Osorio, Jorge E.Chief Scientific OfficerInviragen Inc.Developer of vaccines to protect against infectious diseases06/07/2013Acquired by Takeda
Schinazi, Raymond F.FounderPharmasset Inc.Developer of drugs to treat viral infections04/27/2007IPO45.00 USD
Board Member, OutsiderGliknik Inc.Developer of cancer, autoimmune, and inflammatory disease therapies09/30/2013Corporate Partnership25.00
Board Member, OutsiderReViral Ltd.Developer of antiviral drugs focused on diseases caused by the respiratory syncytial virus (RSV)09/08/2015VC 1st21.00 USD
Startup Activities for which Top Startup Participants Are Engaged Relative to Each Emerging Research Topic. Results demonstrated that the startup participants are concentrated near the top of both networks, especially around the top 10%, as presented in Figure 2. Correlation between both the ranks of the author citation networks and those of the co-authorship networks to the same startup participants was to some degree: their correlation coefficients were 0.337 (exosome), 0.464 (microbiome), 0.371 (CRISPR), 0.337 (Cas9), 0.505 (CAR-T), and 0.528 (Zika).

Hypothesis testing of top 10% degree centrality authors in author citation networks and Co-authorship networks related to “startup readiness” for each emerging research topic

From observations presented in B-1 regarding the six emerging research topics in the biopharmaceutical domain, we hypothesized that the proportion of startup participants is higher among authors of the top 10% degree centrality in both networks (designated hereinafter as “Dual Top 10% Authors”) than it is among authors who have no such high centrality. To conduct testing of our hypothesis, we used Fisher's exact test to infer significance of differences in the observed proportions. Fisher's exact test, a test of statistical significance used for analysis of contingency tables, assesses significance of deviation from a null hypothesis, or P-value, calculated exactly as long as the contingency tables' row and column totals are fixed, rather than relying on an approximation, as does a chi-square approximation [57, 58, 59]. We calculated the probability P that the number of startup participants is equal to or exceeds the observed number among “Dual Top 10% Authors,” with the null hypothesis that startup participants are equally likely to be distributed among authors in both networks irrespective of their degree centralities. Additionally, we calculated how much higher the odds of being a startup participant is among “Dual Top 10% Authors” compared with other authors, i.e., odds ratio [60, 61] too. Following are findings for the six emerging research topics in Table 8. We infer that the results we observed in their odds ratios were statistically significant.
Table 8

Contingency tables related to the number of startup participants and non-participants for dual top 10% authors and others with P-value and odds ratio for each research topic.

Startup ParticipantNon-Participant
Exosome

Dual Top 10% Authors37262
Other Authors4399,845
P-value: 1.875e-08Odds ratio: 3.166

Microbiome

Dual Top 10% Authors1071605
Other Authors99533045
P-value: 4.41e-12Odds ratio: 2.214

CRISPR

Dual Top 10% Authors142611
Other Authors101823480
P-value: 2.2e-16Odds ratio: 5.360

Cas9

Dual Top 10% Authors118402
Other Authors87017430
P-value: 2.2e-16Odds ratio: 5.880

CAR-T

Dual Top 10% Authors23123
Other Authors2062700
P-value: 0.000519Odds ratio: 2.450

Zika

Dual Top 10% Authors20550
Other Authors25711839
P-value: 0.0382Odds ratio: 1.675
The P-values were 1.875e-08 (exosome), 4.41e-12 (microbiome), 2.2e-16 (CRISPR), 2.2e-16 (Cas9), 0.000519 (CAR-T), and 0.0382 (Zika), all of which were equal to or less than three decimals places except for Zika's P-value, which remained less than 0.05, the number used as the cutoff in most statistical hypothesis testing. Odds ratios across all the emerging research topics were 3.166 (exosome), 2.214 (microbiome), 5.360 (CRISPR), 5.880 (Cas9), 2.450 (CAR-T) and 1.675 (Zika), all of which observed a higher startup participant ratio in “Dual Top 10% Authors” than in other authors. Contingency tables related to the number of startup participants and non-participants for dual top 10% authors and others with P-value and odds ratio for each research topic.

Discussion

Our hypothesis was that researchers who are academically central in both author citation networks and co-authorship networks tend to have higher startup readiness in such research topics that appearance frequency is emerging while belonging to actively financed biopharmaceutical industrial segments. Compared to those who are not central, they tend to participate in startups, transforming their research outcomes into commercial applications in emerging fundamental research fields, particularly in fields such as life sciences. Life sciences are user-inspired. They tend to transform scientific research outcomes into practical usage. They attract venture capital financing, as shown in Table 2. Earlier studies conducted for detection of emerging research fields have used bibliometric approaches, such as the work of Shibata et al. (2008), who used a topological clustering method to divide citation networks into clusters and who then tracked the positions of papers in each cluster [62]. Shibata et al. (2011) calculated network centralities, so-called betweenness centralities, of papers with respect to regenerative medicine [63]. Sasaki et al. (2016) calculated nine kinds of network centrality for papers related to photovoltaic solar cells: degree centrality, betweenness centrality, closeness centrality, eigenvector centrality, network constraint, clustering coefficient, Page rank, hub score, and authority score [64]. Although these studies did not address each researcher's preparedness to create startups, to facilitate decision-making processes they proposed prediction models to identify emerging promising studies that might attract many citations. The biopharmaceutical domain is characterized by intense scientific activity and entrepreneurship, as described in the Introduction. Therefore, we strove to emphasize the application of such network centrality and to develop paper-related individual factors to assess academic researchers' startup readiness [4]. Results suggest that scientists with top degree centrality in author citation networks and co-authorship networks have a higher startup participant ratio than other scientists in these specifically selected biopharmaceutical domains (Table 8). As presented in Figure 2, which portrays startup participant authors' degree centrality positions in the author citation networks and in co-authorship networks related to scientific topics emerging in the biopharmaceutical domain, we observed that both networks have a common tendency by which higher degree centrality is associated with higher density of startup participant authors: higher “startup readiness.” Table 9 presents correlation coefficients between the degree centrality of the author citation networks and that of the co-authorship networks to the same authors: 0.434 (exosome), 0.628 (microbiome), 0.349 (CRISPR), 0.322 (Cas9), 0.619 (CAR-T), and 0.556 (Zika). All of those except for CRISPR and Cas9 were higher than those of the degree centralities' ranks, as discussed in B-1. It might also be of additional value to discuss the potential use of patents as features to assess startup readiness as well. As discussed earlier in the Introduction, because it is commonly observed that in biotechnology papers linked to a patent receive more citations than those without a patent link [5], we can assume that authors' patenting activities are correlated with such authors' standing in the networks described previously. In fact, as Table 9 shows, CRISPR and Cas9 had higher correlation coefficients between authors' degree centrality in the author citation networks and the number of patents such authors invented: 0.413 (CRISPR) and 0.399 (Cas9), than those between authors' degree centrality of the author citation networks and that of the co-authorship networks: 0.349 (CRISPR) and 0.322 (Cas9). In other research topics, however, correlation coefficients between authors' degree centrality in the author citation networks and the number of patents such authors invented (0.069 (exosome), 0.019 (microbiome), 0.138 (CAR-T), and 0.078 (Zika)), were much lower than those between the authors’ degree centrality of the author citation networks and that of the co-authorship networks: 0.434 (exosome), 0.628 (microbiome), 0.619 (CAR-T), and 0.556 (Zika).
Table 9

Correlation coefficients of authors’ degree centrality in author citation networks, degree centrality in Co-authorship networks & number of patents they invented, in emerging research topics in actively financed biopharmaceutical industry fields in 2014–2017.

Citation Deg. Cent.Coauth. Deg. Cent.Number of Patents
[Exosome]

Citation Deg. Cent.1
Coauth. Deg. Cent.0.4341
Number of Patents0.0690.0571

[Microbiome]

Citation Deg. Cent.1
Coauth. Deg. Cent.0.6281
Number of Patents0.0190.0141

[CRISPR]

Citation Deg. Cent.1
Coauth. Deg. Cent.0.3491
Number of Patents0.4130.2131

[Cas9]

Citation Deg. Cent.1
Coauth. Deg. Cent.0.3221
Number of Patents0.3990.2041

[CAR-T]

Citation Deg. Cent.1
Coauth. Deg. Cent.0.6191
Number of Patents0.1380.2571

[Zika]

Citation Deg. Cent.1
Coauth. Deg. Cent.0.5561
Number of Patents0.0780.0641
Correlation coefficients of authors’ degree centrality in author citation networks, degree centrality in Co-authorship networks & number of patents they invented, in emerging research topics in actively financed biopharmaceutical industry fields in 2014–2017. Given such stark difference in correlation coefficients between CRISPR, Cas9, and the other research topics, we eventually did not implement patent-related factors across them. As shown in Table 8, CRISPR and Cas9 particularly were the top two research topics related to startup participant ratio in “Dual Top 10% Authors” compared to other authors. However, results also show that correlation between the ranks of author citation networks and those of co-authorship networks to the same startup participants were not strong. This result might derive from the fundamental difference between author citation networks that we constructed from paper citation networks and co-authorship networks. Because the author citation networks were created based on citations among papers, linkages among the authors in the networks were open among authors who have mutual interest, irrespective of their human relations or organizational connections. By contrast, the co-authorship networks were based mainly on human relations and organizations to which the authors belong. They are expected to be more limited and closed networks than the author citation networks. These results suggest that both networks are useful complementarily when we seek authors in emerging research fields who might be startup participants. This complementarity apparently contributed to our Fisher's exact test results and to calculation of the odds ratio, which distinguished “Dual Top 10% Authors” from authors who were not so centrally important. Results of our Fisher's exact test for each emerging research topic, or each keyword with rapidly growing usage in the biopharmaceutical domain support our hypothesis that “Dual Top 10% Authors” have higher “startup-readiness.” We infer that all results are significant. On a side note, with respect to “Zika,” whose P-value 0.0382 was way larger than those of the other emerging research topics, all of which were equal to or less than three decimal places. We presume that “Zika,” as the name of a specific viral disease, differs from the other keywords “exosome,” “microbiome,” “CRISPR,” “Cas9,” and “CAR-T,” all of which can be useful for therapy or well-being. We infer that the P-value of “Zika” is high because of the great number of research projects examining “Zika” that might not fit any immediate use for society. In addition to the points presented above, we acknowledge that variables other than centrality in networks of academic author citation and co-authorship might be used. Moreover, they might affect startup readiness in general. However, our approach can work well specifically for startups in commercialization-oriented yet fundamentally scientific research domains such as the biopharmaceutical domain, as exemplified herein. This method can extract the latest names of research topics and researchers comprehensively, but easily in a scalable computational manner. By contrast, earlier methods have collected data using conventional methods such as personal interviews, reading published papers, or conducting field projects in person, offering only limited instantaneity, comprehensiveness, and scalability. We proposed a computational method that uses openly available databases of startup finances and academic research papers combined, to explore the “startup readiness” of emerging research topics and researchers in user-inspired fundamental scientific research such as life sciences. It is suitable to assess aspects of academic entrepreneurship enables us to boost the collection of real-time data related to relevant science and industry segments and researcher pools, both in greater detail and on a larger scale.

Conclusions

To seek emerging scientific research topics for which startup activities are active, we presented a method based on keyword analyses and academic databases to sort scientific keywords associated with active financing activities and academic research activities. First, we constructed cited-and-citing author networks and co-authorship networks based on paper citation networks relative to the biopharmaceutical domain. Then we analyzed the degree centralities of nodes of the biopharmaceutical-related startup participant authors in their author citation networks as well as co-authorship networks to detect signals of startup readiness in emerging research domains. Results demonstrated that the startup participant authors exhibited high centrality for author citation networks and co-authorship networks related to biopharmaceuticals, especially within the top 10% of centrality for both. Furthermore, P-values obtained from our Fisher's exact test supported our hypothesis: the proportion of startup participants is higher for authors in the top 10% of centrality of both networks than for other authors. Therefore, to detect researchers with “startup readiness” in an emerging research topic, one can assess their combined degree-centrality in author citation networks and co-authorship networks. Because correlation between both the ranks of the author citation networks and those of the co-authorship networks is not high with respect to degree centrality to the same startup participants, complementarity between these two networks can strengthen the collectiveness and relevancy of the author pool as startup participant candidates. This analytical scheme is unique: Our citation network constructs author citation networks derived from paper citation networks. Moreover, our analyses specifically examine authors, not papers. The method calculates all authors' degree centralities exhaustively and can thereby reveal startup readiness. Using author names rather than published papers can make this method scalable into various fields and make it adaptable to situations in which the publication cycle is short and the publications are numerous. In contrast to other variables representing the maturity/level of technology such as TRL and researcher reputation, which have been tacit, difficult to attain or difficult to measure, we use variables such as financing activities, research topics, and researchers’ degree centralities. For that reason, and because our approach is based on publicly available digital data, people with little or no knowledge related to specific disciplines, industries, and regions can use this method to detect research topics and researchers with startup readiness. Furthermore, our computational approach might update necessary information in real time, except for the time lag until publication, although such a time lag is decreasing by virtue of the increasing popularity of open journals. Our method, which is useful for researchers and practitioners participating in research-based startups, can contribute to a rich, convenient, and dynamic understanding of academic entrepreneurship. Because our case study specifically examined biopharmaceutical scientists in certain research topics, the findings here might not be generalizable across all biopharmaceutical research topics, let alone across all disciplines. However, we selected the scientists by objective evaluation based upon ranking, regarding the degree to which financing activities are active in their industry segments, the degree to which academic attention is growing in their research topics, and how high their degree centrality is in the respective networks of author citation and co-authorship from their paper citation networks. Our approach might be generalizable or highly suggestive to other similarly user-inspired fundamental scientific research fields with abundant potential for commercialization and academic entrepreneurship. Moreover, even though this paper examined scientists with specifically selected research topics, our method could be applicable to other biopharmaceutical research topics as well as those whose academic attention is similarly emerging while belonging to actively invested biopharmaceutical fields. Potential disparities among paper author numbers across research topics and our selection of industry segments demand further study because they can influence results. The collectiveness and the relevancy of the author pool and the significance of the selected names requires further research from perspectives such as their startups' completeness, suitability and importance. Strengthening the legitimacy of our methods requires further research to address circumstances under which papers have numerous or limited authors, to widen our methods’ applicability to other industries or research topics, to ascertain the effects of selection and influential factors, and to survey the whole pools of relevant authors and startups at levels ranging from network structure to commercial, financial viability/success.

Declarations

Author contribution statement

T. Goji: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper. Y. Hayashi: Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper. I. Sakata: Conceived and designed the experiments; Analyzed and interpreted the data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Competing interest statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.
  21 in total

1.  Zika Virus: Rapid Spread in the Western Hemisphere.

Authors:  Lin H Chen; Davidson H Hamer
Journal:  Ann Intern Med       Date:  2016-02-02       Impact factor: 25.391

2.  When do scientists become entrepreneurs? The social structural antecedents of commercial activity in the academic life sciences.

Authors:  Toby E Stuart; Waverly W Ding
Journal:  AJS       Date:  2006-07

Review 3.  Ecological and evolutionary forces shaping microbial diversity in the human intestine.

Authors:  Ruth E Ley; Daniel A Peterson; Jeffrey I Gordon
Journal:  Cell       Date:  2006-02-24       Impact factor: 41.582

4.  (Re)defining biopharmaceutical.

Authors:  Ronald A Rader
Journal:  Nat Biotechnol       Date:  2008-07       Impact factor: 54.908

5.  The emerging patent landscape of CRISPR-Cas gene editing technology.

Authors:  Knut J Egelie; Gregory D Graff; Sabina P Strand; Berit Johansen
Journal:  Nat Biotechnol       Date:  2016-10-11       Impact factor: 54.908

Review 6.  Engineering CAR-T cells: Design concepts.

Authors:  Shivani Srivastava; Stanley R Riddell
Journal:  Trends Immunol       Date:  2015-07-11       Impact factor: 16.687

7.  Exosomes and HIV Gag bud from endosome-like domains of the T cell plasma membrane.

Authors:  Amy M Booth; Yi Fang; Jonathan K Fallon; Jr-Ming Yang; James E K Hildreth; Stephen J Gould
Journal:  J Cell Biol       Date:  2006-03-13       Impact factor: 10.539

8.  The Emergence of Zika Virus as a Global Health Security Threat: A Review and a Consensus Statement of the INDUSEM Joint working Group (JWG).

Authors:  Veronica Sikka; Vijay Kumar Chattu; Raaj K Popli; Sagar C Galwankar; Dhanashree Kelkar; Stanley G Sawicki; Stanislaw P Stawicki; Thomas J Papadimos
Journal:  J Glob Infect Dis       Date:  2016 Jan-Mar

9.  Antimicrobials from human skin commensal bacteria protect against Staphylococcus aureus and are deficient in atopic dermatitis.

Authors:  Teruaki Nakatsuji; Tiffany H Chen; Saisindhu Narala; Kimberly A Chun; Aimee M Two; Tong Yun; Faiza Shafiq; Paul F Kotol; Amina Bouslimani; Alexey V Melnik; Haythem Latif; Ji-Nu Kim; Alexandre Lockhart; Keli Artis; Gloria David; Patricia Taylor; Joanne Streib; Pieter C Dorrestein; Alex Grier; Steven R Gill; Karsten Zengler; Tissa R Hata; Donald Y M Leung; Richard L Gallo
Journal:  Sci Transl Med       Date:  2017-02-22       Impact factor: 17.956

Review 10.  Clinical development of CAR T cells-challenges and opportunities in translating innovative treatment concepts.

Authors:  Jessica Hartmann; Martina Schüßler-Lenz; Attilio Bondanza; Christian J Buchholz
Journal:  EMBO Mol Med       Date:  2017-09       Impact factor: 12.137

View more
  1 in total

Review 1.  Ubiquitination, Biotech Startups, and the Future of TRIM Family Proteins: A TRIM-Endous Opportunity.

Authors:  Utsa Bhaduri; Giuseppe Merla
Journal:  Cells       Date:  2021-04-25       Impact factor: 6.600

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.