Literature DB >> 28560354

Citation analysis of scientific categories.

Gregory S Patience1, Christian A Patience2, Bruno Blais1, Francois Bertrand1.   

Abstract

Databases catalogue the corpus of research literature into scientific categories and report classes of bibliometric data such as the number of citations to articles, the number of authors, journals, funding agencies, institutes, references, etc. The number of articles and citations in a category are gauges of productivity and scientific impact but a quantitative basis to compare researchers between categories is limited. Here, we compile a list of bibliometric indicators for 236 science categories and citation rates of the 500 most cited articles of each category. The number of citations per paper vary by several orders of magnitude and are highest in multidisciplinary sciences, general internal medicine, and biochemistry and lowest in literature, poetry, and dance. A regression model demonstrates that citation rates to the top articles in each category increase with the square root of the number of articles in a category and decrease proportionately with the age of the references: articles in categories that cite recent research are also cited more frequently. The citation rate correlates positively with the number of funding agencies that finance the research. The category h-index correlates with the average number of cites to the top 500 ranked articles of each category ([Formula: see text]). Furthermore, only a few journals publish the top 500 cited articles in each category: four journals publish 60% ([Formula: see text]) of these and ten publish 81% ([Formula: see text]).

Entities:  

Keywords:  Information Science

Year:  2017        PMID: 28560354      PMCID: PMC5446096          DOI: 10.1016/j.heliyon.2017.e00300

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

Bibliometric indicators contribute to ranking universities (Kinney, 2007, Moed, 2017), researchers (Hirsch, 2005, Verma, 2015), and journals (Garfield, 2006), and funding decisions for institutes and governments (Bornmann et al., 2008, Bornmann and Daniel, 2008). An individual's citation count and h-index, and the impact factor of the journal's that publishes their work provide input to awards and promotion committees. However, when these committees examine diverse dossiers and compare prestige and productivity between categories, they have little quantitative metrics to substantiate their decisions. Ranking criteria include alumni, awards, highly cited individuals, the number of articles in the Science Citation Index-Expanded and Social Science Index, and articles published in Science and Nature (ARWU, 2016). Citation counts are the basis of several bibliometric indicators—h-index, impact factor (), eigen factor, and g-index. The h-index equals the rank of an article (ordered from the most cited article to the least cited), h, for which it has been cited at least that often (Hirsch, 2005). But these indicators are unhelpful when comparing an engineer versus a scientist or a poet and a cinematographer. Furthermore, because of the disproportionate weighting of the as a means to measure the quality of an article, the San Francisco Declaration on Research Assessment recommended that it not be used for hiring, promotions or funding decisions (Cagan, 2013). Many journals accept their recommendations and now report the (SNIP), SCImago Journal Rank (SJR) and a five-year impact factor () together with . The SNIP considers a three-year window and corrects for a fields average number of references in papers (Moed, 2010, Leydesdorff and Opthof, 2010). Google Scholar groups journals into scientific categories and then ranks them according to an -index: the number of articles in the previous five years with that number of citations (Braun et al., 2006). The most common ranking system is the Journal Impact Factor () that represents the ratio of the number of citations in years and to the number of articles the journal publishes in year x. The number of citations is a proxy to an article's quality (Ebadi and Schiffauerova, 2016); however, since citations practices differ widely across scientific categories, many researchers question their validity as an evaluation metric (Bornmann and Daniel, 2008, Adler et al., 2009, MacRoberts and MacRoberts, 1996). Indeed, comparing productivity and prestige across scientific fields is dubious without criteria that represent substantial contribution. Still, national research evaluation agencies base their judgment criteria on the number of citations (Radicchi and Castellano, 2012, Abbott, 2009, Gilbert, 2009). Normalizing citations corrects for differences in citation rates between categories (Radicchi and Castellano, 2012, Colliander and Ahlgren, 2011, Waltman and van Eck, 2013, Kaur et al., 2013). Fractional citation counting apportions credit based on the number of authors of an article and is one method to account for differences in researchers citation counts between scientific categories (Garfield, 1979, Moed, 2010). Combining fractional counting with percentile ranks (Leydesdorff and Opthof, 2010) may be a superior indicator of a researchers. Relative impact indicators for mean citations compare journal papers between fields (Schubert and Braun, 1986, Vinkler, 2003, Vinkler, 2013). Here we compare the citation practices of the scientific categories in Web of ScienceTM (WoS). First we describe the database, then demonstrate how the number of citations, , varies as a function of bibliometric factors—number of articles per category, number of authors per article (for the 500 most cited), age of the references in these articles, number of institutions financing the research and factors related to journals that publish the research. We demonstrate that the number of articles and the age of the references explain more variance in the citation rates of the 500 most cited articles in each category than do the number of references and the number of authors. This premise compares elite articles from each of the categories and implies that the 500 most cited articles of each have the same quality, which exaggerates the differences between a category that has 300 000 articles and one that has 5000. It attributes scientific advances to research that is cited most. Rather than the top 500, future work will compare 500 articles from each category starting from the 10% or the 25%.

Methods

From 2010 to 2014, Web of Science Core Collection (WoS) (Web of Science) indexed 11.9 million documents into 251 scientific categories. Researchers in the pure sciences, engineering and medicine publish more work indexed by WoS compared with the humanities, social sciences and fine arts. Within these broad scientific fields and subfields, publication and citation rates vary widely, which complicates comparing the researchers, category, journal or institutes, productivity and impact (Waltman and van Eck, 2013). Since the citation patterns vary with document type (Radicchi and Castellano, 2012), we only consider the 6.5 million publications that WoS classifies as articles and ignore all other types (reviews, papers in proceedings, meeting abstracts, etc.). In the beginning of January, 2016, we downloaded the WoS 500 most cited articles from each of the 251 categories. Following Crespo et al. (2010), we consider that citations represent intrinsic scientific value and the culture of the scientific field. Since the database has the top 500 articles in each category, we consider that differences in citation rates within these categories are due entirely to bibliometric factors and not quality or scientific impact. We restricted the work to the Web of Science Core Collection. In the Basic Search category of the WoS, we entered “:” as the criteria and highlighted the field category Topic. We added a second field category Year Published and set the years to 2010–2014. In the following Search page, we restricted the study to Articles (under the Document Types tab). For each of the 251 Web of Science Categories, we sorted the articles from most cited to least cited, then saved the first 500 articles from the Save to Other File Format tab, included the Full Record and Cited References, and set the File Format to Tab-Delimited. Each category file contains more than 40 columns of data including: article category, author's full name, title, journal, abstract, date, scientific field, affiliations, funding agencies, etc. Some errors remain in the database particularly related to the formatting of the references. We checked all references that were older than 500 years and corrected erroneous entries. The WoS citation index compiles data from 12 000 journals. It assigns many of these articles to more than one category such that the sum of the total number of articles is 11.3 million (although the overall total number of articles is only 6.5 million): 304 journals have articles that are duplicated in two categories, 68 are assigned to 3 categories, 16 to 4 categories and 4 journals to 5 categories. Articles from Advanced Materials and Nano Letters appear in 6 categories. Nine of the top 10 articles in mathematical computational biology, computer science interdisciplinary applications, and probability and statistics are identical. Equally, biochemical research methods, biophysics and crystallography share all of the top 6 articles except for one. The article with the most cites (18103) is listed in three categories: biochemistry molecular biology, genetics heredity, and evolutionary biology. WoS assigns 337 000 articles to multidisciplinary materials and only 800 to poetry and African, Australian, and Canadian literature. Because of this large disparity in the number of articles per category, we combined similar categories to ensure that each had at least 4000 papers. For example, we added medical ethics to the ethics category and put folklore, and 6 literature categories to literary theory criticism. The mean number of papers in each category of the truncated dataset (236 categories) was = 43000 papers. The following list describes the bibliometric field indicators. We correlated the number of times papers are cited with bibliometric indicators. For each indicator, we developed a power law expression and calculated the . We then developed power law correlations with multiple factors and retained the expression that gave the highest . We rank the categories from 1 to 236 based on the number of articles that WoS assigns to each. Multidisciplinary materials science, multidisciplinary chemistry, applied physics and chemical physics have the most articles (>240000) while demography, industrial labor relations and logic have the least (<4500). The first several dozen articles in as many as 20 categories have uncharacteristically high citations. The paper with 18103 citations inflates the mean category average of biochemistry molecular biology, genetics heredity, and evolutionary biology by 36. To reduce the variability introduced by these highly cited articles, we set equal (Redner, 1998) to the average number of citations to papers ranked from 31 to 500. The category h-index considers a five-year period (2010 to 2014) and equals the number of articles in a category, h, that have been cited at least h times: Multidisciplinary materials science, multidisciplinary chemistry, multidisciplinary sciences and general internal medicine all have at least 300 articles that have been cited more than 300 times ( > 300); literary reviews, romance literature and classics have less than 10 papers that have been cited 10 times ( < 10). The mean weighted average of the (2014) of the 10 journals that publish the most cited papers in each category: where is the number of articles the ith journal publishes (). Total number of articles that WoS assigns to each category. Mean number of agencies that funded the research as reported in WoS funding agencies listed in the WoS. The total number of references in the bibliography of all 500 articles in a category. The Weibull distribution characterizes the relationship between the cumulative number of references, , and their age, t, the difference between the year the journal published the article and when the reference was published (Patience et al., 2015): where β is the scale parameter: 63% of the references are younger than β. As many as 85 categories cite at least one article older than 300 y and 8 categories cite more than 100 articles older than that (the number of reference articles older than 300 y are in parentheses): classics (687), romance literature (414), literary theory criticism (397), history (344), theatre (283), multidisciplinary humanities (141), philosophy of science history (102) and art (101). We excluded all references older than 100 y in calculating β and only consider references written after 1916. Astronomy astrophysics averages 116 co-authors per article, while particles fields physics averages 169 and nuclear physics average 290. The number of co-authors per article exceeds 15 in 15 categories. To avoid these anomalously high values, in our model, we fit the number of authors per article (in each category) to a Weibull function then assign the number of authors per paper, equal to . The fraction of 500 articles that the top 4 journals publish. In agricultural engineering, Bioresource Technology published 422 of the top 500 articles; Neuroimage published 80% of the top 500 of the neuroimaging category; and, Science and Nature published 454 of the 500 most cited in multidisciplinary sciences. Although 10% of the journals indexed by WoS have at least one paper among the 500 most cited, only 10 journals account for 60% of the 118000 articles of this study. (Supplementary file:Top 10 journals per category.xlsx)

Results

The h-index links productivity of individuals with the citation history of their published articles. It prejudices young researchers and individuals who publish in categories with low citation rates. Modifications to improve the h-index include fractional counting (Leydesdorff and Opthof, 2010), normalizing citations, correcting for the dimensionality of the h-index with a conversion factor (Dienes, 2015). An minimizes the pitfalls associated with the individual h-index; it is an aggregate value that applies to all researchers for the same 5-year period. It is a measure of productivity and correlates with the number of researchers in a field, which reflects the priority that society attributes to specific scientific categories. For instance, the (2010–2014) of multidisciplinary sciences is 367: 367 articles between 2010 and 2014 were cited at least 367 times as of January 2016. It was only 3 for Slavic Literature. The average of the number of citations to the top 500 papers per categories, , correlates with (Figure 1): Iglesias and Pecharromán (2007) derived a theoretical relationship that they apply to individuals that takes into account both the category productivity and an individual's productivity based on
Figure 1

The hcat correlates with the average number of citations of the most highly ranked articles. A power law model fits the average citation rates of articles ranked from 31–100 better than those ranked from 1–30. The correlation coefficient is R2 = 0.997 for the articles ranked from 31–500.

The hcat correlates with the average number of citations of the most highly ranked articles. A power law model fits the average citation rates of articles ranked from 31–100 better than those ranked from 1–30. The correlation coefficient is R2 = 0.997 for the articles ranked from 31–500. Eq. (3) specifically applies to the article rank from 31 to 500. For the articles ranked from 31 to 100, the data are displaced to the right slightly (the coefficient increases) but the slope of the line is the same and slightly higher than the Eq. (4). In fact, considering any series of articles with the same rank—100 to 200, 200 to 300, 300 to 500—only the coefficient changes but the exponent is essentially constant and . However, for the most highly cited papers, ranked 1 to 30, for example, many categories deviate substantially from the regression line (circles to the right). Such articles in these categories represent the substantial fluctuations (Redner, 1998) characteristic of the extremes of the bibliometric citation data. Coincidentally, they share the most highly cited papers. Whereas the increases to the power 0.71 with respect to , how does the category average impact factor, , vary with ? The impact factor for a given year, i is: Recall that the category impact factor is the weighted average of the top 10 journals that publish the 500 most cited articles. Finardi (2013) reported that are poorly correlated with but differences among scientific areas exist. By restricting our analysis to the most highly cited papers, we evaluate the differences between areas and find that increases linearly with citations, but more precisely (Figure 2): is greater than 25 for general internal medicine (50), multidisciplinary sciences (36), and cell biology (26) (Appendix). It is below 0.5 for literary theory criticism, romance literature, classics, theater, and Asian studies. The categories that deviate substantially from the regression line include electrical engineering, applied mathematics, astronomy/astrophysics and nuclear science technology.
Figure 2

The weighted of the journals that publish the top 500 articles for each category is proportional to the average of the number of citations per category.

The weighted of the journals that publish the top 500 articles for each category is proportional to the average of the number of citations per category.

Discussion

Both the and correlate with citation rates and are useful metrics to compare categories quantitatively. But what factors contribute to the citation frequency of an article? Tahamtan et al. (2016) categorize the factors that contribute to how often an article is cited: (1) paper related—research quality, novelty, how well the authors present their results, accessibility, the number of references, and age, β. (Vieira and Gomes, 2010); (2) journal related—, language; and, (3) author related—, authors reputation, collaborations, race, gender, age etc. Yu and Yu (2014) included research field as an addition factor that contributes to citation frequency, which would include and . Here we examine all four factors but assume that since we populate the database solely with the most cited articles, the research quality is equivalent across all categories. The article related aspects we consider are , β and the number of articles WoS assigns to a category, . The only journal related factor we consider is , as expressed by the parameter η. We compared the percentage of women graduating from 141 scientific disciplines with the average number of citations in those disciplines and found no positive correlation, which agrees with other work (National Science Foundation, 2014, Rørstad and Aksnes, 2015). Other author related factors we examined include and . The number of articles in a category, , is the single most important factor that correlates with (SCImago Journal Rank, Zitt and Small, 2010). It decays exponentially with respect to the rank (Figure 3). Articles in categories that cite proportionately more often than the number of articles that WoS assigns to the category lie above the black line in Figure 3 (biological sciences—general internal medicine, peripheral vascular disease, cell tissue engineering, allergy and evolutionary biology). Cell tissue engineering, andrology and mathematical psychology are cited 3 times more than there are papers ( > 3 ). Mathematics, nursing, religion, history, humanities and literature are among the categories that cite proportionately less often than the number of papers that they publish and fall below the line: History and literature reviews have 10 times more papers than citations ( < 10 ). Assuming that the number of citations is directly proportional to the number of papers explains 64% of the variance: = 0.73 ().
Figure 3

The total citation count to the top papers in a category (30 < R ≤ 500), , versus the rank of the number of papers assigned to this category (black line). The number of citations to the most cited papers for each category follows a similar trend (magenta hexagons). Several categories related to biology/medicine cite more frequently than the number of papers in these categories whereas the social sciences, the arts and some categories related to mathematics cite less frequently.

The total citation count to the top papers in a category (30 < R ≤ 500), , versus the rank of the number of papers assigned to this category (black line). The number of citations to the most cited papers for each category follows a similar trend (magenta hexagons). Several categories related to biology/medicine cite more frequently than the number of papers in these categories whereas the social sciences, the arts and some categories related to mathematics cite less frequently. The deviation between the highest number of citations and the lowest for a given is about 3. Biological sciences and medicine related categories lie near the upper bound while humanities lie below the lower bound. Equally important as to explain the variance in the category data is the average age of the references in the articles' bibliography, β. The Weibull distribution accounts for more than 99.5% of the variance in the age distribution. It varies from 4 y (nanoscience nanotechnology and multidisciplinary materials science) to more than 20 y (classics, history of social sciences and romance literature), and averages 9 y over all categories. Categories with a lower β will necessarily have journals with a higher since researchers cite recent articles. As many as 44% of the papers that researchers publish in multidisciplinary materials science are two years old or less while it is only 5% in classics. An inverse cubed relation accounts for 66% of the variance in the data: Besides the number of papers in a category and β, the number of citations increases with the number of authors, (Figure 4) (Abramo and D'Angelo, 2015, Glänzel and Thijs, 2004). Authorship attributes credit to those that contribute to research. Through authorship, people accrue a reputation (Cronin, 2001). Researchers in biosciences cite more often than architects and these varying citation practices render comparisons across scientific fields problematic (Crespo et al., 2010). Articles in multidisciplinary physics, astronomy/astrophysics, particles fields physics and nuclear physics can have several hundred and even more than three thousand authors—hyperauthorship (Birnholtz, 2006, Li et al., 2013)—whereas literature, poetry, and history tend to have a single author. Ten categories exceed 15 authors per paper, which is indicative of hyperauthorship (Boffito et al., 2016). Excluding hyperauthorship, papers average less than 5 authors per paper. Citations increase with the square of the number of authors per paper, with a 10-fold spread:
Figure 4

The average number of citations to the top ranked articles in a category 30 < R ≤ 500, , increases proportionately with the square of the number of authors and are bounded by two extremes and . Ten categories average more than 15 authors per category, which corresponds to hyperauthorship (red filled triangles) (Boffito et al., 2016).

The average number of citations to the top ranked articles in a category 30 < R ≤ 500, , increases proportionately with the square of the number of authors and are bounded by two extremes and . Ten categories average more than 15 authors per category, which corresponds to hyperauthorship (red filled triangles) (Boffito et al., 2016).

Model

Principal component analysis shows that no linear combination of all possible parameters accounts for the majority of the variance. However, a power law model including the prime factors accounts for 86% of the variance: . Excluding 12 categories related to psychology, business and management (Iglesias and Pecharromán, 2007), the following expression accounts for 95% of the variance: The number of papers in a category and the age of the references in these papers account for most of the variance in . The first term variable in the parenthesis, η, represents the fraction of articles of the 500 most cited articles that the top 4 journals publish (Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8). This factor exceeds 0.97 for agricultural engineering, multidisciplinary sciences, neuroimaging and material sciences coatings, and is lower than 0.26 for literary criticism, classics, and management.
Table 1

Bibliometric indicators (2010–2014): Category rank from 1 to 30.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
1Multidisciplinary mat. sci.38130118.83318656.64.50.633.4
2Multidisciplinary chemistry38530614.22487026.24.70.683.2
3Applied physics31126019.52469956.850.793.3
4Physical chemistry35829016.72458406.450.713.3
5Biochemistry mol. biology35425520.723548712.55.40.584.4
6Electrical engineering1651664.22278233.46.40.291.9
7Multidisciplinary sciences51936735.918440416.25.20.985.9
8Environmental sciences15916012.91705995.96.20.62.7
9Neurosciences18718111.41590338.27.70.53.9
10Surgery1041215.71552218.670.31.4
11Pharmacology, pharmacy10511710.11496096.87.10.322.7
12Oncology26021822.714757015.45.50.534
13Condensed matter physics29625018.21318026.84.80.873.4
14Chemical eng.13314114.21272974.860.742.3
15Nanoscience nanotechnology33727415.41264626.44.40.813.3
16Biotechnology, microbiology22018819.81235737.75.60.663.1
17Optics12513614.01229556.16.40.782.5
18Mathematics46681.81192432.112.30.391.6
19Public occupational health1101245.81183336.57.30.292
20Applied mathematics70861.91175492.490.371.6
21Clinical neurology14614910.911108611.77.60.455.8
22Multidisciplinary physics21519712.81094066.48.60.935.4
23Cell biology27223825.910866212.85.30.674.8
24Energy, fuels15415714.01014045.25.60.692.3
25Organic chemistry1001135.41010074.45.80.672.4
26Analytical chemistry991135.21001614.75.50.692.3
27Plant sciences931067.4954097.87.70.632.9
28Food science technology65813.2952364.68.10.521.4
29Multidisciplinary geosciences981156.6930706.49.50.472.4
30Immunology17017114.69201810.85.90.634.7
Table 2

Bibliometric indicators (2010–2014): Category rank from 31 to 60.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
31General internal medicine46031649.79106319.05.80.978.8
32Astronomy, astrophysics1971775.79090120.87.10.618.1
33Heredity genetics22920021.39028317.75.10.557
34Microbiology1251337.9893769.36.10.473.6
35Radiology, nuclear medicine1151285.9873587.66.80.572.3
36Economics79974.2858082.39.10.290.3
37Polymer science981095.0856834.76.30.682.1
38Mechanics67863.2827322.99.30.341.4
39Cardiac cardiovascular systems20818214.38218114.57.50.813.8
40Experimental medicine16216718.18083013.46.40.844.5
41Ecology1241307.9797135.78.60.282.6
42Physics–atomic mol. chem.1271385.1796993.97.30.772.5
43Psychiatry11712911.1770007.87.50.525
44Metallurgy, metallurgical eng.65823.6763484.39.40.81.7
45Mechanical engineering58763.1757363.39.40.431.5
46Biochemical research methods20716214.0733526.05.80.63.1
47Endocrinology, metabolism1271349.7719589.46.90.543.6
48Veterinary sciences40562.2704665.78.80.371.3
49Pediatrics74915.0692467.17.80.622.3
50Civil engineering67853.9682004.07.30.791.7
51Applied chemistry69853.9643754.77.40.591.7
52Inorganic, nuclear chemistry78954.7641575.07.40.892.4
53Instruments, instrumentation74904.8635944.36.20.752.3
54Medicinal chemistry73894.3626516.57.60.651.8
55Electrochemistry1011175.1622865.05.60.732
56Interdisciplinary computer sci.1341264.1604693.87.70.611.8
57Infectious diseases1101219.15915110.36.10.594.1
58Telecommunications831053.3589373.75.30.471.4
59Water resources69884.2577844.78.20.71.9
60Biophysics1071138.3573916.36.80.43.1
Table 3

Bibliometric indicators (2010–2014): Category rank from 61 to 90.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
61Particles fields physics1261345.4570414.510.80.88.1
62Meteorology, atmospheric sci.1041196.5558719.570.522.9
63Zoology45623.1554374.415.60.382.6
64Computer sci. info. systems841013.7531463.27.50.451.5
65Computer sci. AI1111254.7530763.17.60.412.1
66Gastroenterology, hepatology13314213.25210110.56.50.733.6
67Environmental engineering1051195.3518624.86.20.792
68Obstetrics, gynecology65844.5517566.77.30.541.9
69Biomedical engineering981117.4507196.66.30.842.6
70Multidisc. engineering53742.6505803.38.80.541.6
71Orthopedics64833.8502465.78.10.511.4
72Mathematical physics61782.4493142.89.20.72.2
73Marine, freshwater biology55732.8491315.210.10.312.3
74Biology851006.6490576.77.70.633.1
75Thermodynamics64843.6482453.47.90.611.1
76Toxicology76945.5474476.67.90.432
77Physiology66794.3472066.48.30.362.5
78Hematology1271369.44667312.96.40.783.9
79Urology, nephrology931098.4463509.26.10.692.3
80Educational research44662.7462032.79.60.370.1
81Nutrition, dietetics831014.9455116.77.20.562.3
82Nuclear sci. technology38571.4453076.89.90.621.9
83Geochemistry, geophysics73904.1450914.7110.452
84Peripheral vascular disease15715211.44433212.17.90.833.9
85Statistics, probability1091114.2441553.28.30.781.9
86Fluids, plasmas physics55722.5431183.68.70.71.7
87Management75944.9421112.511.90.250.2
88Interdisciplinary math appl.55732.6420832.890.461.6
89Crystallography78844.2415705.76.40.942.5
90Spectroscopy53722.5412605.38.40.392.5
Table 4

Bibliometric indicators (2010–2014): Category rank from 91 to 120.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
91Dentistry, oral surgery46643.7410755.68.40.451
92Agronomy47663.2403095.59.10.531.9
93Ophthalmology68854.6395006.58.80.662.8
94Operations res. management55722.6386682.69.90.471.2
95Multidisciplinary agriculture45622.8383865.37.80.831.6
96Sport science61783.8380735.58.40.571.3
97Automation control systems911084.5376073.16.70.672.1
98Computer sci. software eng.47662.1375963.47.50.281.4
99Health care science services71893.9371556.07.20.441.3
100Respiratory system981159.13691510.57.20.74.1
101Pathology75956.3362519.46.60.542.7
102Multidisciplinary psychology81997.8353262.810.80.50.4
103Computer science theory59813.0336013.27.60.332
104Rehabilitation44632.8334035.18.70.341.1
105History10211.1333121.419.60.280
106Dairy, animal science37582.3325485.59.60.71.8
107Nursing28431.9320823.98.10.260.9
108Clinical psychology68864.8317435.19.30.331.6
109Virology911097.03164910.16.20.853.4
110Nuclear Physics821004.7315714.712.20.8510.9
111Experimental psychology67853.5313213.310.10.340.9
112Coatings, films52692.8307834.87.40.981.7
113Dermatology48655.3304747.28.30.622.1
114Political science39573.1298811.710.10.30
115Psychology73928.1296284.110.30.41.7
116Environmental studies70906.2296253.37.20.631.5
117Computational biology1221214.5293294.26.50.812.3
118Entomology34492.4292664.610.90.31.9
119Oceanography50682.8292594.8100.42.4
120Business67875.1279462.511.90.290
Table 5

Bibliometric indicators (2010–2014): Category rank from 121 to 150.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
121Building technology42593.0276653.39.20.681.3
122Biomaterials951097.6275836.760.932.7
123Behavioral science56724.9274014.39.90.41.8
124Philosophy14280.9267311.314.30.290.1
125Evolutionary biology971067.8259714.78.80.682.5
126Parasitology77957.9255699.56.90.913.4
127Otorhinolaryngology33492.4255215.49.70.490.9
128Sociology42622.7244441.810.90.310
129Manufacturing eng.39562.7242733.28.70.61.1
130Health policy services53744.0237585.26.80.531.3
131Ceramics materials sci.35522.5236124.610.10.881.6
132Physical geography59774.2234895.09.70.452.3
133Fisheries34492.5233355.19.50.52.1
134Forestry37523.1227064.69.80.672.2
135Transplantation62804.52228810.36.50.672.6
136Interdisciplinary social sci.34512.2222162.810.10.50.2
137Law25413.5221481.910.90.230
138Linguistics28442.7220572.511.90.380.5
139Computer science hardware48713.3217563.46.40.51.9
140Critical care medicine971119.02148310.480.84.4
141Biodiversity, conservation62846.0209925.38.10.682.9
142Acoustics41592.8208574.28.70.491.4
143Reproductive biology54734.3207696.47.80.71.8
144Geriatrics, gerontology58775.0206957.67.80.573
145Rheumatology80967.92056510.07.40.845.1
146Industrial engineering43622.3204732.98.90.581
147Developmental psychology59764.8204174.59.70.451.3
148Language, linguistics16311.2201021.912.60.340
149Soil science45663.1200774.99.60.641.9
150Business finance44663.4196882.410.20.540
Table 6

Bibliometric indicators (2010–2014): Category rank from 151 to 180.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
151Developmental biology851028.9176697.66.70.863.5
152Social psychology50683.8175533.1120.470
153Information, library sci.41622.9174882.89.10.510.7
154Multidisciplinary humanities8211.7174551.712.90.510
155Geography45673.7173812.37.90.460.6
156Agricultural engineering68854.3171774.67.30.991.8
157Education scientific disc.34532.5170683.68.30.610.7
158Anthropology33502.9169223.211.80.51.5
159Anesthesiology55724.8168896.57.50.752.1
160Horticulture30452.6168576.39.30.731.9
161Remote sensing56774.6163064.38.40.81.7
162Literature6201.0162351.414.10.480
163Transportation sci.38542.6161113.07.70.641.5
164Applied psychology46684.0160342.911.50.40.1
165Religion9221.0158761.912.40.470
166International relations27472.6155661.78.60.420
167Composites materials sci.40563.2154113.78.10.861.3
168Emergency medicine33533.3152896.88.20.811.1
169Tropical medicine40583.5152868.98.30.912.4
170Substance abuse43643.7150714.88.50.561.9
171Photographic technology59814.6150074.38.40.851.8
172Integrative medicine28412.9140956.28.90.841.5
173Medical laboratory technol.41634.9138607.27.10.762
174Ethics21372.0137682.210.20.640.4
175Transportation31472.5136882.89.10.610.7
176Communication28472.1133272.19.80.360
177Biomedical social science34523.0131874.98.20.710.7
178Aaerospace engineering19341.2130773.011.30.711.1
179Literary theory criticism3100.0129701.118.50.250
180Planning development36572.7128022.09.30.490
Table 7

Bibliometric indicators (2010–2014): Category rank from 181 to 210.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
181Paleontology28442.5124634.713.90.612.5
182Geological engineering24381.9123173.212.60.41.6
183Mining, mineral processing24391.7122604.811.90.891.4
184Medical informatics38582.7120514.77.30.61.6
185Art6171.3120012.614.50.660.5
186Gerontology41604.2118656.490.772.3
187Characterization, testing20351.8117823.89.80.71.3
188Neuroimaging82986.0117596.87.10.973
189Geology36553.9115424.911.80.712.2
190Archaeology20342.0113833.913.50.731.5
191Mineralogy36533.4111944.611.50.712.1
192History philosophy of sci.15301.4110471.814.30.420.6
193Social sci. math. methods32532.7106032.311.30.490.9
194Area studies12241.0105331.57.60.30
195Family studies27461.8103313.39.50.440
196Audiology speech pathology27412.2101793.811.60.681.4
197Cell tissue engineering9311114.9100459.45.70.833.9
198Literary reviews190.597901.115.40.650
199Textiles materials sci.26463.497364.97.60.952
200Educational psychology38593.496923.211.20.570
201Social work20331.896702.99.60.510
202Limnology38563.295784.310.40.862.3
203Criminology, penology23382.295722.711.10.340
204Mycology33524.094025.610.90.52.1
205Anatomy morphology24412.593435.111.10.542
206Hospitality, leisure, sport27442.393282.410.40.610.1
207Allergy63869.692019.57.10.916
208Architecture4140.589932.0150.480.3
209Urban studies25422.289652.490.550.5
210Petroleum engineering12271.188743.513.70.771.2
Table 8

Bibliometric indicators (2010–2014): Category rank from 211 to 236.

RankCategoryN¯cithcatN¯IFNartN¯auβηN¯fund
211Romance literature260.187951.119.90.330
212Cultural studies16331.284811.610.10.540.1
213Public administration21381.984531.89.60.350
214Music7201.583872.013.50.580.2
215Paper, wood material sci.22402.583594.110.20.851.6
216Legal medicine24422.483345.18.70.780.9
217Primary health care23382.882395.57.80.481.4
218Social issues19352.378342.19.20.440.3
219Robotics27462.276723.48.10.621.5
220Ocean engineering18321.776603.411.10.771.8
221Classics390.076181.122.80.240
222Biological psychology36564.074803.710.60.61.5
223Women's studies19321.873123.09.60.60.4
224Special education24422.068134.29.90.630
225Ergonomics22381.863403.1100.690.4
226Cybernetics computer sci.31564.062953.38.30.72.1
227Theater2100.360091.215.20.350
228Ornithology18342.457574.970.591.5
229Asian studies3110.455611.218.50.350
230Film, radio, television9280.754001.410.30.520
231Mathematical psychology21422.753402.512.30.770.2
232Microscopy24432.349045.19.80.711.8
233History of social sci.7170.848671.721.40.420
234Demography19361.844472.010.10.520
235Industrial relations labor13281.544032.110.80.440
236Logic7190.643811.815.10.381
Bibliometric indicators (2010–2014): Category rank from 1 to 30. Bibliometric indicators (2010–2014): Category rank from 31 to 60. Bibliometric indicators (2010–2014): Category rank from 61 to 90. Bibliometric indicators (2010–2014): Category rank from 91 to 120. Bibliometric indicators (2010–2014): Category rank from 121 to 150. Bibliometric indicators (2010–2014): Category rank from 151 to 180. Bibliometric indicators (2010–2014): Category rank from 181 to 210. Bibliometric indicators (2010–2014): Category rank from 211 to 236. The second term variable, , accounts for the number of funding agencies that finance the research, which correlates with the number of authors—the correlation coefficient was lower with versus . Considering that the SNIP (Source Normalized Impact per Paper) journal metric accounts for the average length of reference lists, it is surprising that this factor is insignificant for this data set (Leydesdorff and Opthof, 2010). Presumably, funding agencies weigh their selection criteria heavily on the established publishing record of researchers, which reinforces the Matthew effect (Ebadi and Schiffauerova, 2016). Most categories lie within 33% of the regression line but the regression model consistently underestimates the citations to the psychology categories and it overestimates many of the fine arts categories and some of the chemistry categories (Figure 5).
Figure 5

The equation accounts for 95% of the variance in the average number of citations per category.

The equation accounts for 95% of the variance in the average number of citations per category.

Conclusions

Publishing a highly cited paper is gratifying and confirms that the work has an impact on the scientific community. However, the number of citations the top articles accrue depends on factors other than quality and originality. We tabulate bibliometric indicators for the top 500 cited articles of 236 scientific categories and include the average impact factors of the journals that publish the articles, the category h-index and the total number of articles in each category. With this data, researchers, institutions and funding agencies can gauge their productivity and impact quantitatively. Citation rates, vary across research categories by several orders of magnitude as do the number of articles per category and the number of authors per article. Categories with more articles and more funding are cited more. Other factors that correlate with citations include the age of the references, journal impact factor and funding agencies. We assume that is related to bibliometric indicators and that 500 articles from categories with 100 000 articles (0.5%) are comparable to those with 5000 (10%). This comparison may exaggerate the differences between fields, but science endeavours that have orders of magnitude more researchers will have that much more impact. Most categories are within 33% of the regression equation. Other factors that may account for the difference may be related to the scope of the category. For instance, many researchers outside of the psychology field may be citing psychology papers, which would increase the number of citations beyond what we expect based on the bibliometric indicators. The correlation overestimates the number of citations for nursing and many engineering categories: here, the citation patterns might be narrower as only the people in these fields cite one another. A further limitation to the analysis relates to the limitations of WoS: coverage of the humanities, social sciences, business, and even mathematics are poorer than they are for natural sciences and health sciences. However, the number of funding agencies, which correlates with the number of authors (and the number of international collaborations), helps increase the visibility of research and its scientific impact.

Declarations

Author contribution statement

Gregory S. Patience: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper. Christian A. Patience: Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data. Bruno Blais, Francois Bertrand: Contributed reagents, materials, analysis tools or data.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Competing interest statement

The authors declare no conflict of interest.

Additional information

Supplementary content related to this article has been published online at http://dx.doi.org/10.1016/j.heliyon.2017.e00300. No additional information is available for this paper.
  9 in total

1.  An index to quantify an individual's scientific research output.

Authors:  J E Hirsch
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-07       Impact factor: 11.205

2.  The history and meaning of the journal impact factor.

Authors:  Eugene Garfield
Journal:  JAMA       Date:  2006-01-04       Impact factor: 56.272

3.  National scientific facilities and their science impact on nonbiomedical research.

Authors:  A L Kinney
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-08       Impact factor: 11.205

4.  Good grades, but who gets the cash?

Authors:  Natasha Gilbert
Journal:  Nature       Date:  2009-01-01       Impact factor: 49.962

5.  Impact, not impact factor.

Authors:  Inder M Verma
Journal:  Proc Natl Acad Sci U S A       Date:  2015-06-15       Impact factor: 11.205

6.  The San Francisco Declaration on Research Assessment.

Authors:  Ross Cagan
Journal:  Dis Model Mech       Date:  2013-05-20       Impact factor: 5.758

7.  Navigate the World with College of Nursing, Taipei Medical University, Ranked No. 1 Nationally in Nursing Subject, Academic Ranking of World Universities (ARWU).

Authors:  Kuei-Ru Chou; Chiou-Fen Lin
Journal:  Asia Pac J Oncol Nurs       Date:  2018 Jan-Mar

8.  The measurement of the effect on citation inequality of differences in citation practices across scientific fields.

Authors:  Juan A Crespo; Yunrong Li; Yungrong Li; Javier Ruiz-Castillo
Journal:  PLoS One       Date:  2013-03-14       Impact factor: 3.240

9.  Does the committee peer review select the best applicants for funding? An investigation of the selection process for two European molecular biology organization programmes.

Authors:  Lutz Bornmann; Gerlind Wallon; Anna Ledin
Journal:  PLoS One       Date:  2008-10-22       Impact factor: 3.240

  9 in total
  10 in total

1.  The right to refuse unwanted citations: rethinking the culture of science around the citation.

Authors:  Jaime A Teixeira da Silva; Quan-Hoang Vuong
Journal:  Scientometrics       Date:  2021-05-08       Impact factor: 3.238

2.  Categorical and Geographical Separation in Science.

Authors:  Julian Sienkiewicz; Krzysztof Soja; Janusz A Hołyst; Peter M A Sloot
Journal:  Sci Rep       Date:  2018-05-29       Impact factor: 4.379

3.  Intellectual contributions meriting authorship: Survey results from the top cited authors across all science categories.

Authors:  Gregory S Patience; Federico Galli; Paul A Patience; Daria C Boffito
Journal:  PLoS One       Date:  2019-01-16       Impact factor: 3.240

Review 4.  Citation classics in general medical journals: assessing the quality of evidence; a systematic review.

Authors:  Suhaib Js Ahmad; Ahmed R Ahmed; Karl Friedrich Kowalewski; Felix Nickel; Kamran Rostami; Claire J Stocker; Sherif M Hakky; Rami Archid; Douglas McWhinnie; Ata Mohajer-Bastami; Dionysis Skiadopoulos Seimenis; Sami Ahmad; Sami Mansour; Mohamed H Ahmed; Dushyant Mital; Aristomenis K Exadaktylos
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2020

5.  Virtual and Augmented Reality Applications in Medicine: Analysis of the Scientific Literature.

Authors:  Andy Wai Kan Yeung; Anela Tosevska; Elisabeth Klager; Fabian Eibensteiner; Daniel Laxar; Jivko Stoyanov; Marija Glisic; Sebastian Zeiner; Stefan Tino Kulnik; Rik Crutzen; Oliver Kimberger; Maria Kletecka-Pulker; Atanas G Atanasov; Harald Willschke
Journal:  J Med Internet Res       Date:  2021-02-10       Impact factor: 5.428

6.  Microbial keratinase and the bio-economy: a three-decade meta-analysis of research exploit.

Authors:  Nonso E Nnolim; Uchechukwu U Nwodo
Journal:  AMB Express       Date:  2021-01-07       Impact factor: 3.298

Review 7.  Characteristics, quality and volume of the first 5 months of the COVID-19 evidence synthesis infodemic: a meta-research study.

Authors:  Rebecca Abbott; Alison Bethel; Morwenna Rogers; Rebecca Whear; Noreen Orr; Liz Shaw; Ken Stein; Jo Thompson Coon
Journal:  BMJ Evid Based Med       Date:  2021-06-03

8.  A scientometric study of Iran and the world countries' contribution to internal medicine (1996-2019).

Authors:  Ali Shabestani Monfared; Ali Bijani; Khadijeh Tahmasbi; Mousa Yaminfirooz
Journal:  Caspian J Intern Med       Date:  2022

Review 9.  A Bibliometric Perspective on AI Research for Job-Résumé Matching.

Authors:  Sergio Rojas-Galeano; Jorge Posada; Esteban Ordoñez
Journal:  ScientificWorldJournal       Date:  2022-10-03

10.  Drugs and Epigenetic Molecular Functions. A Pharmacological Data Scientometric Analysis.

Authors:  Dario Kringel; Sebastian Malkusch; Jörn Lötsch
Journal:  Int J Mol Sci       Date:  2021-07-06       Impact factor: 5.923

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.