Literature DB >> 18301760

Effectiveness of journal ranking schemes as a tool for locating information.

Michael J Stringer1, Marta Sales-Pardo, Luís A Nunes Amaral.   

Abstract

BACKGROUND: The rise of electronic publishing, preprint archives, blogs, and wikis is raising concerns among publishers, editors, and scientists about the present day relevance of academic journals and traditional peer review. These concerns are especially fuelled by the ability of search engines to automatically identify and sort information. It appears that academic journals can only remain relevant if acceptance of research for publication within a journal allows readers to infer immediate, reliable information on the value of that research. METHODOLOGY/PRINCIPAL
FINDINGS: Here, we systematically evaluate the effectiveness of journals, through the work of editors and reviewers, at evaluating unpublished research. We find that the distribution of the number of citations to a paper published in a given journal in a specific year converges to a steady state after a journal-specific transient time, and demonstrate that in the steady state the logarithm of the number of citations has a journal-specific typical value. We then develop a model for the asymptotic number of citations accrued by papers published in a journal that closely matches the data.
CONCLUSIONS/SIGNIFICANCE: Our model enables us to quantify both the typical impact and the range of impacts of papers published in a journal. Finally, we propose a journal-ranking scheme that maximizes the efficiency of locating high impact research.

Entities:  

Mesh:

Year:  2008        PMID: 18301760      PMCID: PMC2244807          DOI: 10.1371/journal.pone.0001683

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

As de Solla Price observed [3], the number of scientific journals and the number of papers published in those journals is increasing at an approximately exponential rate. The size and growth of the research literature places a tremendous burden on researchers—how are they to select what to browse, what to read, and what to cite from a large and quickly growing body of literature? This burden does not only affect researchers. Funding agencies, university administrators, and reviewers are called on to evaluate the productivity of researchers and institutions, as well as the impact of their work. Typically, these agents have neither the time nor the financial resources to obtain an in-depth evaluation of the actual research and must instead use indirect indicators of quality such as number of publications, h-index, number of citations, or journal rank [4]–[8]. Despite the oversimplification of using just a few numbers to quantify the scientific merit of a body of research, the entire science and technology community is relying more and more on citation-based statistics as a tool for evaluating the research quality of individuals and institutions [9]. An example of this trend is the widespread use of the Institute of Scientific Information (ISI) Journal Impact Factor (JIF) to rate scientific journals. This practice is pervasive enough that, despite evidence that the JIF can be misleading [10], [11], some countries pay researchers per paper published with the amount being determined by the JIF of the journal in which the paper is published [12]. This act of “judging a book by its cover” has caused researchers to note that we should judge a paper not by the number of citations that the journal in which it is published receives, but by the number of citations the paper itself receives [13]. This seemingly obvious fact is countered by one major challenge—administrators often want an estimate of the impact of a paper long before it has finished accumulating citations, which, as we show later, might take as long as 26 years. The need for an estimate of the ultimate impact of recently published articles is the reason that the JIF is often used as a proxy for quality of the research. Indeed, the premise of the peer-reviewing process is that reviewers are in fact able to assess the quality of a paper. Thus, the heuristic that the journal in which a paper is published is a good proxy for the ultimate impact of a paper is likely to be an adaptive one [14]. Like any heuristic, the evaluation of research using citation analysis has weaknesses. These weaknesses have been extensively explored in the literature [15], [16], however, as reviewed by Nicolaisen [17], there are plausible assumptions underlying the use of citation analysis as a heuristic. Here, we assume that the quality of a paper bears significant correlation with the ultimate impact of the paper, that is, the asymptotic total number of citations to that paper. We further assume that the actual relation between total number of citations and quality is uncertain, and may be field- and even journal-dependent. This latter assumption is prompted by the observation that many extrinsic factors for which we have no data can influence the number of citations that the paper receives. For example, because social influence may affect the citations to a paper, small differences in quality may lead to large differences in the number of citations [18]. In this article, we investigate two fundamental aspects concerning the prediction of the ultimate impact of a published research paper: (i) the time scale τ for the full impact of papers published in a given journal to become apparent, and (ii) the typical impact of papers published in a given journal. We find that τ varies from less than 1 year to 26 years, depending on the journal. Additionally, we find that there is a typical value and a well-defined range for the eventual impact of papers published in a given journal, which enables us to develop a model for the distribution of paper impacts that matches the data. These findings lead us to propose a method of ranking journals based on a natural criterion: the higher a journal is ranked, the higher the probability of finding a high impact paper published in that journal.

Results

We obtained the number of citations accrued by December 31, 2006 for 22,951,535 papers tracked in Thomson Scientific's Web of Science® (WoS) database. This database comprises information on papers published in ∼5,800 science and engineering journals, ∼1,700 social science journals, and ∼1,100 arts and humanities journals. Journals are typically covered from their inception or from the beginning of the WoS coverage for the research area (whichever is later) until the present date or until their demise (whichever is earlier). The beginning of WoS coverage for science and engineering, social science, and arts and humanities is 1955, 1956, and 1975 respectively. In this study, we restrict our analysis to journals publishing at least 50 articles per year for at least 15 years. This condition restricts our analysis to 19,372,228 articles published in 2,267 journals, and enables us to ensure good statistics on the journals that we include in the analysis. More information about the data is included in Appendix S1. Because the citation history of a paper may be field- and even journal-dependent, we first investigate , the probability distribution of ℓ, the logarithm of the number of citations accrued by each paper by December 31st of 2006, for articles published in journal J during year Y. We define ℓ aswhere n is the number of accrued citations. Figures display estimates of the probability density function for the Journal of Biological Chemistry for different years. Two patterns are apparent from the data. First, the distribution for each of the years considered shows a tendency to peak around a central value, that is, there is a characteristic value for ℓ. Second, after about 10 years, the distribution has converged to a steady-state functional form, . The explanation for this apparently counter-intuitive observation is that papers with a small number of citations have stopped accruing citations, while the trickle of citations to the most highly-cited papers is small when compared to the already accrued citations, and thus does not significantly change the value of the logarithm of the number of citations.
Figure 1

Time evolution of the distribution of number of citations of the papers published in a given academic journal.

(A) Probability density function , where Y is a year in the period 1998–2004, J is the Journal of Biological Chemistry, and ℓ≡log10(n) where n is the number of citations accrued by a paper between its publication date and December 31, 2006. Because the papers published in those years are still accruing citations by December 2006, the distributions are not stationary, but instead “drift” to higher values of ℓ. (B) for the Journal of Biological Chemistry and for Y in the period 1991–1993. For this period, the distributions are essentially identical, indicating that has converged to its steady-state form . The steady-state distribution is well described by a normal with mean 1.65 and standard deviation 0.35 (black dashed curve). (C) Time dependence of for three journals: Astrophysical Journal, Ecology, and Circulation. As for the Journal of Biological Chemistry, we find that after some transient period, reaches a stationary value (see Methods). The orange region highlights the set of years for which we consider that is stationary. The time scale τ(J) for reaching the steady-state strongly depends on the journal: τ(Astrophysical Journal) = 18 years, τ(Ecology) = 12 years, and τ(Circulation) = 9 years. Significantly, we find no correlations between τ(J) and , whose values are 1.44 for Astrophysical Journal, 1.70 for Ecology, and 1.66 for Circulation. (D) Pairwise comparison of citation distributions for different years for a given journal. We show the matrices of p-values obtained using the Kolmogorov-Smirnov test [29] for the Astrophysical Journal, Ecology, and Circulation. We color the matrix elements following the color code on the right. p-values close to one mean that it is likely that both distributions come from a common underlying distribution; p-values close to zero mean that is it very unlikely that both distributions come from a common underlying distribution. We then use a box-diagonal model [28] to identify contiguous blocks of years for which the p-value is large enough that the null hypothesis cannot be rejected. The white lines in the matrices indicate the best fit of a box-diagonal model. We identify the first box with more than 2 years for which to be the steady-state period (see Methods).

Time evolution of the distribution of number of citations of the papers published in a given academic journal.

(A) Probability density function , where Y is a year in the period 1998–2004, J is the Journal of Biological Chemistry, and ℓ≡log10(n) where n is the number of citations accrued by a paper between its publication date and December 31, 2006. Because the papers published in those years are still accruing citations by December 2006, the distributions are not stationary, but instead “drift” to higher values of ℓ. (B) for the Journal of Biological Chemistry and for Y in the period 1991–1993. For this period, the distributions are essentially identical, indicating that has converged to its steady-state form . The steady-state distribution is well described by a normal with mean 1.65 and standard deviation 0.35 (black dashed curve). (C) Time dependence of for three journals: Astrophysical Journal, Ecology, and Circulation. As for the Journal of Biological Chemistry, we find that after some transient period, reaches a stationary value (see Methods). The orange region highlights the set of years for which we consider that is stationary. The time scale τ(J) for reaching the steady-state strongly depends on the journal: τ(Astrophysical Journal) = 18 years, τ(Ecology) = 12 years, and τ(Circulation) = 9 years. Significantly, we find no correlations between τ(J) and , whose values are 1.44 for Astrophysical Journal, 1.70 for Ecology, and 1.66 for Circulation. (D) Pairwise comparison of citation distributions for different years for a given journal. We show the matrices of p-values obtained using the Kolmogorov-Smirnov test [29] for the Astrophysical Journal, Ecology, and Circulation. We color the matrix elements following the color code on the right. p-values close to one mean that it is likely that both distributions come from a common underlying distribution; p-values close to zero mean that is it very unlikely that both distributions come from a common underlying distribution. We then use a box-diagonal model [28] to identify contiguous blocks of years for which the p-value is large enough that the null hypothesis cannot be rejected. The white lines in the matrices indicate the best fit of a box-diagonal model. We identify the first box with more than 2 years for which to be the steady-state period (see Methods). These results are not restricted to the Journal of Biological Chemistry; displays these two characteristics for nearly all journals we analyzed (see Appendix S2). However, as illustrated in Figure , the mean value of ℓ in the steady state,and the time τ(J) needed to reach the steady state depend on the journal—for example, τ(Astrophysical Journal) is more than twice τ(Circulation), yet . The existence of a steady state for prompts us to investigate: (i) the functional form of , and (ii) whether there is a universal functional form for all journals. As others have noted [19], many papers remain uncited even decades after their publication. For those papers that do get cited, the total number of citations varies over five orders of magnitude (the most highly-cited paper in the data [20] had received 196,452 citations by the end of 2006). Nevertheless, ℓ follows a distribution that is approximately normal (Figures ). In order to explain our empirical findings, we develop a model for the asymptotic number of citations a paper published in journal J will receive. Our first assumption is that the papers published in journal J have a normal distribution of “quality”, q∈N(μ,σ), where μ and σ depend on J. The simplest model is to equate the ultimate impact with quality, ℓ≈q, so that n≈10. However, since 10 is a continuous random variable, whereas n is integer-valued, the model needs further refinement. In particular, the model must also specify how the continuous values of q map onto the discrete values of n. For generality, we introduce an additional parameter γ to the model, such that One can interpret γ as the value of q at which one can expect a paper to get cited once (Figure ). More generally, one could write n = floor(10−γ), where ε∈N(0,σ), to account for external influences to the number of citations. For example, assuming γ = 0 and q = 3, one would get n = 794 for ε = −0.1 and n = 1258 for ε = 0.1. However, if ε is independent of J, will not be significantly affected by ε. Thus, even though the number of citations to individual papers may change, the mean for a journal will not. To demonstrate the agreement between our model and the data, in Figure 2 we plot the moments of the empirical distributions for each journal together with the predictions of our model for those quantities. It is visually apparent that the model provides a close description of the data.
Figure 2

Modeling the steady-state distributions of the number of citations for papers published in a given journal.

(A) Our model assumes that the “quality” of the papers published by a journal obeys a normal distribution with mean μ and standard deviation σ. The number of citations of a paper with quality q∈N(μ,σ) is given by Eq. (3). Because the quality is a continuous variable whereas the number of citations is an integer quantity, the same number of citations will occur for papers with qualities spanning a certain range of q. In particular, all papers for which q

Modeling the steady-state distributions of the number of citations for papers published in a given journal.

(A) Our model assumes that the “quality” of the papers published by a journal obeys a normal distribution with mean μ and standard deviation σ. The number of citations of a paper with quality q∈N(μ,σ) is given by Eq. (3). Because the quality is a continuous variable whereas the number of citations is an integer quantity, the same number of citations will occur for papers with qualities spanning a certain range of q. In particular, all papers for which qfits). Notice that σ is almost independent of . The solid line corresponds to σ = 0.419, the mean of the estimated values of σ for all journals (see Methods). (C) Scatter plot of the estimated value of γ+1 for versus . Notice the strong correlation between the two variables. The solid line corresponds to (see Methods for details on the fit). (D) Fraction of uncited papers as a function of . For this and all subsequent panels, solid lines show the predictions of the model using , σ = 0.419, and a value of μ for each (see Methods). (E) Variance of ℓ as a function of . (F) Skewness of ℓ as a function of . The skewness of the normal distribution is zero. (G) Kurtosis excess of ℓ as a function of . The kurtosis excess of the normal distribution is zero. Note how, for the case of , the moments of the distribution of citations for cited papers deviate significantly from those expected for a normal distribution. In contrast, for , only a small fraction of papers remains uncited, so deviations from the expectations for a normal distribution are small.

Discussion

Our finding that the distribution of number of citations is log-normal is in agreement with recent generative models of the citation network [21], [22] that predict a log-normal distribution for subsets of papers related by content similarity. Note that this result is not in disagreement with prior claims about the power-law behavior of the citation distribution [23], as the convolution of many log-normal distributions with different means can yield a distribution that can be hard to distinguish from a power law. The findings reported in Figures 1 and 2 demonstrate that there is a quantity, related to the ultimate impact of a paper, which for papers published in a given journal is normally distributed. For all papers published in journal J, that quantity has a well-defined mean, q̅(J) = μ, implying that the average q of the papers is representative of the q of all the papers published in the journal and, thus, of the q of the journal. Our findings thus suggest the possibility of ranking journals according to q̅(J). To this end, we turn to a heuristic used in information retrieval called the Probability Ranking Principle [24]. This principle dictates that the optimal ranking of a set of journals will be the one that maximizes the probability that given a pair of papers (a,b) from journals A and B, respectively, q(a)>q(b) if A is above B in that ranking. This probability is also known as the multi-class “area under curve” (AUC) statistic [25]–[27] (see Methods and Appendix S1 for details). We rank journals in different fields according to both q̅(J) and the JIF. Figure 3 illustrates the effectiveness of these two ranking schemes for separating papers into different journals based on their impact. In Appendix S3, we provide rankings and the value of the multi-class AUC statistic for all fields. Our analysis demonstrates that the ranking scheme defined by q̅(J) is very similar to the optimal ranking.
Figure 3

Comparison of citation-based journal ranking schemes.

We present results for 13 journals that the ISI classifies primarily in experimental psychology, and 36 journals that the ISI classifies primarily in ecology (see Appendix S3 for other fields). For every pair of journals, J and J, belonging to the same field, we obtain the probability p that a randomly selected paper published in J has received more citations than a randomly selected paper published in J. We rank the journals in each field according to three schemes: (A) optimal ranking RAUC, that is, the ranking that maximizes p for R(i)

Comparison of citation-based journal ranking schemes.

We present results for 13 journals that the ISI classifies primarily in experimental psychology, and 36 journals that the ISI classifies primarily in ecology (see Appendix S3 for other fields). For every pair of journals, J and J, belonging to the same field, we obtain the probability p that a randomly selected paper published in J has received more citations than a randomly selected paper published in J. We rank the journals in each field according to three schemes: (A) optimal ranking RAUC, that is, the ranking that maximizes p for R(i) Our analysis also demonstrates that the mean number of citations and the JIF provide particularly inaccurate ranking schemes. This finding is particularly important because some journals and some fields benefit greatly in reputation from the biases in the JIF, while others are at a disadvantage (see Figure 4 and Tables 1 and 2).
Figure 4

Effect of JIF biases on the ranking of journals.

(A) Comparison of the rankings of journals obtained using the JIF and the AUC statistic. Though there are clear correlations between the two rankings, deviations can be extremely large. (B) Probability density function of ΔR(i) = RJIF(i)−RAUC(i). Positive values of ΔR indicate under-rating of the journal. (C) Probability density function of change in the median ranking of the journals primarily classified in a given field, for fields with at least two journals. The papers published in journals classified in fields that are over-rated tend to get cited quickly (probably because of faster publication times), whereas papers published in journals in under-rated fields take longer to start accruing citations. Table S1 lists the median change of rank for each field.

Table 1

Rankings for the field of ecology.

Rank nSteady state
AUCJIFJournal abbreviation σ Q2JIFperiod
11ECOLOGY1.750.3371.1524.7821974–1994
22AM NAT1.720.4080.4484.6601967–1992
34EVOLUTION1.670.3569.8434.2921973–1993
414BEHAV ECOL SOCIOBIOL1.600.3144.4362.3161978–1990
58J ANIM ECOL1.570.3447.5333.3901954–1996
65J ECOL1.550.3545.1324.2391973–1996
715MAR ECOL-PROG SER1.470.3133.6262.2861991–1995
86CONSERV BIOL1.420.4237.5233.7621988–1998
97FUNCT ECOL1.420.3229.6233.4171989–1996
109OIKOS1.410.3534.2223.3811974–1995
1110OECOLOGIA1.400.2927.5223.3331994–1997
1217J EXP MAR BIOL ECOL1.310.3023.5181.9191988–1995
133J APPL ECOL1.310.3625.6174.5271965–2000
1423BIOTROPICA1.300.3825.5171.3911975–1994
1513J VEG SCI1.280.3622.4162.3821989–1999
1622POLAR BIOL1.270.3320.8161.5021981–1994
1728ENVIRON BIOL FISH1.240.4221.0150.9341981–1990
1812BIOL CONSERV1.210.3822.1142.8541988–1996
1911J BIOGEOGR1.210.3620.4132.8781976–1998
2021J WILDLIFE MANAGE1.190.3418.3131.5381984–1995
2118J CHEM ECOL1.110.3114.4101.8961995–1998
2232AM MIDL NAT1.070.3614.590.6671964–1995
2326WILDLIFE RES1.050.3211.691.0321990–1997
2424PEDOBIOLOGIA0.990.4112.881.3471965–1997
2520AGR ECOSYST ENVIRON0.970.4411.471.8321982–2001
2619ECOL MODEL0.910.4011.361.8881977–1998
2730J RANGE MANAGE0.900.379.960.8591966–1995
2829BIOCHEM SYST ECOL0.890.369.260.9061980–1994
2931WILDLIFE SOC B0.870.408.760.8431983–1998
3025J ARID ENVIRON0.830.387.851.2381989–2000
3134SOUTHWEST NAT0.720.385.940.3091980–1994
3233J NAT HIST0.720.386.440.6311966–2000
3335CAN FIELD NAT0.690.405.830.0731983–1993
3416LANDSCAPE URBAN PLAN0.670.415.332.0291985–2004
3527J SOIL WATER CONSERV0.650.497.030.9491966–2002
3636NAT HIST-0.320.440.300.0591989–2005

We consider the 36 journals that are primarily classified in the field of ecology according to the ISI. We rank journals according to: (i) the maximization of the multi-class AUC statistic for the steady-state distributions and (ii) the JIF; q̅ and σ are the model parameters obtained using ; n̅ and Q2 are the mean and median number of citations in the steady state. We also show the steady-state period.

Table 2

Rankings for the field of experimental psychology.

Rank nSteady state
AUCJIFJournal abbreviation σ Q2JIFperiod
14J EXP PSYCHOL LEARN1.550.3547.5342.6011992–1995
26J EXP PSYCHOL HUMAN1.560.3852.1322.2611974–1995
32PSYCHOPHYSIOLOGY1.470.3641.8273.1591985–1995
41NEUROPSYCHOLOGIA1.480.4148.6273.9241964–1995
510MEM COGNITION1.380.4034.0211.5121977–1997
65BRAIN LANG1.250.3322.9162.3171992–1997
712J EXP ANAL BEHAV1.220.3823.8141.2211970–1991
811PERCEPT PSYCHOPHYS1.200.4123.5131.4821965–1996
98J EXP CHILD PSYCHOL1.150.3920.0122.0621963–1999
109PERCEPTION1.070.4517.791.5851973–1995
117ACTA PSYCHOL0.840.5513.252.0941955–2001
123BRAIN COGNITION0.730.619.132.8581995–1999
1313PERCEPT MOTOR SKILL0.540.424.520.3331970–1995

We consider the 13 journals that are primarily classified in the field of experimental psychology according to the ISI. We rank journals according to: (i) the maximization of the multi-class AUC statistic for the steady-state distributions and (ii) the JIF; q̅ and σ are the model parameters obtained using ; n̅ and Q2 are the mean and median number of citations in the steady state. We also show the steady-state period.

Effect of JIF biases on the ranking of journals.

(A) Comparison of the rankings of journals obtained using the JIF and the AUC statistic. Though there are clear correlations between the two rankings, deviations can be extremely large. (B) Probability density function of ΔR(i) = RJIF(i)−RAUC(i). Positive values of ΔR indicate under-rating of the journal. (C) Probability density function of change in the median ranking of the journals primarily classified in a given field, for fields with at least two journals. The papers published in journals classified in fields that are over-rated tend to get cited quickly (probably because of faster publication times), whereas papers published in journals in under-rated fields take longer to start accruing citations. Table S1 lists the median change of rank for each field. We consider the 36 journals that are primarily classified in the field of ecology according to the ISI. We rank journals according to: (i) the maximization of the multi-class AUC statistic for the steady-state distributions and (ii) the JIF; q̅ and σ are the model parameters obtained using ; n̅ and Q2 are the mean and median number of citations in the steady state. We also show the steady-state period. We consider the 13 journals that are primarily classified in the field of experimental psychology according to the ISI. We rank journals according to: (i) the maximization of the multi-class AUC statistic for the steady-state distributions and (ii) the JIF; q̅ and σ are the model parameters obtained using ; n̅ and Q2 are the mean and median number of citations in the steady state. We also show the steady-state period. The bias introduced by the JIF arises directly from the major methodological problems raised against using citation analysis to evaluate journals. First, the mean number of citations to papers published in a journal is not representative of the number of citations to each individual paper [11], a point that our analysis systematically confirms. However, we show that q̅(J) is representative of the q of the papers published in journal J, that being the reason why ranking according to q̅(J) is efficient. Second, citation behavior varies by field [11]. Our analysis again confirms this. Nevertheless, we show that by comparing the steady-state behavior of a set of journals and keeping comparisons to within fields, one can accurately rank a set of journals. Our findings provide a quantitative measure of the efficacy of academic journals, through the work of editors and reviewers, at organizing research based on their prediction of the ultimate impact of that research. Even though far from perfect, the journal system and the ranking of journals provides a powerful heuristic with which to locate the research that will ultimately have the largest impact.

Methods

Identifying steady-state regions

We use the time evolution of to identify transient and steady-state periods. (Figures 1) In the steady state, whereas in the transient period . Because of the noisy fluctuations in the time series, we use a moving average considering the five previous years of the derivative. We define the duration of the transient regime as τ = 2006−Y 0, where Y 0 is largest value of Y for which the moving average is <0.005. We also determine the periods during which the citation distribution is stable. To this end, we compare the citation distribution for all pairs of years using the Kolmogorov-Smirnov test and fit a box-diagonal model to the matrix of p-values. We then identify the periods for which we cannot reject the hypothesis that the citation distribution is stationary [28]. The distribution that we use for comparison is the most recent stationary period before Y 0.

Estimating μ, σ, and γ for a journal

For each steady-state citation distribution, our model (Eq. 3) has three parameters that must be estimated: μ, σ, and γ. To the best of our knowledge, no maximum likelihood estimation procedures exist for the parameters of this model, so we estimate the parameters by minimizing the χ 2 statistic (see Appendix S4 for plots of all the fits)where p is the fraction of papers with n citations, and is the probability of having a paper with n citations according to our model (Eq. 3) In practice, we bin the empirical data so that we have at least ten data points in each bin. This is especially important for the tails of the distribution. Then, the contribution to χ 2 iswhere , and . The fitting parameters suggest that σ has a slight dependency on (Figure ). In contrast, we find that there is a strong dependency of γ on (Figure )with C 0 = 0.91±0.02 and C 1 = 1.03±0.02. For simplicity, when comparing properties of the empirical distributions to model predictions (Figures ), we assume that σ = 0.419 and that . Assuming these two dependencies, one can then obtain a relationship between μ and as As shown in Figure , the estimated value of γ displays large fluctuations to which the remaining parameters in the fit (μ,σ) are very sensitive. In order to obtain a less noisy estimate for those parameters, we fix γ using the relationship in Eq. 7, and estimate μ and σ by minimizing χ 2. The estimate we obtain for μ = q̅ is the one we use for ranking journals (Figure and Tables , ).

Calculating multi-class AUC

We define the best ordering as the one that maximizes the value of the multi-class AUC statistic. For a set of journals and a journal ranking R, we define the multi-class AUC statistic M(F,R) as [27] We denote as p(R) the probability that given a pair of papers (a,b) from journals J and J such that R(A)q(b). We denote as w the weight we assign to each probability, which depends on the number of papers N and N published in journals J and J during the steady-state period, as follows In principle, one could calculate the multi-class AUC statistic for every permutation of the ordering of journal citation distributions, and choose the ordering that gives the highest value. However, the number of permutations of a sequence of even modest size is unwieldy. Fortunately, in almost all cases, the distributions obey the property of transitivity, that is, if a>b and b>c, then a>c, which simplifies the optimization task. In the few cases where the transitivity condition does not hold, we resort to brute-force optimization, and resolve the ambiguity in the ordering by permuting the order of each distribution and finding the permutation that maximizes the multi-class AUC statistic. Supporting information text, and description of other supporting information files. (0.06 MB PDF) Click here for additional data file. Citation history for the 2,266 journals included in our analysis in alphabetical order. For a detailed description of the plots see the caption of panel C in Figure 1. (19.10 MB PDF) Click here for additional data file. Comparison of ranking schemes for all the fields listed in the WoS. (12.95 MB PDF) Click here for additional data file. Fit to the steady-state citation distribution for the 2,266 journals included in our analysis in alphabetical order. (21.06 MB PDF) Click here for additional data file. Median change of rank from JIF to optimal ranking for all fields with at least two journals with more than 50 articles published during the steady-state period. (0.00 MB TXT) Click here for additional data file.
  12 in total

1.  An index to quantify an individual's scientific research output.

Authors:  J E Hirsch
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-07       Impact factor: 11.205

2.  Judge a paper on its own merits, not its journal's.

Authors:  Shu-Dong Zhang
Journal:  Nature       Date:  2006-07-06       Impact factor: 49.962

3.  Cash for papers: putting a premium on publication.

Authors:  Ichiko Fuyuno; David Cyranoski
Journal:  Nature       Date:  2006-06-15       Impact factor: 49.962

4.  Extracting the hierarchical organization of complex systems.

Authors:  Marta Sales-Pardo; Roger Guimerà; André A Moreira; Luís A Nunes Amaral
Journal:  Proc Natl Acad Sci U S A       Date:  2007-09-19       Impact factor: 11.205

5.  Experimental study of inequality and unpredictability in an artificial cultural market.

Authors:  Matthew J Salganik; Peter Sheridan Dodds; Duncan J Watts
Journal:  Science       Date:  2006-02-10       Impact factor: 47.728

6.  Research papers: who's uncited now?

Authors:  D P Hamilton
Journal:  Science       Date:  1991-01-04       Impact factor: 47.728

7.  Impact factors can mislead.

Authors:  H F Moed; T N van Leeuwen
Journal:  Nature       Date:  1996-05-16       Impact factor: 49.962

8.  Why the impact factor of journals should not be used for evaluating research.

Authors:  P O Seglen
Journal:  BMJ       Date:  1997-02-15

9.  Cleavage of structural proteins during the assembly of the head of bacteriophage T4.

Authors:  U K Laemmli
Journal:  Nature       Date:  1970-08-15       Impact factor: 49.962

10.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

View more
  26 in total

1.  Robotic-assisted versus standard unicompartmental knee arthroplasty-evaluation of manuscript conflict of interests, funding, scientific quality and bibliometrics.

Authors:  Leonardo Cavinatto; Michael J Bronson; Darwin D Chen; Calin S Moucha
Journal:  Int Orthop       Date:  2018-10-05       Impact factor: 3.075

2.  Highly cited German research contributions to the fields of radiation oncology, biology, and physics: focus on collaboration and diversity.

Authors:  C Nieder
Journal:  Strahlenther Onkol       Date:  2012-08-23       Impact factor: 3.621

3.  The spread of scientific information: insights from the web usage statistics in PLoS article-level metrics.

Authors:  Koon-Kiu Yan; Mark Gerstein
Journal:  PLoS One       Date:  2011-05-16       Impact factor: 3.240

4.  The impact of article titles on citation hits: an analysis of general and specialist medical journals.

Authors:  Thomas S Jacques; Neil J Sebire
Journal:  JRSM Short Rep       Date:  2010-06-30

Review 5.  Brain metastases research 1990-2010: pattern of citation and systematic review of highly cited articles.

Authors:  Carsten Nieder; Anca L Grosu; Minesh P Mehta
Journal:  ScientificWorldJournal       Date:  2012-09-17

6.  Correlation between article download and citation figures for highly accessed articles from five open access oncology journals.

Authors:  Carsten Nieder; Astrid Dalhaug; Gro Aandahl
Journal:  Springerplus       Date:  2013-06-13

7.  The impact of boundary spanning scholarly publications and patents.

Authors:  Xiaolin Shi; Lada A Adamic; Belle L Tseng; Gavin S Clarkson
Journal:  PLoS One       Date:  2009-08-18       Impact factor: 3.240

8.  The possible role of resource requirements and academic career-choice risk on gender differences in publication rate and impact.

Authors:  Jordi Duch; Xiao Han T Zeng; Marta Sales-Pardo; Filippo Radicchi; Shayna Otis; Teresa K Woodruff; Luís A Nunes Amaral
Journal:  PLoS One       Date:  2012-12-12       Impact factor: 3.240

9.  Move-by-move dynamics of the advantage in chess matches reveals population-level learning of the game.

Authors:  Haroldo V Ribeiro; Renio S Mendes; Ervin K Lenzi; Marcelo del Castillo-Mussot; Luís A N Amaral
Journal:  PLoS One       Date:  2013-01-30       Impact factor: 3.240

10.  In science "there is no bad publicity": papers criticized in comments have high scientific impact.

Authors:  Filippo Radicchi
Journal:  Sci Rep       Date:  2012-11-08       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.