Literature DB >> 35947579

Trends of research productivity across author gender and research fields: A multidisciplinary and multi-country observational study.

Milad Haghani1, Alireza Abbasi2, Clara C Zwack3, Zahra Shahhoseini4, Nick Haslam5.   

Abstract

Bibliographic properties of more than 75 million scholarly articles, are examined and trends in overall research productivity are analysed as a function of research field (over the period of 1970-2020) and author gender (over the period of 2006-2020). Potential disruptive effects of the Covid-19 pandemic are also investigated. Over the last decade (2010-2020), the annual number of publications have invariably increased every year with the largest relative increase in a single year happening in 2019 (more than 6% relative growth). But this momentum was interrupted in 2020. Trends show that Environmental Sciences and Engineering Environmental have been the fastest growing research fields. The disruption in patterns of scholarly publication due to the Covid-19 pandemic was unevenly distributed across fields, with Computer Science, Engineering and Social Science enduring the most notable declines. The overall trends of male and female productivity indicate that, in terms of absolute number of publications, the gender gap does not seem to be closing in any country. The trends in absolute gap between male and female authors is either parallel (e.g., Canada, Australia, England, USA) or widening (e.g., majority of countries, particularly Middle Eastern countries). In terms of the ratio of female to male productivity, however, the gap is narrowing almost invariably, though at markedly different rates across countries. While some countries are nearing a ratio of .7 and are well on track for a 0.9 female to male productivity ratio, our estimates show that certain countries (particularly across the Middle East) will not reach such targets within the next 100 years. Without interventional policies, a significant gap will continue to exist in such countries. The decrease or increase in research productivity during the first year of the pandemic, in contrast to trends established before 2020, was generally parallel for male and female authors. There has been no substantial gender difference in the disruption due to the pandemic. However, opposite trends were found in a few cases. It was observed that, in some countries (e.g., The Netherlands, The United States and Germany), male productivity has been more negatively affected by the pandemic. Overall, female research productivity seems to have been more resilient to the disruptive effect of Covid-19 pandemic, although the momentum of female researchers has been negatively affected in a comparable manner to that of males.

Entities:  

Mesh:

Year:  2022        PMID: 35947579      PMCID: PMC9365186          DOI: 10.1371/journal.pone.0271998

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


Introduction

The growth of global scholarly research activity and its dynamics have for years been a major focus of fields such as science of science and information science [1-4]. Previous studies have looked at the expansion of scholarly literature and have attempted to infer current patterns and predict future directions [5, 6]. Such information is essential for informing scientific communities and policy makers of how the landscape of research around the world is changing. Such information can help governments allocate research funding, research institutions guide hiring and promotion decisions, and individual researchers maximise the impact of their work. While progression of scholarly research can be measured through a variety of indicators [7], a primary indicator of the growth of research has traditionally been the number of scholarly publications [2, 4, 8]. Despite the imperfectness of this metric for measuring research productivity—both at the level of individual scientists [9-11] and aggregate levels [12-14]—it has remained the primary metric for quantifying research production [8]. As Riviera [15] has put it “Communication in science is realized through publications. Thus, scientific explanations, and in general scientific knowledge, are contained in written documents constituting scientific literature” (p. 1446). It has been established that the growth of science does not take place at an equal rate across fields [2]. The phenomenon of “field variation” or “field-dependence” [16, 17] has been document in several studies [18]. One component of our analyses is dedicated to identifying differential patterns of growth in different fields of science while also detecting potential disruptive impacts of Covid-19 pandemic on such patterns. Studies that have investigated or predicted the potential disruptive effects of Covid-19 pandemic on research productivity have mostly produced signs of negative impact [19-21]. These are predominantly based on the analysis of pre-prints of publications (or working papers). Negative impact has been documented in relation to the knowledge production of social scientists [22] and economists [23]. However, some early analyses predicted an increase in productivity in certain fields such as economics and finance [24]. It has also been argued that the pandemic will affect research fields and groups of researchers differently. Termini and Traver [19], for example, point out that “In response to the pandemic, research institutions have enacted strict changes to permitted research operations, requiring scientists to abide by social distancing guidelines in the laboratory, facility closures, and ramped down laboratory activities. While scientists at all stages in their careers have been impacted by these changes to the research environment, early career scientists such as postdoctoral fellows and junior faculty are particularly vulnerable during these unconventional times” (p1). Of particular interest in the present study is to determine which research fields experienced the largest impact of the pandemic disruption, based on publication output. This study is also focused on determining potential discrepancies in the disruption of productivity trends across male and female scientists. The issue of gender representation in scientific publishing is perhaps one of the most documented and most heavily discussed aspects in the science of science [25-27]. Several case studies have documented the gender inequality problem across different countries such as Italy [28], Canada [29], Sweden [30], and Norway [31], to name a few. The origins of the gender gap problem have been much debated in the literature and several explanations have been offered by previous studies [32-36]. Clearly, when considering the aggregate representation of male and female researchers in publications, one main factor that explains the gap is under-representation of female scientists in academic positions [37]. Other factors have also been investigated, most important of which being motherhood and childcare responsibilities [38-41], and the disparities in the extent of opportunities for collaborations across male and female scientists [27, 38, 42]. In view of these contributors to the gender gap in research, how might the Covid-19 pandemic have had a differential impact on female and male researchers? The dynamics of family responsibilities and childcare have undergone considerable changes in 2020 [43-48] as a result of the work-from-home arrangements and/or lack of access to external childcare services in many countries. This raises the question of whether this factor has played a role in overall (aggregate) productivity of female scientists. Covid-related loss of access to facilities, mentorship and in-person meetings with peers may have exacerbated the gender gap in research productivity by reducing opportunities for collaboration. Conversely, as online meetings and webinars became more prevalent since the onset of the pandemic, more opportunities may have arisen for collaborations, especially at an international level. Although there is insufficient research evidence to date, some writers have argued that the pandemic’s negative impacts will be fall disproportionately on women. It has been argued that “The coronavirus disease 2019 (COVID-19) pandemic has upended almost every facet of academia (1). Almost overnight the system faced a sudden transition to remote teaching and learning, changes in grading systems, and the loss of access to research resources. Additionally, shifts in household labour, childcare, eldercare, and physical confinement have increased students’ and faculty’s mental health needs and reduced the time available to perform academic work…Many women academics will likely bear a greater burden during the coronavirus disease 2019 (COVID-19) pandemic” (p. 27) [49]. Similarly the findings of a preliminary survey of American and European scientists in April 2020 predicted that “female scientists and those with young dependents were to be affected disproportionately” [50]. Nearly two years since the onset of the pandemic, now there is an opportunity to objectively quantify these effects. This study aims to examine trends in overall research productivity as well as the possible differential productivity across research fields (1970–2020) and author genders (2006–2020). While the trends in representation of male and female scholarly publication are revisited across different cultures (2006–2020), the potential disruptive impacts of Covid-19 are also quantified based on research production of male and female scientists during 2020, as compared with their respective trends. The data also provides insight into the trends in the gap between male and female representation in scholarly publications and the expected time frame for the gap to close in different world regions should existing trends continue.

Methods and data collection

Three sets of publication meta data were collected from WoS, each using a combination of search query strings. The query strings are all formulated for the “Advanced Search” engine of the WoS Core Collection. All analyses are based on meta data of publication counts and they reflect “all document types”.

Research fields

The first set of meta data contains records of publication counts of the top 100 WoS Categories, in terms of the volume of publications attributed to each category (out of the 254 categories recognised by the WoS). The list has been obtained by acquiring all publication records since 2000 and listing the WoS Categories in descending order based on the number of publications counts. Subsequently, 100 different search queries were formulated and entered into WoS Advanced Search. Each query reads as “WC = [the name of the category as specified by the WoS]”, where WC is the Field Tag for WoS Category. From the outcome of each query, the meta data of publication count (of all document types), during Jan 1970 till Dec 2020, were exported and stored. This time period was applied to all disciplines. While the data of each category is analysed separately, for the presentation purposes, these categories were also further categorised into “broad discipline areas”. These areas include (in alphabetical order) “Agricultural Sciences”, “Arts & Humanities, Interdisciplinary”, “Biology & Biochemistry”, “Chemistry”, “Clinical Medicine”, “Computer Science”, “Economics & Business”, “Engineering”, “Environment/Ecology”, “Geosciences”, “History & Archaeology”, “Literature & Language”, “Materials Science”, “Mathematics”, “Multidisciplinary”, “Philosophy & Religion”, “Physics”, “Plant & Animal Science”, “Psychiatry/Psychology”, “Social Sciences, General” and “Visual & Performing Arts”. Only broad areas that have at least one category in the list of top 100 get a mention in our analysis. Also, some of the WoS Categories have been attributed to more than one broad category. In such cases and for presentation purposes, we allocated the category randomly to one broad discipline.

Author gender

Two further sets of meta data were exported for the gender analyses. Determination of the author gender has traditionally been made based on the first name of the authors. This, however, is typically done on a pre-sampled set of publications, whereas here, we sought to establish a reverse approach where a search query is used to generate samples of papers with male and female authors. In doing so, we formulated our queries based on the Author(s) search function of the WoS. According to the WoS guidelines for searching names of authors, there is no way to only search for the first name of authors. The acceptable format is AU = [“surname” SPACE “first name”] (where AU is the Field Tag), and if only one entry is specified within the search, then that is regarded as the surname by the search engine. So, the question would be; is there any way that one can generate all publications with authors of a specific first name (in any position of authorship, first, middle or last, noting that the WoS search engine does not differentiate between authorship positions). It was discovered that this can be done through the asterisk (*) wildcard. Consider the query AU = A* Albert. Such query would return any publication on which an author with first name of “Albert” and a surname with the initial “A” is listed. When the query is extended to AU = (A* Albert OR B* Albert OR C* Albert OR … OR Z* Albert) (where OR is a Boolean operator), it generates all documents on which at least one Albert is listed as an author. This method constitutes the essence of our proposed search strategy. Assume that one develops a query string by including all common first male/female names of a certain language, say German. Then such query would (approximately) generate all publications on which at least one author with a German male/female first name is listed. Another consideration, however, is the limit for the number of search terms imposed by the WoS. A WoS advanced search query cannot contain more than 16,000 Boolean operators, and if a full list of male/female names of a certain language is to be repeated 26 times, then the list cannot contain a large number of names. In consideration of such limitation and also given the fact that for several languages that we sampled from there are several hundreds of male/female names, we adopted two different sampling methods. One is called here the “long list of first names, A-C surname initials” (hereafter, the A-C method) and the other is called “short list of first names, A-Z surname initials” (hereafter, the A-Z method). In the A-C method, for each given language, we composed a long (as comprehensive as possible) list of male first names and a similar list of female first names, but only considered the first three letter of the alphabet for the surname initials (so that the list of first names only has to be repeated 3 times). In the latter, however, we composed a list of 30 most popular names of each given language, one for females and one for males, but repeated that list 26 times to include every letter of the alphabet for the surname initials. Popular first names given to people born in the late 1950’s to 1990’s were obtained as the sample for this study to ensure that it was representative of scholars active in academia from 2000–2020 (given the average age of an academic is around 40 years). Government registries and records (i.e., registry of Births, Deaths and Marriages) were used to obtain the names, however, for certain countries (i.e., Iran, India) this information was not publicly available. Instead, a minimum of two non-government databases were used to compile the list of top 30 first names, and it was essential criteria that they had separate lists for respective decades (1950’s-1990’s). In countries where a common language is spoken (for example UK/Australia/USA, Brazil/Portugal), the list of top 30 names was compiled using a combination of the most common names in those countries. Unisex names were excluded from the lists, and if a first name had alternative spellings (i.e., addition of an accent), the name and its alternatives were counted as one. To ensure further accuracy, two and three letter names were excluded, as they were often shared across multiple languages. For example, ‘Jan’ is a common European male name, however, in English it is a female name, and is also an abbreviation of the word January. All lists of names are accessible in the Online Supplementary Material of the paper. In total, 14 different languages were considered for both A-C and A-Z methods. This includes (in alphabetical order) Arabic, Dutch, English, French, German, Hindi, Italian, Japanese, Korean, Persian, Portuguese, Russian, Spanish and Turkish. These languages were selected based on two criteria: (1) they are amongst the most widely spoken languages around the world, and (2) the languages allow for the female and male names to be easily distinguished. In order to increase specificity of the resultant data, for each method (A-C or A-Z) and each language, the search query was combined with the name of a country where that language is predominantly spoken (e.g., CU = Germany AND AU = (A* Albert OR B* Albert OR C* Albert …)) (where CU is the Field Tag for countries and AND is a Boolean operator). Only countries that are included in the list of top 100 in terms of the quantity of their scholarly publications were considered. This resulted in 37 different combinations of country and language. For each query (i.e., associated with the male/female A-C/A-Z list of each country-language combination), the meta data of publication count (for all document types) for the period of 2006–2020 was exported and stored for analysis. The reason for going back to only 2006 was that for some country-language combinations, the data before 2006 was scarce and/or lacking discernible patterns. Therefore, for the sake of consistency in comparisons, only the counts of publications from 2006 onwards were considered for all country-language combinations (i.e., a 15 year-long history of their publication records). It should also be noted that the resulting counts are based on number of publications with at least one female author or with at least one male author. Therefore, the counts do not take into account fractional contributions to papers and so cannot be inferred to directly reflect proportional contribution to publications. Most previous studies on the gender gap in authorship are based on determination of the gender in individual documents of a sample of publications. Such an approach, while more nuanced than that of ours, often places a constraint on the number of documents that can be analysed. Typically, studies that use the abovementioned method rely on a sample of a several thousands of articles/pre-prints [43, 51]. Contrastingly, our method is query based, a novel approach in this research area. It is more suited for providing a bigger picture and broadening the scope and scale of the investigation, but that comes at a cost. The control over certain details such as position of authorship, or proportion of male versus female authors on each document is compromised. The series of formulated queries can be used to directly generate the set of publications in which male/female scientists are listed as authors. This directly examines the records of more than 75 scholarly publications without the pre-requisite of exporting their bibliographic data. It also does not rely on the use of any application programming interfaces. The method works only with the meta-data of publications and exempts the analyst from collecting a sample of publications. As such, it can be applied to the entire record of the WoS, without the need to export those records. The method can therefore be used as a benchmark for tracking overall productivity of male and female academics and their relative degree of representation in published documents. Our query-based approach also provides a foundation for replicating the data of overall gender productivity and observing changes in the trends over time. Additionally, it can be further modified to investigate author gender patterns within specific scholarly disciplines.

Method for analysing the growth and gap

Consistent metrics were produced based on each of the abovementioned sets of meta data of publication count. Each metric is used to either quantify trends of productivity over the last 50 years (or 15 years, for the gender analyses), or to focus particularly on 2020 and the impact on the pandemic on such trends. For each dataset, the Actual Growth (AG) of publication counts were calculated associated with each year t, as Eq 1, where PubCnt(t) signifies publication count in year t. The averages of this quantity (AV-G) over the last five, 10 and 15 years (for genders) and five, 10, 20 and 50 years (for disciplines) are reported for each entity of analysis. This quantity is also specifically reported for the year 2020. From here on, when we mention AG without specifying a year, we refer to the 2020 version of this quantity. This is in contrast to the Deviation of Growth (DG) (in 2020) from the projected value. This quantity is calculated as below. Firstly, the data point of 2020 is excluded from the set of publication count. Then based on the historical data of 1970 (or, 2006)-2019 (depending on whether analysis of disciplines or genders is concerned), a polynomial curve of degree four is fitted to the data (this fitted curve has been visualised for each data set in the Results section). Using this regression analysis, we quantify the publications counts that was expected to manifest in 2020 (based on the record of 15 or 50 years of trend in publication count). This quantity is referred to as predicted counts of publications in 2020, pred2020 (as opposed to actual number of documents in 2020, act2020). The DG quantity is then calculated as Eq 2. The above are the metrics that are common between the disciplines and gender analyses. Additional quantities, however, we calculated in relation to gender analyses. For each set of data of country-language combination, the Gap (G) and Relative Gap (RG) between male and female publications as well as the Ratio (R) of female to male publications were calculated based on Eqs 3–5 (note that by male/female publications we mean publications on which at least one male/female author has been listed, at any authorship position). In this notation maleCnt and femaleCnt respectively represent counts of male and female publications for the country-language combination of interest. Using the observation of ratios from 2006 to 2020, we also conducted an additional regression analysis on each set of data and predicted the year in which the female to male ratio of publication counts reaches .3, .5, .7 and .9 for each country-language combination. When no reasonable number could be achieved as a solution to the respective equation (e.g., more than 200 years in the future), then a dash sign “–” is reported in the respective tables of outcomes.

Results

Overall research productivity trends

Fig 1 presents the number of documents and the cumulative number of documents per year, as recorded by Thompson Reuters Web of Science (hereafter, Web of Science [WoS]). During the current century, almost invariably the number of published documents grew compared to the previous year despite major challenges such as The Global Financial Crisis of 2007–2009. However, in 2020, the first year of Covid-19 pandemic, the number of publications declined from the previous year for the first time since 2002. This divergence from the established trend in 2020 (measured almost nine months into 2021, after stabilisation of 2020 publication count) prompted a revisiting of patterns in global research productivity. In this paper, the trends in productivity are revisited across different scientific disciplines (Jan 1970- Dec 2020) and author genders (Jan 2006- Dec 2020).
Fig 1

Cumulative growth of science during 1970–2020.

Analyses across research fields

Publication counts across all top 100 WoS categories for the years 1970–2020 are presented in Figs 2–8. Categories that belong to the same broader area have been placed next to each other, while the name of the broad category that they identify with has been overlaid on each part of the figures. The blue trend curves are visualisations of polynomial curves fitted based on the data of 1970–2019. The prediction of the curves for the 2020 count of publication based on the trend is contrasted with the manifested number of publications for each category. The dashed vertical lines overlaid on each plot facilitates this comparison. The figures show the deviation of the actual number from the projected number of publications within each category. Table 1 provides various statistics related to WoS, including the actual and predicted number of publications in 2020, AG and DG, as well as average growth over the last 5, 10, 20 and 50 years. The cells containing statistics of actual and deviated growth in 2020 have been colour-coded to facilitate the comparison, with red indicating a decrease and green indicating an increase.
Fig 2

Temporal trends of research productivity in various scientific disciplines (part 1).

Fig 8

Temporal trends of research productivity in various scientific disciplines (part 7).

Table 1

Statistics of temporal trends of productivity in various domains of science.

CategoryActual 2020Predicted 2020AG% 2020DG% 2020AV-G 5y%AV-G 10y%AV-G 20y%AV-G 50y%
Arts & Humanities, Interdisciplinary
HUMANITIES MULTIDISCIPLINARY2131928680-13.64-34.53-0.441.573.629.70
Biology & Biochemistry
BIOCHEMICAL RESEARCH METHODS1994720462-4.35-2.58-0.010.234.667.97
BIOCHEMISTRY MOLECULAR BIOLOGY95562883463.717.554.772.572.413.75
BIOLOGY1900381802604.635.153.982.973.413.92
BIOPHYSICS1869818693-2.500.02-2.36-0.451.833.43
BIOTECHNOLOGY APPLIED MICROBIOLOGY40174404481.86-0.681.832.374.917.09
CELL BIOLOGY51815543517.75-4.891.252.972.576.76
ENVIRONMENTAL SCIENCES12610810452822.3917.1117.4812.579.928.19
FOOD SCIENCE TECHNOLOGY416833767711.439.617.455.287.025.49
GENETICS HEREDITY3373334666-10.03-2.763.473.802.916.16
MICROBIOLOGY71978691218.643.974.053.844.865.91
NEUROSCIENCES6655473525-11.19-10.471.772.413.366.69
PATHOLOGY2050423178-8.91-13.04-0.950.173.933.78
PHARMACOLOGY PHARMACY78192733404.836.215.664.515.534.74
PHYSIOLOGY1446418752-19.38-29.65-0.190.891.082.30
TOXICOLOGY1968919968-3.80-1.424.383.195.186.47
ZOOLOGY18164181513.470.071.761.743.933.88
Chemistry
CHEMISTRY PHYSICAL88519865334.492.246.455.535.474.75
CHEMISTRY APPLIED24182220205.358.948.625.555.733.85
CHEMISTRY INORGANIC NUCLEAR1405313767-1.382.040.510.721.483.20
CHEMISTRY MEDICINAL2031820222-5.380.473.463.186.924.68
CHEMISTRY ORGANIC1991719208-2.523.56-0.92-0.161.021.83
CHEMISTRY PHYSICAL88519865334.492.246.455.535.474.75
ENERGY FUELS70603852480.47-20.746.1510.3310.257.62
ENGINEERING CHEMICAL5245956119-0.24-6.986.046.084.814.43
NANOSCIENCE NANOTECHNOLOGY57381588314.27-2.536.156.4414.8218.97
POLYMER SCIENCE29409274397.186.707.655.184.074.65
Clinical Medicine
CARDIAC CARDIOVASCULAR SYSTEMS6025861206.731.74-1.573.673.1454.4636.8078
CLINICAL NEUROLOGY7485172769.38-2.52.786.1884.3135.8557.3554
CRITICAL CARE MEDICINE2284622257.4111.222.58-7.0481.2076.820510.6254
DERMATOLOGY2209922446.55-0.65-1.573.9523.4626.2595.7086
ENDOCRINOLOGY METABOLISM3671736005.65-3.441.943.6261.5263.5135.8438
ENGINEERING BIOMEDICAL2304823806.15-11.32-3.290.2681.3076.158.0528
GASTROENTEROLOGY HEPATOLOGY3944841967.99-5.19-6.390.9322.5613.54157.9586
GENETICS HEREDITY3373334665.56-10.03-2.763.4663.8042.91356.158
HEALTH CARE SCIENCES SERVICES3226330746.617.914.76.6527.9888.379.3542
HEMATOLOGY3021933055.71-8.41-9.39-1.6041.0323.2289.1196
IMMUNOLOGY4785848799.120.01-1.974.343.3173.69256.3486
INFECTIOUS DISEASES3031926535.1522.0112.488.4045.3765.73359.066
MEDICINE GENERAL INTERNAL9402684526.7917.7510.18.2587.0975.5823.5112
MEDICINE RESEARCH EXPERIMENTAL5357355368.6210.17-3.355.3467.17.64155.35
NUTRITION DIETETICS2356821393.858.459.237.6227.4447.5327.0384
ONCOLOGY110408120383.123.31-9.033.7945.3768.27.9152
OPHTHALMOLOGY2185126114.41-11.7-19.511.2763.1494.57256.8134
ORTHOPEDICS2326122342.514.163.955.9085.6566.73257.9124
PEDIATRICS3719738252.522.56-2.845.4783.425.29455.7514
PERIPHERAL VASCULAR DISEASE2578826405-3.51-2.39-1.0540.8992.4066.6178
PSYCHIATRY4419345902.39-7.91-3.873.6144.3184.81254.757
PUBLIC ENVIRONMENTAL OCCUPATIONAL HEALTH7349264840.3415.3511.778.6825.9887.89655.5964
RADIOLOGY NUCLEAR MEDICINE MEDICAL IMAGING4446046476.37-4.74-4.541.7482.7174.53655.0382
RESPIRATORY SYSTEM3727944691.24-0.34-19.883.3267.6519.7159.0338
SURGERY10063693422.469.147.175.5484.4014.52955.8338
TOXICOLOGY1968919967.97-3.8-1.424.383.1945.18356.4662
UROLOGY NEPHROLOGY2892626467.839.348.52.8261.5696.7359.4898
Computer Science
AUTOMATION CONTROL SYSTEMS3453643057.71-8.77-24.67-1.584.0718.03559.0812
COMPUTER SCIENCE ARTIFICIAL INTELLIGENCE5630560081.84-15.71-6.710.94.837.76719.8746
COMPUTER SCIENCE HARDWARE ARCHITECTURE1745418952.39-10.27-8.58-5.5143.5095.99858.077
COMPUTER SCIENCE INFORMATION SYSTEMS7085472944.52-9.4-2.953.6286.8558.8279.2326
COMPUTER SCIENCE INTERDISCIPLINARY APPLICATIONS3632143295.23-15.52-19.2-2.644.4037.941511.9276
COMPUTER SCIENCE SOFTWARE ENGINEERING2378524902.11-13.71-4.7-0.7062.734.5738.6768
INFORMATION SCIENCE LIBRARY SCIENCE1217114167.87-10.36-16.41-3.174-1.2150.5794.041
Economics & Business
BUSINESS37153386761.35-4.101.232.807.765.52
ECONOMICS4565754363.78-9.86-19.070.0162.2586.99853.9658
Engineering
ENGINEERING CIVIL3846241131.868.15-6.949.6547.1437.2598.762
ENGINEERING ELECTRICAL ELECTRONIC169879242180-15.96-42.56-3.852.805.527.04
ENGINEERING MULTIDISCIPLINARY3858846104.52-0.58-19.482.2310.6959.22056.2816
MECHANICS3154636892.23-4.03-16.955.647.0915.466.056
Environment/Ecology
ECOLOGY2742927107.22-2.051.175.033.5144.73256.558
ENGINEERING ENVIRONMENTAL3409929726.1424.2812.8214.3328.7928.211.1178
WATER RESOURCES2583526606.2213.46-2.995.0645.8425.8678.7278
Geosciences
GEOSCIENCES MULTIDISCIPLINARY4310541771.7211.373.095.8467.0626.54155.8538
History & Archaeology
HISTORY4399354436.68-12.36-23.74-3.0160.3232.84454.7394
Material Sciences
MATERIALS SCIENCE MULTIDISCIPLINARY172164167687.835.662.68.9427.4437.0058.5788
PHYSICS CONDENSED MATTER4338140334.452.387.025.0322.9221.8674.3896
POLYMER SCIENCE2940927438.617.186.77.6525.1774.06554.65
Mathematics
MATHEMATICS8420482028.561.892.584.1963.2865.34154.7424
MATHEMATICS APPLIED3828439111.6-4.94-2.163.2421.3865.2096.6214
OPERATIONS RESEARCH MANAGEMENT SCIENCE1715316303.62-7.964.950.990.7426.22858.2826
Multidisciplinary
MULTIDISCIPLINARY SCIENCES95486113999.438.34-19.395.75813.2979.0323.6022
Philosophy & Religion
PHILOSOPHY2350329163.91-7.66-24.09-1.2621.6335.34955.5128
RELIGION2057926226.77-14.37-27.44-2.6621.6566.09713.6982
Physics
ASTRONOMY ASTROPHYSICS2514925920.55-3.11-3.070.362-0.0491.37553.6294
BIOPHYSICS1869818693.4-2.50.02-2.36-0.4491.83353.4324
PHYSICS APPLIED1037071091303.30-5.233.163.324.626.23
PHYSICS ATOMIC MOLECULAR CHEMICAL1944619589.666.05-0.740.521.7361.73553.8318
PHYSICS CONDENSED MATTER4338140334.452.387.025.0322.9221.8674.3896
PHYSICS MULTIDISCIPLINARY3169229105.890.288.161.8722.3643.2341.4366
PHYSICS PARTICLES FIELDS1587215527.293.782.171.9541.9242.37654.2846
Plant & Animal Science
PLANT SCIENCES3771437796.122.86-0.223.6623.9714.7024.062
Psychiatry/Psychology
PSYCHIATRY4419345902.39-7.91-3.873.6144.3184.81254.757
PSYCHOLOGY MULTIDISCIPLINARY2018422180.930.73-9.896.4427.0197.636.844
Social Sciences, General
EDUCATION EDUCATIONAL RESEARCH4695068752.09-18.61-46.44-1.3743.31111.6485.0056
POLITICAL SCIENCE2425028856.42-8.16-19-2.1740.8554.8382.8022
SOCIAL SCIENCES INTERDISCIPLINARY2016835393.32-11.27-75.49-5.966.5511.29854.4002

AG = Actual growth

DG = Deviation of growth

AV-G = Average growth

Green means sharp increase

Yellow means mild increase

Amber means mild decrease

Red means sharp decrease

5y means 5-year average, i.e., 2016–2020

10y means 10-year average, i.e., 2011–2020

15y means 15-year average, i.e., 2006–2020

AG = Actual growth DG = Deviation of growth AV-G = Average growth Green means sharp increase Yellow means mild increase Amber means mild decrease Red means sharp decrease 5y means 5-year average, i.e., 2016–2020 10y means 10-year average, i.e., 2011–2020 15y means 15-year average, i.e., 2006–2020 According to WoS records, the biggest categories of contemporary research are Biology, Engineering Electrical Electronic, Biochemistry Molecular Biology, Chemistry Multidisciplinary, Medicine General Internal, and Material Science Multidisciplinary. The fastest growing areas over the past 5 years are Environmental Sciences and Engineering Environmental. The average annual growth of publication counts in these two categories over that period have respectively been 17.5% and 14.3%. Most categories have been steadily growing, but some categories fluctuate. This includes categories such as Arts & Humanities Multidisciplinary, Biology & Biochemistry, Medicine General Internal, Medicine Research Experimental, Physics Multidisciplinary and Political Science. Categories such as Philosophy, Religion and Education Educational Research have had sustained periods of negligible growth in the past but have been increasing in recent years. In addition to Environmental categories, categories that also show exponential patterns of growth are Chemistry Analytical, Nanoscience Nanotechnology, Energy Fuels, Health Care Sciences Services, Medicine Research Experimental (since about 2000), Respiratory System, Engineering Civil, Engineering Multidisciplinary, Mechanics, Water Resources, Geosciences Multidisciplinary, Material Sciences Multidisciplinary and Multidisciplinary Sciences (since 2000). Within the various domains of science, this study uses the deviation from trends during 2020 as a proxy for the effect of the Covid-19 pandemic on research production. With this in mind, one can observe a mix of growth and decline in productivity across the broader areas. However, two areas appear to have been particularly affected are Computer Science and Social Sciences, General. In all categories of these two areas (within the list of top 100 categories), there have been fewer published papers in 2020 compared to 2019. Other categories, however, have grown markedly beyond expectation in 2020, i.e., their manifested number of publications in 2020 exceeded projected numbers. This includes categories such as Infectious Diseases, Environmental Sciences, Engineering Environmental, Medicine General Internal and Public Environmental Occupational Health.

Analyses of author genders

The temporal record of publication counts of male- and female-authored publications (i.e., publications on which at least a male/female author has been listed) across language-country combinations have been visualised in Figs 9 and 10 for the period of 2006–2020. Overlaid on each plot are also the trend curves (fitted on the data of 2006–2019) along with a dashed vertical line at year = 2020 to facilitate evaluation of departures from forecast. The last unit of the plot set shows the average data across all country-language combinations. In Figs 11 and 12, the absolute gap (in number of documents) between male- and female-authored publications (left vertical axes, black curves) along with the relative gap (right vertical axes, red curves) have been visualised for each language-country combination. Similarly, the ratios of female to male publication counts across all language-country combinations have been visualised in Figs 13 and 14. The results presented in Figs 9–14 are all based on the A-C sampling method. The corresponding results from the A-Z sampling method have been presented in Figs 15–20.
Fig 9

Temporal trends in male (blue lines) and female (red lines) total research productivity based on A-C method (part 1).

Fig 10

Temporal trends in male (blue lines) and female (red lines) total research productivity based on A-C method (part 2).

Fig 11

Gaps between male and female total research productivity, A-C method (part 1).

Fig 12

Gaps between male and female total research productivity, A-C method (part 2).

Fig 13

Ratio of female to male total research productivity, based on A-C method (part 1).

Fig 14

Ratio of female to male total research productivity, based on A-C method (part 2).

Fig 15

Temporal trends in male (blue lines) and female (red lines) total research productivity based on A-Z method (part 1).

Fig 20

Ratio of female to male total research productivity, based on A-Z method (part 2).

Tables 2 and 3 summarise the results of the analyses based on the A-C and A-Z methods, respectively. In these tables, the actual and projected number of publications of male and females in 2020, along with the actual growth (AG) and deviation from the predicted growth (DG), have been reported, along with the average of the (actual) growth over the last 5, 10 and 15 years. The last four columns of the tables report the year in which the ratio of female to male publication counts of the language-country combination is expected to reach values of .3, .5, .7 and .9 based on the current trends. When no feasible solution could be found for the equation, a dash sign “–” has been symbolically reported. We also considered estimating number of years for reaching absolute parity i.e., r = 1.0. However, for many of the countries, no reasonable number can be found as a solution to report. While the ratio for all countries is “asymptotically approaching 1.0”, a solid parity (r = 1.0) cannot be achieved for any country that is showing a divergence pattern based on absolute numbers. That would require the absolute gap to close too, and most of the countries will not have that no matter how infinitesimally close their ratio gets to 1.0. The entities of Tables 2 and 3 in the AG and DG columns have been colour-coded to better demonstrate the positive (green) and negative (red) growth during 2020.
Table 2

Statistics of temporal trends of productivity of male and female scientists based on the A-C method.

Language-countryActual 2020Predicted 2020AG% 2020DG% 2020AV-G 5y%AV-G 10y%AV-G 15y%Year ratio = .3Year ratio = .5Year ratio = .7Year ratio = .9
Arabic-Saudi Arabia_f3666365351.550.3530.2435.4533.71
Arabic-Saudi Arabia_m177501774739.690.0221.8926.0628.422045208521252166
Arabic-Egypt_f2796277936.260.6123.4324.6129.03
Arabic-Egypt_m99841000928.49-0.2518.7220.3122.972024204820732097
Arabic-Iraq_f45345328.330.0847.0250.8743.89
Arabic-Iraq_m2879290425.39-0.8833.6235.9230.712135
Arabic-Qatar_f40640328.080.8322.6033.80124.58
Arabic-Qatar_m1881188132.750.0221.3729.0129.882034206921032138
Arabic-U Arab Emirates_f63263121.310.1423.3625.6330.00
Arabic-U Arab Emirates_m2682270226.33-0.7423.3323.7823.622026205020752100
Arabic-Kuwait_f30130329.18-0.5215.6115.4219.34
Arabic-Kuwait_m1030104214.96-1.1516.2011.7914.682022205820942130
Arabic-Oman_f26526238.741.0817.8323.5726.64
Arabic-Oman_m1036104916.80-1.2717.1518.4319.432026205220782104
Arabic-Lebanon_f44443628.701.7625.3424.4226.06
Arabic-Lebanon_m99499619.61-0.1617.7019.9517.382014202420332043
Arabic-Jordan_f61761430.440.5530.9723.2928.63
Arabic-Jordan_m2680272315.32-1.6023.2016.1121.982029204720652084
Dutch-Netherlands_f49625096-0.64-2.718.7710.6115.69
Dutch-Netherlands_m1534015640-3.10-1.954.775.909.772020203920592079
Dutch-Belgium_f25582613-1.77-2.168.2810.7113.68
Dutch-Belgium_m830383920.18-1.075.717.3710.222020204720742101
English-USA_f1097571111093.73-1.236.587.1710.26
English-USA_m170534172961-1.14-1.422.993.796.982007201320272040
English-England_f30783313131.22-1.727.809.0612.85
English-England_m4701347710-1.05-1.484.235.279.092007201420252036
English-Canada_f19572199541.97-1.958.158.4112.31
English-Canada_m2809928607-0.86-1.814.825.459.422007200920232036
English-Australia_f18230185411.74-1.718.1110.4515.05
English-Australia_m2683927297-0.31-1.704.837.2211.312007201120232035
English-Scotland_f499550803.16-1.698.689.3314.35
English-Scotland_m76517787-1.49-1.785.285.4610.452007201320252036
English-Ireland_f390239603.01-1.4812.4712.3416.42
English-Ireland_m563257120.72-1.428.007.7511.182007201320222031
English-New Zealand_f302630561.89-1.009.2810.4414.37
English-New Zealand_m45814659-1.86-1.706.086.0610.952007201320242035
English-Wales_f173817872.78-2.8310.4810.9613.43
English-Wales_m26732737-3.78-2.405.666.759.912007201520262038
French-France_f18745189852.62-1.284.315.509.46
French-France_m3603736551-0.52-1.433.094.518.402007201920682118
German-Germany_f15071152635.12-1.279.089.0612.97
German-Germany_m3691237394-0.57-1.314.905.519.742012203320552076
Hindi-India_f5824588911.40-1.1110.1117.0520.95
Hindi-India_m151221508618.280.2410.7014.6718.412007203520652095
Italian-Italy_f363773675115.69-1.0310.3810.1613.20
Italian-Italy_m553425566813.63-0.598.158.3411.552007200720302056
Japanese-Japan_f308031400.39-1.954.075.098.67
Japanese-Japan_m15724159230.54-1.273.112.905.98204921012154
Korean-South Korea_f441644620.16-1.051.184.429.30
Korean-South Korea_m1026010430-0.97-1.651.373.718.292007203520852136
Persian-Iran_f8112813316.82-0.2618.0019.8730.99
Persian-Iran_m189071897511.56-0.3614.8816.3323.992011202720442060
Portuguese-Portugal_f683769772.84-2.0510.1811.4617.26
Portuguese-Portugal_m87858958-0.40-1.976.309.4214.862007200720182034
Portuguese-Brazil_f165891668110.04-0.5510.199.8617.63
Portuguese-Brazil_m24771249318.71-0.649.749.2115.162007200720232050
Russian-Russia_f627263216.14-0.7820.4222.1621.15
Russian-Russia_m921992622.05-0.4715.3217.6618.072007201320252037
Russian-Ukraine_f61962419.27-0.8322.1921.6018.92
Russian-Ukraine_m8378603.21-2.7113.8812.1812.702010201620232029
Spanish-Spain_f19785200489.88-1.339.9610.3714.78
Spanish-Spain_m42959434435.53-1.137.377.5111.822007202720502073
Spanish-Mexico_f398540522.68-1.6712.2312.7416.51
Spanish-Mexico_m11828119126.56-0.7110.4710.5714.162015204420742103
Spanish-Colombia_f104610569.99-0.9417.0415.9120.07
Spanish-Colombia_m581958424.43-0.3912.4213.3618.862113
Spanish-Argentina_f174217607.80-1.059.419.7014.74
Spanish-Argentina_m618961918.41-0.036.247.5311.392028207321182163
Spanish-Chile_f87387019.260.3117.9016.0819.43
Spanish-Chile_m6919691513.560.0512.7112.8915.962111
Turkish-Turkey_f99231003213.29-1.106.629.5213.69
Turkish-Turkey_m191541929713.43-0.754.297.1011.862007202020442067
All-Average_f9979101066.16-1.278.479.1012.67
All-Average_m18497187083.89-1.146.016.6110.072007201820382057

AG = Actual growth

DG = Deviation of growth

AV-G = Average growth

Green means sharp increase

Yellow means mild increase

Amber means mild decrease

Red means sharp decrease

5y means 5-year average, i.e., 2016–2020

10y means 10-year average, i.e., 2011–2020

15y means 15-year average, i.e., 2006–2020

Table 3

Statistics of temporal trends of productivity of male and female scientists based on the A-Z method.

Language-countryActual 2020Predicted 2020AG% 2020DG% 2020AV-G 5y%AV-G 10y%AV-G 15y%Year ratio = .3Year ratio = .5Year ratio = .7Year ratio = .9
Arabic-Saudi Arabia_f1695168751.070.5027.6940.2536.65
Arabic-Saudi Arabia_m168571680041.230.3421.4827.9530.06208221412200
Arabic-Egypt_f1669164939.901.2122.2327.8230.59
Arabic-Egypt_m156131557825.180.2217.5621.8524.5720942167
Arabic-Iraq_f25725916.29-0.7342.2447.0541.94
Arabic-Iraq_m3199323822.33-1.2134.9442.2640.192167
Arabic-Qatar_f23823445.121.8230.0836.3251.36
Arabic-Qatar_m1975195529.591.0119.6531.1830.7920872155
Arabic-U Arab Emirates_f35535717.94-0.5725.8732.3732.85
Arabic- U Arab Emirates_m2732274718.83-0.5521.3225.2324.652061210621522198
Arabic-Kuwait_f16516257.141.7722.9222.8818.85
Arabic-Kuwait_m8778948.94-1.9915.0013.7518.5520772149
Arabic-Oman_f12112119.800.2013.4820.6021.15
Arabic-Oman_m96896024.420.7917.3721.4323.27
Arabic-Lebanon_f22122215.71-0.4325.3034.8039.01
Arabic-Lebanon_m1058106411.72-0.6016.5221.0219.812033205820832109
Arabic-Jordan_f31831833.610.1034.4827.4930.82
Arabic-Jordan_m1733173622.91-0.1623.4019.4326.012043207521072139
Dutch-Netherlands_f11572117881.21-1.877.689.1914.46
Dutch-Netherlands_m2192922286-1.98-1.634.445.6110.422007202020332047
Dutch-Belgium_f469247484.90-1.199.5111.4714.74
Dutch-Belgium_m10378104732.72-0.925.487.5511.642008202720452063
English-USA_f1348911360553.28-0.866.638.2811.50
English-USA_m342983347646-2.03-1.361.943.006.292016203020452060
English-England_f3548535995-0.40-1.447.6410.1613.72
English-England_m8562786889-1.64-1.473.254.838.592014202720412054
English-Canada_f22402227731.63-1.669.2410.2614.36
English-Canada_m5016950928-0.78-1.513.694.578.502013202620382050
English-Australia_f22986233071.80-1.408.5912.0917.15
English-Australia_m50069508070.31-1.474.416.7710.952012202420362047
English-Scotland_f58155932-2.19-2.018.499.7515.26
English-Scotland_m1331013558-2.77-1.864.074.579.232013202520372050
English-Ireland_f469747780.26-1.7312.6813.3618.03
English-Ireland_m10182102981.55-1.147.056.5810.442014202320332043
English-New Zealand_f387638886.28-0.3210.4312.7718.20
English-New Zealand_m822383161.29-1.124.845.4510.182014202420342044
English-Wales_f21012157-1.32-2.659.1510.8115.09
English-Wales_m47644838-3.15-1.554.125.008.742014202620392051
French-France_f29327295891.61-0.893.235.048.94
French-France_m5654157131-0.65-1.041.903.487.482007202020532087
German-Germany_f30881311112.78-0.755.906.6511.57
German-Germany_m9124492364-1.86-1.233.974.839.482015205620972137
Hindi-India_f7622759314.720.3814.8823.7728.91
Hindi-India_m375143750818.090.0212.1917.0621.362031205320752097
Italian-Italy_f543435450317.38-0.2910.7311.4914.41
Italian-Italy_m878378832012.94-0.557.748.2211.432007201420302046
Japanese-Japan_f11085111134.79-0.267.827.6711.89
Japanese-Japan_m31621318542.67-0.745.996.439.802011205420962139
Korean-South Korea_f55025519-0.49-0.310.784.0510.64
Korean-South Korea_m65636684-1.49-1.841.173.257.882007200720102023
Persian-Iran_f149611494916.450.0819.5022.6631.49
Persian-Iran_m422694242110.87-0.3614.0615.4022.772018203220462059
Portuguese-Portugal_f12755129383.99-1.4310.5712.8218.76
Portuguese-Portugal_m2818928329-0.07-0.5014.2117.6117.8320072007
Portuguese-Brazil_f371343710210.870.0910.1611.1018.53
Portuguese-Brazil_m51828522128.38-0.749.128.9315.172007200720202036
Russian-Russia_f19575196133.05-0.1918.6823.4322.67
Russian-Russia_m2818928329-0.07-0.5014.2117.6117.832007201320232033
Russian-Ukraine_f1917191615.620.0721.9319.9718.37
Russian-Ukraine_m291729497.20-1.0914.4613.3112.632008201720252033
Spanish-Spain_f45013452449.25-0.518.8210.3114.69
Spanish-Spain_m68831694366.28-0.886.576.7810.872007201320252037
Spanish-Mexico_f914691987.26-0.5711.0112.9216.52
Spanish-Mexico_m17234173457.25-0.659.6910.0313.732007201920382058
Spanish-Colombia_f458945459.240.9614.3320.7829.64
Spanish-Colombia_m822481819.810.5212.4112.9118.002011201820262034
Spanish-Argentina_f701770235.08-0.086.178.9113.47
Spanish-Argentina_m854785497.60-0.026.196.9611.182007200720112026
Spanish-Chile_f4565454115.190.5213.7615.2117.82
Spanish-Chile_m9371938311.28-0.1311.7712.2215.622007202420462069
Turkish-Turkey_f130351304411.27-0.075.6410.2416.16
Turkish-Turkey_m272842751911.80-0.864.347.3011.832007202320392056
All-Average_f15208153145.98-0.708.5110.0213.86
All-Average_m33763341052.80-1.015.286.119.592010202620412057

AG = Actual growth

DG = Deviation of growth

AV-G = Average growth

Green means sharp increase

Yellow means mild increase

Amber means mild decrease

Red means sharp decrease

5y means 5-year average, i.e., 2016–2020

10y means 10-year average, i.e., 2011–2020

15y means 15-year average, i.e., 2006–2020

AG = Actual growth DG = Deviation of growth AV-G = Average growth Green means sharp increase Yellow means mild increase Amber means mild decrease Red means sharp decrease 5y means 5-year average, i.e., 2016–2020 10y means 10-year average, i.e., 2011–2020 15y means 15-year average, i.e., 2006–2020 AG = Actual growth DG = Deviation of growth AV-G = Average growth Green means sharp increase Yellow means mild increase Amber means mild decrease Red means sharp decrease 5y means 5-year average, i.e., 2016–2020 10y means 10-year average, i.e., 2011–2020 15y means 15-year average, i.e., 2006–2020 Data collected for all language-country combinations (from here on, “cultures” for simplicity) obtained from both methods confirm the existence of a notable gender gap between the scholarly publications of male and female scientists. The gap is observable across all 37 examined countries. Moreover, and perhaps most strikingly, we do not observe any trend that is indicative of the gap narrowing (in terms of absolute total number of publications) in any culture. However, pattern of temporal variations of this gap is distinctly different across cultures. According to the trends presented in Figs 9, 10, 15 and 16 (as well as Figs 11, 12, 17 and 18), three general patterns are differentiable across cultures. These patterns are discussed in the following paragraphs.
Fig 16

Temporal trends in male (blue lines) and female (red lines) total research productivity based on A-Z method (part 2).

Fig 17

Gaps between male and female total research productivity, A-Z method (part 1).

Fig 18

Gaps between male and female total research productivity, A-Z method (part 2).

The first pattern is a set of cultures in which the gap between total publications of male and female researchers has been exponentially widening over time. This includes almost all Arabic speaking countries (Egypt, Iraq, Jordan, Kuwait, Lebanon, Oman, Qatar, Saudi Arabia, and United Arab Emirates). An exception is Tunisia, whose gap has shown signs of flattening out (or at least slowing down) over the last three to four years (although there is still no discernible sign of narrowing down in a consistent fashion). These are the countries for which the curve of absolute gap (Figs 11, 12, 17 and 18) has a shape of an increasing convex curve, meaning that, not only has the gap been widening every year in these cultures, but also that the gradient of this increase in the gap has been increasing. If the existing trends of these cultures continue, the gender gap will worsen every year, and at an increasing rate. Note that for some of these countries, when we look at the ratios of female to male total productivity (Figs 13, 14, 19 and 20), an extremely slow pattern of increase in the ratio is observed. In these cases, the gender gap is very gradually closing but the discrepancy in number of male- and female-authored publications is increasing due to the rapid overall growth in publications. Take the case of Arabic-Saudi Arabia as an example. When considering the temporal trend in the female to male ratio within this culture, one cannot predict the year when the total research productivity of their female scholars becomes 70% of that of their male scholars (see Tables 2 and 3), even a hundred years into the future. Also, to reach the 50% ratio, the optimistic forecast is more than 60 years from now, while the pessimistic forecast suggests 120 years from now.
Fig 19

Ratio of female to male total research productivity, based on A-Z method (part 1).

The second pattern concerns cultures in which disparity between male and female total productivity is also increasing (similar to that of the countries listed above), except, not at an increasing rate (i.e., exponential way), and rather, at an approximately linear rate. This includes several European countries (e.g., Belgium, Netherlands, Spain, Germany) whose curves of absolute gap have even started to become slightly concave shape, with slight signs of decrease or flattening out in recent years. Less clear examples of this pattern are seen in Italy, Japan, Iran, India and Brazil. These countries are not fast-tracking the closing of the gender gap, but the problem for them does not seem to be exponentially escalating either. As a result, their relative gap has been consistently decreasing and their female to male publication ratios show a more discernible upward trend. For a considerable portion of this cohort of countries, both A-C and A-Z methods predict that, by the middle of the current century they reach a ratio .7 in terms of their female- to male-authored publication (e.g., Italy, Netherlands, Iran, Spain, Brazil). A larger portion of these countries are predicted to reach .5 ratio by the said date. The third pattern concerns cultures that have managed to maintain a relatively constant absolute gap between total female and male productivity for a sustained amount of time and even have shown small signs of narrowing the gap in the very recent years. This pattern is almost exclusively observable in relation to the developed countries. It is noticeable for Australia, Canada, England, New Zealand, and USA. Both methods (A-C and A-Z) suggest that in all of these countries the absolute gap has had a decrease in 2020 compared to 2019, indicating that this could mark the beginning of a downward trend in gender gap for these countries. However, it is too early to make clear predictions, as no sustained downward trend has been observed in relation to any country yet. According to both sampling methods, the current trends in female to male publication ratios of these countries indicate that by the middle the current century all of them will have reached a ratio of .9. Focusing on the productivity during 2020 in contrast to the previous trends, striking patterns are observable in relation to male and female productivity. Firstly, both methods suggest that academics of Arabic countries (both genders) have shown the highest degree of actual growth in 2020, compared to their 2019 record of publications, whereas English speaking and some Western European countries have shown the opposite trend. Both male and female academics of Asian countries such as Iran and India have demonstrated positive growth in terms of total productivity, although this is to a lesser degree compared to Arabic countries. This pattern is also observable in relation to South American counties such as Chile, Argentina, Colombia and Mexico. For most Arabic speaking countries, both sampling methods suggest that female productivity has had a larger growth compared to male total productivity. This includes countries such as Saudi Arabia, Egypt, Kuwait, Lebanon, and Jordan. The only Arabic speaking country for which the growth of total productivity for male academics has been larger than females in 2020 (compared to 2019) is United Arab Emirates. Similarly, when considering the deviation from the projected growth, both methods suggest that female academics of Arabic speaking countries have shown lesser negative deviation from their productivity trend (i.e., their projected productivity) compared to their male counterparts. When considering English speaking and European countries that have experienced negative impacts on their academic productivity during 2020 compared to previous year(s), the pattern is slightly different. For most of these countries (e.g., Netherlands, USA, England, Canada, Australia, New Zealand) both sampling methods suggest that in terms of actual growth in 2020, female total productivity has been more resistant to the disruptive effects of the pandemic, compared to that of their male counterparts. In certain countries such as USA, Canada and Germany, male total productivity is found to have decreased in 2020 compared to 2019, whereas female total productivity has had positive AG in 2020, and this is confirmed by both sampling methods. However, when the deviation from projected growth is considered, this pattern becomes more mixed. This suggests that that while productivity of female academics seem to have been more resistant to disruptive effects the pandemic compared to male counterparts, their momentum has been rather noticeably affected in a comparable manner to that of males in those countries. Italy shows a growth in publications in 2020, and both sampling methods suggest that the growth has been larger in terms of female productivity. A similar pattern is also observable for Iran, Spain, Chile, Ukraine, Russia, and Brazil. The opposite pattern—i.e., male total productivity showing more resistance to the pandemic effects—is not a common observation in our data (at least when we expect the pattern to have been confirmed by both sampling methods). An exception is India. The productivity of male and female Indian scientists have both increased in 2020, but the amount of increase is estimated to have been larger for male than female, according to both sampling methods.

Discussion

The findings presented above offer a richly detailed picture of trends in research productivity over the last half-century, as these vary by research field and by author gender. They also allow some tentative inferences about the impact of the Covid-19 pandemic on research productivity based on departures in 2020 from projected trends.

Trends by research field

The general trends in annual publication numbers are consistently rising across all research fields, a conclusion that is unsurprising given the well documented relentless increase in global research productivity in recent decades. In most fields the rise since 1970 is approximately linear or exponential, with variation in the rate of change from gradual to very steep. Rates tend to be steepest in the more technological fields, such as environmental sciences and engineering disciplines, nanoscience, and geosciences, and also in some Clinical Medicine categories (e.g., Oncology and Health Care Sciences Services). Rises tend to be less steep in the social sciences and humanities disciplines. Nonlinear patterns, such as periods of stasis preceding rapid increases or recent plateaus, are also found in particular cases, and may be interpretable in light of local dynamics in these fields. One challenge facing the interpretations of all of these patterns is the degree to which the extent and timing of publication growth reflects endogenous change in the research fields themselves or changes in the publications indexed by WoS. Nevertheless, the clear pattern of rising global productivity is unambiguous.

Trends by author gender

The pattern of changes in productivity as a function of author gender, evaluated over a period of 15 rather than 50 years, reflects a similar combination of broad trends and local (country- rather than field-level) variation. The key broad trend is a gradual increase in the proportion of publications with at least one female author, with the ratio of these publications to those with at least one male author increasing from less than 0.4 to more than 0.5 over the study period. The rate of change is troublingly slow, however, with parity very distant and even a 0.7 ratio being forecast as two decades away. In the context of rapid increases in rates of publication, the disparity between the absolute numbers of female- and male-authored publications is growing globally rather than shrinking. Our findings demonstrate the magnitude of the research publication gender gap and the slow but steady rate at which it is narrowing.

Covid-19 impacts by research field

Inspection of departures of 2020 publication counts from 2019 figures and from long-term polynomial forecasts reveals a general pattern of negative growth that can be cautiously ascribed to the Covid-19 pandemic. However, where research fields are concerned the pattern is relatively weak, with substantial variability across fields. Of the 94 fields examined, a slender majority (51.1%) experienced a reduction in publications in 2020 relative to 2019, with a larger majority (59.6%) falling below forecast. However, many fields did not follow this pattern, some such as the study of infectious diseases and public health showing strong growth for reasons likely to be directly pandemic-related, and others such as environmental science probably as a result of growing attention to climate change issues. In contrast, the humanities and social sciences (e.g., philosophy, religion, history, political science, education research), were especially hard hit in 2020, showing substantial negative growth in publication counts: on average -12.0% relative to 2019 and an astonishing -37.2% compared to forecast. Negative growth was also common in engineering and computer science fields. The reasons for these negative impacts are not obvious. One possibility is that fields whose primary publication forum is conference proceedings papers, such as computer science, will show a reduction in publications when travel is restricted, and many conferences are suspended. Another is that the traditionally slower peer review processes in the social science and humanities are more impacted by pandemic-related disruptions, resulting in publication delays. These possibilities should be considered in future research, informed by field-specific knowledge. Further studies should also examine whether reduced publication counts in 2020 primarily represent delayed publications that will appear in future years, or whether it represents research that was not conducted due to the pandemic. Whether or not publication counts recover in 2021 and beyond may help to answer this question.

Covid-19 impacts by author gender

The potentially disproportionate impact of the Covid-19 pandemic on female researchers has been a focus of speculation and some preliminary research, but the present study offers the broadest scope of analysis conducted to date. It suggests that there has been no substantial gender difference in the disruption due to the pandemic. Across 37 national cases, each representing one language-country combination, the average growth in number of female-authored publications from 2019 to 2020 was approximately double the average growth in male-authored publications. Relative to forecast, however, the respective levels of growth are negative to similar degrees. By the A-C method of gender determination, female-authored publications revealed marginally more negative growth than male-authored publications (-1.27% [F] versus -1.14% [M]) but by the alternative A-Z method the pattern reversed (-0.70% [F] versus -1.01% [M]). Although there was notable variability across countries in these patterns, the average pattern is in no way atypical of what is seen in some of the research powerhouses. For example, in the USA, female-authored publications grew from 2019 to 2020 (+3.73% [A-C], +3.28% [A-Z]) while male-authored publications shrank (-1.14% [A-C], -2.03% [A-Z]), and female-authored publications (-1.23% [A-C], -0.86% [A-Z]) fell marginally less below forecast than male-authored publications (-1.42% [A-C], -1.36% [A-Z]). By implication, the Covid-19 pandemic may not have disproportionately disrupted the research productivity of female researchers as has been feared, at least insofar as 2020 publication outputs are concerned. It is possible that disproportionate impacts might emerge at a greater lag, or that they are specific to researchers in particular age or other demographic groups. Future studies should investigate these possibilities.

Limitations

These gender-related findings must also be considered tentative for two key reasons. First, our method for counting publications as female- and male-authored does not allow direct comparison of numbers of publications because publications with mixed-gender authorship are counted toward both categories. Our method also does not directly represent the relative magnitude of research contributions by female and male researchers because it does not count their fractional contribution to publications but only whether they are categorically present or absent. Second, our novel method for classifying author gender may be imprecise and requires further validation. Encouragingly, two different implementations of the method (A-C and A-Z) generated very similar patterns of findings. It is also important to recognize that although the method may have some error rate, there is no obvious reason to believe it would be systematically biased to allocate author to one gender or another, and it has the significant advantage of identifying author gender at much greater scale than other methods.

Directions for future research

A potential dimension that was not explored in this work is the investigation of gender gaps within specific research fields. Such investigation can potentially be undertaken using our proposed query-based method. Also, we note that a more representative metric for the investigation of gender productivity gap could perhaps be the frequency of first authorships (i.e., counts of male versus female first-authored publications). Not differentiating between ranks of authorships, our results with respect to the effect of Covid-19 pandemic are in contrast with another study [43], for example, who found 19% reduction in 2020 in female first-authorship representation in a sample of medical journals compared to 2019. The current paper considers any publication with mixed gender authorship toward both gender groups. The findings with respect to the effect of the pandemic on genders may as a result show some contrasts to the studies that considered pre-prints [22], those that considered first/last/corresponding authorship [51, 52], or the proportion of representation of male and female authors based on a sample of articles/pre-prints. This further adds to the mixture of evidence that already existed, particularly on the effect of pandemic on male and female productivity (see [53] and [54] for examples) and calls for more nuanced investigations of this problem. Currently, major scholarly reference databases (e.g., WoS or Scopus) do not offer search options that can differentiate between authorship orders. The detection of author names in such search engines is purely based on a binary determination of whether a name exists within the list of authors, regardless of the position of the author in the list. This places a limitation for the application of the query-based method proposed by this study. However, should such development be implemented by the WoS, then the same query-based method can be readily employed to examine gender disparity based on patterns of first authorships. Given the benefits that such differentiation could bring to academic inquiries of this nature, we recommend that WoS offers this possibility to its users. A complementary dimension to the analyses presented here is to explore whether country-specific lockdown measures explain the decline in research productivity (similar to the approach of Hipp and Konrad [44] in the context of impact on professional advancement). It is also important to note that our observations regarding the gender production gap remains limited to overall/gross productivity of male and female scholars and not productivity per individual male or female researchers. The observed gaps are, as such, partly a reflection of more male researcher presence in academia [55, 56]. While the results do speak to an existing and widening gender gap in many geographical regions, they do not have any bearing on whether male scholars on average have been more (or less) productive than their female counterparts. Matters of individual author productivity [46] were beyond the scope of our work. Also, further research could investigate how the pandemic impacted scholarly productivity across various career stages. However, we do not see a feasible pathway conducting such investigation using query-based methods, as proposed in this work. Traditional methods of sampling published articles or pre-prints as well as self-reported questionnaires could be the pathways for such investigation [55]. An existing study based on analysis of papers published by Brain, Behaviour and Immunity identifies clear impacts on female first-author representation during 2020 (compared to 2019) as well as a more pronounced impact on female first authorship than last authorship [57]. These existing sets of evidence provide indications that the impact of the pandemic might have been uneven across scholars of various career stages and that early career researchers might have experienced more pronounced setbacks. On a final note, the effect of any disruption on research activities is often reflected in publications with a time lag. This paper uses publications as the proxy of productivity, which has a lag between work input and publication. It is expected that the setback to academic activity, if any, may become more apparent in publication records during 2021 and onwards. The results presented by this work could at best provide some early indications of these disruptions whereas true effects may only manifest in the coming years. Whether these effects are transient and how long they might last before recovery is to be determined by future research.

Conclusions

The findings of this study provided an overall picture of quantitative trends of publications in a large sample of research fields. They have several practical implications for research institutions as well as individual researchers. Knowing what research fields are relatively bigger or are expanding faster would, for instance, be of great importance to research institutions when evaluating performances of scholars for matters such as career promotion. This is in consideration of the correlation that exists between the quantity of research articles that are published within a field and the number of citations that researchers of that field receive each year. Similarly, such performance assessments during the pandemic years need to take into account the overall differential impacts that various research fields have endured from the pandemic disruptions. These differential impacts are comprehensively documented in this study. In relation to gender disparity in overall research production, the findings of this work could guide policy makers who aim to effect changes in long-lasting academic gender gaps. Across different regions of the world, highly differential patterns of gender-related research production gap were unambiguously observable. These observations are particularly important for informing policy makers in countries where the gender gap in research production is not on a tangible path of closing in the foreseeable future, unless effective interventions in academic education, recruitment, research funding allocation, and mentorship are implemented [33, 58]. The findings also exemplify countries that have notably accelerated the closing of their academic gender gap (at least as reflected in the metric of total productivity). This may encourage the exchange of information, experiences, and policy guides between policy makers of these countries and those that seek to intervene with their persistent academic gender gaps. Moreover, while we observed that in countries that endured a larger impact of the pandemic on research productivity, female productivity was often more resilient (e.g., The Netherlands), it should be noted that the effect on the momentum of male and female productivity was closely comparable in nearly every case. There was no evidence of any pattern indicating that one gender endured greater impact on its productivity momentum.

Data for WoS categories.

(ZIP) Click here for additional data file.

Data for author gender.

(ZIP) Click here for additional data file.
  32 in total

1.  The "gender gap" in authorship of academic medical literature--a 35-year perspective.

Authors:  Reshma Jagsi; Elizabeth A Guancial; Cynthia Cooper Worobey; Lori E Henault; Yuchiao Chang; Rebecca Starr; Nancy J Tarbell; Elaine M Hylek
Journal:  N Engl J Med       Date:  2006-07-20       Impact factor: 91.245

2.  Universality of citation distributions: toward an objective measure of scientific impact.

Authors:  Filippo Radicchi; Santo Fortunato; Claudio Castellano
Journal:  Proc Natl Acad Sci U S A       Date:  2008-10-31       Impact factor: 11.205

3.  Computer science. Future science.

Authors:  James A Evans
Journal:  Science       Date:  2013-10-04       Impact factor: 47.728

4.  Longitudinal Analysis of Gender Differences in Academic Productivity Among Medical Faculty Across 24 Medical Schools in the United States.

Authors:  Anita Raj; Phyllis L Carr; Samantha E Kaplan; Norma Terrin; Janis L Breeze; Karen M Freund
Journal:  Acad Med       Date:  2016-08       Impact factor: 6.893

5.  Opinion: In the wake of COVID-19, academia needs new solutions to ensure gender equity.

Authors:  Jessica L Malisch; Breanna N Harris; Shanen M Sherrer; Kristy A Lewis; Stephanie L Shepherd; Pumtiwitt C McCarthy; Jessica L Spott; Elizabeth P Karam; Naima Moustaid-Moussa; Jessica McCrory Calarco; Latha Ramalingam; Amelia E Talley; Jaclyn E Cañas-Carrell; Karin Ardon-Dryer; Dana A Weiser; Ximena E Bernal; Jennifer Deitloff
Journal:  Proc Natl Acad Sci U S A       Date:  2020-06-17       Impact factor: 11.205

6.  Gender Differences in First and Corresponding Authorship in Public Health Research Submissions During the COVID-19 Pandemic.

Authors:  Michelle L Bell; Kelvin C Fong
Journal:  Am J Public Health       Date:  2020-11-19       Impact factor: 9.308

7.  The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index.

Authors:  Peder Olesen Larsen; Markus von Ins
Journal:  Scientometrics       Date:  2010-03-10       Impact factor: 3.238

8.  Gender gap in women authors is not worse during COVID-19 pandemic: Results from Research and Practice in Thrombosis and Haemostasis.

Authors:  Mary Cushman
Journal:  Res Pract Thromb Haemost       Date:  2020-06-21

9.  How Support of Early Career Researchers Can Reset Science in the Post-COVID19 World.

Authors:  Erin M Gibson; F Chris Bennett; Shawn M Gillespie; Ali Deniz Güler; David H Gutmann; Casey H Halpern; Sarah C Kucenas; Clete A Kushida; Mackenzie Lemieux; Shane Liddelow; Shannon L Macauley; Qingyun Li; Matthew A Quinn; Laura Weiss Roberts; Naresha Saligrama; Kathryn R Taylor; Humsa S Venkatesh; Belgin Yalçın; J Bradley Zuchero
Journal:  Cell       Date:  2020-06-12       Impact factor: 41.582

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.