Literature DB >> 34785264

The global prevalence of depression and anxiety among doctors during the covid-19 pandemic: Systematic review and meta-analysis.

G Johns¹, V Samuel², L Freemantle³, J Lewis⁴, L Waddington².

Abstract

BACKGROUND: This review provides an estimate of the global prevalence of depression and anxiety symptoms among doctors, based on analysis of evidence from the first year of the COVID-19 pandemic.
METHODS: A systematic review was conducted to identify suitable studies. Final searches were conducted on 3rd March 2021. Papers were initially screened by title and abstract, based on pre-agreed inclusion criteria, followed by full-text review of eligible studies. Risk of bias was assessed using the Joanna Briggs Checklist for Prevalence Studies. Data from studies rated as low or medium risk of bias were pooled using a random-effects meta-analysis. Sensitivity and subgroup analyses were conducted to explore heterogeneity.
RESULTS: Fifty-five studies were included after full-text review. Of these, thirty studies were assessed as low or medium risk of bias and were included in primary analyses. These comprised twenty-six studies of depression (31,447 participants) and thirty studies of anxiety (33,281 participants). Pooled prevalence of depression and anxiety was 20.5% (95% CI 16.0%-25.3%) and 25.8% (95% CI 20.4%-31.5%) respectively.
INTERPRETATION: Evidence from the first year of the pandemic suggests that a significant proportion of doctors are experiencing high levels of symptoms of depression and anxiety, although not conclusively more so than pre-pandemic levels. Differences in study methodology and variation in job demands may account for some of the observed heterogeneity. LIMITATIONS: Findings must be interpreted with caution due to the high heterogeneity and moderate risk of bias evident in the majority of included studies.

Entities: Chemical

Keywords: Coronavirus; Covid-19; Doctors; Physicians; anxiety; depression

Mesh：

Year: 2021 PMID： 34785264 PMCID： PMC8596335 DOI： 10.1016/j.jad.2021.11.026

Source DB: PubMed Journal: J Affect Disord ISSN： 0165-0327 Impact factor: 4.839

Introduction

On the 30th January 2020 the world health organisation (WHO) declared the coronavirus disease 2019 (COVID-19) outbreak a Public Health Emergency of International Concern, its highest level of alarm. An unparalleled global response followed, with local and national ‘lockdowns’, quarantines, travel restrictions, and physical distancing measures introduced in attempts to curb transmission rates. At the time of writing, there have been over 114 million confirmed cases and more than 2.5 million reported COVID-associated deaths (WHO, 2021). In response to the unprecedented pressure on global health systems, there has been enhanced focus on the mental wellbeing of healthcare staff. In April 2020, The Lancet published a position paper outlining their suggested research priorities for the pandemic: “The immediate research priorities are to monitor and report rates of anxiety, depression, self-harm, suicide, and other mental health issues both to understand mechanisms and crucially to inform interventions. This should be adopted across the general population and vulnerable groups, including front-line workers.” (, p5) Poor mental health and wellbeing among healthcare staff has organisational implications for patient safety, experience, and satisfaction (Wallace et al., 2009), in addition to financial costs, impact on productivity, and the direct effects on the individual (Royal College of Psychiatrists, 2015). High pressured working environments, heavy workload, long hours, limited resources, organisational restructuring, and a culture of blame and fear have all been implicated as contributory factors (Wilkinson, 2015; Lemaire and Wallace, 2017); all factors that have become increasingly salient within the context of the current global crisis. The Job Demand-Resources (JD-R) model of occupational stress (Demerouti et al., 2001) offers a framework to understand these problems. The model hypothesises that as job demands increase so too does emotional strain, which negatively affects performance. Whereas greater access to job resources is associated with enhanced engagement and performance. Job demands are conceptualised as the physical, psychological, social, and organisational features of a job that require sustained physical and/or psychological effort. Examples of job demands are high workload or emotionally demanding interactions with patients. Job resources are defined as the physical, psychological, social, or organizational aspects of a job that facilitate achievement of work-based goals, reduce job demands, and stimulate personal growth, learning, and development. Examples of job resources are performance feedback, autonomy, and skill variety. The theory suggests that job demands are associated with health-impairments (e.g., poor mental or physical health), whereas job resources are associated with engagement and motivational processes (Bakker and Demerouti, 2017). The current pandemic can be considered a universal job demand on health care systems across the world. However, there will also be additional localised variability in job demands and resources. For example, insufficient staffing levels and underfunded services may create additional strain for healthcare workers. Medics form an essential part of the global frontline pandemic response. Studies conducted outside of global crises have highlighted that medical students and doctors are already at increased risk of psychological distress, depression, anxiety, burnout, and suicidality, compared with the general population (De Sio et al., 2020; Dong et al., 2020; Tian-Ci Quek, 2019; Hayes et al., 2017; Dai et al., 2015; Dyrbye et al., 2006). As a result, there have been calls to improve the conceptual definition and measurement of wellbeing in medics (Brady et al., 2018; Wallace et al., 2009). Studies conducted during the 2003 outbreak of severe acute respiratory syndrome (SARS) indicated significant psychological distress in 18% to 57% of health care workers (Tam et al., 2004; Chan and Huak, 2004; Phua et al., 2005; Nickell et al., 2004; Maunder et al., 2004). A study conducted one to two years post-SARS outbreak found high levels of burnout, psychological distress, and posttraumatic stress in healthcare workers (Maunder et al., 2006). However, a similar study by Lancee et al. (2008) found incidence of new episodes of psychiatric disorders in community populations were similar to, or higher than, those observed in health care workers two years post-outbreak. Although a number of studies have focused on the prevalence of mental health outcomes in doctors during the current COVID-19 pandemic, to the author's knowledge, there have been no systematic reviews conducted to analyze and synthesize data relating exclusively to doctors. Some meta-analyses of healthcare workers of multiple professions have included doctors (Santabárbara et al., 2021; Pappa et al., 2020; Salari et al., 2020; Luo et al., 2020), and sub-group analyses provide some evidence of high levels of psychological distress among medics. However, outcomes from these analyses are limited by review design (e.g., rapid reviews), and underpowered sub-group meta-analyses for doctors. In addition, given the rate of publications during the pandemic, an up-to-date review is needed. The current review will focus on the prevalence of symptoms of depression and anxiety during the COVID-19 pandemic. Previous meta-analyses have estimated the global prevalence of major depressive disorder and anxiety disorders to be 4.7% (4.4–5.0%) (Ferrari et al., 2013) and 7.3% (4.8–10.9%) (Baxter et al., 2013) respectively. The core features of depression are persistent depressed mood and anhedonia; other symptoms included psychomotor agitation or retardation, appetite changes, sleep problems, fatigue, feelings of low self-worth, poor concentration, and suicidal ideation (American Psychiatric Association 2013) Anxiety is characterized by psychological and somatic symptoms, including autonomic arousal (e.g., palpitations, sweating, trembling, dry mouth, difficulty breathing, chest pain, nausea), restlessness, fatigue, difficulty concentrating, irritability, and sleep problems (American Psychiatric Association 2013). Depression and anxiety are associated with impairments in cognitive functioning, including poorer performance on tests of memory, attention, executive function and motor function (Rock et al., 2014; Hallion et al., 2017; Moran, 2016; Eysenck et al., 2007; Runswick et al., 2018; Wilson, 2012). These cognitive, physiological, and behavioural consequences may be of particular concern among medical doctors, given the potential implications for professional competence and patient safety, as well as personal wellbeing. The aim of this systematic review and meta-analysis is to analyze the evidence emerging from the first year of the COVID-19 pandemic to answer the following research questions: What is the global prevalence of depression and anxiety symptoms among doctors during the COVID-19 pandemic? What factors might explain differences in the prevalence of depression and anxiety symptoms among doctors during the COVID-19 pandemic?

Methods

This systematic review and meta-analysis was conducted in accordance with PRISMA (Page et al., 2021) and MOOSE (Meta-analyses of Observational Studies in Epidemiology) guidelines (Stroup et al., 2000). The review protocol was registered with PROSPERO and is available online (CRD42021228667).

Eligibility criteria

The CoCoPop framework (Condition, Context, Population), for prevalence and incidence reviews, was used to develop the following inclusion criteria: (i) assessment of depression and/ or general anxiety symptoms using a standardised and validated measure; (ii) conducted during the COVID-19 pandemic; (iii) practicing medical doctors working in any speciality, across the world. Studies were excluded studies based on the following criteria: (i) studies conducted outside of the pandemic timeframe; (ii) studies using non-standardised or unvalidated measures; (iii) studies that do not report prevalence for the target population or do not provide sufficient information to calculate prevalence; (iv) studies that have not separated professions in the data; (v) studies relating exclusively to medical students, non-practicing doctors, or non-medical doctors; (vi) pre-prints, or studies not published in a peer reviewed journal; (vii) studies with a sample size <139 (calculated according to minimum expected prevalence from previous literature (Vaughan and Morrow, 1989)).; (viii) qualitative studies; (ix) articles inaccessible for full review or not published in English; (x) studies not reporting original research (e.g., literature review, article, commentary); (xi) studies focussing on mental health outcomes other than depression and/ or general anxiety (e.g., stress, burnout, specific anxiety disorders).

Search strategy

A search strategy was developed following consultation with an expert librarian. Search terms were selected to identify records reporting on prevalence data for depression and anxiety in doctors during the COVID-19 pandemic. Full text searches were conducted using the following key search terms: (covid OR covid-19 OR “sars cov 2″ OR “sarscov2" OR “corona virus") AND (doctor* OR physician* OR medic OR medics) AND (anxiety OR “anxiety symptoms” OR “anxiety disorder” OR anxious OR “generali?ed anxiety” OR panic OR worry OR depress* OR “mental health” OR “mental illness” OR “mental disorder*”). Four electronic databases (PubMed, CINAHL, Embase, PsychInfo) and one preprint database (MedRxiv) were searched. Final searches were conducted on 3rd March 2021. Search strategies were adapted for each database, where necessary. No restrictions were applied. An example of the search terms used is included in Supplementary Information 1 (SI1). Identified records were extracted to Zotero and then uploaded to (Covidence systematic review software 2021).

Selection process

Two independent reviewers (G.J. and L.F.) screened titles and abstracts, followed by all eligible full text papers, based on the pre-agreed inclusion criteria. Inter-rater reliability was substantial (K = 0.66/ 0.68). Two research supervisors (L.W. and V.S.) were available to resolve any disagreements.

Data extraction

Data extraction was conducted independently by G.J. and a third reviewer (J.L.), and cross-checked for reliability. Where essential data was missing, the corresponding authors were contacted to request information. The following data items were extracted: author, publication year, study design, recruitment method, data collection timeframe, geographical location, measures used, cut-off and severity thresholds. The following data were extracted for the target population only (i.e., doctors): sample size, sex, age, number of positive cases of depression and anxiety, response rate. In cases where prevalence information was missing, relevant calculations were made, where possible. The primary outcome was the total number of positive cases of depression and/ or anxiety among doctors during the pandemic, determined by the number of participants scoring above a pre-defined threshold on a validated depression or anxiety measure. Frequency data were collected for total sample (N), anxiety and/or depression cases (n), and resulting proportions with 95% confidence intervals (CI).

Study risk of bias assessment

Risk of bias was independently assessed by G.J. and J.L. for all included studies using the Joanna Briggs Inventory (JBI) Checklist for Prevalence Studies tool (Munn et al., 2015). The tool was developed for the purpose of increasing consistency in systematic reviews of prevalence data and has been recommended as the most appropriate tool for studies of this kind (Migliavaca et al., 2020). Study risk of bias was evaluated based on the following nine criteria: 1) Was the sample frame appropriate to address the target population? 2) Were study participants recruited in an appropriate way? 3) Was the sample size adequate? 4) Were the study subjects and setting described in detail? 5) Was data analysis conducted with sufficient coverage of the identified sample? 6) Were valid methods used for the identification of the condition? 7) Was the condition measured in a standard, reliable way for all participants? 8) Was there appropriate statistical analysis? 9) Was the response rate adequate, and if not, was the low response rate managed appropriately? Within the existing literature (Islam et al., 2020; Sarria-Santamera et al., 2021), level of bias is assessed by calculating the total number of criteria with a yes response and converting this score into a percentage (n/9). Studies scoring <50% are considered high risk of bias, 50–69% medium risk of bias, and ≥70% low risk of bias. The quality assessment tool was first piloted on a small number of studies. L.W. and V.S. were available for consultation and to resolve any disagreements.

Data analysis

Studies assessed as high risk of bias were excluded from the primary analysis. Following consultation with expert statisticians, a meta-analysis for proportional data was conducted using the Metaprop (Nyaga et al., 2014) command of the software package STATA version 16.1 (StataCorp, 2019). To address potential weighting issues that can occur when including studies with proportions close to one or zero, which can disproportionately skew the outcome of meta-analysis, proportions were transformed using the Freeman-Tukey double arcsine method (Freeman & Tukey, 1950), and back-transformed for ease of interpretation (Barendregt et al., 2013). A DerSimonian & Laird (1986) random effects model was used to extract pooled prevalence, given the assumed differences in regional demographics and study design. The I2 statistic was used to assess the statistical heterogeneity (Higgins et al., 2003). I2 values < 50% are considered low, 50–75% moderate, and >75% high. Subgroup analyses were conducted to explore sources of heterogeneity, as expected in meta-analyses of cross-sectional studies.

Sensitivity and subgroup analysis

Sensitivity analysis was conducted to explore the impact of individual studies (leave one out and cumulative analyses), and the impact of study quality and design (risk of bias, measure, severity threshold, and survey timeframe). Survey timeframe was split into first three months of the pandemic (January to March 2020), and April 2020 onwards. In line with the JD-R model, subgroup analysis was conducted to explore the potential for variability in job demands and resources to explain heterogeneity of outcomes during the pandemic. gross domestic product (GDP) per capita and doctors per 10,000 population were used as potential indicators of job demands and resources for each study. GDP per capita was split into three groups <$10,000 per capita, $10–15,000 per capita, and >$25,000 per capita. Doctors per 10,000 population was split into four groups <15.5, 15.5–19, 20–29, and >30. Geographical region was also explored as a potential source of heterogeneity, with studies grouped by continent; two studies were omitted from this sub-group analysis due to their global coverage. Sub-group analysis was only conducted for categories with a minimum of four studies. Reported outcomes are proportion (p), confidence interval (CI) and percentage prevalence (p × 100%). All statistical analyses were two-tailed and p=<0.05 was considered statistically significant.

Publication bias

Publication bias was assessed via visual inspection of funnel plots (SI3 and SI4) and Egger's test (Egger et al., 1997), with p = <0.05 indicating publication bias.

Certainty assessment

The grading of recommendations assessment, development, and evaluation (GRADE) system was used to assess the quality of the overall body of evidence and the level of confidence in the conclusions drawn (Guyatt et al., 2008). GRADE assessment considers factors over and above individual study risk of bias, such as imprecision, inconsistency, indirectness, study limitations and publication bias. Overall quality of evidence may be rated as high, moderate, low, or very low. All observational research begins as low quality and can be (less commonly) upgraded or (more commonly) downgraded, based on the five criteria outlined above (Balshem et al., 2011).

Results

Study selection

A total of 2359 records were identified following systematic review of four databases and one pre-print server. After removal of duplicates, 1680 records were screened by title and abstract. Full text review was conducted on 161 papers, of which a further 106 studies were excluded. Fifty-five studies (see SI2 for references) were included in the quality assessment process. A further 22 studies were excluded from the primary analysis due to high risk of bias, leaving 33 studies assessed as medium or low risk of bias. Twenty-three studies reported data for depression and anxiety, seven reported data exclusively for anxiety, and three reported data exclusively for depression. Study characteristics and prevalence data for high risk of bias studies are presented in Supplementary Tables 1 and 2 (ST1 and ST2). A PRISMA diagram detailing the flow of information is presented in Fig. 1

Fig. 1

Prisma diagram presenting flow of information.

Risk of bias in studies

Risk of bias ratings for all 55 studies, assessed using the JBI Checklist for Prevalence Studies tool, are presented in ST3. Five studies were assessed as low, 28 as medium, and 22 as high risk of bias. Most studies used appropriate methods to identify and measure the condition(s) and reported appropriate statistical analysis. Setting and characteristics were also largely well described, although a small number of studies reporting on a wide range of health care workers were downgraded on this item, due to the lack of sufficient detail pertaining specifically to the target population of interest for this review (i.e., doctors). The predominant use of non-probability sampling methods reduced scores for many studies. This methodology typically indicates the absence of a sampling frame and random sampling approach, an inability to calculate a response rate, and introduces coverage bias. Some studies lost additional points due to inadequate reporting of data (e.g., absence of numerator and/or denominator), and some did not report sample size calculation, or provide sufficient information to calculate retrospectively.

Study characteristics

The sample size of the studies ranged from 149 to 10,178. All studies employed a cross-sectional design. Full study characteristics are summarised in ST4.

Characteristics of studies assessing depression

A total of 31,447 participants from 26 studies were included; ten studies were based in Asia, seven in Europe, four in North America, two in South America, two in Africa, and one multi-national. Participants’ mean (SD) age ranged from 28.0 (3) to 45.2 (13.3). The proportion of female participants ranged from 3.4% to 80.1%. The median number of participants per study was 467.5. Male vs female split was 45.9% vs 54.0% respectively (NB. sex data not reported for some studies).

Characteristics of studies assessing anxiety

A total of 33,281 participants from 30 studies were included. Ten studies were based in Asia, nine in Europe, five in North America, three in South America, two in Africa, and two were multi-national. The mean (SD) age of sample size ranged 28.0 (3) to 52.0 (11). The proportion of female participants ranged from 8.3% to 80.1%. The median number of participants per study was 502.5. Male vs female split was 46.6% vs 53.8%.

Measures

Seventeen studies used the Generalised Anxiety Disorder Scale-7 Item (GAD-7; Spitzer et al., 2006), thirteen used the Patient Health Questionnaire- 9 Item (PHQ-9; Kroenke et al., 2001), seven used the Hospital Anxiety and Depression Scale (HADS; Zigmond and Snaith, 1983), three used the Depression Anxiety Stress Scale-21 item (DASS-21, short version of the DASS; Lovibond and Lovibond, 1995), three used the Patient Health Questionnaire- 2 Item (PHQ-2; Löwe et al., 2005), one used the Generalised Anxiety Disorder Scale-2 Item (GAD-2; Kroenke et al., 2007) one used the Beck Anxiety Inventory (BAI; Beck et al., 1988), and one used the Patient-Reported Outcomes Measurement Information System– Anxiety (PROMIS; Cella et al., 2010).

Results of individual studies

Point prevalence of depression ranged from 6.1% (95% CI 5.5–6.8%) (Li et al., 2020)to 73.4% (95% CI 65.9–79.7%) (Elhadi et al., 2020). Point prevalence of anxiety ranged from 5.9% (95% CI 4.1–8.3%) (Skoda et al., 2020)to 74.2% (95% CI 70.3–77.8%) (Jain et al., 2020), although only two out of the 26 depression studies and two out of the 30 anxiety studies reported prevalence of <10%. Point prevalence and confidence intervals for all individual studies are presented in ST5.

Results of synthesis

The pooled prevalence of depression for the 26 included studies was 20.5% (95% CI 16.0–25.3%), with a high degree of heterogeneity (I2 = 98.931%), as presented in Fig 2 . The pooled prevalence of anxiety for the 30 included studies was 25.8% (95% CI 20.4–31.5%), with a similarly high degree of heterogeneity (I2 = 99.190%), presented in Fig. 3

Fig. 2

Forest plot showing the global prevalence of depression symptoms among doctors.

Fig. 3

Forest plot showing the global prevalence of anxiety symptoms among doctors.

Forest plot showing the global prevalence of depression symptoms among doctors. Forest plot showing the global prevalence of anxiety symptoms among doctors.

Sensitivity analysis

Sensitivity analysis for studies of depression

One study affected the pooled prevalence of depression by ≥1%. The study in question (Elhadi et al., 2020) changed pooled prevalence by 1.7%. After running the analysis without this study, pooled prevalence was 18.8% (95% CI 14.6–23.3%). Cumulative analysis revealed heterogeneity only reached acceptability for a subset of thirteen studies (Chatzittofis et al., 2021, Civantos et al., 2020a, Civantos et al., 2020b, Fauzi et al., 2020, Florin et al., 2020, Hilmi et al., 2020, Khanna et al., 2020, Lai et al., 2020, Que et al., 2020, Vallée et al., 2020, Wang et al., 2020, Wang et al., 2020, Elhadi and Msherghi, 2021) all with proportions falling within a 7% range (95% CI 10.6–17.4%). For these studies, heterogeneity was reduced to moderate (I2=65.063) and pooled prevalence was 13.5% (95% CI 12.2–14.8%). As presented in Table 1 , between-group heterogeneity was not significant when analysed by measure (p = 0.062), severity threshold (p = 0.330), survey timeframe (p = 0.681), or risk of bias (p = 0.600).

Table 1

Sensitivity analysis for depression studies.

Depression	Studies, N	Pooled,%	95% CI	I²	p
Measure†:
PHQ9	13	16.1	10.4–22.8	99.316	0.062
HADS-D	7	27.5	17.6–38.6	98.174	0.062
Severity:					0.330
Mild	10	23.5	15.0–33.2	99.487
Moderate	16	18.5	14.6–22.9	96.395
Timeframe:
First 3 months	7	18.9	10.9–28.3	97.994	0.681
April onwards	19	21.1	15.7–27.0	99.117	0.681
Risk of bias:
Low	4	18.5	12.3–25.7	96.856	0.600
Medium	22	20.9	15.6–26.7	98.849	0.600

† Measures with fewer than four studies omitted *p = < 0.05.

Sensitivity analysis for depression studies. † Measures with fewer than four studies omitted *p = < 0.05.

Sensitivity analysis for studies of anxiety

Three studies affected the pooled prevalence of anxiety by ≥1% (Jain et al., 2020, Elhadi et al., 2020, Thomaier et al., 2020), the largest impact was a 1.5% change (Jain et al., 2020). After removing the three largest influencing studies, pooled prevalence was 21.8% (95% CI 17.3–26.7%). Cumulative analysis revealed that heterogeneity only reached acceptability for a subset of ten studies (Civantos et al., 2020a, Civantos et al., 2020b, Fauzi et al., 2020, Imran et al., 2020, Malgor et al., 2021, Shalhub et al., 2021, Wang et al., 2020, Wang et al., 2020, Elhadi and Msherghi, 2021, Kannampallil et al., 2020), all with proportions falling within an 8.5% range (95% CI 15.2–23.6%). For these studies, heterogeneity was reduced to moderate (I2=58.054) and pooled prevalence was 20.9% (95% CI 19.5–22.4%). As presented in Table 2 , between-group heterogeneity was statistically significant when analysed by measure (p = 0.034), severity threshold (p = 0.013), and survey timeframe (p = 0.038), but not by risk of bias (p = 0.089).

Table 2

Sensitivity analysis for anxiety studies.

Anxiety	Studies, N	Pooled,%	95% CI	I²	p
Measure†:
GAD7	17	20.3	14.3–27.2	99.293	0.034*
HADS-A	7	35.5	23.2–49	98.639	0.034*
Severity:
Mild	10	37.2	25.0–50.4	99.665	0.013*
Moderate	20	20.5	15.9–25.6	97.690	0.013*
Timeframe:
First 3 months	8	17.2	9.7–26.3	98.206	0.038*
April onwards	22	29.2	22.5–36.4	99.327	0.038*
Risk of bias:
Low	5	19.4	14.7–24.6	94.133	0.089
Medium	25	27.1	20.0–35.0	98.314	0.089

† Measures with fewer than four studies omitted * p = < 0.05.

Sensitivity analysis for anxiety studies. † Measures with fewer than four studies omitted * p = < 0.05.

Secondary analysis

Secondary analysis was performed with all studies (i.e., including those assessed as high risk of bias). The prevalence of depression symptoms for the 16 studies assessed as high risk of bias was 34.6% (95% CI 23.8–46.1%, I2=98.467). When compared with the 26 primary studies assessed as medium or low risk of bias, between-group heterogeneity was statistically significant (p = 0.018) (see SI5). By contrast, the prevalence of anxiety symptoms for the twenty-two studies assessed as high risk of bias (27.0%, 95% CI 20.5–34.0%, I2=98.918) was not significantly different from the 30 studies assessed as medium or low risk of bias (p = 0.787) (see SI6).

Subgroup analysis

Subgroup categorical information for each study is provided in ST6.

Subgroup analysis for studies of depression

As presented in Table 3 , between-group heterogeneity was statistically significant for studies of depression when analysed by GDP per capita (p = 0.014). Further analysis revealed significant heterogeneity between the <$10,000 and $10–15,000 groups (p = 0.005) but differences were not significant between other groups. Differences were not explained by geographical region (p = 0.282), or by doctors per 10,000 population (p = 0.198).

Table 3

Subgroup analysis for studies of depression symptoms.

Depression	Studies, N	Pooled,%	95% CI	I²	p
Region†:					0.282
Asia	10	14.8	9.0–21.7	99.339
Europe	7	21.3	13.4–30.5	97.697
North America	4	24.5	12.8–38.6	98.064
GDP per capita:
>$25,000	9	20.1	12.8–28.6	98.150	0.014*
$10–15,000	8	13.3	9.0–18.4	97.349
<$10,000	7	28.8	19.1–39.6	99.030
Doctors per 100K:					0.198
>30	4	16.3	12.2–20.8	90.204
20–29	8	27.7	17.2–39.7	98.073
15.5–19	8	15.0	8.4–23.0	98.541
<15.5	4	20.3	12.2–29.8	99.095

† Regions with fewer than four studies omitted * p = < 0.05 ** p = <0.01.

Subgroup analysis for studies of depression symptoms. † Regions with fewer than four studies omitted * p = < 0.05 ** p = <0.01.

Subgroup analysis for studies of anxiety

As presented in Table 4 , between-group heterogeneity was statistically significant among anxiety studies when analysed by doctors per 10,000 population (p = 0.003). As expected, the highest pooled prevalence of anxiety was calculated for the group of studies with the lowest number of doctors per 10,000 population (<15.5) at 37.9% (95% CI 20.6–56.9%). However, the lowest rates of anxiety were not observed in either of the categories with the highest numbers of doctors per 10,000 population (20–29, >30) but rather for the group of studies within the 15.5–19 doctors per 10,000 population range, with a prevalence of 14.7% (95% CI 9.0–21.5%). Further analysis revealed significant heterogeneity between the 15.5–19 group, when compared with the <15.5 group (p = 0.013), and when compared with the 20–29 group (p = 0.001). GDP per capita was on the threshold of significance (p = 0.054). Differences were not explained by geographical region (p = 0.145).

Table 4

Subgroup analysis.

Anxiety	Studies, N	Pooled,%	95% CI	I²	p
Region†:
Asia	10	21.5	13.1–21.3	99.508	0.145
Europe	8	21.0	12.7–30.8	98.255
North America	5	35.3	23.0–48.6	98.330
GDP per capita:					0.054
>$25,000	10	25.6	16.0–36.6	98.980
$10–15,000	10	16.4	9.4–24.9	98.870
>$10,000	8	32.7	22.3–44.1	98.934
Doctors per 100K:					0.003**
>30	5	19.1	9.1–31.6	98.681
20–29	11	32.4	24.0–41.4	97.796
15.5–19	8	14.7	9.0–21.5	98.050
<15.5	4	37.9	20.6–56.9	99.51

† Regions with fewer than four studies omitted * p = < 0.05 ** p = <0.01.

Subgroup analysis. † Regions with fewer than four studies omitted * p = < 0.05 ** p = <0.01. Egger's test revealed that publication bias was not statistically significant for studies reporting prevalence of depression symptoms (p = 0.6765), nor for studies reporting anxiety symptoms (p = 0.8973) (see SI3 and SI4 for visual funnel plots).

Discussion

Summary of evidence

The objective of this systematic review and meta-analysis was to provide an estimate of the global prevalence of depression and anxiety symptoms among doctors during the COVID-19 pandemic. The overall pooled prevalence of depression, calculated from 26 studies and 31,447 participants, was 20.5% (95% CI 16.0–25.3%). The overall pooled prevalence of anxiety, calculated from 30 studies and 33,281 participants, was 25.8% (95% CI 20.4–31.5%).

Comparison with existing evidence

Findings are broadly comparable to earlier estimates for doctors, conducted within the first three to six months of the pandemic. Pappa et al. (2020) conducted a meta-analysis of health care workers up until mid-April 2020. Their subgroup analysis of six studies reporting anxiety data specifically for doctors revealed a pooled prevalence of 21.7% (95% CI 15.3–29.0%); while five studies reported depression data with a pooled prevalence of 25.4% (95% CI 16•6–35.2%). In Santabárbara et al. (2021) meta-analysis of anxiety in health care workers, conducted up until mid-September 2020, a sub-group analysis of 13 studies of doctors reported a more modest pooled prevalence of 17% (95% CI 12.0–22.0%) for anxiety. This figure is comparable to the proportion calculated from the eight studies conducted in the first three months in the current study, but somewhat lower than the overall pooled estimate. However, direct comparisons are difficult due to the wide and overlapping confidence intervals and significant heterogeneity found across reviews. The prevalence of depression and anxiety symptoms among doctors also falls within the range reported in research conducted during the SARS epidemic ranging from 18% to 57% (Tam et al., 2004; Chan and Huak, 2004; Phua et al., 2005; Nickell et al., 2004; Maunder et al., 2004; Koh et al., 2005). However, these studies reported data on the prevalence of psychological distress rather than symptoms of depression and anxiety. Furthermore, many of these studies focussed on the broader population of healthcare workers, rather than doctors, so a direct comparison is not possible The results of the current study are also broadly consistent with previous studies conducted prior to the pandemic, indicating very high prevalence of depression and anxiety among doctors. However, evidence of a clear increase compared with pre-pandemic estimates is lacking. As above, direct comparisons are difficult to make as much of the pre-pandemic literature reports the prevalence of psychological distress and/ or burnout, rather than depression and anxiety, for this population. To the author's knowledge, there has only been one systematic review of depression and anxiety in qualified doctors prior to the pandemic (Beyond Blue, 2010); however, pooled prevalence was not calculated due to the wide variation in point prevalence. The narrative summary reported depression as ranging from 14% to 60%, and anxiety ranging from 18% to 55%. Subsequently, a cross-sectional study based in the Netherlands reported prevalence of depression and anxiety among doctors to be 29% and 24% respectively (Ruitenburg et al., 2012). In 2017, a study conducted in Ireland reported 16.6% and 14.4% of doctors with symptoms of depression and anxiety of moderate severity or above (Hayes et al., 2017); although these figures are more modest (particularly in relation to anxiety symptoms) than those reported in the current study, they remain considerably higher than rates in the general population. Previous research has also found higher levels of job demands are associated with reduced wellbeing in doctors (Khan et al., 2018, Lee et al., 2013, Teoh et al., 2021). A tentative hypothesis is that the absence of a clear increase in prevalence of depression and anxiety among doctors during the COVID-19 pandemic, compared with previous estimates, might suggest either a ceiling effect of job demands has been reached, or that greater job resources have been made available during the pandemic to offset the increased demands. Interestingly, a meta-analysis conducted for the general population, up to June 2020, estimated the global prevalence as 28.0% (95% CI 25.0–31.2%) for depression and 26.9% (95% CI 24.0–30.0%) for anxiety (Nochaiwong et al., 2021). These rates are significantly higher than pre-pandemic global estimates for the general population of 4.7% (4.4–5.0%) for depression (Ferrari et al., 2013) and 7.3% (4.8–10.9%) for anxiety (Baxter et al., 2013). This suggests there may have been a large increase in depression and anxiety symptoms among the general population within the first few months of the pandemic, reaching the consistently high levels reported among doctors. Furthermore, while levels of anxiety in the Nochaiwong study appear similar to those reported for doctors in the current study (26.9% vs 25.8%), levels of depression appear significantly higher in the global general population compared to those observed in doctors in the current study (28.0% vs 20.5%). Given that reduced activity is associated with depression, this finding might be explained by the presumed greater levels of inactivity within the general population, due to lockdown restrictions. Whereas doctors, as essential workers, may have experienced a less severe loss of routine. It is also of note that the pre-pandemic Ferrari and Baxter meta-analyses used studies that estimated prevalence based on ‘gold standard’ diagnostic interview procedures rather than self-report, which may account for some of the difference in outcomes. The data from this study suggests that doctors continue to be a population at high risk of depression and anxiety, but the evidence does not support a clear increase in symptoms, compared with pre-pandemic data.

Sub-group heterogeneity

The subgroup analyses conducted in this review (geographical region, doctors per 10,000 population, GDP per capita) were able to explain some of the heterogeneity in depression and anxiety studies, but not consistently. When comparing prevalence based on GDP per capita, there was significant between-group heterogeneity for depression (p = 0.014), and threshold significance for anxiety (p = 0.054). As expected, the highest prevalence rates were recorded for the lowest GDP per capita (<$10,000 studies), with pooled prevalence of 28.8% (95% CI 19.1–39.6%) for depression and 32.7% (95% CI 22.3–44.1%) for anxiety. However, notably, for both sub-group analyses, the lowest levels of depression and anxiety were not reported for countries with the highest GDP per capita (>$25,000), but for studies in the $10–15,000 level, with prevalence of depression at 13.3% (95% CI 9.0–18.4%) and of anxiety at 16.4% (95% CI 9.4–24.9%). These findings are consistent with previous research that suggests that beyond a certain level of wealth and resource, additional benefit to emotional wellbeing is minimal (Kahneman and Deaton, 2010). Findings are somewhat consistent with the JD-R model, which was used to select the subgroup comparisons of GDP per capita and doctors per 10,000 population as factors that may be expected to increase job demands and reduce job resources for doctors during the pandemic. Lowest GDP corresponded with highest rates of depression symptoms, and lowest numbers of doctors per 10,000 corresponded with highest rates of anxiety.

Methodological heterogeneity

The methodological differences explored via sensitivity analyses (risk of bias, measure, severity threshold, survey timeframe) did not explain the heterogeneity for depression studies, apart from when comparing high risk of bias with low/ medium risk of bias studies (p = 0.018). High risk of bias studies produced a prevalence of 34.6% (23.8–46.1%) whereas low/medium risk of bias studies produced a prevalence of 20.5% (16.0–25.3%). Conversely, all of the methodological differences were relevant in explaining the heterogeneity in anxiety studies, apart from risk of bias (high vs low/medium p = 0.787). The type of measure used in depression studies did not produce statistically significant differences in estimates (p = 0.062). Pooled prevalence was 16.1% (95% CI 10.4–22.8%) for the PHQ9 and 27.5% (95% CI 17.6–38.6%) for the HADS-D. However, for anxiety, there was a significant difference between studies using the GAD7 vs those using the HADS-A (p = 0.034). Pooled prevalence was 20.3% (95% CI 14.3–27.2%) for the GAD7 and 35.5% (95% CI 23.2–49.1%) for the HADS-A. This may be explained by potential differences in the underlying factor being measured. For example, a meta confirmatory factor analysis of the HADS identified a strong general factor. The authors suggested that it does not provide good separation between symptoms of anxiety and depression and recommended it may be best used as a measure of general distress (Norton et al., 2013). Reporting of mild vs moderate and above symptoms did not produce statistically different prevalence estimates for depression (p = 0.330) but did for anxiety (p = 0.013). Studies reporting mild and above symptoms of anxiety produced a pooled prevalence of 37.2% (95% CI 25.0–50.4%) whereas studies reporting moderate and above symptoms produced a more modest estimate of 20.5% (95% CI 15.9–25.6%). The lack of consensus and consistency across studies regarding what constitutes clinically significant levels of anxiety symptoms, and the poor equivalence when comparing severity levels across different measures, presents a challenge when attempting to estimate an overall prevalence (Clover et al., 2020). The timeframe of data collection was not significant for depression studies (p = 0.681) but was for anxiety studies (p = 0.038). Interestingly, the pooled prevalence of anxiety symptoms was significantly lower in studies conducted within the first three months of the pandemic (17.2%, 95% CI 9.7–26.3%) compared with studies reporting data from April onwards (29.2%, 95% CI 22.5–36.4%). Although this was based on a small subgroup of eight studies. This finding is in contrast to research in the UK general population between 23rd March and 9th August 2020 that suggest symptoms of anxiety were higher in the first few months before gradually declining (Fancourt et al., 2021). This finding might be understood as the consequence of chronic stress on the medical workforce as the pandemic progressed. However, it is also of note that findings from the UK-based study (Fancourt et al., 2021) are not consistent with the pooled prevalence reported in a similar timeframe from the global meta-analysis (Nochaiwong et al., 2021). This inconsistency is reflective of the overall high variability in the evidence.

Limitations

This review has several limitations. Firstly, there are a number of limitations associated with the methodology of the studies of interest. As with all observational research, causation cannot be inferred. The predominant use of non-probability sampling methods introduced the highest levels of bias. This methodology means that a sampling frame and stratified random sampling approach is typically absent, which has implications for coverage bias and the ability to calculate a response rate. In addition, the widespread use of online-only survey, although appropriate given the global context, may have introduced further coverage bias by excluding people who were too busy or overwhelmed to access their emails or social media. Other potential sources of bias include self-selection bias, which may be introduced by disproportionately attracting doctors with a past history or particular interest in mental health. Conversely, social desirability bias can also be introduced by the use of self-report measures. All of which can influence study results. Another significant limitation is the high heterogeneity observed across studies. Heterogeneity is inherent in meta-analyses of this type of data, but limits confidence in the conclusions drawn. Given the between-study variability in geographical location, settings, and specialities, generalisability may be limited. Lack of consistency in methodological approaches also limits confidence in conclusions, including the use of a wide variety of questionnaires, differences in cut-offs and severity thresholds, and absence of ‘gold standard’ diagnostic interviews. There are also several limitations associated with the methodology of the overall review. High risk of bias studies were excluded, with the aim of reducing overall bias and increasing homogeneity (Higgins et al., 2011; Detweiler et al., 2016). However, a drawback of analysis with a reduced sample is a reduction in overall precision. Sensitivity analysis incorporating high risk studies indicated that omitting these studies from the primary analyses of anxiety was not sufficient to explain heterogeneity. However, the significant difference in pooled prevalence in depression studies highlights the potential utility of this approach in avoiding overestimation of distress. Inter-rater reliability for risk of bias ratings was not an available as a function within the software used. Reporting bias may have been introduced by the exclusion of gray literature, non-English language papers, and inaccessible papers. While this study covered symptoms of depression and anxiety, specific anxiety disorders and other mental health conditions were excluded. It may also have been useful to consider the influence of additional variables, including indicators of more localised job demands, such as local infection rates during the timeframe for each study, and indicators of resources, such as organisational, social and psychological factors. Finally, although this review covers more than twelve months of research conducted during the pandemic, any studies published after the 3rd March 2021 will be absent from analyses. Given the rate at which new studies are being published, a more updated meta-analysis may soon be required.

Quality of evidence

The overall quality of evidence likely falls within the low to very low range, as per GRADE assessment guidelines. All observational research begins as low quality. Given the wide-ranging point prevalence observed across studies, the broad confidence intervals around pooled prevalence estimates, and the high level of heterogeneity observed, this assessment appears to be a fair reflection. This means that the estimate of effect is uncertain and future research may change this estimate. Recommendations for improving the quality of future research are outlined below.

Strengths

Despite these limitations, this review has a number of strengths. Firstly, risk of bias assessment highlighted a number of strengths in the individual studies. The vast majority of studies used appropriate and valid methods to identify depression and/or anxiety and measured the condition(s) in a standard and reliable way for all participants. Most studies appropriately described and reported the statistical analyses conducted. Setting and characteristics were also largely well described. In consideration of the overall review, to our knowledge, this is the first systematic review and meta-analysis of the global prevalence of symptoms of depression and anxiety among doctors during the pandemic. The number of studies returned in our searches was unexpectedly high; enabling us to be more selective in the quality of the studies included for full analysis. Although high risk of bias studies were excluded from the primary analyses, secondary analysis was also conducted to compare high vs medium/ low risk of bias studies. While between-group heterogeneity was not significant when comparing the risk of bias for anxiety studies, heterogeneity was significant for depression studies. The more modest pooled prevalence for depression, using just the lower risk studies, may therefore be considered a more accurate estimate. Data were extracted for cases above clinical cut-off thresholds; for the majority of studies, reported cut-offs were within the moderate severity range. In the few studies where a specific cut-off score was not reported, data were extracted for cases in the moderate and above categories. Studies reporting prevalence estimates based on predominantly mild symptoms are likely to provide an overinflated estimation of mental health conditions in this population; therefore, the pooling of predominantly moderate and above estimates may offer a more accurate reflection of the prevalence of clinically relevant symptoms in doctors than studies including data for all levels of symptom severity. Further strengths include the large number of overall participants from across the globe, spanning a wide range of clinical specialities and settings. Subgroup analyses, exploring the potential impact of job demands, provides some additional insight into factors that may be influencing prevalence.

Recommendations

Given the evidence for high levels of depression and anxiety symptoms among doctors across the world, health care services should consider multi-level approaches to support (Bakker and Demerouti, 2018). Firstly, organisational and structural changes are needed to ensure doctors have access to the most fundamental resources, such as time to sleep, eat, exercise, and spend time with others (Unadkat and Farquhar, 2020). Ongoing efforts should be made to destigmatise discussions around mental health (Galbraith et al., 2020). Formal and informal peer support systems may help to facilitate these conversations and should be encouraged (Behrman et al., 2020). Schwartz rounds are increasing in popularity, are well received by staff (Flanagan et al., 2020), and can normalize conversations around the emotional impact of work and reduce stigma. Similarly, formal and informal psychology input should be embedded within health services. Services should consider incorporating evidence based and high-quality interventions, such as those based on mindfulness and cognitive-behavioural therapy, which have been found to be effective in reducing stress, anxiety, and depression for doctors and nurses (Melnyk et al., 2020; Murray et al., 2016). Systems to monitor the wellbeing of doctors should be in place, and in cases where one-to-one psychological support is required there should be clear and discreet pathways to referral. Further longitudinal research is needed to monitor long-term outcomes and to explore potential differences in trajectory of mental health outcomes for doctors compared with other populations. Future research may benefit from greater consideration of individual, social and organisational demands and resources. Improvements to research methodology would also increase the overall quality of the evidence base and enable greater confidence in conclusions. Specifically, the adoption of random probability sampling methods is needed. There also needs to be more consistency in measurement. Future studies would benefit from adopting ‘gold standard’ diagnostic interview methods, using only measures with the strongest psychometric properties, utilizing cut-offs that optimize sensitivity and specificity in identifying clinically relevant symptoms, and reporting on a broader range of cut-offs in order to facilitate better comparisons with studies using alternative measures (Clover et al., 2020; Cameron et al., 2008).

Conclusion

This systematic review and meta-analysis provides a comprehensive analysis of the global prevalence of depression and anxiety symptoms among doctors during the first twelve months of the COVID-19 pandemic. Symptoms of depression and anxiety are elevated among doctors, compared with earlier research from the general population, but not conclusively more so than pre-pandemic levels among doctors. Differences in study design and variation in job demands may account for some of the observed heterogeneity. Findings may help to quantify the needs of this population and guide health care systems to plan support as we recover from the pandemic, and prepare for other times of national or global crisis.

Role of funding sources

This review was conducted as part of doctoral training and is funded by NHS Wales.

CRediT authorship contribution statement

GJ designed the study and protocol, conducted literature searches, screening, data extraction, quality assessment, statistical analysis, and wrote the manuscript. VS contributed to the design and coordination of the study and provided input into the final drafts. LF conducted independent screening and full text review. JS conducted independent data extraction and quality assessment. LW contributed to the design and coordination of the study, provided input into the final drafts and was the primary supervisor. All authors have approved the final manuscript.

Registration and protocol

The protocol was registered on and can be accessed online (CRD42021228667).

Conflict of Interest

The authors declare no conflicts of interest.

9 in total

1. The Effects of Long Working Hours on Mental Health Among Resident Physicians in Saudi Arabia.

Authors: Daniah Bondagji; Mutaz Fakeerh; Hassan Alwafi; Adeel Ahmed Khan
Journal: Psychol Res Behav Manag Date: 2022-06-20

2. Prevalence and predictors of mental health outcomes in UK doctors and final year medical students during the COVID-19 pandemic.

Authors: G Johns; L Waddington; V Samuel
Journal: J Affect Disord Date: 2022-05-13 Impact factor: 6.533

3. Prevalence of depression in Uganda: A systematic review and meta-analysis.

Authors: Mark Mohan Kaggwa; Sarah Maria Najjuka; Felix Bongomin; Mohammed A Mamun; Mark D Griffiths
Journal: PLoS One Date: 2022-10-20 Impact factor: 3.752

4. Psychological Distress Was Still Serious Among Anesthesiologists Under the Post COVID-19 Era.

Authors: Fei Guo; Ruili Han; Ting Luo; Shengyang Jin; Yuting Yan; Jun Wang; Xude Sun; Changjun Gao
Journal: Psychol Res Behav Manag Date: 2022-03-28

Review 5. Anxiety Linked to COVID-19: A Systematic Review Comparing Anxiety Rates in Different Populations.

Authors: Hafsah Saeed; Ardalan Eslami; Najah T Nassif; Ann M Simpson; Sara Lal
Journal: Int J Environ Res Public Health Date: 2022-02-15 Impact factor: 3.390

Review 6. Policies on mental health in the workplace during the COVID-19 pandemic: A scoping review.

Authors: David Villarreal-Zegarra; C Mahony Reátegui-Rivera; Iselle Sabastizagal-Vela; Miguel Angel Burgos-Flores; Nieves Alejandra Cama-Ttito; Jaime Rosales-Rimache
Journal: PLoS One Date: 2022-07-28 Impact factor: 3.752

7. Mental health problems and needs of frontline healthcare workers during the COVID-19 pandemic in Spain: A qualitative analysis.

Authors: Roberto Mediavilla; Anna Monistrol-Mula; Kerry R McGreevy; Mireia Felez-Nobrega; Audrey Delaire; Pablo Nicaise; Santiago Palomo-Conti; Carmen Bayón; María-Fe Bravo-Ortiz; Beatriz Rodríguez-Vega; Anke Witteveen; Marit Sijbrandij; Giulia Turrini; Marianna Purgato; Cécile Vuillermoz; Maria Melchior; Papoula Petri-Romão; Jutta Stoffers-Winterling; Richard A Bryant; David McDaid; A-La Park; José Luis Ayuso-Mateos
Journal: Front Public Health Date: 2022-07-27

8. E-Health Psychological Intervention for COVID-19 Healthcare Workers: Protocol for its Implementation and Evaluation.

Authors: Alejandro Dominguez-Rodriguez; Reyna Jazmín Martínez-Arriaga; Paulina Erika Herdoiza-Arroyo; Eduardo Bautista-Valerio; Anabel de la Rosa-Gómez; Rosa Olimpia Castellanos Vargas; Laura Lacomba-Trejo; Joaquín Mateu-Mollá; Miriam de Jesús Lupercio Ramírez; Jairo Alejandro Figueroa González; Flor Rocío Ramírez Martínez
Journal: Int J Environ Res Public Health Date: 2022-10-05 Impact factor: 4.614

9. Mental Health and Associated Demographic and Occupational Factors among Health Care Workers during the COVID-19 Pandemic in Latvia.

Authors: Laura Valaine; Gunta Ancāne; Artūrs Utināns; Ģirts Briģis
Journal: Medicina (Kaunas) Date: 2021-12-18 Impact factor: 2.430

9 in total