Literature DB >> 35637660

Ethnic disparities in publicly-available pulse oximetry databases.

Fatemeh Y Sinaki1, Rabab Ward2, Derek Abbott3, John Allen4, Richard Ribon Fletcher5,6, Carlo Menon7,8, Mohamed Elgendi1,2,7,8.   

Abstract

Inaccuracies have been reported in pulse oximetry measurements taken from people who identified as Black. Here, we identify substantial ethnic disparities in the population numbers within 12 pulse oximetry databases, which may affect the testing of new oximetry devices and impact patient outcomes.
© The Author(s) 2022.

Entities:  

Keywords:  Biomarkers; Diagnostic markers; Public health

Year:  2022        PMID: 35637660      PMCID: PMC9142514          DOI: 10.1038/s43856-022-00121-8

Source DB:  PubMed          Journal:  Commun Med (Lond)        ISSN: 2730-664X


There has been an unprecedented demand for pulse oximetry—a method of determining the oxygen saturation (SpO2) of the blood—during the COVID-19 pandemic to aid in medical decision-making. Pulse oximetry data are also widely used for medical research and algorithm development. The measurement of SpO2 involves shining light onto tissue at two separate wavelengths and derives the oxygen saturation from the relative changes in light absorption with each heartbeat. This pulsatile component is independent of skin pigmentation; however, other factors, such as the specific properties of the light source and the algorithms used by the product manufacturer, can produce variations that depend on skin pigmentation[1]. A recent study revealed the clinical importance of racial disparities in pulse oximetry readings[2]. Specifically, when compared to measurements of arterial oxygen saturation, the pulse oximetry algorithms in these devices were found to produce systematically higher saturation values in Black patients compared to white patients. Such systematic racial biases could adversely affect clinical decision-making, such as triage for supplemental oxygen, due to the pulse oximetry readings of Black patients appearing artificially higher. Such inaccuracies may disproportionately increase the risk of unrecognized low oxygen saturation levels in Black patients under certain circumstances, for example in people with COVID-19[3]. Recent advancements in artificial intelligence (AI) have relied on using public databases to undertake feature extraction with pulse oximetry signals to assess hypertension[4], estimate lung function[5], and validate algorithms developed for monitoring patients with COVID-19[6]. To prevent potential disparities in the calibration and accuracy of pulse oximetry devices and their algorithms, the pulse oximeter signals within such public databases need to be representative of the diverse populations on which these devices are used. Since inaccuracies in pulse oximetry readings have been attributed to differences in skin pigmentation and skin pigmentation varies with race and ethnicity, it is essential to clarify these terms as used in this article. We have predominantly chosen to use the term ethnicity because that is the specific term used in the public datasets and is also the term that is identified by the patients themselves. For the purpose of discussing health disparities, while both race and ethnicity are social constructs, ethnicity has emerged as the preferred one since it encompasses cultural aspects of social identity[7] that extend beyond the more simplistic view of race that is primarily based on shared skin pigmentation or physical characteristics[8]. While members of a given ethnicity can express a range of skin pigmentation, it is generally agreed that those patients who self-identify as Black generally have a darker skin pigmentation than other ethnic groups. In order to investigate the proportion of individual ethnicities represented in publicly available pulse oximetry databases, we conducted a comprehensive assessment of accessible databases from 1st January 2012–1st January 2022 using PubMed consisting of Medical Subject Headings (MeSH) terms and Title/Abstract keywords. Applying the inclusion and exclusion criteria defined in Fig. 1 resulted in 12 research articles describing 12 publicly-available datasets to assess different medical conditions using pulse oximeter data.
Fig. 1

Flow chart of study identification, inclusion and exclusion criteria.

Presentation of the literature search and selection procedure together with the numbers of records at each stage sourced from PubMed from 1 January 2012 to 1 January 2022 using the following keywords: ((“critical care”[MeSH Terms] OR (“critical”[All Fields] AND “care”[All Fields]) OR “critical care”[All Fields] OR (“oximetry”[MeSH Terms] OR “oximetry”[All Fields] OR (“oxygen”[All Fields] AND “saturation”[All Fields]) OR “oxygen saturation”[All Fields]) OR (“oximetry”[MeSH Terms] OR “oximetry”[All Fields] OR (“o2”[All Fields] AND “saturation”[All Fields]) OR “o2 saturation”[All Fields]) OR “PPG”[All Fields] OR (“photoplethysmogram”[All Fields] OR “photoplethysmograms”[All Fields]) OR (“photoplethysmography”[MeSH Terms] OR “photoplethysmography”[All Fields])) AND ((“publicly”[All Fields] AND (“availabilities”[All Fields] OR “availability”[All Fields] OR “available”[All Fields])) OR (“freely”[All Fields] AND (“access”[All Fields] OR “accessed”[All Fields] OR “accesses”[All Fields] OR “accessibilities”[All Fields] OR “accessibility”[All Fields] OR “accessible”[All Fields] OR “accessing”[All Fields])))) AND ((humans[Filter]) AND (english[Filter])). Papers that met the inclusion criteria, discussing the development and publication of original pulse oximetry datasets, were selected for analysis. This resulted in 12 research papers representing 12 publicly available datasets. Here n refers to the number of studies, where m refers to the number of publicly available databases.

Flow chart of study identification, inclusion and exclusion criteria.

Presentation of the literature search and selection procedure together with the numbers of records at each stage sourced from PubMed from 1 January 2012 to 1 January 2022 using the following keywords: ((“critical care”[MeSH Terms] OR (“critical”[All Fields] AND “care”[All Fields]) OR “critical care”[All Fields] OR (“oximetry”[MeSH Terms] OR “oximetry”[All Fields] OR (“oxygen”[All Fields] AND “saturation”[All Fields]) OR “oxygen saturation”[All Fields]) OR (“oximetry”[MeSH Terms] OR “oximetry”[All Fields] OR (“o2”[All Fields] AND “saturation”[All Fields]) OR “o2 saturation”[All Fields]) OR “PPG”[All Fields] OR (“photoplethysmogram”[All Fields] OR “photoplethysmograms”[All Fields]) OR (“photoplethysmography”[MeSH Terms] OR “photoplethysmography”[All Fields])) AND ((“publicly”[All Fields] AND (“availabilities”[All Fields] OR “availability”[All Fields] OR “available”[All Fields])) OR (“freely”[All Fields] AND (“access”[All Fields] OR “accessed”[All Fields] OR “accesses”[All Fields] OR “accessibilities”[All Fields] OR “accessibility”[All Fields] OR “accessible”[All Fields] OR “accessing”[All Fields])))) AND ((humans[Filter]) AND (english[Filter])). Papers that met the inclusion criteria, discussing the development and publication of original pulse oximetry datasets, were selected for analysis. This resulted in 12 research papers representing 12 publicly available datasets. Here n refers to the number of studies, where m refers to the number of publicly available databases. In total, as of January 28th 2022, these databases have been used to produce over 6214 citations according to Google Scholar including 3544 citations for Medical Information Mart for Intensive Care (MIMIC III)[9]; 1049 citations for MIMIC II[10]; 531 citations for IEEEPPG Dataset[11]; 243 citations for Multiparameter Intelligent Monitoring in Intensive Care I (MIMIC I)[12]; 239 citations for WESAD[13]; 215 citations for Vortal Dataset[14]; 102 citations for the CapnoBase Dataset[15]; 87 citations for the University of Queensland Vital Signs Dataset[16]; 86 citations for PPG-DaLiA[17]; 63 citations for PPG-BP Dataset[18]; 50 citations for Wrist PPG Signals Recorded during Exercise[19]; and 5 citations for Medical Information Mart for Intensive Care IV (MIMIC-IV)[20]. We evaluated the existence of potentialdisparities in ethnicity based on the existing patient records as reported in the publicly available databases. In the absence of such information, the numbers of subjects of each category were inferred and quantified based on the authors’ research institutions’ locations or where the data was collected, as shown in Table 1.
Table 1

Summary of all the 12 publicly available datasets.

TitleAuthors (year of publication)Name of databaseNumber of subjectsCountry of data locationEthnicity is clearly stated or inferredEthnicityNumber of subjects based on ethnic group
A database to support development and evaluation of intelligent intensive care monitoringMoody and Mark[12] (1996)MIMIC I93USClearly stated

2.4% Asian

9.1% Black

19.1% Other

70.3% White

2 Asian

8 Black

18 Other

65 White

CapnoBase: signal database and tools to collect, share and annotate respiratory signalsKarlen et al.[15] (2010)CapnoBase42CanadaInferred

11.0% Asian

3.5% Black

12.6% Other

72.9% White

4 Asian

1 Black

6 Other

31 White

Multiparameter intelligent monitoring in intensive care II (MIMIC-II): a public-access intensive care unit databaseSaeed et al.[10] (2011)MIMIC-II32000USClearly stated

2.5% Asian

10.6% Black

17.7% Other

69.2% White

800 Asian

3392 Black

5664 Other

22144 White

University of Queensland vital signs dataset: development of an accessible repository of anesthesia patient monitoring data for researchLiu et al.[16] (2012)University of Queensland Vital Signs32AustraliaInferred

3.1% Asian

0.4% Black

27.7% Other

69.2% White

1 Asian

0 Black

9 Other

22 White

TROIKA: a general framework for heart rate monitoring using wrist-type photoplethysmographic (PPG) signals during intensive physical exerciseZhang et al.[11] (2015)IEEEPPG12ChinaInferred

100.0% Asian

0.0% Black

0.0% Other

0.0% White

12 Asian

0 Black

0 Other

0 White

MIMIC-III, a freely accessible critical care databaseJohnson et al.[9] (2016)MIMIC-III53423USClearly stated

2.4% Asian

7.7% Black

18.6% Other

71.3% White

1282 Asian

4113 Black

9937 Other

38091 White

An assessment of algorithms to estimate respiratory rate from the electrocardiogram and photoplethysmogramCharlton et al.[14] (2016)Vortal45UKInferred

7.5% Asian

3.4% Black

9.1% Other

80.0% White

4 Asian

2 Black

3 Other

36 White

Description of a database containing wrist PPG signals recorded during physical exercise with both accelerometer and gyroscope measures of motionJarchi and Casson[19] (2016)Wrist PPG Signals Recorded during Exercise8UKInferred

7.5% Asian

3.4% Black

0.1% Other

80.0% White

1 Asian

1 Black

0 Other

6 White

Introducing WESAD, a multimodal dataset for wearable stress and affect detectionSchmidt et al.[13] (2018)WESAD15GermanyInferred

2.5% Asian

1.0% Black

8.3% Other

88.2% White

0 Asian

0 Black

2 Other

13 White

A new, short-recorded photoplethysmogram dataset for blood pressure monitoring in ChinaLiang et al.[18] (2018)PPG-BP219ChinaInferred

92.9% Asian

0.0% Black

7.1% Other

0.0% White

203 Asian

0 Black

16 Other

0 White

Deep PPG: large-scale heart rate estimation with convolutional neural networksReiss et al.[17] (2019)PPG-DaLiA15GermanyInferred

2.5% Asian

1.0% Black

8.3% Other

88.2% White

0 Asian

0 Black

2 Other

13 White

MIMIC-IVJohnson et al.[20] (2021)MIMIC-IV60000USClearly stated

3.0% Asian

10.0% Black

10.0% Other

77.0% White

1800 Asian

6000 Black

6000 Other

46200 White

The numbers of subjects and distribution of different ethnic groups as identified or inferred in the publicly available datasets. In case of the absence of patient ethnicity information, the location of the authors’ research institutions’ orthe data collection location was used to infer ethnicity based on local statistics. For Vortal and Wrist PPG Signals Recorded during Exercise databases, we inferred the ethnicity of subjects based on the Institute of Race Relations (https://irr.org.uk/research/statistics/ethnicity-and-religion/). For CapnoBase database, we inferred the ethnicity of subjects based on Statistics Canada (https://www.statcan.gc.ca/en/start) reviewing “Census Profile, 2016 Census” (https://www12.statcan.gc.ca/census-recensement/2016/dp-pd/prof/details/page.cfm?Lang=E&Geo1=PR&Code1 = 01&Geo2=PR&Code2 = 01&Data=Count&SearchText=canada&SearchType=Begins&SearchPR=01&B1 = All&TABID = 1). As an example, the Black population accounts for 3.5% of Canada’s total population (https://www.statcan.gc.ca/en/dai/smr08/2022/smr08_259). For University of Queensland Vital Signs, we inferred the ethnicity of subjects based on Australian Bureau of Statistics (https://www.abs.gov.au/). For WESAD and PPG-DaLiA databases, we inferred the ethnicity of subjects based on Statistisches Bundesamt (https://www.destatis.de/EN/Home/_node.html). For PPG-BP, we inferred the ethnicity of subjects based on National Bureau of Statistics of China (http://www.stats.gov.cn/english/).

Summary of all the 12 publicly available datasets. 2.4% Asian 9.1% Black 19.1% Other 70.3% White 2 Asian 8 Black 18 Other 65 White 11.0% Asian 3.5% Black 12.6% Other 72.9% White 4 Asian 1 Black 6 Other 31 White 2.5% Asian 10.6% Black 17.7% Other 69.2% White 800 Asian 3392 Black 5664 Other 22144 White 3.1% Asian 0.4% Black 27.7% Other 69.2% White 1 Asian 0 Black 9 Other 22 White 100.0% Asian 0.0% Black 0.0% Other 0.0% White 12 Asian 0 Black 0 Other 0 White 2.4% Asian 7.7% Black 18.6% Other 71.3% White 1282 Asian 4113 Black 9937 Other 38091 White 7.5% Asian 3.4% Black 9.1% Other 80.0% White 4 Asian 2 Black 3 Other 36 White 7.5% Asian 3.4% Black 0.1% Other 80.0% White 1 Asian 1 Black 0 Other 6 White 2.5% Asian 1.0% Black 8.3% Other 88.2% White 0 Asian 0 Black 2 Other 13 White 92.9% Asian 0.0% Black 7.1% Other 0.0% White 203 Asian 0 Black 16 Other 0 White 2.5% Asian 1.0% Black 8.3% Other 88.2% White 0 Asian 0 Black 2 Other 13 White 3.0% Asian 10.0% Black 10.0% Other 77.0% White 1800 Asian 6000 Black 6000 Other 46200 White The numbers of subjects and distribution of different ethnic groups as identified or inferred in the publicly available datasets. In case of the absence of patient ethnicity information, the location of the authors’ research institutions’ orthe data collection location was used to infer ethnicity based on local statistics. For Vortal and Wrist PPG Signals Recorded during Exercise databases, we inferred the ethnicity of subjects based on the Institute of Race Relations (https://irr.org.uk/research/statistics/ethnicity-and-religion/). For CapnoBase database, we inferred the ethnicity of subjects based on Statistics Canada (https://www.statcan.gc.ca/en/start) reviewing “Census Profile, 2016 Census” (https://www12.statcan.gc.ca/census-recensement/2016/dp-pd/prof/details/page.cfm?Lang=E&Geo1=PR&Code1 = 01&Geo2=PR&Code2 = 01&Data=Count&SearchText=canada&SearchType=Begins&SearchPR=01&B1 = All&TABID = 1). As an example, the Black population accounts for 3.5% of Canada’s total population (https://www.statcan.gc.ca/en/dai/smr08/2022/smr08_259). For University of Queensland Vital Signs, we inferred the ethnicity of subjects based on Australian Bureau of Statistics (https://www.abs.gov.au/). For WESAD and PPG-DaLiA databases, we inferred the ethnicity of subjects based on Statistisches Bundesamt (https://www.destatis.de/EN/Home/_node.html). For PPG-BP, we inferred the ethnicity of subjects based on National Bureau of Statistics of China (http://www.stats.gov.cn/english/). To avoid any uncertainty in the results of ethnic disparity analysis for a given population, databases with inferred ethnicity information were excluded from the statistical analysis. Four databases for which data for ethnicity was clearly stated, MIMIC, MIMIC-II, MIMIC-III and MIMIC-IV, were included in the statistical analysis. The distribution of ethnic groups in the four databases is shown in Fig. 2.
Fig. 2

Box plots of the ethnic makeup by proportion in all databases.

This figure combines all the databases used in the four publicly available pulse oximeter databases that clearly reported the distribution of ethnic groups. The data supports the hypothesis that disparities exist here. Significant differences are evident between white and Black (p < 0.0001), white and Asian (p < 0.0001), and Black and Asian populations (p = 0.021). All pairs of groups were tested by using a simultaneous pairwise Tukey test. The bottom and the top of the box are the 25th and 75th percentiles, and the line inside the box is the 50th percentile (median). Whiskers from minimum to maximum are determined with a 95% confidence interval.

Box plots of the ethnic makeup by proportion in all databases.

This figure combines all the databases used in the four publicly available pulse oximeter databases that clearly reported the distribution of ethnic groups. The data supports the hypothesis that disparities exist here. Significant differences are evident between white and Black (p < 0.0001), white and Asian (p < 0.0001), and Black and Asian populations (p = 0.021). All pairs of groups were tested by using a simultaneous pairwise Tukey test. The bottom and the top of the box are the 25th and 75th percentiles, and the line inside the box is the 50th percentile (median). Whiskers from minimum to maximum are determined with a 95% confidence interval. We tested the statistical significance among all the subjects in the four databases considering a p-value <0.05 as statistically significant and analyzed the variance using a one-way ANOVA followed by post hoc test to provide simultaneous two-way interactions using the Tukey’s honest significant difference criterion. The results indicated that there was a significant difference between the mean distributions of all racial groups; Asian and Black (p = 0.021), Asian and white (p = 4.10 × 10−14), and Black and white (p = 5.01 × 10−13). The same trend was observed between Other and Asian (p = 9.43 × 10−05), Other and Black (p = 0.026), and Other and white (p = 4.82 × 10−12). The results also suggested a higher proportion of white subjects compared to Asian, Black and other populations. These results demonstrate the existence of clear disparities in these key databases. Detailed results on the statistical separability tests for all pairs of demographic groups are provided in Table 2.
Table 2

Tukey simultaneous tests for differences of means.

Difference of levelsDifference of meansSE of difference95% CIT-valueAdjusted P-value
Asian-black6.772.00(0.84, 12.71)3.390.02
Asian-other13.782.00(7.84, 19.71)6.89<0.0001
Asian-white69.382.00(63.44, 75.31)34.69<0.0001
Black-other7.002.00(1.06, 12.94)3.500.02
Black-white62.602.00(56.66, 68.54)31.30<0.0001
Other-white−55.602.00(−61.54, −49.66)−27.80<0.0001
Tukey simultaneous tests for differences of means. In the remaining databases in which ethnicity was not explicitly stated, the ethnic disparity is not known. However, if we examine the demographic statistics of each data set, based on location, we see that significant potential for disparity exists. For example, the Vortal dataset was collected in the UK in 2016, and the authors did not provide the race of each participant. Based on government records, we can infer the ethnic distributions based on UK ethnicity statistics: 7.5% Asian, 3.4% Black, 0.1% Other, and 80.0% white. The same method to infer ethnicity was used for the remaining databases, as shown in Table 1. Furthermore, since the racial groups were not clearly defined, it does suggest a lax approach to the matter of constructing reference databases, mainly when applied to vascular optical measurement technology that can be influenced by skin color characteristics. White subjects appeared in all four MIMIC databases where the ethnicity was clearly stated, constituting an average of 73.19% of the total population. However, Black subjects only accounted for an average of 9.29% of the sample population. In addition, Asian subjects comprised an average of 2.67% of the total population investigated. Such distributions highlight the potential for racial and ethnic biases in algorithms and devices, leading to possible challenges in their wider application in medicine. Our findings highlight clear disparities in pulse oximetry databases. As these biased databases would be used during the premarket phase to adjust pulse oximeter accuracy and to develop algorithms for oxygen saturation determination, they place subjects with darker skin pigmentation at increased risk of unrecognized health conditions[3]. Such health inequalities necessitate the development of new pulse oximeter databases with more racially balanced populations. Our recommendation does not deny the value of exploiting existing biased databases; rather, it attempts to benefit from using these publicly available databases when testing developed algorithms, as well as aiming for more balanced populations in future databases. Asian and Black populations have low representation in existing databases and it would also be beneficial to create an increased number of databases from different geographical regions. Since last year, the US Food and Drug Administration has started to issue new guidelines to evaluate pre- and post-market pulse oximeters[3], and to increase awareness of racial and ethnic disparities that can affect the accuracy of pulse oximetry algorithms. As publicly-accessible databases are commonly used for the development of many biomedical algorithms and devices, our findings highlight the need to improve device algorithms and expand these databases to better represent a diversity of skin pigmentations regardless of the racial or ethnic group. Improving diversity in public databases would help improve the general accuracy of AI algorithms, especially for measurements that involve frequently life-threatening conditions such as COVID-19. Supplementary Data 1 Description of Additional Supplementary Files Peer Review File
  14 in total

1.  Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database.

Authors:  Mohammed Saeed; Mauricio Villarroel; Andrew T Reisner; Gari Clifford; Li-Wei Lehman; George Moody; Thomas Heldt; Tin H Kyaw; Benjamin Moody; Roger G Mark
Journal:  Crit Care Med       Date:  2011-05       Impact factor: 7.598

2.  The foundation of modern racial categories and implications for research on black/white disparities in health.

Authors:  Nina T Harawa; Chandra L Ford
Journal:  Ethn Dis       Date:  2009       Impact factor: 1.847

3.  A new conceptualization of ethnicity for social epidemiologic and health equity research.

Authors:  Chandra L Ford; Nina T Harawa
Journal:  Soc Sci Med       Date:  2010-04-29       Impact factor: 4.634

4.  University of Queensland vital signs dataset: development of an accessible repository of anesthesia patient monitoring data for research.

Authors:  David Liu; Matthias Görges; Simon A Jenkins
Journal:  Anesth Analg       Date:  2011-12-20       Impact factor: 5.108

Review 5.  A Wearable Tele-Health System towards Monitoring COVID-19 and Chronic Diseases.

Authors:  Wei Jiang; Sumit Majumder; Samarth Kumar; Sophini Subramaniam; Xiaohe Li; Ridha Khedri; Tapas Mondal; Mansour Abolghasemian; Imran Satia; M Jamal Deen
Journal:  IEEE Rev Biomed Eng       Date:  2022-01-20

6.  Deep PPG: Large-Scale Heart Rate Estimation with Convolutional Neural Networks.

Authors:  Attila Reiss; Ina Indlekofer; Philip Schmidt; Kristof Van Laerhoven
Journal:  Sensors (Basel)       Date:  2019-07-12       Impact factor: 3.576

Review 7.  Pulse Oximetry for Monitoring Patients with COVID-19 at Home. Potential Pitfalls and Practical Guidance.

Authors:  Andrew M Luks; Erik R Swenson
Journal:  Ann Am Thorac Soc       Date:  2020-09

8.  An assessment of algorithms to estimate respiratory rate from the electrocardiogram and photoplethysmogram.

Authors:  Peter H Charlton; Timothy Bonnici; Lionel Tarassenko; David A Clifton; Richard Beale; Peter J Watkinson
Journal:  Physiol Meas       Date:  2016-03-30       Impact factor: 2.833

9.  A new, short-recorded photoplethysmogram dataset for blood pressure monitoring in China.

Authors:  Yongbo Liang; Zhencheng Chen; Guiyong Liu; Mohamed Elgendi
Journal:  Sci Data       Date:  2018-02-27       Impact factor: 6.444

Review 10.  The use of photoplethysmography for assessing hypertension.

Authors:  Mohamed Elgendi; Richard Fletcher; Yongbo Liang; Newton Howard; Nigel H Lovell; Derek Abbott; Kenneth Lim; Rabab Ward
Journal:  NPJ Digit Med       Date:  2019-06-26
View more
  2 in total

1.  Assessment of Blood Pressure Using Only a Smartphone and Machine Learning Techniques: A Systematic Review.

Authors:  Fridolin Haugg; Mohamed Elgendi; Carlo Menon
Journal:  Front Cardiovasc Med       Date:  2022-06-13

2.  Subject-Based Model for Reconstructing Arterial Blood Pressure from Photoplethysmogram.

Authors:  Qunfeng Tang; Zhencheng Chen; Rabab Ward; Carlo Menon; Mohamed Elgendi
Journal:  Bioengineering (Basel)       Date:  2022-08-18
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.