Literature DB >> 33078049

Variation in COVID-19 Data Reporting Across India: 6 Months into the Pandemic.

Varun Vasudevan1, Abeynaya Gnanasekaran1, Varsha Sankar2, Siddarth A Vasudevan3, James Zou4.   

Abstract

India reported its first case of COVID-19 on January 30, 2020. Six months since then, COVID-19 continues to be a growing crisis in India with over 1.6 million reported cases. In this communication, we assess the quality of COVID-19 data reporting done by the state and union territory governments in India between July 12 and July 25, 2020. We compare our findings with those from an earlier assessment conducted in May 2020. We conclude that 6 months into the pandemic, the quality of COVID-19 data reporting across India continues to be highly disparate, which could hinder public health efforts. © Indian Institute of Science 2020.

Entities:  

Year:  2020        PMID: 33078049      PMCID: PMC7557231          DOI: 10.1007/s41745-020-00188-z

Source DB:  PubMed          Journal:  J Indian Inst Sci        ISSN: 0019-4964


Introduction

Two key components in containing the COVID-19 pandemic are public awareness and public trust in the government. These components critically depend on timely and accessible dissemination of COVID-19 data by the government1. While there are studies showing disparities in personal healthcare access in India, very little was known about the quality of access to public health data across India, especially during the early months of COVID-19 pandemic2,3. To address this problem, we developed a semi-quantitative framework to assess the quality of COVID-19 data reporting, and used it to calculate a COVID-19 Data Reporting Score (CDRS) for 29 state and union territory (UT) governments of India4. This assessment was done during the 2-week period from May 19 to June 1, 2020. The study showed a strong disparity in the quality of COVID-19 data reporting across India—CDRS varied from 0.61 (good) to 0.0 (poor) across the country, with a median value of 0.26. COVID-19: Coronavirus disease 2019 is an infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Pandemic: A pandemic is defined as an epidemic occurring worldwide, or over a very wide area, crossing international boundaries and usually affecting a large number of people. In this communication, we present the findings from a second assessment of the quality of COVID-19 data reporting across India. This study was done during the 2-week period from July 12 to July 25, 2020, and includes 35 states1 and UTs of India. Hereafter, this 2-week period is referred to as the scoring period. Lakshadweep was excluded from the study as it did not have any COVID-19-positive cases as of July 12, 2020. Hereafter, the first assessment done during May is referred to as study-1 and the second assessment from July is referred to as study-2. CDRS scoring table. Each “Metric-Report Item” pair is an indicator. Overall there are 45 indicators. The scores that an indicator can take are listed in the table. NA denotes not applicable. This table is filled for each state by inspecting the COVID-19 data reported by that state

Methods

Our scoring framework consists of 45 indicators spanning four key dimensions of public health data reporting—availability, accessibility, granularity, and privacy4,5. These indicators capture the presence or absence of a piece of information in the reported data and the format in which it is reported. We would like to emphasize that our framework does not assess the “accuracy of the reported data.” In the availability dimension, we check the availability of basic data such as, daily and cumulative number of confirmed cases, deaths, and recoveries in the state5. To assess the accessibility of data, we check for the presence of trend graphics, availability of data in English, and the ease of getting to the web page where data are reported. Trend graphics are important because they make it easier to see patterns in the data. To evaluate the granularity of data, we check whether the state is reporting cumulative data stratified by age, gender, comorbidity, and districts. Granular data helps a layperson connect with the data at a personal level. To assess if a state is ensuring privacy while reporting data, we check if any personally identifiable information of COVID-19 suspects or patients are made publicly available on the state’s COVID-19 data reporting page. The report items shown as column headers in Table 1 represent five possible stages in which an individual can find themselves during the pandemic.
Table 1:

CDRS scoring table. Each “Metric-Report Item” pair is an indicator. Overall there are 45 indicators. The scores that an indicator can take are listed in the table. NA denotes not applicable. This table is filled for each state by inspecting the COVID-19 data reported by that state

DimensionMetricReport item
ConfirmedDeathsRecoveredQuarantineICU
AvailabilityTotal (cumulative)0, 10, 10, 10, 10, 1
Daily0, 10, 10, 10, 10, 1
Historical daily data0, 10, 10, 10, 10, 1
AccessibilityEase of access0, 1
Availability in English0, 1
Total trend graphics0, 10, 10, 10, 10, 1
Daily trend graphics0, 10, 10, 10, 10, 1
GranularityTotal stratified by age0, 10, 10, 1NA0, 1
Total stratified by gender0, 10, 10, 1NA0, 1
Total stratified by comorbidity0, 10, 1, 20, 1NA0, 1
Total stratified by districts0, 10, 10, 10, 10, 1
PrivacyCompromise in privacy− 1, 1
Trend graphics: This refers to the time-series line chart of a variable with date on the horizontal x-axis and the value of the variable on the vertical y-axis. Each “Metric-Report Item” pair shown in Table 1 is an indicator. The entries in the table represent the possible scores an indicator can earn4. This table is filled for each state during the scoring period by checking the data reported by that state. For example, if a state is reporting total confirmed COVID-19 cases then a score of 1 is assigned to that indicator. The scores recorded in the table are collectively referred to as the scoring data. Using the scoring data, four categorical scores, one for each dimension, and an overall score is calculated for each state. The categorical scores are obtained by summing the scores earned by the indicators in that dimension. The overall score is the normalized sum of the four categorical scores, and is referred to as the COVID-19 Data Reporting Score (CDRS). For further details on the scoring metrics, scoring process, and score calculation, refer to our article introducing the CDRS framework4. Filled map showing CDRS across India. The map represents the disparity in the quality of COVID-19 data reporting across India. Dark green (red) indicates states that have high (low) quality data reporting. Left: A dot plot showing the spread of CDRS values. States are sorted in the decreasing order of CDRS. Right: The incremental change in CDRS since study-1. Incremental change is not shown for states (marked by an *) that were excluded in study-1.

Results and Discussion

CDRS and the normalized categorical scores for the states in India are tabulated in Table 2. The categorical scores are normalized by the difference of maximum and minimum score possible in that category. The value of CDRS across states indicates a strong disparity in the quality of COVID-19 data reporting in India. The five number summary of CDRS is, min = 0.00, first quartile = 0.20, median = 0.30, third quartile = 0.35, and maximum = 0.63. The geographical disparity in CDRS is evident from the map2 shown in Fig. 1.
Table 2:

CDRS and the normalized categorical scores for the states in India. States are listed in the alphabetical order.

State / Union TerritoryAccessibility scoreAvailability scoreGranularity scorePrivacy scoreCDRS
1Andaman and Nicobar Islands0.170.270.000.500.17
2Andhra Pradesh0.080.600.170.500.30
3Arunachal Pradesh0.170.200.000.500.13
4Assam0.170.200.170.500.20
5Bihar0.000.000.00NA0.00
6Chandigarh0.420.270.00-0.500.20
7Chhattisgarh0.080.400.220.500.26
8Dadra and Nagar Haveli and Daman and Diu0.170.470.220.500.30
9Delhi0.170.670.000.500.28
10Goa0.170.600.060.500.28
11Gujarat0.080.600.170.500.30
12Haryana0.420.470.330.500.41
13Himachal Pradesh0.170.200.000.500.13
14Jammu and Kashmir0.080.470.170.500.26
15Jharkhand0.170.670.170.500.35
16Karnataka0.670.730.500.500.63
17Kerala0.750.670.330.500.57
18Ladakh0.420.730.220.500.46
19Madhya Pradesh0.080.600.170.500.30
20Maharashtra0.420.470.170.500.35
21Manipur0.170.330.000.500.19
22Meghalaya0.170.200.000.500.13
23Mizoram0.170.670.070.500.33
24Nagaland0.500.400.170.500.35
25Odisha0.670.600.280.500.50
26Puducherry0.670.670.220.500.50
27Punjab0.170.730.17-0.500.33
28Rajasthan0.170.270.110.500.20
29Sikkim0.170.470.000.500.24
30Tamil Nadu0.500.600.330.500.48
31Telangana0.250.670.000.500.30
32Tripura0.170.270.220.500.24
33Uttar Pradesh0.000.000.00NA0.00
34Uttarakhand0.170.470.220.500.30
35West Bengal0.170.670.220.500.37
Figure 1:

Filled map showing CDRS across India. The map represents the disparity in the quality of COVID-19 data reporting across India. Dark green (red) indicates states that have high (low) quality data reporting.

CDRS and the normalized categorical scores for the states in India. States are listed in the alphabetical order. Figure 2 lists states in the decreasing order of CDRS. As seen in the figure, Karnataka is at the top, Bihar and Uttar Pradesh are at the bottom. Bihar and Uttar Pradesh get a CDRS of 0 because they do not release any COVID-19 data on their government or health department website. Figure 2 also shows the incremental change in CDRS from its previous value calculated during study-1 conducted between May 19 and June 1, 2020. As seen in Fig. 2 CDRS has increased in 12 states and decreased in 5 states since the previous study. Figure 3 presents boxplots showing CDRS across India from study-1 and study-2. As seen in the figure the median value has increased slightly from 0.26 to 0.30.
Figure 2:

Left: A dot plot showing the spread of CDRS values. States are sorted in the decreasing order of CDRS. Right: The incremental change in CDRS since study-1. Incremental change is not shown for states (marked by an *) that were excluded in study-1.

Figure 3:

Boxplots showing CDRS across India from the assessments conducted during May (study-1) and July 2020 (study-2). In the boxplot for July the outlier denotes Karnataka.

Figure 4 shows the number of states that get a non-zero score on an indicator in our framework. Among the 35 states assessed in this study, 33 states report some data on the COVID-19 situation in the state. Bihar and Uttar Pradesh continue to not publish any data on their government or health department website. The remaining 33 states report the total deaths and recovered cases, while only 32 of them report the total confirmed cases. Gujarat does not report the total confirmed cases but reports the number of active cases.
Figure 4:

Table shows the number of states that get a non-zero score on an indicator. For example, (1) total confirmed is 32 indicating that 32 states report total confirmed COVID-19 cases, (2) availability in English is 29 indicating that 29 states are reporting data in English. Privacy indicator is not shown in this table.

CDRS of 12 states have improved in study-2 as compared to study-1. Nine of the 12 states, namely, Andhra Pradesh, Chhattisgarh, Goa, Haryana, Karnataka, Kerala, Ladakh, Uttarakhand, and West Bengal have started reporting more granular data. This is encouraging and is definitely a step in the right direction. In general, the states continue to score the lowest in the granularity dimension. Jharkhand, which had the highest granularity score in study-1 has stopped reporting age- and gender-stratified data for the total confirmed cases, deaths, and recoveries since June 8, 2020. Hence, its normalized granularity score dropped from 0.50 to 0.17 in this study. It might be worthwhile to investigate what led the Jharkhand government to stop reporting age- and gender-stratified data. Punjab and Chandigarh compromised the privacy of individuals under quarantine by releasing personally identifiable information on their official websites. Chandigarh releases the name and address of people under home quarantine on a daily basis. Punjab released name, age, gender, and mobile number of persons inbound to the state from New Delhi on May 10, 20204. As of July 25, 2020, the document is still present on the Punjab government’s health department website. Boxplots showing CDRS across India from the assessments conducted during May (study-1) and July 2020 (study-2). In the boxplot for July the outlier denotes Karnataka. Table shows the number of states that get a non-zero score on an indicator. For example, (1) total confirmed is 32 indicating that 32 states report total confirmed COVID-19 cases, (2) availability in English is 29 indicating that 29 states are reporting data in English. Privacy indicator is not shown in this table.

Additional Comments

Testing: The strategy recommended by ICMR for COVID-19 testing in India has evolved over time6–8. The degree of relevance of testing data in understanding the spread of COVID-19 within a state depends on the testing strategy (e.g., how people are chosen for testing). Therefore, we did not include an indicator in our framework to score the reporting of testing data. However, we note that all the states in India report some data on testing. But the reported testing data in most states do not distinguish total samples tested from total persons tested. In other words, most states are reporting total samples tested without specifying how many of them are unique. This is an important limitation to the data that is available to track the testing in a state9. For instance, in the case of Tamil Nadu which reports both total samples and total persons tested, the difference between those two numbers is more than a lakh as on August 7, 202010. Age brackets: Karnataka, Odisha, and Tamil Nadu report total number of confirmed cases stratified by age. Karnataka and Kerala report the total number of deaths stratified by age. However, the number of age brackets used by each of these states is different, making it difficult to compare the age distribution of confirmed and deceased individuals across states. For example, Karnataka, Odisha, and Tamil Nadu use eight, four, and three age brackets, respectively, to report the total number of confirmed cases stratified by age. Aarogya Setu mobile app: On April 02, 2020, the Indian government launched Aarogya Setu mobile app with the objective of enabling Bluetooth-based contact tracing, mapping of likely hotspots, and dissemination of relevant COVID-19 information11. To use the app, one has to register with a mobile number, agree to its data sharing policy, and give it access to Bluetooth and location information. While access to phone number, Bluetooth, and location information might be necessary for contact tracing, we believe that expecting people to provide such information just to access critical COVID-19 data is unreasonable. Therefore, we did not consider data reported through the Aarogya Setu app while scoring the states. However, we would like to mention that the app reports cumulative and daily data for confirmed, deaths, and recoveries, both as text and trend graphics for all states. Data aggregation platforms: covid19india.org is a volunteer-driven nationwide COVID-19 data aggregation initiative. They collect and report COVID-19 data from across the country. While the initiative is noteworthy, it does not replace the need for high-quality data reporting on official government websites for the following reason. The initiative can fill-in gaps in the accessibility dimension described in our framework. However, they cannot fill-in for the gaps along the availability and granularity dimensions resulting from the lack of corresponding data released by the government.

Conclusion

Our assessment informs the public health efforts in India about the disparity in the quality of COVID-19 data reporting across the country. The available evidence shows that an improvement in the quality of data reporting is required all across India. The disparity in CDRS shows the lack of a unified framework for reporting COVID-19 data in India, and highlights the need for a national agency like Indian Council of Medical Research (ICMR) to monitor or audit the quality of data reporting done by the states. The disparate reporting score also reflects inequality in individual access to public health information and privacy protection based on the state of residence4. Overall, there is an urgent need to fill the gaps in COVID-19 data reporting across the states. There has been only a marginal improvement in the quality of COVID-19 data reporting done by the states between May and July. With the pandemic being far from over, it is imperative that the states continue to learn from each other and improve their data reporting. We conclude this communication by quoting the following from the Economic Survey of India, “Given that sophisticated technologies already exist to protect privacy and share confidential information, governments can create data as a public good within the legal framework of data privacy. In the spirit of the Constitution of India, data should be ‘of the people, by the people, for the people’.”12

Sources for scoring data

State/Union TerritoryData reporting websites
1Andaman and Nicobar Islands https://dhs.andaman.gov.in/
2Andhra Pradesh

http://hmfw.ap.gov.in/covid_19_dailybulletins.aspx

http://hmfw.ap.gov.in/covid_dashboard.aspx

3Arunachal Pradesh http://covid19.itanagarsmartcity.in/
4Assam https://covid19.assam.gov.in/
5BiharNo sources
6Chandigarh http://chdcovid19.in/
7Chhattisgarh http://cghealth.nic.in/ehealth/covid19/pages/index.html
8Dadra and Nagar Havel iand Daman and Diu https://dddcovid19.in/
9Delhi

https://delhifightscorona.in/

http://web.delhi.gov.in/wps/wcm/connect/doit_health/Health/Home/Covid19/Bulletin+July+2020

https://coronabeds.jantasamvad.org/

10Goa

https://www.goa.gov.in/covid-19/

https://nhm.goa.gov.in/corona-virus-important-links-iec/

11Gujarat https://gujcovid19.gujarat.gov.in/
12Haryana

http://www.nhmharyana.gov.in/page.aspx?id=208

https://gisgmda.maps.arcgis.com/apps/dashboards/5cade394ece3496a9e0c4f168f9536a2

13Himachal Pradesh http://www.nrhmhp.gov.in/
14Jammu and Kashmir https://www.jkinfonews.com/index.aspx
15Jharkhand

https://www.jharkhand.gov.in/Home/Covid19Dashboard

http://jrhms.jharkhand.gov.in/Press-Release.aspx

16Karnataka https://covid19.karnataka.gov.in/english
17Kerala https://dashboard.kerala.gov.in/index.php
18Ladakh http://covid.ladakh.gov.in/
19Madhya Pradesh http://mphealthresponse.nhmmp.gov.in/covid/
20Maharashtra

https://www.covid19maharashtragov.in/mh-covid/dashboard

https://arogya.maharashtra.gov.in/1175/Novel–Corona-Virus

21Manipur http://nrhmmanipur.org/?page_id=621
22Meghalaya http://meghalayaonline.gov.in/covid/login.htm
23Mizoram

https://mcovid19.mizoram.gov.in/

https://health.mizoram.gov.in/posts

https://dipr.mizoram.gov.in/posts

24Nagaland

https://nagahealth.nagaland.gov.in/

https://covid19.nagaland.gov.in/

25Odisha https://statedashboard.odisha.gov.in/
26Puducherry

https://covid19dashboard.py.gov.in/

https://covid19.py.gov.in/

27Punjab

https://dronamaps.com/corona.html#/

http://pbhealth.gov.in/media-bulletin.htm

https://corona.punjab.gov.in/

28Rajasthan http://www.rajswasthya.nic.in/
29Sikkim https://covid19sikkim.org/
30Tamil Nadu https://stopcorona.tn.gov.in/
31Telangana https://covid19.telangana.gov.in/
32Tripura

https://tripura.gov.in/covid-test

https://covid19.tripura.gov.in/

https://covid19.tripura.gov.in/Visitor/ViewStatus.aspx

33Uttar PradeshNo sources
34Uttarakhand http://health.uk.gov.in/pages/view/101-covid19-health-bulletin-for-uttarakhand
35West Bengal https://www.wbhealth.gov.in/
  3 in total

1.  Regional COVID-19 registry in Khuzestan, Iran: A study protocol and lessons learned from a pilot implementation.

Authors:  Javad Zarei; Maryam Dastoorpoor; Amir Jamshidnezhad; Maria Cheraghi; Abbas Sheikhtaheri
Journal:  Inform Med Unlocked       Date:  2021-01-19

2.  Government dissemination of epidemic information as a policy instrument during COVID-19 pandemic: Evidence from Chinese cities.

Authors:  Xun Wu; Lei Shi; Xinyu Lu; Xiaotong Li; Liang Ma
Journal:  Cities       Date:  2022-03-03

3.  Contact tracing of COVID-19 in Karnataka, India: Superspreading and determinants of infectiousness and symptomatic infection.

Authors:  Mohak Gupta; Giridara G Parameswaran; Manraj S Sra; Rishika Mohanta; Devarsh Patel; Amulya Gupta; Bhavik Bansal; Vardhmaan Jain; Archisman Mazumder; Mehak Arora; Nishant Aggarwal; Tarun Bhatnagar; Jawaid Akhtar; Pankaj Pandey; Vasanthapuram Ravi; Giridhara R Babu
Journal:  PLoS One       Date:  2022-07-11       Impact factor: 3.752

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.