Literature DB >> 32090164

Data on water, sanitation, and hygiene in six select metro cities of India.

Shashi Kala Saroj1, Srinivas Goli1,2, Md Juel Rana1,3, Bikramaditya K Choudhary1.   

Abstract

The purpose of this data article is to describe the data and provide the methodological notes on the construction of availability, accessibility, and overall Water, Sanitation and Hygiene (WASH) performance index using a set of thirteen indicators for six metro cities in India. It also presents the details on survey design and the nature of data collected on WASH indicators in India Human Development Survey for 2004-05 (IHDS-I) and 2011-12 (IHDS-II). The principal component analysis (PCA) procedure was used in the construction of the WASH indices. The IHDS is the only survey that provides comprehensive data on WASH indicators for six metro cities in India (Delhi, Mumbai, Kolkata, Chennai, Hyderabad, & Bangalore). The IHDS has been jointly conducted by researchers from the National Council of Applied Economic Research (NCAER), New Delhi and the University of Maryland, the United States of America (USA). The database is hosted in the public repository at the Inter-University Consortium for Political and Social Research (ICPSR) and the reference number for IHDS-I and IHDS-II are ICPSR 22626 and ICPSR 36151 respectively. The data are publicly available through ICPSR. Interpretation of the present data can be found in the research article titled "Availability, accessibility, and inequalities of water, sanitation, and hygiene (WASH) services in Indian metro cities" (Saroj et al., 2019) [9].
© 2020 Published by Elsevier Inc.

Entities:  

Keywords:  India; India human development survey; Metro cities; WASH

Year:  2020        PMID: 32090164      PMCID: PMC7025195          DOI: 10.1016/j.dib.2020.105268

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Tables The data used in this article are freely available in the public domain at the ICPSR. The data for two panels (2004–05) & (2011–12) are processed and uploaded with this article so that other researchers working in this area will not have to download and process the raw data again. IHDS is a nationally representative, a multi-topic repeat survey with a huge sample of 41,554 and 42,152 households, also for the first-time separately available for eight major states of India. The datasets provide guidance on the analytical methods to research, monitor, and evaluate the WASH program in India. The information on WASH for eight major cities are valuable for the policy and practice in academia and the government.

Data description

The datasets presented in this article was collected as a part of India Human Development Survey (IHDS)-I (2004–5) and IHDS-II (2011–12) [5,6] with a special representative sampling design adopted for six largest populated cities in India, namely, Mumbai, Delhi, Bangalore, Hyderabad, Chennai, and Kolkata (Table 1). They are coded as 1–6 in the variable “Metro6”. These metro cities are identified according to the census 2001, the definition of “urban agglomerations” [3,4]. These six cities also represent the major four geographical regions of India (Mumbai from the West, Delhi from the North, Kolkata from the East and Chennai, Bangalore and Hyderabad from the South). This article shows the data description and methodological note for preparing the WASH indices. Table 1 presents the descriptive statistics of the sample for selected six metro cities from the IHDS-I and IHDS-II. Table 2 describes the socio-economic characteristics of the households: educational level of the head of the household, economic status, occupation of the household's head; social and religious affiliation of the households among the six metro cities in both the rounds of IHDS. Table 3 describes the outcome variables, which were used in the WASH availability and accessibility index construction by the six metro cities for 2004–05 and 2011–12. About 13 indicators have been selected to create WASH availability and accessibility indices. Table 4 describes the Eigenvalues of the WASH availability and accessibility indices for 2004–05 and 2011–12. Similarly, Fig. 1 displays the scree plot of the Eigenvalues of the WASH availability components for the year 2004–05 and 2011–12; while Fig. 2 displays the scree plot of Eigenvalues for the WASH accessibility components, for both the year.
Table 1

Descriptive statistics of the data for six Metro cities.

Metro cities2004–05
2011–12
nPercentnPercent
Mumbai58514.1552413.39
Delhi132632.08126632.36
Kolkata111426.95107927.58
Chennai29107.0425906.62
Bangalore36008.7135108.97
Hyderabad45711.0643311.07
Total41331003912100
Table 2

Description of households’ background characteristics among the six Metro cities of India, 2004–05 and 2011–12.

Background CharacteristicsMumbai
Delhi
Kolkata
Chennai
Bangalore
Hyderabad
Total
2004–052011–122004–052011–122004–052011–122004–052011–122004–052011–122004–052011–122004–052011–12
Education level of the head of the household
Illiterate29.5727.8645.2751.9844.8351.8636.3143.5325.0062.3982.1980.2144.8351.22
Primary27.1826.5305.1004.2214.7515.0902.5503.8411.6705.9806.6804.5912.7812.04
Secondary36.5838.1739.6433.5330.5426.3255.8641.7953.6124.7908.7712.0034.6529.28
Higher06.6707.4409.9910.2809.8906.7305.2810.8409.726.8402.3503.1907.7407.46
Economic status
Poor13.1600.5711.2806.3316.4213.5404.7414.2703.3304.8407.2904.4911.6008.07
Non Poor86.8499.4388.7293.6783.5886.4695.2685.7396.6795.1692.7195.5188.4091.93
Occupation of the head of the household
Primary37.4459.5436.4438.9449.3747.5247.6027.1536.9439.6069.1768.8246.3648.57
Secondary11.7910.5014.3424.2212.5823.9921.7843.1114.1724.5008.8413.9913.3921.99
Tertiary43.0827.2935.1734.8627.7327.4719.0223.4238.8933.6216.4816.3830.2527.39
No Occupation7.6902.6714.0501.9810.3301.0211.6006.3210.0002.2805.5200.8110.0002.04
Social categories
General Hindu43.4248.2839.3332.4040.8140.545.2004.9327.7811.6812.3810.6432.5631.31
OBC27.0128.0521.3023.7012.6310.4844.8240.5740.0051.2853.2249.4427.5526.78
SC17.9515.4618.5324.5825.5027.2544.2546.2911.6720.5126.1229.3824.2026.19
ST01.0300.9500.4301.1103.0003.0200.0000.0000.8302.8500.4101.3001.3301.75
Muslims05.6403.6316.4915.3317.8718.5202.4203.6108.6107.6906.3508.2211.4411.76
Christians and Others04.9603.6303.9102.8800.1800.1903.3004.6011.1105.9801.5101.0202.9302.21
n585524132612661114107929125936035145743341333912
Table 3

Description of variables used in WASH Availability and Accessibility index in the six Metro cities of India, 2004–05 and 2011–12.

VariablesMumbai
Delhi
Kolkata
Chennai
Bangalore
Hyderabad
Total
2004–052011–122004–052011–122004–052011–122004–052011–122004–052011–122004–052011–122004–052011–12
WASH Availability index
Source of drinking water
(0) Open well, river, pond, truck, others01.0300.1904.8208.9900.4700.9404.6203.4001.1108.8303.7707.2202.3303.94
(1) Piped water, tube well, hand pump, covered well, rain and bottled water98.9799.8195.1891.0199.5399.0695.3896.6098.8991.1796.2392.7897.6796.06
Timing of water supply
(0) < 1 hour28.5556.8733.1540.4665.0761.1474.9252.5270.0030.2080.9352.9255.3852.51
(1) > 1 hour71.4543.1366.8559.5434.9338.8625.0847.4830.0069.8019.0747.0844.6247.49
Toilet facility available
(0) Open defecation54.8713.3617.3014.0333.6613.4536.3328.512.2205.9844.0740.8934.8118.19
(1) Traditional latrine, VIP latrine, flush toilet45.1386.6482.7085.9766.3486.5563.6771.4997.7894.0255.9359.1165.1981.81
Hand wash after defecation
(0) No00.1721.3700.6209.3200.3123.1500.2259.2806.1246.6901.0828.32
(0) Yes99.8378.6399.3890.6899.6976.8599.7840.7293.8853.3198.9271.68
Drinking water storage
(0) Unhygienic storage (without lid)01.5408.4014.2529.1613.3617.7401.3418.6049.7214.8125.8010.1019.28
(1) Hygienic storage (with lid)98.4691.6085.7570.8486.6482.2698.6681.4050.2885.1974.2089.9080.72
Housing space
(0) More than 3 persons per room28.7205.5330.4831.2115.8924.5319.9910.3618.0613.9620.6719.0922.3619.71
(1) 3 or less persons per room71.2894.4769.5268.7984.1175.4780.0189.6481.9486.0479.3380.9177.6480.29
Kitchen (Cooking place)
(0) Within living area34.5326.3428.9835.2135.6830.7635.4020.443.0606.2744.8441.2733.6729.84
(1) Separate from living area65.4773.6671.0264.7964.3269.2464.6079.5696.9493.7355.1658.7366.3370.16
Type of house
(0) Kutcha39.1514.1227.1815.1160.8852.9028.8421.8830.5622.2292.4447.9449.2333.01
(1) Pucca60.8585.8872.8284.8939.1247.1071.1678.1269.4477.787.5652.0650.7766.99
WASH Accessibility index
Time taken to access to water
(0) More than half an hour (>30 minutes)08.3806.8719.8247.4345.4179.0662.1555.4602.7834.4756.3539.9634.4449.50
(1) Less than half an hour (<30 minutes)91.6293.1380.1852.5754.5920.9437.8544.5497.2265.5343.6560.0465.5650.50
Location of toilet
(0) Open defecation01.8802.8609.8308.5116.4111.1730.0523.0201.9401.9919.7436.6013.5513.10
(1) Within dwelling, inside or outside the premises, public toilet98.1297.1490.1791.4983.5988.8369.9576.9898.0698.0180.2663.4086.4586.90
The material used for hand wash
(0) Ash, mud, water only06.3203.0510.4107.6638.2827.4358.5646.5602.7822.2255.6442.2129.5122.51
(1) Soap93.6896.9589.5992.3461.7272.5741.4453.4497.2277.7844.3657.7970.4977.49
Purification of drinking water
(0) Rarely, sometimes, never86.1575.0085.2579.3990.7991.9199.0689.8197.7878.6391.0972.2990.1982.70
(1) Always13.8525.0014.7520.619.218.090.9410.192.2221.378.9127.7109.8117.30
Method of Pouring water
(0) Cups, utensils, hand68.8964.8939.6158.6377.1082.4595.6097.7193.8959.2698.4393.4474.2776.02
(1) Long ladle or tap31.1135.1160.3941.3722.9017.5504.4002.296.1140.7401.5706.5625.7323.98
Type of fuel used in cooking
(0) Firewood, cow dung, crop residue, coal, or charcoal83.2561.4590.0692.6279.2575.5593.5786.1896.1180.0689.4878.9385.9378.04
(1) LPG and kerosene16.7538.5509.9407.3820.7524.4506.4313.8203.8919.9410.5221.0714.0721.96
n585524132612661114107929125936035145743341333912

Note: (0) represents the disadvantageous category, (1) represents the advantageous category.

(−) represents the lack of samples.

Table 4

Eigenvalues of the WASH indexes 2004–5 and 2011–12.

Components2004–05
2011–12
Eigen valuesDifferenceEigen valuesDifference
WASH availability
11.9950.7391.8470.641
21.2560.2581.2060.235
30.9980.0720.9710.078
40.9260.2160.8930.113
50.7100.1190.7800.106
60.5910.0680.7040.075
70.523.0.599.
WASH accessibility
11.5630.5351.6500.596
21.0280.0251.0540.097
31.0030.0740.9570.083
40.9290.0830.8740.083
50.8460.2150.7910.117
60.631.0.674.
Fig. 1

Scree plots of Eigenvalues of WASH availability index in 2004–05 and 2011-12 Source: IHDS (2004–05 & 2011–12).

Fig. 2

Scree plots of Eigenvalues of WASH accessibility index in 2004–05 and 2011-12 Source: IHDS (2004–05 & 2011–12).

Descriptive statistics of the data for six Metro cities. Description of households’ background characteristics among the six Metro cities of India, 2004–05 and 2011–12. Description of variables used in WASH Availability and Accessibility index in the six Metro cities of India, 2004–05 and 2011–12. Note: (0) represents the disadvantageous category, (1) represents the advantageous category. (−) represents the lack of samples. Eigenvalues of the WASH indexes 2004–5 and 2011–12. Scree plots of Eigenvalues of WASH availability index in 2004–05 and 2011-12 Source: IHDS (2004–05 & 2011–12). Scree plots of Eigenvalues of WASH accessibility index in 2004–05 and 2011-12 Source: IHDS (2004–05 & 2011–12).

Experimental design, materials, and methods

Data

Sample

Table 1 describe the sample size of the selected six metro cities of India. The IHDS longitudinal surveys collected the household data for the selected cities in 2004–05 [N = 4133], and 2011–12 [N = 3912] [3]. The share of the sample in 2004–05, was 32.1%, 26.9%, 14.2%, 11.1%, 08.7%, and 07.0% for Delhi [n = 1326], Kolkata [n = 1114], Mumbai [n = 585], Hyderabad [n = 457], Bangalore [n = 360], and Chennai [n = 291] respectively. Almost, same pattern is visualized in the 2011-12 samples of these six metro cities.

Background variables

The socio-economic characteristics of the households are described in detail in Table 2. The table shows that the percentage of households with non-poor economic status has increased from 88.4% to 91.9% from 2004–05 to 2011–12. In 2004–05, among the socio-religious groups, the majority of the population belongs to the Hindu General Castes (32.6%), followed by Other Backward Classes (OBC) (27.6%), Scheduled Caste (SC) (24.2%), Muslims (11.4%), and Christian population (02.9%). In 2011–12, except for the SC population, the percentage of all other socio-religious categories have decreased during 2004–05 to 2011–12. In 2004–05, 46.4% of the households heads involved in the primary occupation, while 30.25% of them work in the tertiary sector followed by 13.4% in secondary sector occupation. In 2011–12, there is a decline in the tertiary sector, while an increase in primary and secondary occupations. Mumbai, Delhi, and Bangalore have a greater share of households with high socio-economic status vice-versa the Kolkata, Hyderabad, and Chennai.

Outcome indicators (WASH availability and accessibility)

Table 3 presents the description of the variables which were used in the construction of WASH availability and accessibility indices. In 2004–05, among these six metro cities, Mumbai (98.9%), Kolkata (99.5%), and Bangalore (98.8%) has larger number of households with improved source of drinking water (piped, tube well, hand pump, covered well, rain and bottled water) than Delhi (95.2%), Chennai (95.3%), and Hyderabad (96.2%). Except for Mumbai and Chennai, other cites show the decline in the availability of improved sources of drinking water from 2004–05 to 2011–12. All city average of availability of improved sources of drinking water has declined from 97.7% in 2004–05 to 96.1% in 2011–12. However, more than one-hour water supply has slightly increased from 44.6% to 47.5% from 2004–05 to 2011–12, especially in Hyderabad (19.1%–47.1%), Chennai (25.1%–47.5%) and Bangalore (30.0%–69.8%). The availability of improved toilet facilities has increased from 65.2% to 81.8%, mainly in Mumbai (45.1%–86.6%), Kolkata (66.3%–86.6%), Chennai (63.7%–71.5%), Delhi (82.7%–85.9%), and Hyderabad (55.9%–59.1%). Similarly, the housing density (three or < 3 persons per room) has also improved from 77.6% to 80.3% from 2004–05 to 2011–12, in which Mumbai (71.3%–94.5%), Chennai (80.0%–89.6%), and Bangalore (81.9%–86.1%) have greater improvement than others. Similarly, the overall percentages of pucca houses have got better from 50.8% in 2004–05 to 66.9% in 2011–12, almost in all six cities. Furthermore, the separate kitchen from the living area has also increased from 66.3% to 70.2% for both the years, notably in Chennai (64.60%–79.56%), Mumbai (65.5%–73.7%), Kolkata (64.3%–69.2%), and Hyderabad (55.2%–58.7%), but Bangalore (96.9%–93.7%) and Delhi (71.0%–64.8%) have slight decline. In addition, the improved material used for hand wash (soap) is increased from 70.5% to 77.5%. Similarly, the percentage of households’ always using purified drinking water (09.8%–17.3%) and improved cooking fuel (LPG and Kerosene) (14.1%–21.9%) has increased between 2004–05 and 2011–12. Whereas, the safe drinking water storage (hygienic storage with lid) has reduced from 89.9% to 80.7% and 25.7%–23.9%, respectively, within the same period.

Statistical analysis

The WASH availability, accessibility, and overall performance have been estimated, separately, for both the period, 2004–05 and 2011–12. The indicators taken for the construction of the composite indices of the WASH availability, accessibility, and overall performance are presented in Table 3. The variables considered for the availability and accessibility indicators were transformed into a generalized linear form by creating dummy variables, which dichotomized as ‘0’ for disadvantageous and ‘1’ for the advantageous group. PCA has been used to estimate the score for availability and accessibility separately for both survey rounds. Scree plots from the PCA have been displayed in Fig. 1, Fig. 2. The Eigenvalues in the PCA of WASH availability and accessibility show the deviation between the components. The first two components (components 1 and 2) have Eigenvalue more than 1 and gives an ‘elbow’ like shape. The steep slope between the component 1 and 2 shows that the majority of the variables are explained by the first component, in both the WASH availability and accessibility indexes, for both the rounds [7]. Table 4 shows the values of the Eigenvalue of components, in which around 50% of the variance is explained by the first two dimensions in the component space. The overall performance of WASH is the average of availability and accessibility scores. The scores for availability, accessibility, and overall performance were ranked in ascending order and divided into three equal classes for both rounds of the survey which labeled as ‘poor’, ‘middle’ and ‘better-off’ WASH performance. The items used for the construction of both availability and accessibility were tested for their validity, reliability, and suitability. All analyses are carried out using the STATA software, version 13 (pca, predict p1, and xtile) (StataCorp) [10]. The WASH inequality had measured at the city level by using the Gini, Theil and Atkinson indices. The Gini coefficients provide city-wise households level inequality estimation. Along with the city level inequality estimates, the Theil and Atkinson indices allow estimating intra and inter-city inequalities. The Theil's index is the single parameters of the General Entropy class (GE a = 1); while the Atkinson index incorporates the social value judgment of the people about inequality in the society [1,2,8]. Association between the WASH and socioeconomic factors of the households was analyzed by using the order logit regression model as the WASH is the ordinal outcome variable [poor (1), middle (2) and better-off (3)]. Three separate regression models were carried out for availability, accessibility, and overall performance of WASH (see reference [9] for the results). The background characteristics are economic status, educational level, socio-religious groups and occupational status of the households is mentioned in Table 2.

Conflict of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Specifications Tables

Subject AreaEconomics, Geography, Demography
Specific subject areaHousehold basic amenities (water, sanitation, & hygiene)
Type of DataTables and Figures
How data was acquiredThe unit-level data of the IHDS-I & II were allowed to download upon registration and permission from the ICPSR.
Data FormatThe unit-level raw data was recoded and analyzed in Stata and presented in the form of tables and figures https://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/22626; https://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/36151; https://data.mendeley.com/datasets/fjgkp5mcnj/1.
Parameters for data collectionIHDS data is a multi-topic panel survey of households in India. It covered the topics related to household characteristics, amenities, health, education, employment, economic status, demographic characteristics and gender relations, and social capital. It also surveyed separately the children's (8–11 years old) education quality (reading, writing and arithmetic tests). Additional information also collected at the village, school and medical facility levels. Further, information on the above-said indicators were also collected for the six metro cities [Delhi, Mumbai, Kolkata, Chennai, Bangalore, & Hyderabad] separately.
Description of data collectionIHDS data were collected in all states and union territories in India, except Andaman & Nicobar and Lakshadweep. It had surveyed household samples in IHDS-I round [N = 41,554] and re-interviewed 83% in IHDS-II round [42,152 households]. A stratified random sampling method was used to collect rural samples in 1503 villages and the probability proportional to size (PPS) method was used to collect urban samples in 971 blocks. Detail description of the data collection method has given in the following links: https://www.icpsr.umich.edu/icpsrweb/content/DSDR/idhs-data-guide.html & https://www.icpsr.umich.edu/icpsrweb/content/DSDR/idhs-II-data-guide.html
Data source locationNational Council of Applied Economic Research (NCAER), New Delhi and the University of Maryland, USA.
Data accessibilityThe raw data is located in the public repository of ICPSR: https://doi.org/10.3886/ICPSR22626.v12 for 2004–05 & ICPSR: https://doi.org/10.3886/ICPSR36151.v6 for 2011–12.
The data is also included in this research.
Related research articleShashi Kala Saroj, Srinivas Goli, Md. Juel Rana, and Bikramaditya K. Choudhary, Availability, Accessibility, and Inequalities of Water, Sanitation, and Hygiene (WASH) Services in Indian Metro Cities, Sustainable Cities and Society 2019, https://doi.org/10.1016/j.scs.2019.101878/In Press.
Value of the Data

The data used in this article are freely available in the public domain at the ICPSR.

The data for two panels (2004–05) & (2011–12) are processed and uploaded with this article so that other researchers working in this area will not have to download and process the raw data again.

IHDS is a nationally representative, a multi-topic repeat survey with a huge sample of 41,554 and 42,152 households, also for the first-time separately available for eight major states of India.

The datasets provide guidance on the analytical methods to research, monitor, and evaluate the WASH program in India.

The information on WASH for eight major cities are valuable for the policy and practice in academia and the government.

  1 in total

1.  The Scree Test and the Number of Factors: a Dynamic Graphics Approach.

Authors:  Rubén Daniel Ledesma; Pedro Valero-Mora; Guillermo Macbeth
Journal:  Span J Psychol       Date:  2015-03-17       Impact factor: 1.264

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.