Literature DB >> 21031103

Age heaping and accuracy of age data collected during a community survey in the yavatmal district, maharashtra.

Geeta S Pardeshi1.   

Abstract

BACKGROUND: Age is an important variable in epidemiological studies and an invariable part of community-based study reports. AIMS: The aim was to assess the accuracy of age data collected during community surveys. SETTINGS AND
DESIGN: A cross-sectional study was designed in rural areas of the Yavatmal district.
MATERIALS AND METHODS: Age data were collected by a house-to-house survey in six villages. An open-ended questionnaire was used for data collection. STATISTICAL ANALYSIS: Age heaping and digit preference were measured by calculating Whipple's index and Myers' blended index. Age Ratio Scores (ARS) and Age Accuracy Index (AAI) were also calculated.
RESULTS: Whipple's index for the 10-year age range, i.e., those reporting age with terminal digit "0" was 386.71. Whipple's index for the 5-year range, i.e., those reporting age with terminal digit '0' or '5' was 382.74. Myer's blended index calculated for the study population was 41.99. AAI for the population studied was 14.71 with large differences between frequencies of males and females at certain ages.
CONCLUSION: The age data collected in the survey were of very poor quality. There was age heaping at ages with terminal digits '0' and '5', indicating a preference in reporting such ages and 42% of the population reported ages with an incorrect final digit. Innovative methods in data collection along with measuring and minimizing errors using statistical techniques should be used to ensure the accuracy of age data which can be checked using various indices.

Entities:  

Keywords:  Age accuracy index; Myer’s blended index; Whipple’s index; age heaping

Year:  2010        PMID: 21031103      PMCID: PMC2963876          DOI: 10.4103/0970-0218.69256

Source DB:  PubMed          Journal:  Indian J Community Med        ISSN: 0970-0218


Introduction

Age is an important study variable in demography and epidemiological studies. It is a socio-demographic variable related to the host in descriptive studies and also a commonly assessed risk factor in analytical studies. The accuracy of age data collected by house-tohouse surveys varies in different set-ups and depends on numerous factors. This is clearly indicated in studies which describe the age-related data of census from different countries.(12) Different set-ups have different social values attached to age. A variety of irregularities and misstatements have been noted with respect to age-related data.(3) Misstatement of age is a common example of content error in census and surveys. Of these irregularities, age heaping is a common phenomenon. Age data frequently display excess frequencies at round or attractive ages, such as even numbers and multiples of 5 leading to age heaping. Age heaping is considered to be a measure of data quality and consistency. This study describes age heaping and assesses the accuracy of age data collected during a community survey in the Yavatmal district of Maharashtra state.

Materials and Methods

Data collection

The age data were collected during a survey in six villages of the Yavatmal district in Maharashtra state in September 2006. The six villages were selected by simple random sampling using the lottery method. The final year students of a college of social work were selected and trained for data collection. A pretested questionnaire was used for data collection. A house-to-house survey was conducted in the selected villages. The age data were collected as per the information given orally by the respondents present in the house during the survey. The age of the persons not available during the survey was obtained from their family members who acted as proxy informants. No records, birth registers, or birth certificates were crosschecked to confirm the age data. A meeting with the interviewers was conducted to note their experiences during data collection. The investigator supervised data collection to observe the process of reporting age.

Measurement of age heaping and age accuracy

Age heaping and digit preference were measured by calculating Whipple’s index and Myers’ blended index. Age Ratio Scores (ARS) and Age Accuracy Index (AAI) were also calculated. Whipple’s index detects a preference for ages ending in 0, 5, or both. Whipple’s index is constructed for the age group of 23–62 years using the following formula: Whipple’s index varies from 0 to 500. A value of 0 indicates that digits ‘0’ and ‘5’ are not reported, 100 means there is no preference for ‘0’ or ‘5’, and a maximum of 500 is seen when only the digits ‘0’ and ‘5’ are reported in the age data. The inference about age distribution based on this index is as follows: <105 = highly accurate; 105–109.9 = fairly accurate; 110–124.9 = approximate; 125–174.9 = rough; ≥175 = very rough. Myer’s blended index is calculated for the age above 10 years and shows the excess or deficit of people in ages ending in any of the 10 digits expressed as percentages. It is based on the assumption that the population is equally distributed among the different ages. The steps in the calculation of Myers’ blended index are as follows: Sum of populations ending in each digit over the whole range starting with the lower limit of the range (e.g., 10, 20, 30, 40,….; 11, 21, 31,….) Ascertain sum excluding the first population combined in step 1 (e.g., 20, 30, 40,….; 21, 31, 41,….) Weight the sums in steps 1 and 2 and add the results to obtain a blended population (e.g., weights 1 and 9 for 0 digit, weights 2 and 8 for 1, etc.) Convert distribution in step 3 into percentages. Take the deviation of each percentage in step 4 from 10.0, which is the expected value for each percentage. A summary index of preference for all terminal digits is derived as one half of the sum of the deviations from 10.0%, each without regard to signs. ARS are calculated for age up to 74 years and are defined here as the ratio of the population in a given age group to one-third the sum of the population in that age group and in the preceding and following groups, multiplied by 100. The age ratio is expressed for a 5-year age group as follows: where 5Pa is the population in the given age group, 5Pa-5 is the population in the preceding age group, and 5Pa+5is the population in the following age group. In the absence of extreme fluctuations in the past vital events, the age ratios for all age groups should be about equal to 100. The sum of the deviations from 100 of the age ratios for males divided by number of age groups gives the mean deviation for males and the same procedure also gives the mean deviation for females. The average of the mean deviations of males and females is a measure of the overall accuracy of the age data, i.e., age accuracy index.

Results

The age data of a total of 4304 people in 823 households were collected during the survey. The total population in the age group ‘23–62’ was 2017. Among them, the population reporting age ending in ‘0’ was 780 and those reporting age with the terminal digit of ‘5’ were 764. Thus Whipple’s index for the 10-year age range, i.e., for those reporting age with terminal digit ‘0’, was 386.71. Whipple’s index for the 5-year range, i.e., for those reporting age with terminal digit ‘0’ or ‘5’, was 382.74. The total population in the age group ‘above 10 years’ was 3571. Table 1 describes the steps in the calculation of Myers blended index. The Myers blended index calculated for the study population was 41.99.
Table 1

Calculation of preferences indices for terminal digits by Myers’ blended method

Terminal digit, aPopulation with terminal digit
Weight for
Blended population
Starting at 10+aStarting at 20+aColumn 1Column 2Number (1*3+2*4)Percent distributionDeviation of percentage from 10 (6-10)
1234567
113878289002.8776-7.12
23121933722877.31-2.68
3187864612644.04-5.95
4152545510303.29-6.70
595285964914829.2419.24
6196857316275.20-4.79
7156748213964.46-5.53
82561319124357.784-2.21
994281009403.00-6.99
Total (irrespective of sign)357126023128110083.99
Summary index of age preference41.99
Calculation of preferences indices for terminal digits by Myers’ blended method Figure 1 describes the deviations of the percentage of the blended population from 10 along each of the terminal digits. The most preferred terminal digits while reporting age were ‘0’ and ‘5’ and most least mentioned were ‘1’, ‘9’, and ‘4’.
Figure 1

Myers index by terminal digit

Myers index by terminal digit The total population aged up to 74 years was 4197. Table 2 describes the calculation of the age ratios for males and females in this population. The age ratio for males was 13.06 and for females was 16.35. The AAI for the population studied was 14.71. Maximum positive deviations in males was observed in the age group of 65–69 years (26%) while in females it was at 60–64 years (44%). The maximum negative deviations were noted in the 55–59 years age group (33%) in males and in the 55–59 year age group (28%) in females.
Table 2

Age accuracy index for males and females

Age (years)Analysis of age ratio
Population
Male
Female
MaleFemaleRatioDeviation from 100RatioDeviation from 100
123456
<5161151
5–9207206102.312.31101.311.31
10–14239253103.313.31110.3110.32
15–19248229103.913.91107.847.85
20–24229155110.6310.6380.44-19.55
25–2914419485.21-14.79114.5614.57
30–3413415987.96-12.0487.36-12.64
35–39179193111.8811.88127.5327.53
40–44167102111.8311.8379.89-20.10
45–491028884.30-15.7098.14-1.86
50–549479116.0516.05105.805.80
55–59475766.82-33.1872.45-27.54
60–6470100107.697.69143.5443.54
65–697852126.4926.4980-20.00
70–743743
Total (irrespective of sign)169.80212.61
Mean13.0616.35
Age accuracy index for males and females Figure 2 describes the age ratios according to sex for the 5-year age group. The curve is not smooth but shows sharp jumps and clustering at certain ages indicating large differences between frequencies of populations in adjacent groups. A comparison of the curves for males and females indicates large differences between frequencies of males and females at certain ages. For example, in the age group of 20–24, the males show a positive deviation while in the case of females, a sharp dip is noted. A reverse phenomenon is seen in the age group of 25–29 years.
Figure 2

Age ratios by sex for five year age groups

Age ratios by sex for five year age groups When the interviewers introduced themselves and requested the household members to provide information regarding age, one of the family members usually volunteered to give the information. If he/she was not sure about someone’s age, it was cross-checked with other members. Some of the responses which indicate difficulties in data collection were as follows: Tumhich bagha ata majhe vay kiti asel te (You only decide my age) Majhe vay andaje liha (Write my approximate age) Mahit nahi (I do not know my age) Majhe vay pastis te chalis asel (My age must be around 35–40) Some wanted to know the objective of collecting data. Kasha sathi? Ya mule kay fayda ahe? (Why do you want to know my age? What benefit will I get?) In such situations a few hints were given by the interviewers such as age at marriage, duration of marriage, duration after marriage when the first child was born, and in the case of children, age was ascertained depending on the class of school in which the child was studying. Some of the interviewers stated that they made only rough estimates of the age in such cases.

Discussion

The methodology for data collection used in this study was similar to the method used in nearly all communitybased studies. Considering the three indices studied, the quality of age data collected in the survey can be inferred to be of very poor quality. There was age heaping at ages with terminal digits ‘0’ and ‘5’, indicating a preference in reporting such ages. The accuracy of age data should be assessed using various indices in studies in which age is an important variable. Whipple’s index is considered to be a fair measure of general reliability of age distribution. The ages of early childhood and old age are excluded from the formula because they are more frequently influenced by other types of errors and issues than digit preference. Whipple’s index of more than 175 indicates that age distribution is very rough with age heaping at ages with terminal digits ‘0’ and ‘5’.(4) Myers’ blended index for the study group indicates that a minimum of 40% of the population reported ages with an incorrect final digit. No cut-off values for AAI have been described. Among the 42 countries for which AAI was calculated in a study, a majority had AAI less than 10 except for two countries, namely, UAE (United Arab Emirates) and Russia in which AAI was more than 14, and were categorized as high AAI.(5) Lower the value of AAI, the more accurate the age data. In this study, AAI brings out irregularities other than age heaping at terminal digits ‘0’ and ‘5’. These include differences in the frequency of population in adjacent age groups and in males and females in the age groups studied. The approximation of age awareness manifests itself in the phenomenon of age heaping in self-reported or proxy age data. Individuals lacking knowledge of their age rarely state this openly, but choose instead a figure they think plausible. They do not choose randomly but have a systematic tendency to prefer attractive numbers such as those ending in ‘0’ and ‘5’ or even numbers or in some societies, numbers with other specific terminal digits. Age heaping indicates ignorance of one’s own age or a tendency to round ages. Age awareness is quite low and many have only a vague idea about their age. In cases where age is reported by proxy respondents, the response is more likely to be an approximation or a guess.(6) The role of the respondents and interviewers leading to age heaping has not been differentiated in this study. Age heaping has been noted in studies which have analyzed age data in census, Demographic and Health Survey (DHS), and National Family Health Survey (NFHS) in India.(178) A number of determinants of age heaping such as literacy, household size, degree of interaction with administration, use of calendars, astrology, etc. have been studied.(89)A strong and statistically significant association has been found between age heaping and illiteracy and age heaping has been used as an indicator of human capital.(10) The impact of such misreporting can lead to misclassification bias and wrong assessment of demographic rates and interfere with planning effective interventions. The official records such as birth certificates and school certificates can be a valid source of information regarding age but such records may not be available in many households. NFHS-3 survey in Maharashtra has reported that among children under 5 years of age, 80% births were registered; but in 35% children, births were registered but their birth certificates were not available.(11) Other methods of data collection which ensure the accuracy of age data need to be evolved. In a study, a local time path calendar was used in which the interviewer took the respondent back in time using the local calendar and the memory of respondents was triggered by relating events to Indian festivals and other landmarks in the lives of people, enabling them to reply in their own time perspective.(12) The findings indicated significantly less heaping in the durations of postpartum amenorrhea, breastfeeding, postpartum abstinence, and contraceptive use. The quality of age data is important because age sex distribution is not only an invariable part of a survey report but the bias introduced in studies can lead to wrong inferences. Innovative methods in data collection along with measuring and minimizing errors using statistical techniques should be used to ensure accuracy of age data.
  1 in total

1.  Quality of age data in patients from developing countries.

Authors:  Srdjan Denic; Falah Khatib; Hussein Saadi
Journal:  J Public Health (Oxf)       Date:  2004-06       Impact factor: 2.341

  1 in total
  16 in total

1.  Measurement of smoking behavior: Comparison of self-reports, returned cigarette butts, and toxicant levels.

Authors:  Melissa D Blank; Alison B Breland; Paul T Enlow; Christina Duncan; Aaron Metzger; Caroline O Cobb
Journal:  Exp Clin Psychopharmacol       Date:  2016-06-27       Impact factor: 3.157

2.  The reliability of calendar data for reporting contraceptive use: evidence from rural Bangladesh.

Authors:  Rebecca L Callahan; Stan Becker
Journal:  Stud Fam Plann       Date:  2012-09

3.  Population and fertility by age and sex for 195 countries and territories, 1950-2017: a systematic analysis for the Global Burden of Disease Study 2017.

Authors: 
Journal:  Lancet       Date:  2018-11-08       Impact factor: 79.321

4.  Measurement of cigarette smoking: Comparisons of global self-report, returned cigarette filters, and ecological momentary assessment.

Authors:  Jenny E Ozga; Colleen Bays; Ilana Haliwa; Nicholas J Felicione; Stuart G Ferguson; Geri Dino; Melissa D Blank
Journal:  Exp Clin Psychopharmacol       Date:  2021-02-25       Impact factor: 3.492

5.  Measurement and explanation of socioeconomic inequality in catastrophic health care expenditure: evidence from the rural areas of Shaanxi Province.

Authors:  Yongjian Xu; Jianmin Gao; Zhongliang Zhou; Qinxiang Xue; Jinjuan Yang; Hao Luo; Yanli Li; Sha Lai; Gang Chen
Journal:  BMC Health Serv Res       Date:  2015-07-03       Impact factor: 2.655

6.  Assessing the validity of respondents' reports of their partners' ages in a rural South African population-based cohort.

Authors:  Guy Harling; Frank Tanser; Tinofa Mutevedzi; Till Bärnighausen
Journal:  BMJ Open       Date:  2015-03-06       Impact factor: 2.692

7.  Trends in Demographic and Health Survey data quality: an analysis of age heaping over time in 34 countries in Sub Saharan Africa between 1987 and 2015.

Authors:  Mark Lyons-Amos; Tara Stones
Journal:  BMC Res Notes       Date:  2017-12-20

8.  Prevalences and trends of chronic diseases in Shaanxi Province, China: Evidence from representative cross-sectional surveys in 2003, 2008 and 2013.

Authors:  Sha Lai; Jianmin Gao; Zhongliang Zhou; Xiaowei Yang; Yongjian Xu; Zhiying Zhou; Gang Chen
Journal:  PLoS One       Date:  2018-08-23       Impact factor: 3.240

9.  Household characteristics for older adults and study background from SAGE Ghana Wave 1.

Authors:  Richard B Biritwum; George Mensah; Nadia Minicuci; Alfred E Yawson; Nirmala Naidoo; Somnath Chatterji; Paul Kowal
Journal:  Glob Health Action       Date:  2013-06-11       Impact factor: 2.640

10.  Exploring status and determinants of prenatal and postnatal visits in western China: in the background of the new health system reform.

Authors:  Xiaojing Fan; Zhongliang Zhou; Shaonong Dang; Yongjian Xu; Jianmin Gao; Zhiying Zhou; Min Su; Dan Wang; Gang Chen
Journal:  BMC Public Health       Date:  2017-07-20       Impact factor: 3.295

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.