Literature DB >> 28451063

Bayesian adjustment of gastric cancer mortality rate in the presence of misclassification.

Nastaran Hajizadeh1, Mohamad Amin Pourhoseingholi1, Ahmad Reza Baghestani1, Alireza Abadi1, Mohammad Reza Zali1.   

Abstract

AIM: To correct for misclassification error in registering causes of death in Iran death registry using Bayesian method.
METHODS: National death statistic from 2006 to 2010 for gastric cancer which reported annually by the Ministry of Health and Medical Education included in this study. To correct the rate of gastric cancer mortality with reassigning the deaths due to gastric cancer that registered as cancer without detail, a Bayesian method was implemented with Poisson count regression and beta prior for misclassified parameter, assuming 20% misclassification in registering causes of death in Iran.
RESULTS: Registered mortality due to gastric cancer from 2006 to 2010 was considered in this study. According to the Bayesian re-estimate, about 3%-7% of deaths due to gastric cancer have registered as cancer without mentioning details. It makes an undercount of gastric cancer mortality in Iranian population. The number and age standardized rate of gastric cancer death is estimated to be 5805 (10.17 per 100000 populations), 5862 (10.51 per 100000 populations), 5731 (10.23 per 100000 populations), 5946 (10.44 per 100000 populations), and 6002 (10.35 per 100000 populations), respectively for years 2006 to 2010.
CONCLUSION: There is an undercount in gastric cancer mortality in Iranian registered data that researchers and authorities should notice that in sequential estimations and policy making.

Entities:  

Keywords:  Bayesian method; Cause of death; Gastric cancer; Iran; Misclassification

Year:  2017        PMID: 28451063      PMCID: PMC5390301          DOI: 10.4251/wjgo.v9.i4.160

Source DB:  PubMed          Journal:  World J Gastrointest Oncol


Core tip: In some mortality cases, causes of deaths are registered as causes that cannot or should not be considered as the underlying causes of death like cancer without mentioning the type. These cases are not included in the estimations of cause specific mortality rates and leads to under-estimate health risks and burden of disease. The aim of this study is to correct the misclassification of gastric cancer deaths in cancer without label group using a Bayesian method.

INTRODUCTION

Cancer is one of the major health problems in the world and is the third cause of death (after cardiovascular disease and injuries) in Iran[1]. Gastric cancer is a disease in which the cells of the inner lining of the stomach start to divide abnormally and uncontrollably, that forming a mass called tumor[2]. Gastric cancer is the seventh cause of all deaths in Iran and is the first cause of cancer death in Iranian men and the second cause of cancer death (after breast cancer) in Iranian women[3]. The mortality of gastric cancer is high because this cancer does not show symptoms in early stages and diagnosed when the cancer is in its final stages[4]. Burden of disease is used to evaluate the health status of a country and determining priority of risk factors in order to setup cancer control programs. Cancer registry data are important to estimate the burden of disease, monitoring the screening programs effects, early diagnostics and other prognostic factors, and can be used to guide policy makers to appropriate cancer prevention programs. Among medical indices, mortality is a familiar projection to assess the burden of diseases. But achieving this aim requires a reliable death registry systems that reports death statistics accurately and completely[5-7]. In Iran, among four vital events (births, marriages, divorces and mortality) which were registered by the National Organization for Civil Registration (NOCR), mortality was the worst in quality. There was some progress in registering deaths but some problems like delayed registration and inaccurate recording of causes of death remained until 2002, that Ministry of health and medical education Deputy of Research and Technology, started up a system to record the causes of deaths. This system did not allow to delayed deaths registry, but the causes of death were susceptible to information bias due to misclassification[8]. Most high-income and many middle-income countries have a complete vital registration system in which the majority of deaths get a death certificate completed by a physician[9]. But still, a number of causes of death in the process of completing death certificates and the coding of underlying cause of death based on standardized international rules, remains challenging[10-13]. In some cases, especially in developing countries, the cause of death is recorded with error[14,15]. For example if a death due to gastric cancer being labeled as a death due to any other cause, the misclassification error in outcome is occurs. Misclassification error makes the registered data inaccurate and often leads to major problems like biased estimates of burden and health risks in epidemiological analysis[16,17]. According to the Iranian death registry, about 15% to 20% of death statistics are recorded in misclassified categories such as cardiopulmonary arrest, old age without dementia, septicemia, unknown, cancer without mention of details, and other ill-defined conditions. Murray and Lopez in 1996, for the first time, introduced the term “garbage coding” for assigning deaths to causes that are not useful for public health analysis of cause-of-death data[18-21]. In developing countries like Iran that registration is not completely accurate, statistical methods can be very helpful to overcome this problem. Two statistical approaches are recommended to deal with misclassification; first is using a small valid sample and extending the results to the population[22] and the second is Bayesian analysis which is a flexible method that makes the possibility of combining the prior information regarding the subset of the parameters with the observed data to achieve a posterior distribution which will be the basis of inferences to correct the statistics. Bayesian models also can easily accommodate unobserved variables such as an individual’s true information in the presence of Misclassification error[23]. The aim of this study is to use Bayesian method to estimate the rate of misclassification that occurs by registering cancer (with no label) as the cause of death instead of deaths that have occurred because of gastric cancer in Iran’s cancer registry system.

MATERIALS AND METHODS

Mortality rates due to gastric cancer and also cancer without label from 2006 to 2010 are extracted from Iranian annual of death statistics which reported annually by Iran’s Ministry of Health and Medical Education, in two sex groups (male and female) and four age groups (under 15 years - 15 to 49 years - 50 to 69 years - 70 years and more). To reassign deaths from garbage codes to valid causes, the approach can be divided into three steps: The first is identifying garbage codes. The second is identifying the target causes where the deaths assigned to a garbage code should in principle be reassigned to; for example if a death cause is registered as cancer and the type of cancer is not mentioned, we face with a garbage code that should be reassigned to a specific cancer. The third step is choosing the fraction of deaths that are assigned to the garbage code that should be reallocated to the target cause[13]. In this study we consider cancer without label as garbage code because cancer with no label is most likely to be registered as cause of death instead of a specific cancer like gastric cancer. The data were entered to the Bayesian model by two vectors y1 = [y11, y21,...,yr1] for gastric cancer and y2 = [y12, y22,...,yr2] for cancer without label. Both y1 and y2 are count data and follow the Poisson distribution. The subscript r shows the number of covariate patterns that is made by age and sex group combinations. θ is considered to be the probability of incorrectly register a mortality from gastric cancer as mortality due to cancer without label group. To perform Bayesian inference, an informative beta prior distribution was assumed for the misclassified parameter, i.e., θ ~ beta (a, b). The initial value for the parameter of beta distribution are taken to be a = 20 and b = 80, based on Iranian annual cancer registration reports. Since θ(misclassified parameter) is an unknown parameter, a latent variable approach was employed to simplify the full conditional models; considering Ui | θ, y1, y2 ~ Binomial (yi2, Pi) as the number of counts from the first group that are incorrectly labeled as being in the misclassified group that Pi = (λi1θ)/(λi1θ + λi2), finally the posterior distribution appears in the following form; θ | Ui, y1, y2 ~ Beta (∑iUi + a, ∑iyi + b). The misclassified parameter is estimated using a Gibbs sampling algorithm and averaging of the outcome. Analyses were done using R software version 3.2.0.

RESULTS

Mortality data consisting of all deaths due to gastric cancer from 2006 to 2010 were considered in this study. Age standardized rate (ASR) of gastric cancer mortality was 9.69 per 100000 populations in 2006, 10.2 per 100000 populations in 2007, 9.93 per 100000 populations in 2008, 9.76 per 100000 populations in 2009 and 9.67 per 100000 populations in 2010 respectively. According to the Bayesian estimation, in year 2006, there was between 3% to 7% misclassification in registering cause of death as cancer without mentioning details while the underlying cause of death has been gastric cancer. The estimated percent of misclassification based on implemented Bayesian method for year 2006 to 2010 is shown in Table 1. This percent were subtracted from deaths that had registered as cancer without mentioning details and added to the number of deaths due to gastric cancer. The age standardized rate per 100000 populations for gastric cancer was estimated to be 10.17 in 2006, 10.51 in 2007, 10.23 in 2008 10.44 in 2009 and 10.35 in 2010, after Bayesian correction respectively. The age standardizes rate of gastric cancer before and after Bayesian correction for 2006 to 2010 is visualized in Figure 1. The number of gastric cancer death before and after Bayesian correction of misclassification for years 2006 to 2010 is shown in Table 1 and its trend is shown in Figure 2.
Table 1

Misclassified parameter and the number of gastric cancer death before and after Bayesian correction and percent of increase in number of deaths after Bayesian correction in Iran’s death registry 2006-2010

YearMisclassified parameterBefore Bayesian correction
After Bayesian correction
MaleFemaleTotalMaleFemaleTotal
20065%388716425529408117245805
20073%400116905691412117405861
20083%391216525564402917025731
20097%390716505557418017665946
20107%394416655609422017826002
Figure 1

Age standardized rate of gastric cancer mortality in Iran from 2006 to 2010, before and after Bayesian correction of misclassification in causes of death. ASR: Age standardized rate.

Figure 2

Crude number of gastric cancer mortality in Iran from 2006 to 2010, before and after Bayesian correction of misclassification in causes of death.

Misclassified parameter and the number of gastric cancer death before and after Bayesian correction and percent of increase in number of deaths after Bayesian correction in Iran’s death registry 2006-2010 Age standardized rate of gastric cancer mortality in Iran from 2006 to 2010, before and after Bayesian correction of misclassification in causes of death. ASR: Age standardized rate. Crude number of gastric cancer mortality in Iran from 2006 to 2010, before and after Bayesian correction of misclassification in causes of death.

DISCUSSION

Iran’s death registry is subject to misclassification in reporting the underlying cause of death. About 3%-7% of deaths due to gastric cancer are registered as cancer without mentioning the type of cancer. After correcting misclassification error in death registry data, the number of deaths due to gastric cancer and its age standardized rate were increased. Gastric cancer crude mortality count in Iran had an increasing trend from year 2006 to 2010 except for 2008 that might be because of incompleteness of data; but the age standardized rate of gastric cancer was decreasing from year 2007 onward (except for 2008). About two-thirds of gastric cancer occurs in developing countries[24-27] and its rates are generally about twice as high in men as in women[28]. The age standardized rate (ASR) of gastric cancer incidence and mortality per 1000000 populations based on GLOBOCAN report 2012 is shown in Table 2. The rates show that the ASR of gastric cancer incidence (15.8 per 100000) and also the ASR of gastric cancer mortality (11.7 per 100000) is highest in Asia compared to other continents; It is moderate in Europe and South America and lowest in Northern America and most parts of Africa[3,28].
Table 2

Incidence and mortality age standardized rates per 100000 populations due to gastric cancer for some continents, reported by GLOBOCAN 2012

ContinentIncidence ASRMortality ASR
World12.18.9
Asia15.811.7
Europe9.46.9
South America10.38.5
North America4.02.1
Africa3.83.5

ASR: Age standardized rates.

Incidence and mortality age standardized rates per 100000 populations due to gastric cancer for some continents, reported by GLOBOCAN 2012 ASR: Age standardized rates. The age standardized rates of incidence and mortality per 100000 populations in different regions of Asia based on GLOBOCAN report 2012 are shown in Table 3. The incidence and mortality rates are also higher in Eastern Asia in comparison with other Asian regions. This region includes China, Japan and South Korea, that are three countries with the highest gastric cancer incidence and mortality rates[29]. Gastric cancer is the most frequently diagnosed form of cancer in Iran[30], with incidence rate 15.3 per 100000 and mortality rate 12.9 per 100000 populations based on GLOBOCAN report 2012[3]. A steady decline has been observed in gastric cancer incidence and mortality rates in the most of countries in Northern America and Europe since the middle of the 20th century[31,32]. In recent years similar decreasing trends have been noted in areas with high rates of gastric cancer history, including some countries in Asia (Japan, China, and South Korea), Latin America (Colombia and Ecuador), and Europe (Ukraine)[33]. This reduction maybe due to improved sanitation and antibiotics and consequently reduction in chronic H. pylori infection[34]. Although the age-adjusted rates have been decreased, it is estimated to have a substantial rise in the crude rates between the years 2000 to 2020 because of the increasing the size and age of the world population, especially in developing countries[35,36].
Table 3

Incidence and mortality age standardized rates per 100000 populations due to gastric cancer for different regions of Asia, reported by GLOBOCAN 2012

RegionIncidence ASRMortality ASR
Eastern Asia24.216.5
Western Asia9.58.1
South-Central Asia6.76.1
South-Eastern Asia6.05.3

ASR: Age standardized rates.

Incidence and mortality age standardized rates per 100000 populations due to gastric cancer for different regions of Asia, reported by GLOBOCAN 2012 ASR: Age standardized rates. Gastric cancer is a major health problem in the world, especially in Asia. So it is needed to make appropriate policy making for allocation of resources for gastric cancer control and prevention. To achieve this aim an accurate registry system is needed, while there are some misclassifications in registering causes of death especially in developing countries[14,15]. Misclassification of causes of death has been a concern in cancer trends analysis and researches on cancer epidemiology for decades[14]. Misclassification error leads to under-estimation of cause specific mortality rates and consequently under-estimation in burden of disease and influences the policy makings and health risk prioritizations[10-12,37]. In the study of Khosravi et al[38], validated data from hospital death was used to measure the impact of misclassification on rates of cardiovascular disease mortality. But they didn’t employ Bayesian method. Bayesian approach has received much attention to correct for misclassification in mortality data. Whittemore and Gong[39] used a Bayesian approach to estimate cervical cancer mortality rates and Sposto et al[40] developed maximum likelihood method for assessing the effect of diagnostic misclassification on non-cancer and cancer mortality in atomic-bomb survivors. Stamey et al[41] provided a Bayesian approach, which extends the models introduced by Whittemore and Gong[39] and Sposto et al[40]. They assume that the misclassification parameters are unknown. They used the prior information on the misclassification parameters instead of using valid data. They applied their Bayesian approach for estimating the number of deaths due to cancer and non-cancer after correcting for misclassification in registering causes of deaths among survivors of Hiroshima and Nagasaki after atomic bombings[41]. Pourhoseingholi et al[42] extended the models proposed by Stamey et al[41] to re-estimate the rates of cause specific deaths in cancer registry data after correcting for misclassification[25,42,43]. Based on his study on gastric cancer mortality in Iranian population from 1995 to 2004, there were between 30%-40% misclassification in recording deaths due to gastric cancer[44]. The current study reveals that the accuracy of death registration in Iran is getting better in recent years. In conclusion there is an undercount of gastric cancer mortality in Iranian registration system Because of misclassification error in registering causes of death. Although it seems that the misclassification rate has been reduced, it still exists as a major problem. So, policy makers who use mortality data to determine priorities for disease control and prevention, should notice to this underreported data and registration of causes of deaths should be done more accurately. Increase in data accuracy, requires more expert staffing, refining foundations, and powerful hardware and software resources[45]. In the absence of valid data, Bayesian approach is a good and flexible alternative to reduce the effects of Misclassification in registered cancer mortality data.

COMMETNS

Background

Mortality data registries are subject to misclassification; because some deaths assigned to causes that cannot considered as underlying death cause. For example if mortality due to a special cancer be registered as cancer without mentioning the type of cancer, misclassification error occurs. The aim of this study is to estimate the rate of misclassification in registering deaths due to gastric cancer in cancer without label group using a Bayesian method and re-estimate the rate of gastric cancer mortality in Iran.

Research frontiers

In Iran, death registries data is subject to misclassification. Reviewing the medical records or verbal autopsy as a practical solution for misclassification is time consuming. The hotspot of this study is using the Bayesian method for estimating the rate of misclassification in registering causes of death, which is rapid and cost-effective.

Innovations and breakthroughs

By using the Bayesian method, it is not needed to valid the data for estimating the rate of misclassification. Data validation is very costly and time consuming and in many cases it is not possible to obtain valid data. For implementing the Bayesian method only prior information about the misclassification rate is enough.

Applications

Since registered mortality data is used for health policy making and estimating the burden of disease, after correcting the misclassification in death registry system, more precise estimates of death rates and cause specific burden of disease will be achieved. Consequently there will be a better planning for disease control and prevention.

Terminology

Misclassification is lack of agreement between the observed value and the true value in categorical data. Bayesian method is one of the statistical approaches that assign a distribution or a probability to events or parameters based on previous experience or an expert’s idea and revise those probabilities and distributions after obtaining experimental data with applying Bayes’ theorem.

Peer-review

This is an interesting research.
  36 in total

1.  An age-period-cohort analysis of gastric cancer mortality from 1950 to 2007 in Europe.

Authors:  Matteo Malvezzi; Martina Bonifazi; Paola Bertuccio; Fabio Levi; Carlo La Vecchia; Adriano Decarli; Eva Negri
Journal:  Ann Epidemiol       Date:  2010-12       Impact factor: 3.797

2.  Counting the dead and what they died from: an assessment of the global status of cause of death data.

Authors:  Colin D Mathers; Doris Ma Fat; Mie Inoue; Chalapati Rao; Alan D Lopez
Journal:  Bull World Health Organ       Date:  2005-03-16       Impact factor: 9.408

3.  Impact of misclassification on measures of cardiovascular disease mortality in the Islamic Republic of Iran: a cross-sectional study.

Authors:  Ardeshir Khosravi; Chalapati Rao; Mohsen Naghavi; Richard Taylor; Nahid Jafari; Alan D Lopez
Journal:  Bull World Health Organ       Date:  2008-09       Impact factor: 9.408

4.  A prospective study of 1152 hospital autopsies: II. Analysis of inaccuracies in clinical diagnoses and their significance.

Authors:  H M Cameron; E McGoogan
Journal:  J Pathol       Date:  1981-04       Impact factor: 7.996

5.  The global health burden of infection-associated cancers in the year 2002.

Authors:  Donald Maxwell Parkin
Journal:  Int J Cancer       Date:  2006-06-15       Impact factor: 7.396

6.  Algorithms for enhancing public health utility of national causes-of-death data.

Authors:  Mohsen Naghavi; Susanna Makela; Kyle Foreman; Janaki O'Brien; Farshad Pourmalek; Rafael Lozano
Journal:  Popul Health Metr       Date:  2010-05-10

7.  Prognostic factors in gastric cancer using log-normal censored regression model.

Authors:  M A Pourhoseingholi; Bijan Moghimi-Dehkordi; Azadeh Safaee; Ebrahim Hajizadeh; Ali Solhpour; M R Zali
Journal:  Indian J Med Res       Date:  2009-03       Impact factor: 2.375

8.  Years of life lost (YLL) from cancer is an important measure of population burden--and should be considered when allocating research funds.

Authors:  N G Burnet; S J Jefferies; R J Benson; D P Hunt; F P Treasure
Journal:  Br J Cancer       Date:  2005-01-31       Impact factor: 7.640

Review 9.  Burden of gastrointestinal cancer in Asia; an overview.

Authors:  Mohamad Amin Pourhoseingholi; Mohsen Vahedi; Ahmad Reza Baghestani
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2015

Review 10.  Burden of gastrointestinal cancers and problem of the incomplete information; how to make up the data?

Authors:  Abdolhamid Sharifian; Mohamad Amin Pourhoseingholi; Ahmadreza Baghestani; Nastaran Hajizadeh; Sepideh Gholizadeh
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2016
View more
  2 in total

1.  Trend of Gastric Cancer after Bayesian Correction of Misclassification Error in Neighboring Provinces of Iran.

Authors:  Nastaran Hajizadeh; Ahmad Reza Baghestani; Mohamad Amin Pourhoseingholi; Sara Ashtari; Hadis Najafimehr; Luca Busani; Mohammad Reza Zali
Journal:  Galen Med J       Date:  2019-07-09

2.  Bayesian correction model for over-estimation and under-estimation of liver cancer incidence in Iranian neighboring provinces.

Authors:  Nastaran Hajizadeh; Ahmad Reza Baghestani; Mohamad Amin Pourhoseingholi; Hadis Najafimehr; Zeinab Fazeli; Luca Bosani
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2017
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.