Literature DB >> 34466473

Trend of Gastric Cancer after Bayesian Correction of Misclassification Error in Neighboring Provinces of Iran.

Nastaran Hajizadeh1, Ahmad Reza Baghestani2, Mohamad Amin Pourhoseingholi3, Sara Ashtari1, Hadis Najafimehr3, Luca Busani4, Mohammad Reza Zali1.   

Abstract

BACKGROUND: Some errors may occur in the disease registry system. One of them is misclassification error in cancer registration. It occurs because some of the patients from deprived provinces travel to their adjacent provinces to receive better healthcare without mentioning their permanent residence. The aim of this study was to re-estimate the incidence of gastric cancer using the Bayesian correction for misclassification across Iranian provinces.
MATERIALS AND METHODS: Data of gastric cancer incidence were adapted from the Iranian national cancer registration reports from 2004 to 2008. Bayesian analysis was performed to estimate the misclassification rate with a beta prior distribution for misclassification parameter. Parameters of beta distribution were selected according to the expected coverage of new cancer cases in each medical university of the country.
RESULTS: There was a remarkable misclassification with reference to the registration of cancer cases across the provinces of the country. The average estimated misclassification rate was between 15% and 68%, and higher rates were estimated for more deprived provinces.
CONCLUSION: Misclassification error reduces the accuracy of the registry data, in turn causing underestimation and overestimation in the assessment of the risk of cancer in different areas. In conclusion, correcting the regional misclassification in cancer registry data is essential for discerning high-risk regions and making plans for cancer control and prevention. Copyright
© 2019, Galen Medical Journal.

Entities:  

Keywords:  Bayesian Analysis; Gastric Cancer; Incidence

Year:  2019        PMID: 34466473      PMCID: PMC8344079          DOI: 10.31661/gmj.v0i0.1223

Source DB:  PubMed          Journal:  Galen Med J        ISSN: 2322-2379


Introduction

Gastric cancer is the fourth most prevalent form of malignancy accounting for 8% of all new cases (989,600 diagnoses) [1]. It is the most prevalent form of cancer in men and the third most common form of cancer (after breast and colorectal cancers) in women in Iran [2]. Its incidence is approximately twice among men as compared to women [3], and over 70% of the cases occur in developing countries [1]. There is a large geographical dispersion in the incidence of gastric cancer on a global scale [3]. Furthermore, there is a large alteration in cancer incidence rate across populations at the lowest and highest risk of gastric cancer [4]. Cancer has been considered as one of the leading causes of death worldwide [5]. It makes population-based and accurate knowledge of cancer occurrence sorely precious to recognize trends and the risk factors that create those trends [6]. The cancer registry data are the main source of data on the burden of cancers by principled registering of the cancer incidence, prevalence, survival, and mortality [7]. Nowadays, their work has expanded into the assessment of cancer screening plans and interventions for cancer control. However, the deficiencies in the registering individuals’ information, including patient’s residence, the primary site of the tumor, date of diagnosis, and date of death [6], make the registered data inaccurate for use in future planning. Most patients prefer to get medical services in the capital of the country or at their neighboring provinces, which are equipped with better medical facilities. Because of the lack of adequate healthcare in their city [8], the patients prefer to register at their neighboring provinces. This is the cause of misclassification. The expected coverage rate of cancer is an indicator of misclassification error in registering cancer incidence, as the expected coverage is reported to be more than 100% in some medical universities and less than 100% in others [9]. Two approaches exist to refine misclassification. The first approach is validating a sample of data by rechecking medical records and expanding its results to the target population [10]. The second approach for correcting the misclassification error is by using the Bayesian method. In this method, the researcher takes prior evidence into account in the analysis [11] by determining prior distribution on the parameters [12]. This study aimed to inquire about the trend of gastric cancer after estimating the misclassification rate in the registry system using the Bayesian method and re-estimating the incidence rate in each province of Iran.

Materials and Methods

Gastric cancer incidence data from 2004 to 2008 were extracted from the National Cancer Registry (NCR) of Iran, which is published annually by the Ministry of Health (MoH) [9]. The NCR collects cancer incidence data by collaborating with medical universities of the country. Each medical university makes a dataset of new cases of cancer, which are certified by pathology centers. The new cases that are collected are entered into a software that is designed by the MoH. In this stage, duplicate cases are removed, and the remaining recorded cancer cases are coded according to the international coding of disease (10th revision). The MoH sends back the prepared dataset of cancer cases to medical universities. For each medical university, an expected coverage of new cancer cases is calculated, which has been set to 113 per 100,000 population covered by that university. Data were entered into the model in 2 vectors. The first vector contained the age-standardized rate (ASR) for males and females in 4 age groups for the province with less than 100% expected coverage, and the second vector contained the same data for a province with more than 100% of the expected coverage, which is in the neighborhood [13, 14]. Patients were divided into the following 4 groups: those aged 14 years, 15 to 49 years, 50 to 69 years, and more than 70 years. As vectors y1 and y2 contain count data, Poisson distribution was considered for them [15, 16]. For the misclassified parameter (θ), which is considered as the probability of recording data in the wrong group, an informative beta prior distribution was assumed. Hence, θ~beta(a,b)[17, 18]. Prior values for beta parameters (a and b) were selected based on the expected coverage of cancer cases in each province. Expectation of this distribution a/(a+b) converges to the misclassification rate. The misclassified parameter is not a known parameter; hence, a latent variable (U) was applied as the number of cases that in fact belonged to the first group but were wrongly assigned to the second group. A binomial distribution was assumed for the latent variable, that is, Ui | θ,y1,y2~Binomial(yi2,Pi), and Pi=(λi1θ)/(λi1θ+λi2), which is the probability of wrong classification in the second group. A sample size of 100,000 is produced from the posterior distribution Beta(∑i Ui+a,∑i yi1+b) by Gibbs sampling [19, 20, 21]. Misclassification rate was estimated by averaging the produced sample from the posterior distribution. Analyses were conducted using the R software version 3.2.0.

Results

The registered cases of gastric cancers from 2004 to 2008 in Iran were investigated. The ASR of gastric cancer for females increased from 6.42 per 100,000 populations (1439 persons) in 2004 to 10.00 per 100,000 (2243 persons) in 2008. Similarly, the ASR of gastric cancer for males increased from 7.03 per 100,000 populations (3770 persons) in 2004 to 19.16 per 100,000 (5165 persons) in 2008. The trend of gastric cancer incidence from 2004 to 2008 for both sexes is shown in Figure-1. Among 30 provinces of Iran, the data of 21 provinces were entered into the Bayesian model, two by two. Other nine provinces had a coverage of cancer cases that was almost equal to their expected number of cancer patients; hence, the rates of cancer in those provinces remained unchanged. As an example, the percentage of expected cases for Tehran (the capital of Iran), which is a high-facility province from the perspective of existence of equipped healthcare centers and professional doctors in the central part of the country, was 155.63% in 2008, whereas the Qom, Qazvin, and Markazi provinces that are adjacent to Tehran had just covered 53.9%, 66.3%, and 69.6% of their expected number of new cancer cases, respectively. Thus, Tehran has observed 55.63% more cases than its expected number, and Qom, Qazvin, and Markazi provinces observed fewer cancer cases than their expectation. Expected coverage rates for different provinces of Iran from 2004 to 2008 are based on NCR annuals [9]. After performing the Bayesian analysis, 37% misclassification was estimated between Tehran and Qom, 32% misclassification between Tehran and Qazvin, and 43% misclassification between Tehran and Markazi in 2008. Estimated misclassification rates in other provinces are presented in Table-1. The rate of gastric cancer in the study period, before and after the Bayesian correction of errors, is reported in Table-2.
Figure 1
Table 1

Estimated Rate of Misclassification Among Provinces Using Bayesian Method

Facilitate province Divested province Estimated rate of misclassification
2004 2005 2006 2007 2008
Razavi khorasanSouth khorasan-0.280.660.560.43
TehranMarkazi0.440.320.260.270.43
Razavi khorasanSistan0.730.630.670.650.72
TehranQom0.370.230.220.230.37
TehranGhazvin0.290.210.220.20.32
KhozestanIlam0.190.510.150.210.26
KhozestanBushehr0.360.540.440.40.54
MazandaranGolestan0.460.370.330.320.43
Razavi khorasanNorth khorasan-0.280.580.480.43
IsfahanChaharmahal0.290.180.210.190.08
IsfahanKohgilouye0.130.180.210.170.08
FarsHormozgan0.690.450.450.520.63
East azarbaijanArdebil0.110.120.110.280.21
East azarbaijanWest azarbaijan0.180.140.130.330.36
Table 2

Age-Standardized Rate of Gastric Cancer Before and After Bayesian Correction

Provinces Before After
2004 2005 2006 2007 2008 2004 2005 2006 2007 2008
South khorasan -8.383.104.538.66.16.117.6310.7117.66
Razavikhorasan 13.468.7115.1315.1217.2111.965.769.629.0711.61
Tehran 6.617.808.567.9416.504.966.607.536.9814.78
Markazi 7.146.378.918.038.7114.3911.0713.2811.8014.09
Sistan 1.981.962.833.112.567.286.4912.7513.829.95
Qom 8.159.2611.059.5810.9213.8213.2614.9213.1918.41
Ghazvin 11.5810.4610.8711.5613.1016.7413.8414.2214.7319.42
Khozestan 5.724.084.475.0210.804.341.593.183.588.02
Ilam 8.745.808.777.7911.7514.5716.2112.7611.7519.50
Bushehr 3.883.322.273.253.758.799.625.708.2411.84
Golestan 7.9910.4312.8712.3311.5615.2518.0420.1119.1019.20
Mazandaran 17.9116.1116.8115.4922.0514.7511.7812.8711.6617.90
North khorasan -8.234.286.269.00-15.7210.4112.9620.12
Chaharmahal 5.698.839.2912.4211.039.7512.7414.9618.2113.36
Isfahan 6.215.696.937.993.254.733.664.915.792.89
Kohgilouye 13.919.159.5213.8911.1621.4716.0316.4121.8714.72
Hormozgan 2.482.962.663.454.379.198.207.4310.5218.85
Fars 5.916.205.469.259.614.444.613.986.945.06
Ardebil 26.6118.4518.4918.5226.3331.2021.9221.2826.4935.11
East azarbaijan 10.357.497.1119.5219.356.324.243.9212.3511.51
West azarbaijan 16.3015.1615.4115.4412.9719.8817.7418.0621.6119.74

Discussion

There was a remarkable misclassification error with respect to the registration of gastric cancer among adjacent areas in Iran. Besides, there was an increase in gastric cancer incidence during the years considered in this study. This increase was higher in males than in females. Highest rates of estimated misclassifications belonged to more deprived provinces such as Sistan, Hormozgan, South Khorasan, North Khorasan, and Bushehr. Also, there was no significant reduction in misclassification rate during the years considered in this study. It indicates that still sufficient effort is not made to prepare healthcare facilities and improve the registration system in all provinces. The well-known risk factors for gastric cancer are Helicobacter pylori infections, family history of gastric cancer, and smoking. However, some populations with a high prevalence of H. pylori infection and low rates of gastric cancer show that other factors may also be important [3]. Also, the incidence rate among immigrants tends to be similar to those in the country to which they move rather than to those in their country of origin. It can be concluded that environmental factors play a large role in the incidence rates [22, 23]. Thus, it is anticipated that the incidence of cancer is similar in adjacent regions that are exposed to similar circumstances, but there are major differences in the incidence of gastric cancer, which can be justified by misclassification error in recording domicile of patients that causes overestimation or underestimation in the rate of cancer in neighboring areas. Acquiring knowledge about the diffusion of disease among different communities in different areas is an appropriate method for recognizing the factors that influence disease incidence [24] and quantifying the potentials for disease control and prevention [25]. However, usually, spatial analysis is performed based on registered data for finding the geographic pattern of disease and determining high-risk areas. In those types of studies, the existence of misclassification is often ignored. As a result, wrong estimates of risk are achieved in different regions.

Conclusion

Our study indicates that some misclassification exists in registering cancer incidence. As registered data are the basic source for health policymakers to identify high-risk areas that are in need of more healthcare facilities, misclassification error should be accounted and corrected. Otherwise, it affects the need assessments to dedicate the facilities to the provinces and leads to the allocation of fewer facilities to the provinces that in fact are in need of more healthcare facilities. When valid data are not available, the Bayesian method is a fast and cost-effective way to account for and correct regional misclassification error.

Acknowledgment

This study was performed inGastroenterology and Liver Diseases Research Center of Shahid Beheshti University of Medical Sciences and supported by grant number 10127.

Conflict of interest

There is no any conflict of interest regarding the publication of this article. Trend of gastric cancer for two genders (2004 to 2008) in Iran
  17 in total

1.  Modelling risk when binary outcomes are subject to error.

Authors:  Pat McInturff; Wesley O Johnson; David Cowling; Ian A Gardner
Journal:  Stat Med       Date:  2004-04-15       Impact factor: 2.373

Review 2.  The evolution of the population-based cancer registry.

Authors:  Donald M Parkin
Journal:  Nat Rev Cancer       Date:  2006-08       Impact factor: 60.716

3.  A comparison of sensitivity-specificity imputation, direct imputation and fully Bayesian analysis to adjust for exposure misclassification when validation data are unavailable.

Authors:  Marine Corbin; Stephen Haslett; Neil Pearce; Milena Maule; Sander Greenland
Journal:  Int J Epidemiol       Date:  2017-06-01       Impact factor: 7.196

4.  Burden of hepatocellular carcinoma in Iran; Bayesian projection and trend analysis.

Authors:  Mohamad Amin Pourhoseingholi; Zeinab Fazeli; Mohammad Reza Zali; Seyed Moayed Alavian
Journal:  Asian Pac J Cancer Prev       Date:  2010

Review 5.  Review of cancer registration and cancer data in Iran, a historical prospect.

Authors:  Mohammad Ali Mohagheghi; Alireza Mosavi-Jarrahi
Journal:  Asian Pac J Cancer Prev       Date:  2010

6.  The global health burden of infection-associated cancers in the year 2002.

Authors:  Donald Maxwell Parkin
Journal:  Int J Cancer       Date:  2006-06-15       Impact factor: 7.396

7.  Bayesian analysis of risk factors for anovulation.

Authors:  Yan Liu; Wesley O Johnson; Ellen B Gold; Bill L Lasley
Journal:  Stat Med       Date:  2004-06-30       Impact factor: 2.373

Review 8.  Epidemiology of stomach cancer.

Authors:  Hermann Brenner; Dietrich Rothenbacher; Volker Arndt
Journal:  Methods Mol Biol       Date:  2009

9.  Trend of hepatocellular carcinoma incidence after Bayesian correction for misclassified data in Iranian provinces.

Authors:  Nastaran Hajizadeh; Ahmad Reza Baghestani; Mohamad Amin Pourhoseingholi; Sara Ashtari; Zeinab Fazeli; Mohsen Vahedi; Mohammad Reza Zali
Journal:  World J Hepatol       Date:  2017-05-28

10.  Bayesian adjustment of gastric cancer mortality rate in the presence of misclassification.

Authors:  Nastaran Hajizadeh; Mohamad Amin Pourhoseingholi; Ahmad Reza Baghestani; Alireza Abadi; Mohammad Reza Zali
Journal:  World J Gastrointest Oncol       Date:  2017-04-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.