Literature DB >> 33664570

Development and Assessment of Prediction Models for the Development of COPD in a Typical Rural Area in Northwest China.

Yide Wang1, Zheng Li1,2, Feng-Sen Li1,2.   

Abstract

OBJECTIVE: This study aimed to construct and evaluate a clinical predictive model for the development of COPD in northwest China's rural areas.
METHODS: A cross-sectional study of a natural population was performed in rural northwest China. After assessing demographic and disease characteristics, a clinical prediction model was developed. First, we used the least absolute shrinkage and selection operator regression model to screen possible factors influencing COPD. Then construct a logistic regression model and draw a nomogram. The discriminability of the model was further evaluated by the calibration diagram, C-index and ROC curve system. Clinical benefit was analyzed using the decision curve. Finally, the 1000 bootstrap resamples and Harrell's C-index was used for internal verification of the nomogram.
RESULTS: Among 3249 patients in the local rural natural population, 394 (12.13%) were diagnosed with COPD. The LASSO regression model was used to find the optimal combination of parameters, and the screened influencing factors included age, gender, barbeque, smoking, passive smoking, energy type, ventilation system and Post-Bronchodilator FEV1. These predictors are used to construct a nomogram. C index is 0.81 (95% confidence interval:0.79-0.83). The combination of the calibration curve and ROC curve indicates that the model has high discriminability. The decision curve shows benefits in clinical practice when the threshold probability is >6% and <58%, respectively. The internal verification results using Harrell's C-Index were 0.80 (95% confidence interval: 0.78-0.83).
CONCLUSION: Combining information such as age, sex, barbeque, smoking, passive smoking, type of energy, ventilation systems, and Post-Bronchodilator FEV1 can be easily used to predict the risk of COPD in local rural areas.
© 2021 Wang et al.

Entities:  

Keywords:  COPD; China; nomograms; predictive models

Mesh:

Year:  2021        PMID: 33664570      PMCID: PMC7924122          DOI: 10.2147/COPD.S297380

Source DB:  PubMed          Journal:  Int J Chron Obstruct Pulmon Dis        ISSN: 1176-9106


Introduction

Chronic obstructive pulmonary disease (COPD), as chronic progressive lung disease, is an important cause of the global increase in morbidity and mortality.1 The high incidence of COPD, which is expected to rise to the third leading cause of death in the world by 2030, will impose a heavy social and economic burden.2 It is important to note that most of the world’s COPD morbidity and mortality occurs in low - and middle-income countries, and there are significant urban-rural differences—the rural situation is more severe than the urban one. There is even a growing body of research confirming that rural residence is an independent and important risk factor for COPD.3,4 Prognosis varies widely in different stages of diagnosis of COPD, and high mortality is likely associated with inadequate early diagnosis of the disease. Prevention strategies for early detection and early diagnosis are important for identifying people at increased risk of COPD or those with early disease, and for implementing early interventions in that population as much as possible, thereby reducing the disease burden.5 A clinical prediction model based on multiple risk factors plays an important role in efficiently and economically identifying high-risk groups.6–9 In addition to common risk factors include smoking, secondhand smoke exposure, sex, body mass index, age, socioeconomic status, history of asthma, and respiratory infections. Besides, biomass fuel, coal exposure, poverty, and low levels of education in rural areas are also important contributors to the high incidence and mortality of COPD.10,11 At present, Our understanding of the risk factors for the disease is still very limited. Several clinical predictive models of COPD have been published, with problems of poor predictive ability and lack of model evaluation and validation.12 There is not even a predictive model based on rural areas of grimmer conditions. Therefore, in this study, we will develop and validate a risk prediction model for rural areas. Considering that Moyu County in northwest China is characterized by low urbanization level, prominent rural characteristics, small mobility of local personnel and high stability, this is taken as the investigation point.

Methods

Participants

To provide a scientific basis for accurate prevention of chronic diseases, China has carried out several natural population cohort studies and health follow-up studies in recent years. As one of the keys funded projects, “Natural Population Cohort Study in Northwest China” (SQ2017YFSF090013) aims to study the aetiology and disease burden of characteristic chronic diseases in Northwest China and analyze the relationship between chronic diseases and health in northwest China, including natural environment and behavioural patterns. “Study on cohort Construction and Health Follow-up of Xinjiang Multi-ethnic natural Population” (2017YFC0907203) is a sub-project of this project. Our study population is all from this project. We conducted a cross-sectional study of a natural population in rural northwestern China from June to December 2018. Inclusion criteria:1. The age above 35 is not limited to gender; 2. Participate voluntarily and sign the informed consent;3. Able to communicate normally without obvious cognitive impairment;4. Permanent residents;5. Effective lung function tests and complete data. The criteria for COPD were: people with persistent respiratory symptoms or a history of risk factors such as smoke, combined with lung function test – post-bronchodilator FEV1/FVC ratio<0.70. Ethics Committee of Xinjiang Uygur Autonomous Region Chinese Medicine Hospital approved this study (IRB ID:2018XE0108), and all study participants signed written informed consent. Our study also complied with the Declaration of Helsinki.

Predictors and Definitions

Trained professionals performed the acquisition of data, questionnaires and lung function tests. Considering the Global Initiative for Chronic Obstructive Lung Disease (GOLD)13 and practical operability, we included candidates that include: height (a medical height meter with an accuracy of 0.1cm requires removal of shoes and caps during measurement); Bodyweight (using an electronic medical scale with an accuracy of 0.1kg, taking off the coat, shoes and hat during measurement); Waist circumference (using a soft tape measure, accurate to 0.1cm, with the lower edge of the tape measure placed at the highest point on either side of the hip bone and then around the waist horizontally for one week); Gross annual income, yuan (< 10,000, ≥10,000); Smoking is defined as daily or almost daily smoking; Passive smoking was defined as more than two times of passive tobacco per week. A ventilation system is defined as the installation of an exhaust fan or a range hood; Barbeque was defined as eating barbeque every day. Lung function was detected using MasterScreen Pneumo version2.13.

Statistical Analysis

R software (version3.6.3; ) was used for data analysis and mapping. Continuous variables were expressed as mean ± SD for T-test. The classified data were described as frequencies and percentages, and the chi-square test was performed. First of all, in order to screen the possible influencing factors of COPD, we took the presence or absence of COPD as the dependent variable, investigated relevant indicators as the independent variable, and carried out Univariate Logistic Regression Analyses. Indicators with statistically significant differences were incorporated into the LASSO model to screen potential predictors.14–16 The above candidate factors were incorporated into the logistic regression model, and their correlation with COPD was analyzed. The statistical significance level was bilateral, and the nomogram was further constructed.17,18 Then, we used the calibration curve, calibration C index and ROC curve to evaluate the discriminant performance of the model, respectively.19–21 To assess the usefulness of this model in clinical practice, we used the decision curve for evaluation.22 Finally, the internal validation of the model is carried out by using the random data extraction and C-index method.20

Results

Patients’ Characteristics

A total of 3249 patients from Local rural areas had complete data for model derivation. The mean age was 52.8±9.4 years. The rates of smoking, passive smoking, coal and biofuel use and ventilation system use were 14.2%, 7.9%, 95.5% and 17.3%, respectively. Besides, education is a mainly illiterate, semi-literate and primary school. Finally, a total of 394 cases (12.13%) were diagnosed with COPD (). Compared with the control group, the COPD group had higher active smoking rate, passive smoking rate, daily barbeque consumption rate, furnace utilization rate and biofuel coal utilization rate.In addition, old age, male age, tuberculosis, lack of ventilation, emaciation, and low level of economic income may also be risk factors for COPD (Table 1).
Table 1

Univariate Logistic Regression Analyses of COPD Influencing Factors

VariablesOR95% CIP value
Age, year
 <401.000
 40–490.9380.626–1.4040.754
 50–591.8351.231–2.7360.003
 60–693.1502.073–4.787<0.001
 70–796.0002.602–13.835<0.001
Sex
 Male1.000
 Female1.5451.244–1.917<0.001
BMI (kg/m2)1.0731.044–1.103<0.001
Waist (cm)1.0091.000–1.0180.051
Education level
 Illiterate1.000
 Primary or Higher0.6720.508–0.8900.006
Marital status
 Married1.000
 Divorce and so on0.9490.706–1.2760.706
Gross annual income (yuan)
 <10,0001.000
 ≥10,0000.7370.594–0.9140.005
Type of medical insurance
 Rural cooperative medical care1.000
 Self-funded and so on1.0010.474–2.1110.999
Alcohol intake status
 Yes1.000
 No1.3450.847–2.1360.209
Tea consumption
 Yes1.000
 No1.0900.858–1.3850.479
Fresh fruits consumption
 Never or rarely1.000
 1–3times/month0.8580.301–2.4480.775
 1–3 times/week0.6440.255–1.6270.352
 4–6 times/week1.0920.826–1.4430.538
 Everyday0.8910.691–1.1480.372
Barbecue
 Yes1.000
 No0.7430.563–0.9810.036
Active smoking
 No1.000
 Yes1.4701.117–1.9350.006
Passive smoking
 No1.000
 Yes1.5511.101–2.1860.012
Main source of energy
 Biomass or coal1.000
 Electricity or gas0.3560.165–0.7670.008
Ventilation system
 No1.000
 Yes0.5690.410–0.7890.001
Stove
 Yes1.000
 No0.6050.382–0.9570.032
Tuberculosis
 No1.000
 Yes2.0771.235–3.4930.006
Physical exercise
 No1.000
 Yes1.2390.812–1.8900.319
Insomnia
 No1.000
 Yes0.5990.276–1.3000.195

Abbreviations: OR, odds ratio; CI, confidence interval; BMI, body mass index.

Univariate Logistic Regression Analyses of COPD Influencing Factors Abbreviations: OR, odds ratio; CI, confidence interval; BMI, body mass index.

Feature Selection

Among the many potential influencing factors, eight factors were screened out by the LASSO regression model. The above predictors are all non-zero coefficients in the regression model. These features include age, sex, barbeque, smoking, passive smoking, type of energy, ventilation systems, and Post-Bronchodilator (Figure 1).
Figure 1

Screening predictors using LASSO binary logistic regression model.

Screening predictors using LASSO binary logistic regression model.

Developing Nomogram

According to the LASSO Regression Model, the eight regression predictors screened were considered the best combination. We used these factors to construct logistic regression models. To visualize the model, we developed a nomogram (Figure 2).
Figure 2

A nomogram to predict the development of COPD.

A nomogram to predict the development of COPD.

Assessment of Nomogram

The calibration curve of the COPD prediction model is shown in Figure 3. C index was 0.81 (95% CI:0.79–0.83), combined with the ROC curve (Figure 4), the results showed that the model had high accuracy. We used the decision curve to analyze the nomogram’s clinical application. The results in Figure 5 show that using this model to predict COPD development is more beneficial when the threshold probability is >6% and <58%, respectively. Internally verified results show that the non-compliant nomogram C index is 0.80 (95% CI:0.78–0.83).
Figure 3

Calibration curve to predict the development of COPD.

Figure 4

ROC curve to predict the development of COPD.

Figure 5

Decision curve of prediction models for the development of COPD.

Calibration curve to predict the development of COPD. ROC curve to predict the development of COPD. Decision curve of prediction models for the development of COPD.

Discussion

To the best of our knowledge, this is the first based rural area COPD risk prediction model that compensates for the shortcomings of existing models. The prediction tool showed good resolution, calibration power and satisfactory clinical practicality. As the largest developing country globally, China ranks among the top three in COPD mortality rates.23 Especially in rural areas of China, the prevalence of COPD is 13.7%,4,24 much higher than in urban areas. As global aging trends continue to intensify, COPD has become a major public health challenge that has drawn attention in many countries. In the latest global initiative for the diagnosis, treatment, and Prevention of Chronic obstructive pulmonary disease, COPD has been upgraded from an irreversible airflow limitation to a reversible disease that has been treated earlier.25 Studies have shown that early diagnosis of COPD patients is of great significance for improving prognosis, reducing disease progression, and reducing the economic burden.26 As a practical tool, clinical prediction models can be stratified according to the population’s risk characteristics, facilitating early detection and diagnosis of COPD, and facilitating effective individualized treatment and management.27–30 Therefore, it is necessary to construct a practical COPD prediction tool in typical rural areas. This study shows that smoke exposure, including active and passive smoking, remains an important risk factor for COPD. Consistent with previously published predictive models,30–33 active smoking is an important predictor of our risk model. COPD is a result of the interaction between genes and the environment. Passive smoking, as an important environmental risk factor, is getting more and more attention.34 In this study, after screening by the LASSO regression model, passive smoking was also included in the prediction model. Age was a strong predictor of COPD in our model. The prevalence of COPD increased significantly with age compared to younger adults. However, it is not clear whether age-related aging contributes to COPD, or whether exposure to risk factors accumulates with age, leading to the disease. In addition, our study found a significantly higher prevalence in men than in women. It is consistent with most previous epidemiological studies.35,36 But some recent studies have shown the opposite. Recent studies in human pathology and animal studies have found that women have a greater burden of small-airway disease when exposed to the same amount of smoke.37–40 Surprisingly, of the four clinical prediction models for COPD published, only one study included lung function parameters.32 Several studies have suggested that low lung function predicts future COPD risk.41,42 The above evidence supports the use of lung function tests to predict the future risk of COPD in high-risk populations. The study also showed significant statistical differences in socioeconomic status factors such as education level and income level between the COPD and non-COPD populations. Combined with the research conclusions of Townend, J. et al, it is suggested that low socioeconomic levels will increase the risk of this disease.43 But it is not clear what factors of poverty are at work. However, our study shows that reducing Traditional fuel use, such as coal and biofuels, and increasing ventilation systems may significantly reduce COPD’s risk. Therefore, it is reasonable to speculate that the association between low socioeconomic level and increased risk of disease may reflect increased particle exposure caused by poverty, high biofuel utilization, lack of ventilation, etc. Compared with other rural areas, the local diet is quite special. Barbeque is a common food eaten by local people. We included daily and occasional barbeque consumption as binary variables in the prediction model. The cooking process for local barbeques, which often involves burning traditional biofuels such as wood and crop stalks or coal, can cause high indoor pollution levels. We do not yet know whether this association between barbeques and increased COPD risk is related to increased exposure to particles or other causes for the high frequency of barbeque consumption. The nomogram has good discrimination for COPD patients and other populations. This risk prediction tool can be used by health care professionals in clinical practice to provide opportunities for local high-risk populations in rural areas to correct possible risk factors and provide more cost-effective interventions. For example: promoting smoking cessation and tobacco control in public places; Reduce the use of coal and traditional bio-fuels such as wood, animal waste, crop stalks, etc. Add ventilation system and optimize barbeque cooking environment; A close look at people with low lung function. Simultaneously, this visual prediction tool can serve as a bridge between doctors and the general public: clinicians can scientifically disseminate more accessible, intuitive, and rational information about the prevention of COPD; Potential at-risk populations can refer to this model to avoid possible risk factors. Inevitably, there are also several limitations to our current study. First of all, we only carried out internal verification of the model’s robustness, without external verification by other groups. Therefore, this nomogram needs to be externally evaluated in a wider population to further assess its universal applicability. Second, although we have included as many potential influencing factors as possible, there may still be some characteristics that have not been analyzed due to the limitations of realistic conditions. Finally, our findings are based on a cross-sectional study of a natural population, so prospective studies are needed to verify this tool before being widely used clinically.

Conclusion

We developed and validated a new COPD prediction model for rural northwest China. The influencing factors of this nomogram are easy to obtain and have good discrimination and internal verification. Our clinical prediction model complements existing research. Of course, this nomogram’s general applicability still needs to be further verified by more external people. Finally, we believe this model will be an important step in establishing early screening and diagnostic criteria for COPD in rural areas.
  43 in total

1.  Passive smoking and chronic obstructive pulmonary disease mortality: findings from the Japan collaborative cohort study.

Authors:  Shigekazu Ukawa; Akiko Tamakoshi; Hiroshi Yatsuya; Kazumasa Yamagishi; Masahiko Ando; Hiroyasu Iso
Journal:  Int J Public Health       Date:  2017-02-17       Impact factor: 3.380

Review 2.  Nomograms in oncology: more than meets the eye.

Authors:  Vinod P Balachandran; Mithat Gonen; J Joshua Smith; Ronald P DeMatteo
Journal:  Lancet Oncol       Date:  2015-04       Impact factor: 41.316

3.  Sex differences in severe pulmonary emphysema.

Authors:  Fernando J Martinez; Jeffrey L Curtis; Frank Sciurba; Jeanette Mumford; Nicholas D Giardino; Gail Weinmann; Ella Kazerooni; Susan Murray; Gerard J Criner; Donald D Sin; James Hogg; Andrew L Ries; MeiLan Han; Alfred P Fishman; Barry Make; Eric A Hoffman; Zab Mohsenifar; Robert Wise
Journal:  Am J Respir Crit Care Med       Date:  2007-04-12       Impact factor: 21.405

4.  COPD phenotypes and machine learning cluster analysis: A systematic review and future research agenda.

Authors:  Vasilis Nikolaou; Sebastiano Massaro; Masoud Fakhimi; Lampros Stergioulas; David Price
Journal:  Respir Med       Date:  2020-07-28       Impact factor: 3.415

5.  Assessing the calibration of mortality benchmarks in critical care: The Hosmer-Lemeshow test revisited.

Authors:  Andrew A Kramer; Jack E Zimmerman
Journal:  Crit Care Med       Date:  2007-09       Impact factor: 7.598

6.  Association between Household Air Pollution Exposure and Chronic Obstructive Pulmonary Disease Outcomes in 13 Low- and Middle-Income Country Settings.

Authors:  Trishul Siddharthan; Matthew R Grigsby; Dina Goodman; Muhammad Chowdhury; Adolfo Rubinstein; Vilma Irazola; Laura Gutierrez; J Jaime Miranda; Antonio Bernabe-Ortiz; Dewan Alam; Bruce Kirenga; Rupert Jones; Frederick van Gemert; Robert A Wise; William Checkley
Journal:  Am J Respir Crit Care Med       Date:  2018-03-01       Impact factor: 30.528

7.  Prediction models for the development of COPD: a systematic review.

Authors:  Melanie C Matheson; Gayan Bowatte; Caroline J Lodge; Shyamali C Dharmage; Jennifer L Perret; Adrian J Lowe; Chamara V Senaratna; Graham L Hall; Nick de Klerk; Louise A Keogh; Christine F McDonald; Nilakshi T Waidyatillake; Peter D Sly; Deborah Jarvis; Michael J Abramson
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2018-06-14

8.  Impact of COPD diagnosis timing on clinical and economic outcomes: the ARCTIC observational cohort study.

Authors:  Kjell Larsson; Christer Janson; Björn Ställberg; Karin Lisspers; Petter Olsson; Konstantinos Kostikas; Jean-Bernard Gruenberger; Florian S Gutzwiller; Milica Uhde; Leif Jorgensen; Gunnar Johansson
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2019-05-13

9.  Continuing to Confront COPD International Patient Survey: methods, COPD prevalence, and disease burden in 2012-2013.

Authors:  Sarah H Landis; Hana Muellerova; David M Mannino; Ana M Menezes; MeiLan K Han; Thys van der Molen; Masakazu Ichinose; Zaurbek Aisanov; Yeon-Mok Oh; Kourtney J Davis
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2014-06-06

10.  Chronic respiratory diseases and risk factors in 12 regions of the Russian Federation.

Authors:  Alexander G Chuchalin; Nikolai Khaltaev; Nikolay S Antonov; Dmitry V Galkin; Leonid G Manakov; Paola Antonini; Michael Murphy; Alexander G Solodovnikov; Jean Bousquet; Marcelo H S Pereira; Irina V Demko
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2014-09-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.