| Literature DB >> 35559265 |
Xuening Wu1, Chengsheng Yin2,3, Xianqiu Chen2, Yuan Zhang2, Yiliang Su2, Jingyun Shi4, Dong Weng2, Xing Jiang2, Aihong Zhang5, Wenqiang Zhang1, Huiping Li2.
Abstract
Background: Idiopathic pulmonary fibrosis (IPF) needs a precise prediction method for its prognosis. This study took advantage of artificial intelligence (AI) deep learning to develop a new mortality risk prediction model for IPF patients.Entities:
Keywords: artificial intelligence (AI); deep learning; disease severity grade; idiopathic pulmonary fibrosis (IPF); pulmonary fibrosis stage; semantic segmentation
Year: 2022 PMID: 35559265 PMCID: PMC9086624 DOI: 10.3389/fphar.2022.878764
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.988
FIGURE 1Case screening process. In total, 232 cases were diagnosed as IPF according to the 2018 IPF diagnosis and treatment guidelines. A total of 26 patients were excluded, two patients were diagnosed as interstitial pneumonia with autoimmune features (IPAF) during follow-up; 24 patients had incomplete CT and lung function data. Finally, 206 cases were included in the retrospective analysis (including 16 cases of lung transplantation): 93 surviving cases, including 11 lung transplants; 81 deaths, out of which 10 died from lung cancer, 67 died from acute exacerbation of IPF, and 4 died after lung transplantation; and 32 patients failed to follow up, including one failed to follow up after lung transplantation.
Comparison of different pulmonary staging methods.
| Scoring method | Parameter | Advantage | Disadvantage | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gender | Age | FVC% | DLco% | Tlc% | FEV1% | Lung capacity (vtg) | HRCT | X-ray | PaO2 | SpO2% | Smoking | Clubbing finger | Extent of dyspnea | |||
| GAP | √ | √ | √ | √ | Simple | Lack HRCT and PaO2 data | ||||||||||
| CPI | √ | √ | √ | Can reflect combined emphysema | Lack HRCT and PaO2 data | |||||||||||
| CRP | √ | √ | √ | √ | √ | √ | √ | Require many parameters | Complex and lack HRCT and lung function data | |||||||
| JRS | √ | √ | Simple | Lack HRCT and PaO2 data | ||||||||||||
| Accessibility | Easy | Easy | Easy | Easy | Easy | Easy | Require a comprehensive device to measure lung function | Easy | Easy but images overlap | Require arterial blood | Easy | Difficult for quantification | Vary greatly in individuals | Require a complex scoring system and may be influenced by subjective bias | ||
| Importance | Y | Y | Y | Y | Affected by multiple factors | Correlate to airway disease | ? | Y | Y | Y | Y | ? | ? | Y | ||
| Parameters in our method | √ | √ | √ | √ | √ | √ | ||||||||||
Notes: Y: the parameter is important. ? the importance of the parameter is currently unknown. √: the parameter was included in the model of this study.
SpO2%: oxygen saturation of peripheral blood. SpO2 is the resting arterial oxygen saturation measured at fingertips. FVC: forced vital capacity. FVC%pred: the percentage of the actual FVC over the predicted FVC. FEV1: forced expiratory volume in one second. FEV1%pred: the percentage of the actual FEV1 over the predicted FEV1. DLco: diffusing capacity of the lung for carbon monoxide. DLco%pred: the percentage of the actual DLco over the predicted DLco. FEV1/FVC%: the percentage of FEV1 over FVC. GAP (gender, age, and physiologic variables) stage followed the recommendation by Brett Ley, and a higher stage represented a greater death risk. CPI: composite physiologic index. In 2002, Athol U. Wells and others proposed to use CPI, which combined chest CT and pulmonary functional parameters, to assess the severity of interstitial lung diseases (ILDs). A higher CPI represents a more severe ILD. CRP: clinical-radiographic-physiologic. Leslie C. Watters et al. published the CRP system in 1986. JRS: Ryo Okuda et al. proposed the IPF staging method in 2004. HRCT: high-resolution computed tomography.
Patients’ general clinical characteristics.
| Patient data (n-206) | Value |
|---|---|
| Median age years | 64.1 ± 7.9 |
| Male/female | 196/10 |
| Smokers/non-smokers | 150/56 |
| Survival time (months) | 28.7 ± 19.3 |
| SpO2% | 95.2 ± 3.5 |
| FVC%pred | 71.9 ± 20.1 |
| FEV1%pred | 75.1 ± 20.6 |
| DLco%pred | 52.0 ± 28.4 |
| FEV1/FVC% | 83.5 ± 7.8 |
| CT score values by AI | 14.1 ± 11.30 |
| CT score values by radiologists | 24.5 ± 13.8 |
| CT stage I/II/III | 56/114/36 |
| PF stage a/b/c | 95/80/31 |
| GAP stage I/II/III | 108/63/35 |
| CPI | 44.6 ± 21.0 |
Notes: Measurement data are presented as mean ± standard deviation (SD). Count data are presented as percentage or proportion.
SpO2%: oxygen saturation of peripheral blood. SpO2 is the resting arterial oxygen saturation measured at fingertips. FVC: forced vital capacity. FVC%pred: the percentage of the actual FVC over the predicted FVC. FEV1: forced expiratory volume in one second. FEV1%pred: the percentage of the actual FEV1 over the predicted FEV1. DLco: diffusing capacity of the lung for carbon monoxide. DLco%pred: the percentage of the actual DLco over the predicted DLco. FEV1/FVC%: the percentage of FEV1 over FVC. CT score values were calculated by AI according to the method in the article. CT-based stage: the stage was determined by using CT score values by AI following the criteria described in Table 3. PF-based grade: the grade was determined by using the pulmonary function and physiological parameters (age, gender, FVC%pred, DLco%pred, and SpO2%) and following the description in Table 3. The grade was defined as (a) mild, (b) moderate, and (c) severe. GAP (gender, age, and physiologic variables) stage followed the recommendation by Brett Ley, and a higher stage represented a greater death risk. CPI: composite physiologic index. In 2002, Athol U. Wells and others proposed to use CPI, which combined chest CT and pulmonary functional parameters, to assess the severity of interstitial lung diseases (ILDs). A higher CPI represents a more severe ILD.
Criteria for CT-based pulmonary fibrosis staging and PF-based severity grading (patent no: ZL 2019 1 0514972.5).
| PF Scoring criteria | SpO2% | FVC%pred | DLco%pred | Age (year) | Gender | Total severity score | Criteria for severity grading | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ≥95% | 90–94% | ≤89% | >75% | 50–75% | <50% | >55% | 36–55% | <36% | Cannot complete | ≤60 | 61–65 | >65 | M | F | 10 | a (mild) | b (moderate) | c (severe) | |||
| 0 | 1 | 2 | 0 | 1 | 2 | 0 | 1 | 2 | 3 | 0 | 1 | 2 | 1 | 0 | 0–3 | 4–6 | 7–10 | ||||
| CT-based staging criteria | I | Honeycomb lesion area is <5% of the entire lung | |||||||||||||||||||
| II | Honeycomb lesion area is 5–25% of the entire lung | ||||||||||||||||||||
| III | Honeycomb lesion area is >25% of the entire lung | ||||||||||||||||||||
| CTPF stage presentation example | Fibrosis stage/severity | Definition | |||||||||||||||||||
| II a | Fibrosis stage II and IPF severity grade a (mild) | ||||||||||||||||||||
Notes: SpO2%: oxygen saturation of peripheral blood. SpO2% is the resting arterial oxygen saturation measured at fingertips. FVC: forced vital capacity. FVC%pred: the percentage of the actual FVC over the predicted FVC. DLco: diffusing capacity of the lung for carbon monoxide. DLco%pred: the percentage of the actual DLco over the predicted DLco.
FIGURE 2Correlation of AI–CT fibrosis score and lung function parameters. (A) Correlation between CT-score and FVC%pred, Spearman correlation coefficient rs = -0.40, p < 0.01; (B) correlation between CT-score and DLco%pred, Spearman correlation coefficient rs = -0.66, p < 0.01; (C) correlation between CT-score and SpO2%, Spearman correlation coefficient rs = -0.44, p < 0.01; (D) correlation between CT-score and CPI, Spearman correlation coefficient rs = 0.65, p < 0.01; and (E) correlation between CT-score and manual-CT scores by radiologists, Spearman correlation coefficient rs = 0.80, p < 0.01.
FIGURE 3Analysis of CT stage and PF grading and mortality.(A) shows the relationship between CT staging and mortality risk based on Fine–Gray regression CT staging univariate analysis, which might be mixed with the influence of PF grade. (B) shows the same relationship in the multivariate analysis of CT staging and PF grading. Adjusted PF grading means the effect of PF grading was eliminated. The results showed that CT stage, with both PF grade adjusted and unadjusted, was positively correlated with mortality risk. (C) shows the relationship between PF grade and mortality risk based on Fine–Gray regression PF grade univariate analysis, which might be mixed with the influence of CT staging. (D) shows the same relationship between the multivariate analysis of CT staging and PF classification. CT staging adjusted means the effect of the CT stage was eliminated. The results show that the PF grade, with both CT staging adjusted and unadjusted, is positively correlated with mortality risk.
Patients’ clinical characteristics of the training set and validation set.
| Characteristic | Combined set | Training set | Validation set |
|
|---|---|---|---|---|
| No.[n (%)] | 206 (100) | 165 (80) | 41 (20) | |
| FVCpred [mean (SD)] | 71.91 (20.12) | 71.95 (20.45) | 71.74 (18.97) | 0.953 |
| Fibrosis rate [ | 11.34 (4.61,20.74) | 11.45 (4.86,20.28) | 9.62 (3.92,23.2) | 0.390 |
| Emphysema rate [median (Q1,Q3)] | 0.16 (0.02,1.23) | 0.18 (0.02,1.23) | 0.16 (0.01,1.05) | 0.931 |
| Age [median (Q1,Q3)] | 64.5 (60,69) | 65 (59,70) | 64 (60,68) | 0.542 |
| SaO2 [median (Q1,Q3)] | 96 (94.4,97.3) | 96 (94.2,97.1) | 96.8 (95.4,97.8) | 0.156 |
| FEV1pred [median (Q1,Q3)] | 73.75 (60.7,88.7) | 74.1 (60.8,87.6) | 69 (60.4,91.6) | 0.764 |
| DLCO pred [median (Q1,Q3)] | 51.6 (36.9,70.1) | 51.5 (36.9,69.6) | 51.7 (37.4,73.9) | 0.977 |
| Survivetime [median (Q1,Q3)] | 27 (13,40) | 26 (14,40) | 27 (10,38) | 0.441 |
| GAP stage I/II/III | 108/63/35 | 86/49/30 | 22/14/5 | 0.631 |
| PF grade a/b/c | 95/80/31 | 74/65/26 | 21/15/5 | 0.729 |
| CT stage I/II/III | 56/114/36 | 43/94/28 | 13/20/8 | 0.636 |
Notes: Measurement data are presented as mean ± standard deviation (SD). Count data are presented as percentage or proportion.
SpO2%: oxygen saturation of peripheral blood. SpO2 is the resting arterial oxygen saturation measured at fingertips. FVC: forced vital capacity. FVC%pred: the percentage of the actual FVC over the predicted FVC. FEV1: forced expiratory volume in one second. FEV1%pred: the percentage of the actual FEV1 over the predicted FEV1. DLco: diffusing capacity of the lung for carbon monoxide. DLco%pred: the percentage of the actual DLco over the predicted DLco. CT-based stage: the stage was determined by using the average score of the two radiologists and following the criteria described in Table 3. PF-based grade: the grade was determined by using the pulmonary function and physiological parameters (age, gender, FVC%pred, DLco%pred, and SpO2%) and following the description in Table 3. The grade was defined as (a) mild, (b) moderate, and (c) severe. GAP (gender, age, and physiologic variables) stage followed the recommendation by Brett Ley, and a higher stage represented a greater death risk.
Discrimination of different models in the training and validation cohort.
| Prediction time | Model | Training set | Validation set | ||
|---|---|---|---|---|---|
| AUC | Brier score | AUC | Brier score | ||
| 1 year | CTPF | 74.3 [63.2,85.4] | 8.6 [2.4,14.8] | 92.0 [83.4,100.0] | 8.1 [0.5,15.7] |
| CT | 66.4 [55.4,77.4] | 8.9 [2.5,15.4] | 86.2 [70.9,100.0] | 8.5 [0.5,16.5] | |
| PF | 71.8 [60.7,82.9] | 8.7 [2.4,15.1] | 84.6 [71.6,97.5] | 8.4 [0.4,16.4] | |
| GAP | 73.5 [63.0,84.0] | 8.7 [2.4,15.0] | 75.3 [60.0,90.5] | 8.7 [0.5,16.9] | |
| 2 years | CTPF | 78 [70.2,85.9] | 16.0 [10.1,22.0] | 75.0 [57.1,92.9] | 14.3 [6.6,22.1] |
| CT | 69.6 [61.6,77.5] | 17.5 [11.2,23.7] | 73.7 [58.0,89.4] | 14.3 [7.2,21.3] | |
| PF | 74.1 [65.4,82.7] | 16.5 [10.2,22.8] | 65.2 [43.8,86.6] | 15.2 [6.5,23.8] | |
| GAP | 71.2 [62.3,80.0] | 17.0 [10.9,23.1] | 62.3 [41.9,82.7] | 15.3 [7.0,23.5] | |
| 3 years | CTPF | 72.8 [58.3,87.3] | 18.2 [11.9,24.6] | 76.0 [56.8,95.2] | 17.6 [9.4,25.9] |
| CT | 64.2 [51.0,77.5] | 19.8 [13.2,26.5] | 65.2 [46.8,83.7] | 20.1 [12.1,28.1] | |
| PF | 70.6 [56.5,84.8] | 18.5 [12.0,25.0] | 72.5 [54.3,90.7] | 18.3 [9.7,26.9] | |
| GAP | 69.9 [55.6,84.3] | 18.7 [12.4,25.0] | 68.8 [50.8,86.8] | 19.0 [10.7,27.3] | |
Notes: CI: confidence interval. Model CT: CT-based stage was used in the univariate Fine–Gray death risk regression analysis. Model PF: PF-based grade was used in the univariate Fine–Gray death risk regression analysis. Model CTPF: CTPF comprehensive stage was used in the multivariate Fine–Gray death risk regression analysis. Model GAP: GAP stage proposed by Brett Ley was used in univariate Fine–Gray death risk regression analysis. AUC: area under curve. The AUC value reflects the model’s capability of discrimination. The higher the AUC value, the higher the model’s ability to identify the mortality risk. Brier score is an indicator that comprehensively reflects the discrimination and calibration of the model. The smaller the Brier score, the better the discrimination and calibration predicted by the model.
FIGURE 4Model prediction nomogram and calibration curve. (A) Mortality nomogram of CTPF as the predictive model. (B–D) Calibration curves after cross-validation using CT staging, PF staging, CTPF comprehensive staging, and GAP staging to predict patients’ cumulative mortality risk at 1, 2, and 3 years. The CTPF prediction model has the best AUC value, Brier score, and stability.
CTPF model-predicted 1-, 2-, and 3-year accumulative survive rate of patients at different CTPF stage.
| CTPF stage | 1-year cumulative survival rate% | 2-year cumulative survival rate% | 3-year cumulative survival rate % |
|---|---|---|---|
| I a | 96.90 | 91.77 | 89.11 |
| I b | 94.15 | 84.83 | 80.20 |
| I c | 90.68 | 76.56 | 69.88 |
| II a | 92.79 | 81.51 | 76.02 |
| II b | 86.64 | 67.60 | 59.15 |
| II c | 79.23 | 52.96 | 42.63 |
| III a | 91.23 | 77.84 | 71.46 |
| III b | 83.89 | 61.90 | 52.55 |
| III c | 75.18 | 45.89 | 35.18 |
Notes: CTPF, stage: CTPF-based comprehensive stage. I a: CT, stage I and PF, grade a; I b: CT, stage I and PF, grade b; I c: CT, stage I and PF, grade c; II a: CT, stage II, and PF, grade a; II b: CT, stage II, and PF, grade b; II c: CT, stage II, and PF, grade c; III a: CT, stage III, and PF, grade a; III b: CT, stage III, and PF, grade b; III c: CT, stage III, and PF, grade c.
CT I: honeycomb lesion area was <5% of the entire lung. CT II: honeycomb lesion area was 5–25% of the entire lung. CT III: honeycomb lesion area was >25%. The PF-based grade was determined by assessing the scores of age, gender, FVC%pred, DLco%pred, and SpO2% according to the criteria in Table 3 and adding the scores. PF (a): score 0–3. PF(b): score 4–6. PF(c): score 7–10.
FIGURE 5Examples of patient’s original lung CT image, honeycomb lung region segmentation, and staging. (A-1,2,3) are the original CT images, the segmented lung region, and honeycomb lung region identified by the deep learning model of patient Zhang. The corresponding stage of this patient is II c. Similarly, (B-1,2,3) are the corresponding images of patient Xu, whose stage is Ia. (C) shows their comprehensive CTPF staging; patient Zhang’s comprehensive stage is IIc; the comprehensive stage of patient Xu is Ia.