| Literature DB >> 33521746 |
Hui Liu1,2, Jing Chen3,4, Qin Yang5, Fang Lei1,6,3, Changjiang Zhang6,3,7, Juan-Juan Qin6,3, Ze Chen1,6,3, Lihua Zhu6,3, Xiaohui Song3, Liangjie Bai3, Xuewei Huang6,3, Weifang Liu3, Feng Zhou3,5, Ming-Ming Chen6,3, Yan-Ci Zhao6,3, Xiao-Jing Zhang1,6,3, Zhi-Gang She6,3, Qingbo Xu8, Xinliang Ma9, Peng Zhang1,3,5, Yan-Xiao Ji3,5, Xin Zhang3, Juan Yang3, Jing Xie6, Ping Ye10, Elena Azzolini11,12, Alessio Aghemo11,12, Michele Ciccarelli11,12, Gianluigi Condorelli11,12, Giulio G Stefanini11,12, Jiahong Xia13, Bing-Hong Zhang14, Yufeng Yuan15, Xiang Wei13, Yibin Wang16, Jingjing Cai1,17, Hongliang Li1,6,3,5.
Abstract
BACKGROUND: To develop a sensitive risk score predicting the risk of mortality in patients with coronavirus disease 2019 (COVID-19) using complete blood count (CBC).Entities:
Keywords: COVID-19; complete blood count; latent markov model; mortality; prediction model; risk score
Mesh:
Year: 2021 PMID: 33521746 PMCID: PMC7831644 DOI: 10.1016/j.medj.2020.12.013
Source DB: PubMed Journal: Med (N Y) ISSN: 2666-6340
Figure 1Flowchart for patient selection and distribution of the training and the validation cohorts
aExcluded due to leukemia, bwith at least 2 complete blood count (CBC) records, cwith CBC records at arbitrary time points, dwith only 1 CBC test, and eremained in hospitals at the end of follow-up date. LMM, latent Markov model.
Baseline characteristics of the Hubei cohort
| Variables | All (n = 12,759) | Non-severe survivor (n = 6,593) | Severe survivor (n = 4,246) | Death (n = 984) |
|---|---|---|---|---|
| Median age (IQR), y | 59 (46–68) | 56.0 (42.0–66.0) | 60.0 (48.0–68.0) | 70.0 (63.0–78.0) |
| Male sex, no./total no. (%) | 6,157 (48.3) | 3,016/6,593 (45.8) | 2,031/4,246 (47.8) | 648/984 (65.9) |
| Median heart rate (IQR), bpm | 84 (78–96) | 81.0 (77.0–90.0) | 94.0 (80.0–107.0) | 89.0 (78.0–101.0) |
| Median respiratory rate (IQR) | 20 (19–21) | 20.0 (19.0–20.0) | 20.0 (20.0–22.0) | 21.0 (20.0–25.0) |
| Median systolic blood pressure (IQR), mmHg | 128 (120–140) | 128.0 (120.0–140.0) | 127.0 (117.0–140.0) | 131.0 (120.0–146.0) |
| Median diastolic blood pressure (IQR), mmHg | 79 (71–87) | 80.0 (72.0–88.0) | 78.0 (70.0–85.0) | 77.0 (69.0–86.0) |
| Fever, no./total no. (%) | 9,351 (73.3) | 4,617/6,593 (70.0) | 3,278/4,246 (77.2) | 782/984 (79.5) |
| Median SpO2 (IQR), % | 97 (95–98) | 98.0 (96.0–99.0) | 97.0 (95.0–98.0) | 90.0 (81.0–96.0) |
| Days from symptom to hospitalization (IQR), days | 11 (7–20) | 12.0 (6.0–22.0) | 11.0 (7.0–19.0) | 10.0 (6.0–14.0) |
| Median follow-up time (IQR), days | 17 (11–26) | 16.0 (10.0–24.0) | 22.0 (14.0–32.0) | 9.0 (5.0–17.0) |
| Diabetes mellitus (any type), n (%) | 2,200 (17.2) | 852 (12.9) | 872 (20.5) | 290 (29.5) |
| Chronic obstructive pulmonary disease, n (%) | 150 (1.18) | 51 (0.8) | 50 (1.2) | 36 (3.7) |
| Hypertension, n (%) | 4,648 (36.4) | 2,020 (30.6) | 1,680 (39.6) | 559 (56.8) |
| Coronary arterial disease, n (%) | 1,168 (9.2) | 454 (6.9) | 397 (9.4) | 202 (20.5) |
| Heart failure, n (%) | 99 (0.8) | 11 (0.2) | 31 (0.7) | 46 (4.7) |
| Cerebrovascular disease, n (%) | 413 (3.2) | 148 (2.2) | 125 (2.9) | 81 (8.2) |
| Renal insufficiency, n (%) | 500 (3.9) | 168 (2.6) | 132 (3.1) | 150 (15.2) |
| Neoplastic disease, n (%) | 385 (3.0) | 180 (2.7) | 110 (2.6) | 61 (6.2) |
| Liver disease, n (%) | 276 (2.2) | 125 (1.9) | 103 (2.4) | 32 (3.3) |
| WBC count >9.5 × 109/L, no./total no. (%) | 1,296/12,759 (10.2) | 320/6,593 (4.9) | 431/4,246 (10.2) | 411/984 (41.8) |
| Neutrophil count >6.3 × 109/L, no./total no. (%) | 1,980/12,759 (15.5) | 501/6,593 (7.6) | 695/4,246 (16.4) | 574/984 (58.3) |
| Lymphocyte count <1.1 × 109/L, no./total no (%) | 5,150/12,759 (40.4) | 1,832/6,493 (27.8) | 1,958/4,246 (46.1) | 841/984 (85.5) |
| Platelet count <125 × 109/L, no./total no. (%) | 1,236/12,759 (9.7) | 439/6,493 (6.7) | 374/4,246 (8.8) | 304/984 (30.9) |
| CRP > ULN, no./total no. (%) | 3,565/7,247 (49.2) | 1,550/4,157 (37.3) | 1,055/1,871 (56.4) | 529/542 (97.6) |
| Procalcitonin > ULN, no./total no. (%) | 4,303/10,212 (42.1) | 1,444/4,927 (29.3) | 1,859/3,629 (51.2) | 696/854 (81.5) |
| Alanine transaminase >40 U/L, no./total no. (%) | 2,717/12,213 (22.3) | 1,260/6,245 (20.2) | 1,005/4,159 (24.2) | 261/949 (27.5) |
| BUN > ULN, no./total no. (%) | 1,178/12,361 (9.5) | 302/6,308 (4.8) | 296/4,188 (7.1) | 449/959 (46.8) |
| CK-MB > ULN, no./total no. (%) | 462/8,339 (5.5) | 138/3,870 (3.6) | 109/2,960 (3.7) | 177/748 (23.7) |
| Total cholesterol >5.17 mmol/L, no./total no. (%) | 1,302/10,301 (12.6) | 762/4,936 (15.4) | 409/3,745 (10.9) | 49/820 (6.0) |
| D-dimer > ULN, no./total no. (%) | 5,497/11,384 (48.3) | 2,038/5,618 (36.3) | 2,175/3,985 (54.6) | 819/930 (88.1) |
| LDL-C >3.37 mmol/L, no./total no. (%) | 1,302/9,220 (14.1) | 731/4,604 (15.9) | 421/3,103 (13.6) | 57/720 (7.9) |
bpm, beats per minute; BUN, blood urea nitrogen; CK-MB, creatinine kinase-myocardial band; CRP, C-reactive protein; IQR, interquartile range; LDL-C, low-density lipoprotein cholesterol; SpO2, peripheral oxygen saturation; ULN, upper limit of normal; WBC, white blood cell.
ULN was defined according to criteria in each hospital.
Figure 2Dynamic trajectories of 13 CBC parameters in patients with COVID-19
Smooth trajectories of the values of CBC parameters by the severity of the disease with 95% confidence intervals were plotted based on locally weighted regression and smoothing scatterplots. The horizontal dotted lines represent the empirical upper limit of normal (ULN) or lower limit of normal (LLN) of these CBC parameters. M, males; F, females.
Figure 3Flowchart for variable selection
A total of 38 CBC factors—13 numeric and 25 categorized variables—were included in the selection process. Generalized linear mixed models (GLMMs) with each variable as a fixed effect were built. The multivariate GLMM with stepwise forward selection following the Akaike information criterion (AIC) ranking established from the univariate fixed-effects models was applied. The significance levels for entry and stay were set to 0.05, and the multivariate GLMM was further controlled for age. A multivariate model with 5 variables was selected as the optimal model and was used to develop the risk assessment score.
Factors generating from GLMM and point distribution according to the coefficiency in the Cox model
| Covariates | Estimates | SE | z | p | Points |
|---|---|---|---|---|---|
| NLR > 4.06 | 3.50 | 0.11 | 14.46 | <0.001 | 6 |
| NLR 2.22–4.06 | 1.03 | 0.12 | 4.35 | <0.001 | 2 |
| Platelet counts decrease | 0.75 | 0.03 | 10.42 | <0.001 | 2 |
| Neutrophil counts increase | 0.46 | 0.04 | 7.59 | <0.001 | 1 |
| WBC counts increase | 0.15 | 0.04 | 2.86 | 0.004 | 1 |
| 50–59 | 0.51 | 0.06 | 2.76 | 0.006 | 1 |
| 60–69 | 0.47 | 0.06 | 2.85 | 0.004 | 1 |
| ≥70 | 0.75 | 0.06 | 4.68 | <0.001 | 2 |
Platelet count decrease indicates platelet count <100 × 109/L, neutrophil count increase indicates neutrophil count >6.3 × 109/L, and WBC count increase indicates WBC count >9.5 × 109/L. GLMM, generalized linear mixed model; NLR, neutrophil:lymphocyte ratio; SE, standard error; WBC, white blood cell.
Cox model: ℎ(t,X) = ℎ_0 (t)exp(3.5[NLR > 4.06] + 1.03[NLR 2.22–4.06] + 0.75[platelet counts decrease] + 0.46[neutrophil counts increase] + 0.15[WBC counts increase] + 0.51[age 50–59] + 0.47 [age 60–69] + 0.75[Age ≥ 70]).
Performance of PAWNN score in Wuhan and Italian validation datasets
| Hubei participants with 1 CBC test | Italian participants | |
|---|---|---|
| No. patients | 2,739 | 227 |
| AUROC (95% CI) | 0.97 (0.96–0.98) | 0.80 (0.74–0.86) |
| Total accuracy, % (95% CI) | 91.13 (86.05–92.85) | 76.65 (65.20–81.94) |
| Sensitivity, % (95% CI) | 93.84 (90.51–98.10) | 68.83 (58.44–94.81) |
| Specificity, % (95% CI) | 90.90 (85.13–92.84) | 80.67 (49.33–87.33) |
| PPV, % (95% CI) | 46.30 (35.27–52.01) | 64.77 (49.32–73.97) |
| NPV, % (95% CI) | 99.44 (99.13–99.81) | 83.67 (79.00–93.94) |
AUROC, area under the subject operating characteristic curve; CBC, complete blood count; CI, confidence interval; NPV, negative predictive value; PPV, positive predictive value.
Figure 4Status prevalence and transition probabilities between the statuses at subsequent time points
The number of survivors and non-survivors with their means and standard deviations of the PAWNN score of each status at each time point are recorded in the boxes. The transition probabilities represent the probability that a member of a given status at a specified time point will transition to another given status at the next time point. Transition probabilities are represented by arrows. Time 1, time 2, and time 3 were selected based on days 0–7 after admission (time 1), value of CBC records during days 8–14 (time 2) of hospitalization, and CBC during hospitalization ≥15 days (time 3).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| R-3.6.3 | R Foundation for Statistical Computing | |
| Adobe illustrator CC 2019 | Adobe company | |
| Ggplot-3.3.2 | Wickham | |
| pRoc-1.16.2 | Robin et al. | |
| lme4- 1.1-26 | Bates et al. | |
| Caret- 6.0-86 | Max Kuhn | |
| Data.table- 1.13.4 | Matt Dowle | |
| Effects- 4.2-0 | Fox and Weisberg, | |
| Lmest-3.0.1 | Bartolucci et al. | |
| cAIC4-0.9 | Saefken et al. | |
| LCAvarsel-1.1 | Fop et al., | |
| Survival- 3.2-7 | Therneau and Grambsch | |
| Survminer-0.4.8 | Alboukadel Kassambara | |