| Literature DB >> 33550007 |
Wufeng Xue1, Chunyan Cao2, Jie Liu2, Yilian Duan2, Haiyan Cao2, Jian Wang3, Xumin Tao3, Zejian Chen3, Meng Wu4, Jinxiang Zhang5, Hui Sun6, Yang Jin7, Xin Yang3, Ruobing Huang3, Feixiang Xiang2, Yue Song2, Manjie You2, Wen Zhang2, Lili Jiang2, Ziming Zhang2, Shuangshuang Kong2, Ying Tian2, Li Zhang2, Dong Ni8, Mingxing Xie9.
Abstract
The outbreak of COVID-19 around the world has caused great pressure to the health care system, and many efforts have been devoted to artificial intelligence (AI)-based analysis of CT and chest X-ray images to help alleviate the shortage of radiologists and improve the diagnosis efficiency. However, only a few works focus on AI-based lung ultrasound (LUS) analysis in spite of its significant role in COVID-19. In this work, we aim to propose a novel method for severity assessment of COVID-19 patients from LUS and clinical information. Great challenges exist regarding the heterogeneous data, multi-modality information, and highly nonlinear mapping. To overcome these challenges, we first propose a dual-level supervised multiple instance learning module (DSA-MIL) to effectively combine the zone-level representations into patient-level representations. Then a novel modality alignment contrastive learning module (MA-CLR) is presented to combine representations of the two modalities, LUS and clinical information, by matching the two spaces while keeping the discriminative features. To train the nonlinear mapping, a staged representation transfer (SRT) strategy is introduced to maximumly leverage the semantic and discriminative information from the training data. We trained the model with LUS data of 233 patients, and validated it with 80 patients. Our method can effectively combine the two modalities and achieve accuracy of 75.0% for 4-level patient severity assessment, and 87.5% for the binary severe/non-severe identification. Besides, our method also provides interpretation of the severity assessment by grading each of the lung zone (with accuracy of 85.28%) and identifying the pathological patterns of each lung zone. Our method has a great potential in real clinical practice for COVID-19 patients, especially for pregnant women and children, in aspects of progress monitoring, prognosis stratification, and patient management.Entities:
Keywords: Contrastive learning; Lung ultrasound; Multi-modality; Multiple instance learning
Mesh:
Year: 2021 PMID: 33550007 PMCID: PMC7817458 DOI: 10.1016/j.media.2021.101975
Source DB: PubMed Journal: Med Image Anal ISSN: 1361-8415 Impact factor: 8.545
Fig. 1Two examples for patient LUS data and clinical information. Unknown number of missing lung zones and the different format of LUS videos and images for the rest zones make the data highly heterogeneous; the clinical information provides helpful cues for analysis and also brings difficulty of combining two totally different sources of modalities; Learning the highly nonlinear mapping between the patient severity and the input LUS data is of great challenge.
Fig. 7Segmentation examples for a representative frame from five lung zones with different severity. Cyan: pleural line, Green: A-line, Brown: B-line, Purple: consolidation. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Overall architecture. The representation learning follows a pipeline of LUS pattern segmentation, LUS zone score prediction, and patient-level severity assessment, during which the representation of LUS data transfers progressively into a discriminative feature. The variable at the bottom-right of the rectangle means the repetitions of the operation inside that rectangle. For video with N frames, the segmentation has to be repeated N times; for patients with data of K lung zones available, the LUS zone score prediction has to be repeated K times. The dashed line represents the two representation transfer in our training procedure.
Fig. 3Dual-level supervised multiple instance learning. Both the instance-level and the bag-level supervision are used to help patient representation learning.
Patient severity results (%). For each configuration, average performance of 10 learned model is reported. The standard deviations of the accuracies are also displayed. These results reveal the effectivenesses of the staged learning procedure, the DSA-MIL module, and the MA-CLR module. CI: clinical information.
| config. | modality | Patient severity | Severe or non-severe | ||||
|---|---|---|---|---|---|---|---|
| accuracy | F1-score | accuracy | recall | precision | F1-score | ||
| MZS | LUS | 49.38 | 45.89 | 79.50 | 78.50 | 80.11 | 79.29 |
| A-MIL1 | LUS | 64.25 | 66.05 | 84.0 | 76.75 | 82.57 | |
| A-MIL2 | LUS | 64.25 | 64.57 | 84.75 | 83.77 | 84.91 | |
| DSA-MIL | LUS | 67.63 | 67.91 | 84.88 | 78.75 | 89.83 | 83.84 |
| MLP | CI | 56.75 | 56.15 | 79.88 | 80.75 | 79.60 | 80.03 |
| CONCAT | LUS and CI | 55.25 | 57.37 | 83.13 | 80.25 | 85.36 | 82.52 |
| MA-CLR | LUS and CI | 72.75 | 72.31 | 86.5 | 82.25 | 89.91 | 85.90 |
| MA-CLR (max-vote) | LUS and CI | 85.00 | 89.47 | ||||
Fig. 4Modality alignment contrastive learning of representation. Patients are of different severities. The blue arrow maximizes the agreement between LUS and clinical information of the same patient, therefore make the two modalities aligned; the red arrow minimize the inter-class agreement of two patients with different severities, therefore keep the discriminativeness; an additional constraint makes this contrastive pattern consistent. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Dataset splitting for patient severity assessment.
| Patient severity | 1 | 2 | 3 | 4 |
|---|---|---|---|---|
| Training | 12 | 145 | 63 | 13 |
| Test | 20 | 20 | 20 | 20 |
Distribution of LUS zone score for the training and test dataset. According to the split in Table 1, the LUS zone score also follow an even distribution.
| Zone severity | 0 | 1 | 2 | 3 | 4 |
|---|---|---|---|---|---|
| Training | 697 | 476 | 206 | 113 | 34 |
| Test | 55 | 53 | 53 | 52 | 52 |
Numbers of images/frames with presence of the corresponding ultrasound patterns. A: A-line; P: pleural line; B: B-line; C: consolidation.
| LUS patterns | A | P | B | C | Total |
|---|---|---|---|---|---|
| Training | 666 | 4338 | 3611 | 340 | 4398 |
| Test | 205 | 2438 | 2295 | 481 | 2528 |
p-value of significant test of the severity assessment performance for different configuration. p-value less then 0.05 means the performance difference of the two methods is significant.
| MZS | A-MIL1 | A-MIL2 | DSA-MIL | MLP | CONCAT | |
|---|---|---|---|---|---|---|
| accuracy of patient severity | ||||||
| A-MIL1 | 3.12E-11 | |||||
| A-MIL2 | 1.64E-10 | 1 | ||||
| DSA-MIL | 4.54E-10 | 1.97E-04 | 7.72E-05 | |||
| MLP | 2.61E-07 | 2.96E-07 | 2.96E-07 | 3.36E-10 | ||
| CONCAT | 8.42E-07 | 1.35E-07 | 1.35E-07 | 1.60E-09 | 2.56E-03 | |
| MA-CLR | 2.15E-12 | 5.37E-08 | 1.47E-08 | 1.84E-06 | 2.80E-13 | 1.24E-12 |
| accuracy of severe/non-severe | ||||||
| A-MIL1 | 1.73E-03 | |||||
| A-MIL2 | 5.74E-05 | 5.09E-01 | ||||
| DSA-MIL | 7.81E-06 | 4.42E-01 | 9.06E-01 | |||
| MLP | 3.94E-01 | 6.29E-03 | 1.18E-04 | 3.18E-06 | ||
| CONCAT | 3.13E-05 | 3.90E-01 | 7.68E-02 | 4.45E-02 | 7.46E-04 | |
| MA-CLR | 1.96E-08 | 1.68E-02 | 2.05E-02 | 1.33E-02 | 3.01E-07 | 2.09E-05 |
Fig. 5Confusion matrix for patient severity assessment under different configurations. 1, mild; 2, moderate; 3, severe; 4, critical severe.
Performance comparison with existing work for binary severe/non-severe prediction (%).
| References | Modality | Patient | accuracy | recall | precision |
|---|---|---|---|---|---|
| CT | 102 | - | 83.3 | 75.0 | |
| CT | 176 | 87.5 | 93.3 | - | |
| CT | 242 | 98.5 | 95.2 | 97.5 | |
| ours | LUS+CI | 313 | 87.5 | 85.0 | 89.47 |
Fig. 6Performance and confusion matrix for LUS zone score prediction with (right) and without (left) the pre-trained segmentation model. Both 5-way and 2-way classifications are evaluated. It’s obvious that including the rich semantic information in the pre-trained model brings great performance improvement for LUS zone score prediction.
Evaluation of LUS segmentation task. A: A-line; P: pleural line; B: B-line; C: consolidation.
| Hit Ratio | Dice | mIoU | Precision | Recall | |
|---|---|---|---|---|---|
| A | 0.284 | 0.258 | 0.302 | 0.444 | 0.280 |
| P | 0.704 | 0.620 | 0.649 | 0.730 | 0.645 |
| B | 0.804 | 0.715 | 0.695 | 0.743 | 0.737 |
| C | 0.160 | 0.170 | 0.201 | 0.379 | 0.173 |