Literature DB >> 35647616

Improving Joint Learning of Chest X-Ray and Radiology Report by Word Region Alignment.

Zhanghexuan Ji1, Mohammad Abuzar Shaikh1, Dana Moukheiber1, Sargur N Srihari1, Yifan Peng2, Mingchen Gao1.   

Abstract

Self-supervised learning provides an opportunity to explore unlabeled chest X-rays and their associated free-text reports accumulated in clinical routine without manual supervision. This paper proposes a Joint Image Text Representation Learning Network (JoImTeRNet) for pre-training on chest X-ray images and their radiology reports. The model was pre-trained on both the global image-sentence level and the local image region-word level for visual-textual matching. Both are bidirectionally constrained on Cross-Entropy based and ranking-based Triplet Matching Losses. The region-word matching is calculated using the attention mechanism without direct supervision about their mapping. The pre-trained multi-modal representation learning paves the way for downstream tasks concerning image and/or text encoding. We demonstrate the representation learning quality by cross-modality retrievals and multi-label classifications on two datasets: OpenI-IU and MIMIC-CXR. Our code is available at https://github.com/mshaikh2/JoImTeR_MLMI_2021.

Entities:  

Keywords:  Attention; Multi-modality; Self-supervised learning

Year:  2021        PMID: 35647616      PMCID: PMC9134785          DOI: 10.1007/978-3-030-87589-3_12

Source DB:  PubMed          Journal:  Mach Learn Med Imaging


  4 in total

1.  Deep Visual-Semantic Alignments for Generating Image Descriptions.

Authors:  Andrej Karpathy; Li Fei-Fei
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-08-05       Impact factor: 6.226

2.  Preparing a collection of radiology examinations for distribution and retrieval.

Authors:  Dina Demner-Fushman; Marc D Kohli; Marc B Rosenman; Sonya E Shooshan; Laritza Rodriguez; Sameer Antani; George R Thoma; Clement J McDonald
Journal:  J Am Med Inform Assoc       Date:  2015-07-01       Impact factor: 4.497

3.  Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary Edema Assessment.

Authors:  Geeticka Chauhan; Ruizhi Liao; William Wells; Jacob Andreas; Xin Wang; Seth Berkowitz; Steven Horng; Peter Szolovits; Polina Golland
Journal:  Med Image Comput Comput Assist Interv       Date:  2020-09-29

4.  MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports.

Authors:  Alistair E W Johnson; Tom J Pollard; Seth J Berkowitz; Nathaniel R Greenbaum; Matthew P Lungren; Chih-Ying Deng; Roger G Mark; Steven Horng
Journal:  Sci Data       Date:  2019-12-12       Impact factor: 6.444

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.