Sarah M Ryan1, Nichole E Carlson1, Harris Butler1, Tasha E Fingerlin1,2,3, Lisa A Maier4,5,6, Fuyong Xing1. 1. University of Colorado-Denver, Department of Biostatistics and Informatics, Anschutz Medical Campus, Aurora, Colorado, United States. 2. National Jewish Health, Department of Biomedical Research, Denver, Colorado, United States. 3. University of Colorado-Denver, Department of Epidemiology, Anschutz Medical Campus, Aurora, Colorado, United States. 4. National Jewish Health, Department of Medicine, Denver, Colorado, United States. 5. University of Colorado-Denver, Department of Medicine, Anschutz Medical Campus, Aurora, Colorado, United States. 6. University of Colorado-Denver, Department of Environmental and Occupational Health, Anschutz Medical Campus, Aurora, Colorado, United States.
Abstract
Purpose: An open question in deep clustering is how to explain what in the image is driving the cluster assignments. This is especially important for applications in medical imaging when the derived cluster assignments may inform decision-making or create new disease subtypes. We develop cluster activation mapping (CLAM), which is methodology to create localization maps highlighting the image regions important for cluster assignment. Approach: Our approach uses a linear combination of the activation channels from the last layer of the encoder within a pretrained autoencoder. The activation channels are weighted by a channelwise confidence measure, which is a modification of score-CAM. Results: Our approach performs well under medical imaging-based simulation experiments, when the image clusters differ based on size, location, and intensity of abnormalities. Under simulation, the cluster assignments were predicted with 100% accuracy when the number of clusters was set at the true value. In addition, applied to computed tomography scans from a sarcoidosis population, CLAM identified two subtypes of sarcoidosis based purely on CT scan presentation, which were significantly associated with pulmonary function tests and visual assessment scores, such as ground-glass, fibrosis, and honeycombing. Conclusions: CLAM is a transparent methodology for identifying explainable groupings of medical imaging data. As deep learning networks are often criticized and not trusted due to their lack of interpretability, our contribution of CLAM to deep clustering architectures is critical to our understanding of cluster assignments, which can ultimately lead to new subtypes of diseases.
Purpose: An open question in deep clustering is how to explain what in the image is driving the cluster assignments. This is especially important for applications in medical imaging when the derived cluster assignments may inform decision-making or create new disease subtypes. We develop cluster activation mapping (CLAM), which is methodology to create localization maps highlighting the image regions important for cluster assignment. Approach: Our approach uses a linear combination of the activation channels from the last layer of the encoder within a pretrained autoencoder. The activation channels are weighted by a channelwise confidence measure, which is a modification of score-CAM. Results: Our approach performs well under medical imaging-based simulation experiments, when the image clusters differ based on size, location, and intensity of abnormalities. Under simulation, the cluster assignments were predicted with 100% accuracy when the number of clusters was set at the true value. In addition, applied to computed tomography scans from a sarcoidosis population, CLAM identified two subtypes of sarcoidosis based purely on CT scan presentation, which were significantly associated with pulmonary function tests and visual assessment scores, such as ground-glass, fibrosis, and honeycombing. Conclusions: CLAM is a transparent methodology for identifying explainable groupings of medical imaging data. As deep learning networks are often criticized and not trusted due to their lack of interpretability, our contribution of CLAM to deep clustering architectures is critical to our understanding of cluster assignments, which can ultimately lead to new subtypes of diseases.
Authors: Elizabeth A Regan; John E Hokanson; James R Murphy; Barry Make; David A Lynch; Terri H Beaty; Douglas Curran-Everett; Edwin K Silverman; James D Crapo Journal: COPD Date: 2010-02 Impact factor: 2.409
Authors: Sarah M Ryan; Tasha E Fingerlin; Margaret Mroz; Briana Barkes; Nabeel Hamzeh; Lisa A Maier; Nichole E Carlson Journal: Eur Respir J Date: 2019-08-29 Impact factor: 16.671
Authors: David R Moller; Laura L Koth; Lisa A Maier; Alison Morris; Wonder Drake; Milton Rossman; Joseph K Leader; Ronald G Collman; Nabeel Hamzeh; Nadera J Sweiss; Yingze Zhang; Scott O'Neal; Robert M Senior; Michael Becich; Harry S Hochheiser; Naftali Kaminski; Stephen R Wisniewski; Kevin F Gibson Journal: Ann Am Thorac Soc Date: 2015-10
Authors: Nadera J Sweiss; Rafah Salloum; Seema Gandhi; Seema Ghandi; Maria-Luisa Alegre; Ray Sawaqed; Maria Badaracco; Kenneth Pursell; David Pitrak; Robert P Baughman; David R Moller; Joe G N Garcia; Timothy B Niewold Journal: PLoS One Date: 2010-02-05 Impact factor: 3.240
Authors: Jordan Alexander Zach; John D Newell; Joyce Schroeder; James R Murphy; Douglas Curran-Everett; Eric A Hoffman; Philip M Westgate; MeiLan K Han; Edwin K Silverman; James D Crapo; David A Lynch Journal: Invest Radiol Date: 2012-10 Impact factor: 6.016