| Literature DB >> 30444907 |
Masami Kawagishi1, Takeshi Kubo2, Ryo Sakamoto2,3, Masahiro Yakami2,3, Koji Fujimoto4, Gakuto Aoyama1, Yutaka Emoto5, Hiroyuki Sekiguchi2, Koji Sakai6, Yoshio Iizuka1, Mizuho Nishio2,3, Hiroyuki Yamamoto1, Kaori Togashi2.
Abstract
We aimed to describe the development of an inference model for computer-aided diagnosis of lung nodules that could provide valid reasoning for any inferences, thereby improving the interpretability and performance of the system. An automatic construction method was used that considered explanation adequacy and inference accuracy. In addition, we evaluated the usefulness of prior experts' (radiologists') knowledge while constructing the models. In total, 179 patients with lung nodules were included and divided into 79 and 100 cases for training and test data, respectively. F-measure and accuracy were used to assess explanation adequacy and inference accuracy, respectively. For F-measure, reasons were defined as proper subsets of Evidence that had a strong influence on the inference result. The inference models were automatically constructed using the Bayesian network and Markov chain Monte Carlo methods, selecting only those models that met the predefined criteria. During model constructions, we examined the effect of including radiologist's knowledge in the initial Bayesian network models. Performance of the best models in terms of F-measure, accuracy, and evaluation metric were as follows: 0.411, 72.0%, and 0.566, respectively, with prior knowledge, and 0.274, 65.0%, and 0.462, respectively, without prior knowledge. The best models with prior knowledge were then subjectively and independently evaluated by two radiologists using a 5-point scale, with 5, 3, and 1 representing beneficial, appropriate, and detrimental, respectively. The average scores by the two radiologists were 3.97 and 3.76 for the test data, indicating that the proposed computer-aided diagnosis system was acceptable to them. In conclusion, the proposed method incorporating radiologists' knowledge could help in eliminating radiologists' distrust of computer-aided diagnosis and improving its performance.Entities:
Mesh:
Year: 2018 PMID: 30444907 PMCID: PMC6239329 DOI: 10.1371/journal.pone.0207661
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of notations.
| Notation | Description | Example |
|---|---|---|
| “Diagnosis” as the inference target node (random variable) | NA | |
| state of random variable | ||
| imaging findings and clinical data as the other nodes (random variable) | shape, tumor marker | |
| state of random variable | ||
| Evidence, as a set of | { | |
| posterior probability of | NA | |
| inference diagnosis with the highest posterior probability among | ||
| reason candidate (a proper subset of | If | |
| | | the number of elements of | If |
| { | ||
| influence of | NA | |
| prior probability of the inference diagnosis | NA | |
| difference between | NA | |
| the performance metric of inference model | NA | |
| explanation (reasoning) adequacy of inference model | NA | |
| inference accuracy of inference model | NA | |
| Reference reasons (1–7 imaging findings and/or clinical data chosen by radiologists) | “shape is polygon,” “diameter is small and cavitation exists,” and “satellite lesion exists” | |
| Reasons derived by the inference system | “shape is polygon” and “diameter is small and cavitation exists” |
Abbreviation: NA, not available
Fig 1An example of a Bayesian network (directed acyclic graphical model).
The Bayesian network has nodes (circles) and directed links (arrows). Each node and directed link represent a random variable and relationship, respectively. Each node can have a discriminate value (state).
Fig 2An example of probability propagation.
Curved arrows represent the propagation direction, dotted curved arrow with an X indicates no propagation, and gray circle (X) represents a node where Evidence is given. (a) Model A: Propagation does not occur from X to D. (b) Model B: Propagation occurs from X to D.
Fig 3Three types of update to the graphical model.
Delete denotes unlinking an existing link, reverse denotes reversing an existing link, and join denotes creating a new link.
Performance of the best three inference models with and without prior knowledge.
| Training data | Test data | ||||||
|---|---|---|---|---|---|---|---|
| Prior knowledge | Model | F-measure ( | Accuracy ( | Metric ( | F-measure ( | Accuracy ( | Metric ( |
| with | Best | 0.399 | 75.9 | 0.579 | 0.411 | 72.0 | 0.566 |
| 2nd | 0.324 | 70.9 | 0.516 | 0.325 | 76.0 | 0.542 | |
| 3rd | 0.363 | 70.9 | 0.536 | 0.328 | 74.0 | 0.534 | |
| without | Best | 0.342 | 72.2 | 0.532 | 0.274 | 65.0 | 0.462 |
| 2nd | 0.314 | 74.7 | 0.530 | 0.222 | 63.0 | 0.426 | |
| 3rd | 0.361 | 77.2 | 0.566 | 0.250 | 60.0 | 0.425 | |
Fig 4Frequencies of subjective ranks recoded by two radiologists.
Note: Ranks 5, 3, and 1 in the 5-point scale represent beneficial, appropriate, and detrimental, respectively.
Fig 5An example of misclassification and inadequate reasoning by the inference system.
A benign lung nodule (arrow) was classified as metastasis.