| Literature DB >> 31388566 |
Abstract
Machine learning (ML) and its parent technology trend, artificial intelligence (AI), are deriving novel insights from ever larger and more complex datasets. Efficient and accurate AI analytics require fastidious data science-the careful curating of knowledge representations in databases, decomposition of data matrices to reduce dimensionality, and preprocessing of datasets to mitigate the confounding effects of messy (i.e., missing, redundant, and outlier) data. Messier, bigger and more dynamic medical datasets create the potential for ML computing systems querying databases to draw erroneous data inferences, portending real-world human health consequences. High-dimensional medical datasets can be static or dynamic. For example, principal component analysis (PCA) used within R computing packages can speed & scale disease association analytics for deriving polygenic risk scores from static gene-expression microarrays. Robust PCA of k-dimensional subspace data accelerates image acquisition and reconstruction of dynamic 4-D magnetic resonance imaging studies, enhancing tracking of organ physiology, tissue relaxation parameters, and contrast agent effects. Unlike other data-dense business and scientific sectors, medical AI users must be aware that input data quality limitations can have health implications, potentially reducing analytic model accuracy for predicting clinical disease risks and patient outcomes. As AI technologies find more health applications, physicians should contribute their health domain expertize to rules-/ML-based computer system development, inform input data provenance and recognize the importance of data preprocessing quality assurance before interpreting the clinical implications of intelligent machine outputs to patients.Entities:
Keywords: Health occupations; Medical ethics
Year: 2019 PMID: 31388566 PMCID: PMC6599029 DOI: 10.1038/s41746-019-0138-5
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Fig. 1Data matrix decomposition for accelerated dynamic MRI of three contrast enhanced image phases.[18] Compressed sensing (CS) images reflect sparsity only, while low rank plus sparse (L + S) images provide improved spatiotemporal resolution resulting from the automated background suppression of the sparse (S) components, enhancing contrast resolution. (Reproduced with the permission of the Institute of Electrical and Electronics Engineers)
Fig. 2Change detection method of robust principal component analysis (PCA) combined with low rank plus sparse (L + S) matrix decomposition.[18] The original image (left), low-rank clean component (center), and sparse component image (right) reflect disease progression primarily localized to the retinal fundus.[28] (Reproduced with the permission of the Institute of Electrical and Electronics Engineers)