| Literature DB >> 35639561 |
Chelsea Chandler1,2, Peter W Foltz2, Brita Elvevåg3,4.
Abstract
OBJECTIVES: Machine learning (ML) and natural language processing have great potential to improve efficiency and accuracy in diagnosis, treatment recommendations, predictive interventions, and scarce resource allocation within psychiatry. Researchers often conceptualize such an approach as operating in isolation without much need for human involvement, yet it remains crucial to harness human-in-the-loop practices when developing and implementing such techniques as their absence may be catastrophic. We advocate for building ML-based technologies that collaborate with experts within psychiatry in all stages of implementation and use to increase model performance while simultaneously increasing the practicality, robustness, and reliability of the process.Entities:
Keywords: active learning; machine learning; natural language processing; safeguards
Mesh:
Year: 2022 PMID: 35639561 PMCID: PMC9434423 DOI: 10.1093/schbul/sbac038
Source DB: PubMed Journal: Schizophr Bull ISSN: 0586-7614 Impact factor: 7.348
Glossary of technical terms
| Artificial Intelligence (AI) | AI is a general term for computer systems that exhibit intelligent behavior and are able to learn, explain, and advise their users. |
|---|---|
| Machine Learning (ML) | ML is a subset of AI that harnesses statistical algorithms to learn |
| Natural Language Processing (NLP) | NLP is another type of AI that incorporates both statistical and linguistic knowledge to understand human language. |
| Features | A measurable property of the data (eg, the number of words spoken is a |
| Hyperparameters | Parameters that are set before the final training of a model rather than learned during the process (eg, the number of iterations of training used to train a model). |
| Classification | A category of ML models that are trained to predict a category. It can be binary (e.g., mentally ill or healthy) or multiclass (eg, classifying speech as “schizophrenic-like”, “manic-like”, or non-disordered). |
| Regression | A category of ML models that are trained to predict a numerical, continuous-valued output (eg, a clinical rating). |
| Neural Network | A type of ML model that is a system of nodes, composed in layers. Each node learns some nonlinear equation on a subset of training data and when all are combined, a categorical or real valued output can be predicted. Modern neural networks have hundreds to thousands of nodes and layers and are trained on large datasets. |
| K-Means Clustering | An unsupervised method of partitioning a dataset into K clusters with the goal of minimizing within-cluster variance. Each data point belongs to a single cluster determined by its nearest mean or centroid. |
| Edge Case | Any situation that occurs near a decision boundary, at the extremes of the inputs, an exception to a learned rule, or anything that may require additional or special handling. |
| Overfitting | A situation where a model too closely fits its training data. This is an issue because it may learn to fit to spurious correlations in the data rather than learning a generalized solution to the problem itself. |
Fig. 1.Top row: stages of the traditional ML framework applied to story recall scoring with minimal human involvement. Bottom row: active learning and human-in-the-loop safeguards which must be incorporated for translational value.
Fig. 2.Active learning process for story recall scoring. Adapted from Human-in-the-Loop Machine Learning [31].
Fig. 3.Scatter plot of unlabeled story recall responses chosen via uncertainty (N = 45), diversity (N = 30 clustering, N = 10 real world), and random (N = 10) sampling in the first active learning iteration, visualized with the first two components of PCA.
Fig. 4.Line plot of story recall model prediction and human rating correlations when harnessing active learning (solid lines) versus random sampling (dashed lines), testing on the evaluation set (blue lines with overlaid circles) and the transfer set (orange lines without overlaid circles). Both approaches continue until the full dataset is harnessed and correlations converge. For interpretation of the references to color in figures, the reader is referred to the web version of this article.