| Literature DB >> 35527306 |
Jasmine Fardouly1, Ross D Crosby2, Suku Sukunesan3.
Abstract
Advances in machine learning and digital data provide vast potential for mental health predictions. However, research using machine learning in the field of eating disorders is just beginning to emerge. This paper provides a narrative review of existing research and explores potential benefits, limitations, and ethical considerations of using machine learning to aid in the detection, prevention, and treatment of eating disorders. Current research primarily uses machine learning to predict eating disorder status from females' responses to validated surveys, social media posts, or neuroimaging data often with relatively high levels of accuracy. This early work provides evidence for the potential of machine learning to improve current eating disorder screening methods. However, the ability of these algorithms to generalise to other samples or be used on a mass scale is only beginning to be explored. One key benefit of machine learning over traditional statistical methods is the ability of machine learning to simultaneously examine large numbers (100s to 1000s) of multimodal predictors and their complex non-linear interactions, but few studies have explored this potential in the field of eating disorders. Machine learning is also being used to develop chatbots to provide psychoeducation and coping skills training around body image and eating disorders, with implications for early intervention. The use of machine learning to personalise treatment options, provide ecological momentary interventions, and aid the work of clinicians is also discussed. Machine learning provides vast potential for the accurate, rapid, and cost-effective detection, prevention, and treatment of eating disorders. More research is needed with large samples of diverse participants to ensure that machine learning models are accurate, unbiased, and generalisable to all people with eating disorders. There are important limitations and ethical considerations with utilising machine learning methods in practice. Thus, rather than a magical solution, machine learning should be seen as an important tool to aid the work of researchers, and eventually clinicians, in the early identification, prevention, and treatment of eating disorders.Entities:
Keywords: Artificial learning; Chatbot; Detection; Eating disorder; Ethical concerns; Machine learning; Social media; Statistics; Treatment
Year: 2022 PMID: 35527306 PMCID: PMC9080128 DOI: 10.1186/s40337-022-00581-2
Source DB: PubMed Journal: J Eat Disord ISSN: 2050-2974
Research using ML to detect eating disorder risk via survey responses
| Study | Sample | Predictors variables | Outcome variables | Best performing ML approach | Explanatory power of best performing model | Limitations |
|---|---|---|---|---|---|---|
| Buscema et al. [ | 172 females with a diagnosed ED | 124 different variables: generic information, alimentary behaviour, eventual treatment and hospitalization, substance use, menstrual cycles, weight and height, hematochemical and instrumental examinations, psychodiagnostic tests | ED diagnosis | Feed forward neural networks | 87% | Cross-sectional study Small sample |
| Forrest et al. [ | 191 adults with BED in a randomised controlled trial | Treatment condition, demographic information, baseline clinical characteristics | (1) Binge eating abstinence; or (2) reduction; (3) ED psychopathology; (4) perceived weight loss; and (5) actual weight loss | Elastic net | (1) 51%; (2) 4%; (3) 27%; (4) 12%; (5) 68% | Cross-sectional study Small sample |
| Haynos et al. [ | 415 female adults with a diagnosed ED, of which 320 completed measures at Year 1, and 277 completed measures at Year 2 | Demographics, psychiatric treatment, ED symptoms, other psychiatric diagnoses and symptoms, self-esteem | (1) ED diagnosis; (2) objective binge eating; (3) compensatory behaviours; and (4) underweight BMI | Elastic net regularized logistic regressions | At year 1: (1) 62%; (2) 77%; (3) 88%; (4) 93% At year 2: (1) 61%; (2) 71%; (3) 85%; (4) 89% | Small sample |
| Krug et al. [ | 1402 adolescents and adults (92% female), with (n = 588) or without (n = 760) a diagnosed ED | Cross-cultural risk factors for EDs before the age of 12 | (1) ED onset; (2) differential ED diagnoses | Penalised logistic regression (LASSO) | (1) 89%; (2) 70% | Cross-sectional study |
| Linardon et al. [ | 1341 adults (91% females), with (n = 512) and without (n = 829) recurrent binge eating | Intuitive eating behaviours, flexible restraint behaviours, rigid restraint behaviours, rigid restraint cognitions | Recurrent binge eating behaviour | Decision tree classification | 70% | Cross-sectional study |
| Orru et al. [ | 107 females, with (n = 53) and without (n = 54) a diagnosed ED | Presence of manic/hypomanic and depressive symptoms, AN and BN symptoms | ED status | Naïve bayes | 91% | Cross-sectional study Small sample |
| Ren et al. [ | 830 non-clinical young females | Psychological distress, eating inflexibility, body image inflexibility, body dissatisfaction, emotional overeating, loss of control overeating, body mass index | ED risk | Decision tree classification | 85% | Cross-sectional study |
| Rosenfield and Linstead [ | 44 female young adults, with (n = 20) and without (n = 24) a previous diagnosis of AN | ED symptoms, psychosocial impairment, symptoms of autism spectrum disorder | ED status | K-means clustering | 78% | Cross-sectional study Small sample |
ED, eating disorder; AN, anorexia nervosa; BN, bulimia nervosa; BED, binge eating disorder; BMI, body mass index; ML, machine learning
Research using ML to detect eating disorder risk via social media and internet data
| Study | Sample | Predictors variables | Outcome variables | Best performing ML approach | Explanatory power of best performing model | Limitations |
|---|---|---|---|---|---|---|
| Benítez-Andrades et al. [ | 494,025 posts containing ED-related words on Twitter | Posts with ED-related words | (1) Posts written by users who identify online as suffering from EDs; (2) posts that promoted having an ED; (3) informative posts; (4) scientific posts | Bidirectional encoder representations from transformer–based models (RoBERTa) | (1) 83%; (2) 89%; (3) 84%; (4) 94% | Outcome based on content of posts not validated measures of EDs Only identifies content of people who publicly acknowledge their ED on Twitter Specific predictors unknown |
| Chancellor et al. [ | 62,000 posts with removed pro-ED content or ED content remaining publicly available on Instagram | Combinations and frequencies of different ED hashtags and captions | Whether the post was removed or still publicly available | Logistic regression | 69% | Only identifies content of people who publicly acknowledge their ED on Instagram |
| Chancellor et al. [ | 26 million posts from 100,000 users who post pro-ED content on Instagram | Mental illness severity (MIS; low, medium, high) in a user’s previous posts based on the content of hashtags | MIS (low, medium, high) based on the content of hashtags | Multinomial logistic regression | 81% | MIS inferred from posts not validated measures |
| Chancellor et al. [ | 877,000 pro-ED photo posts shared on Tumblr, 569 of which were removed by Tumblr | Text, hashtag, and photo content from Tumblr posts | Whether the post would be/was removed by Tumblr for violating community guidelines | Deep neural network | 89% | Specific predictors unknown |
| De Choudhury [ | 55,334 posts collected from 18,923 blogs on Tumblr who mentioned common ED and anorexia symptomatology tags | Social, affective, cognitive, and linguistic style expression in posts | (1) Whether a post shares any kind of anorexia related content; (2) Whether a post relates to the proana or the pro-recovery community | Support vector machine classifier | (1) 83%; (2) 81% | Outcome based on content of posts not validated measures of EDs Only identifies content of people who publicly acknowledge their ED on Tumblr |
| Hwang et al. [ | 185,950 posts and 3,528,107 comments from a weight management subcommunity on Reddit | 4 types of emotional eating behaviours and 5 types of feedback based on Latent Dirichlet Allocation topic modelling method | Emotional eating diagnosis based on authors’ expertise | Stochastic gradient descent | 91% | Outcome based on content of posts not validated measures of EDs |
| Sadeh-Sharvit et al. [ | 231 adult women on Prolific who contributed their internet browsing history over the past 6-months | Keywords related to EDs, daily visits to social media, fraction of searches on Google or Bing, activity rates, participant age | ED status (clinical/subclinical ED, high risk for an ED, or no ED) based on responses to validated surveys | GentleBoost | 53% | Small sample size |
| Wang et al. [ | 119,825,361 posts on Twitter from 72,047 users, of which 1,797,239 posts were from 3380 users who self-identify with an ED on Twitter | User engagement and activity, posting preference, interaction diversity, psychometric properties of posts | ED status (ED or non-ED user) | Support vector machine | 97% | ED status determined by self-identifying ED on Twitter Only identifies content of people who publicly acknowledge their ED on Twitter |
| Yan et al. [ | 4759 posts from 6 ED-related subcommunities on Reddit | Relationships between key words within the text of each post | Whether users need immediate mental health support for an ED based on expertise of two clinical psychologists | Logistic regression | 96% | Required human coders with extensive expertise |
| Zhou et al. [ | 18,288 posts on Twitter with ED-related words | ED-related words in posts | ED-related topic clusters/themes | Correlation Explanation (CorEx) topic model | 78% | Outcome based on content of posts not validated measures of EDs Only identifies content of people who publicly acknowledge their ED on Twitter |
| Zhou et al. [ | 123,977 posts on Twitter with ED-related words | Posts with ED-related words | (1) ED-relevant and ED-irrelevant posts; (2) ED-promotional and education and ED-laypeople posts | Convolutional neural network (CNN) and long short-term memory (LSTM) | (1) 89%; (2) 90% | Outcome based on content of posts not validated measures of EDs Only identifies content of people who publicly acknowledge their ED on Twitter |
ED, eating disorder; ML, machine learning
Research using ML to detect eating disorder risk via physiological predictors
| Study | Sample | Predictors variables | Outcome variables | Best performing ML approach | Explanatory power of best performing model | Limitations |
|---|---|---|---|---|---|---|
| Cerasa et al. [ | 36 females, with an ED diagnosis ( | Structural magnetic resonance images | ED status | Support vector machine | 85% | Small sample size |
| Cyr et al. [ | 84 female adolescents, who met criteria for BN ( | Functional magnetic resonance images of fronto-striatal regions during performance of a Simon task | ED status | Support vector machine | BN v. HC = 58% SBN v. HC = 64% Train BN v. HC, Test SBN v. HC = 66% | Small sample |
| Guo et al. [ | 13,206 adolescent and adult females, who have AN ( | Whole genome genotyping data | AN status | Logistic regression with LASSO penalty | 69% | |
| Ioannidis et al. [ | 3937 observations from 36 AN inpatients | Physiological parameters, blood investigations over a 1-year period | Medical risk defined by independent clinical rates of deteriorating cases | Random forest | 98% | Small sample |
| Lavagnino et al. [ | 30 females, with an AN diagnosis ( | Structural neuroimaging scans | ED status | Least absolute shrinkage and selection operator (LASSO) | 83% | Small sample |
| Lavagnino et al. [ | 67 adult females, who have restrictive-type AN ( | Structural brain scans to test cortical thickness | ED status | Linear relevance vector machine | AN v. HC = 74% | Small sample |
| Strigo et al. [ | 1 adolescent female with mixed ED, depressive, and gastrointestinal symptoms | Functional magnetic resonance images during a pain anticipation program and psychological survey responses created on previous samples | Which diagnostic phenotype most closely approximates the patient | Support vector machine | 56% based on brain activation 84% based on psychological variables | Case study |
| Weygandt et al. [ | 70 females, who met criteria for binge eating disorder (BED; | Functional imaging from a whole-body tomograph while viewing food or neutral images | ED status | Support vector machine | BED v. C-NW = 86% BN v. C-NW = 78% BED v. C-OW = 71% BN v. C-OW = 86% BED v. BN = 84% | Small sample size |
ED, eating disorder; AN, anorexia nervosa; BN, bulimia nervosa; BED, binge eating disorder; HC, healthy controls; ML, machine learning