Literature DB >> 33355678

Development of a Machine Learning Model Using Multiple, Heterogeneous Data Sources to Estimate Weekly US Suicide Fatalities.

Daejin Choi1, Steven A Sumner2, Kristin M Holland3, John Draper4, Sean Murphy4, Daniel A Bowen3, Marissa Zwald3, Jing Wang3, Royal Law5, Jordan Taylor6, Chaitanya Konjeti6, Munmun De Choudhury6.   

Abstract

Importance: Suicide is a leading cause of death in the US. However, official national statistics on suicide rates are delayed by 1 to 2 years, hampering evidence-based public health planning and decision-making. Objective: To estimate weekly suicide fatalities in the US in near real time. Design, Setting, and Participants: This cross-sectional national study used a machine learning pipeline to combine signals from several streams of real-time information to estimate weekly suicide fatalities in the US in near real time. This 2-phase approach first fits optimal machine learning models to each individual data stream and subsequently combines predictions made from each data stream via an artificial neural network. National-level US administrative data on suicide deaths, health services, and economic, meteorological, and online data were variously obtained from 2014 to 2017. Data were analyzed from January 1, 2014, to December 31, 2017. Exposures: Longitudinal data on suicide-related exposures were obtained from multiple, heterogeneous streams: emergency department visits for suicide ideation and attempts collected via the National Syndromic Surveillance Program (2015-2017); calls to the National Suicide Prevention Lifeline (2014-2017); calls to US poison control centers for intentional self-harm (2014-2017); consumer price index and seasonality-adjusted unemployment rate, hourly earnings, home price index, and 3-month and 10-year yield curves from the Federal Reserve Economic Data (2014-2017); weekly daylight hours (2014-2017); Google and YouTube search trends related to suicide (2014-2017); and public posts on suicide on Reddit (2 314 533 posts), Twitter (9 327 472 tweets; 2015-2017), and Tumblr (1 670 378 posts; 2014-2017). Main Outcomes and Measures: Weekly estimates of suicide fatalities in the US were obtained through a machine learning pipeline that integrated the above data sources. Estimates were compared statistically with actual fatalities recorded by the National Vital Statistics System.
Results: Combining information from multiple data streams, the machine learning method yielded estimates of weekly suicide deaths with high correlation to actual counts and trends (Pearson correlation, 0.811; P < .001), while estimating annual suicide rates with low error (0.55%). Conclusions and Relevance: The proposed ensemble machine learning framework reduces the error for annual suicide rate estimation to less than one-tenth of that of current forecasting approaches that use only historical information on suicide deaths. These findings establish a novel approach for tracking suicide fatalities in near real time and provide the potential for an effective public health response such as supporting budgetary decisions or deploying interventions.

Entities:  

Year:  2020        PMID: 33355678      PMCID: PMC7758810          DOI: 10.1001/jamanetworkopen.2020.30932

Source DB:  PubMed          Journal:  JAMA Netw Open        ISSN: 2574-3805


  23 in total

1.  Super learner.

Authors:  Mark J van der Laan; Eric C Polley; Alan E Hubbard
Journal:  Stat Appl Genet Mol Biol       Date:  2007-09-16

2.  Internet monitoring of suicide risk in the population.

Authors:  Michael J McCarthy
Journal:  J Affect Disord       Date:  2009-09-12       Impact factor: 4.839

3.  Social sciences. Social media for large studies of behavior.

Authors:  Derek Ruths; Jürgen Pfeffer
Journal:  Science       Date:  2014-11-28       Impact factor: 47.728

4.  Tracking search engine queries for suicide in the United Kingdom, 2004-2013.

Authors:  V S Arora; D Stuckler; M McKee
Journal:  Public Health       Date:  2016-03-11       Impact factor: 2.427

5.  Increase in Suicide Mortality in the United States, 1999-2018.

Authors:  Holly Hedegaard; Sally C Curtin; Margaret Warner
Journal:  NCHS Data Brief       Date:  2020-04

6.  Facebook language predicts depression in medical records.

Authors:  Johannes C Eichstaedt; Robert J Smith; Raina M Merchant; Lyle H Ungar; Patrick Crutchley; Daniel Preoţiuc-Pietro; David A Asch; H Andrew Schwartz
Journal:  Proc Natl Acad Sci U S A       Date:  2018-10-15       Impact factor: 11.205

7.  Detecting relapse in youth with psychotic disorders utilizing patient-generated and patient-contributed digital data from Facebook.

Authors:  M L Birnbaum; S K Ernala; A F Rizvi; E Arenare; A R Van Meter; M De Choudhury; J M Kane
Journal:  NPJ Schizophr       Date:  2019-10-07

8.  Social media use in the United States: implications for health communication.

Authors:  Wen-ying Sylvia Chou; Yvonne M Hunt; Ellen Burke Beckjord; Richard P Moser; Bradford W Hesse
Journal:  J Med Internet Res       Date:  2009-11-27       Impact factor: 5.428

9.  Influenza forecasting with Google Flu Trends.

Authors:  Andrea Freyer Dugas; Mehdi Jalalpour; Yulia Gel; Scott Levin; Fred Torcaso; Takeru Igusa; Richard E Rothman
Journal:  PLoS One       Date:  2013-02-14       Impact factor: 3.240

10.  Advanced Daily Prediction Model for National Suicide Numbers with Social Media Data.

Authors:  Kyung Sang Lee; Hyewon Lee; Woojae Myung; Gil-Young Song; Kihwang Lee; Ho Kim; Bernard J Carroll; Doh Kwan Kim
Journal:  Psychiatry Investig       Date:  2018-04-05       Impact factor: 2.505

View more
  3 in total

Review 1.  Leveraging data science to enhance suicide prevention research: a literature review.

Authors:  Avital Rachelle Wulz; Royal Law; Jing Wang; Amy Funk Wolkin
Journal:  Inj Prev       Date:  2021-08-19       Impact factor: 3.770

2.  Opportunities and challenges of using social media big data to assess mental health consequences of the COVID-19 crisis and future major events.

Authors:  Martin Tušl; Anja Thelen; Kailing Marcus; Alexandra Peters; Evgeniya Shalaeva; Benjamin Scheckel; Martin Sykora; Suzanne Elayan; John A Naslund; Ketan Shankardass; Stephen J Mooney; Marta Fadda; Oliver Gruebner
Journal:  Discov Ment Health       Date:  2022-06-27

3.  Estimating Weekly National Opioid Overdose Deaths in Near Real Time Using Multiple Proxy Data Sources.

Authors:  Steven A Sumner; Daniel Bowen; Kristin Holland; Marissa L Zwald; Alana Vivolo-Kantor; Gery P Guy; William J Heuett; DeMia P Pressley; Christopher M Jones
Journal:  JAMA Netw Open       Date:  2022-07-01
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.