| Literature DB >> 31651969 |
Brett Beaulieu-Jones1, Samuel G Finlayson2, Corey Chivers3, Irene Chen4, Matthew McDermott4, Jaz Kandola5, Adrian V Dalca1,4, Andrew Beam6, Madalina Fiterau7, Tristan Naumann8.
Abstract
Importance: The use of machine learning applications related to health is rapidly increasing and may have the potential to profoundly affect the field of health care. Objective: To analyze submissions to a popular machine learning for health venue to assess the current state of research, including areas of methodologic and clinical focus, limitations, and underexplored areas. Design, Setting, and Participants: In this data-driven qualitative analysis, 166 accepted manuscript submissions to the Third Annual Machine Learning for Health workshop at the 32nd Conference on Neural Information Processing Systems on December 8, 2018, were analyzed to understand research focus, progress, and trends. Experts reviewed each submission against a rubric to identify key data points, statistical modeling and analysis of submitting authors was performed, and research topics were quantitatively modeled. Finally, an iterative discussion of topics common in submissions and invited speakers at the workshop was held to identify key trends. Main Outcomes and Measures: Frequency and statistical measures of methods, topics, goals, and author attributes were derived from an expert review of submissions guided by a rubric.Entities:
Year: 2019 PMID: 31651969 PMCID: PMC6822089 DOI: 10.1001/jamanetworkopen.2019.14051
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Data Sets and Data Access for Each Type of Data
| Data Type | Total No. of Data Sets | No. (%) of Data Sets | |
|---|---|---|---|
| Public or Apply for Access | Private, Institutional Access, Self-collected, or Synthetic (Not Shared) | ||
| Total | 166 | 97 (58.4) | 69 (41.6) |
| Structured | 54 | 28 (51.9) | 26 (48.1) |
| Image | 39 | 28 (71.8) | 11 (28.2) |
| Text | 34 | 19 (55.9) | 15 (44.1) |
| Biological sequence | 10 | 8 (80.0) | 2 (20.0) |
| ECG | 10 | 6 (60.0) | 4 (40.0) |
| EEG | 7 | 4 (57.1) | 3 (42.9) |
| Speech | 6 | 1 (16.7) | 5 (83.3) |
| Video | 6 | 3 (50.0) | 3 (50.0) |
Abbreviations: ECG, electrocardiography; EEG, electroencephalography.
Clinical Conditions
| Condition | No. (%) of |
|---|---|
| Brain and mental health | 25 (15.1) |
| Oncology | 21 (12.7) |
| Cardiovascular | 19 (11.4) |
| Diabetes | 10 (6.0) |
| Pregnancy and natality | 6 (3.6) |
| Pulmonary | 6 (3.6) |
| ICU | 5 (3.0) |
| Infection | 5 (3.0) |
| Mobility and skeletal conditions | 4 (2.5) |
| Mortality | 3 (1.8) |
| Vocal disorder | 3 (1.8) |
| Quality of care or quality of life | 2 (1.2) |
| Hemorrhage | 2 (1.2) |
| Addiction, smoking, opioid | 2 (1.2) |
| Multiple | 7 (4.2) |
| Other | 12 (7.8) |
| NA | 43 (25.9) |
Abbreviations: ICU, intensive care unit; NA, not applicable.
Some manuscripts addressed more than 1 condition (percentages will not equal 100).
Figure 1. Visualization of 12-Topic Model Trained on Third Annual Machine Learning for Health (ML4H) Workshop Manuscripts
Principal component (PC) projection of 12 topics learned from ML4H manuscripts. An interactive version can be viewed online.[23]
Figure 2. Selected Topics Representing Application Domains and Methods From the Topic Model Trained on Third Annual Machine Learning for Health (ML4H) Workshop Manuscripts
An interactive version can be viewed online.[23] AI indicates artificial intelligence; ICD, International Classification of Diseases. dt, ti, xi, and yi are equation variables common in reinforcement learning algorithms.