Literature DB >> 32517778

The use of machine learning in rare diseases: a scoping review.

Julia Schaefer^1,2, Moritz Lehne³, Josef Schepers², Fabian Prasser^2,4, Sylvia Thun^2,4,5.

Abstract

BACKGROUND: Emerging machine learning technologies are beginning to transform medicine and healthcare and could also improve the diagnosis and treatment of rare diseases. Currently, there are no systematic reviews that investigate, from a general perspective, how machine learning is used in a rare disease context. This scoping review aims to address this gap and explores the use of machine learning in rare diseases, investigating, for example, in which rare diseases machine learning is applied, which types of algorithms and input data are used or which medical applications (e.g., diagnosis, prognosis or treatment) are studied.
METHODS: Using a complex search string including generic search terms and 381 individual disease names, studies from the past 10 years (2010-2019) that applied machine learning in a rare disease context were identified on PubMed. To systematically map the research activity, eligible studies were categorized along different dimensions (e.g., rare disease group, type of algorithm, input data), and the number of studies within these categories was analyzed.
RESULTS: Two hundred eleven studies from 32 countries investigating 74 different rare diseases were identified. Diseases with a higher prevalence appeared more often in the studies than diseases with a lower prevalence. Moreover, some rare disease groups were investigated more frequently than to be expected (e.g., rare neurologic diseases and rare systemic or rheumatologic diseases), others less frequently (e.g., rare inborn errors of metabolism and rare skin diseases). Ensemble methods (36.0%), support vector machines (32.2%) and artificial neural networks (31.8%) were the algorithms most commonly applied in the studies. Only a small proportion of studies evaluated their algorithms on an external data set (11.8%) or against a human expert (2.4%). As input data, images (32.2%), demographic data (27.0%) and "omics" data (26.5%) were used most frequently. Most studies used machine learning for diagnosis (40.8%) or prognosis (38.4%) whereas studies aiming to improve treatment were relatively scarce (4.7%). Patient numbers in the studies were small, typically ranging from 20 to 99 (35.5%).
CONCLUSION: Our review provides an overview of the use of machine learning in rare diseases. Mapping the current research activity, it can guide future work and help to facilitate the successful application of machine learning in rare diseases.

Entities: Chemical Disease Gene Species

Keywords: Machine learning; Rare diseases; Scoping review

Mesh：

Year: 2020 PMID： 32517778 PMCID： PMC7285453 DOI： 10.1186/s13023-020-01424-6

Source DB: PubMed Journal: Orphanet J Rare Dis ISSN： 1750-1172 Impact factor: 4.123

Background

Diseases that affect fewer than 5 patients per 10,000 are defined as rare in Europe [1]. But rare diseases are only rare when considered individually. With more than 6000 known rare diseases [2], their collective global health burden is high, and recent estimates report a population prevalence of at least 3.5–5.9% [3]. (The true prevalence is probably higher, as for many rare diseases epidemiological data are scarce.) Moreover, due to their often genetic origin and early disease onset – often in infancy or childhood – most rare diseases follow patients for large parts of their lives, thus further exacerbating the disease burden. More than 80% of rare diseases affect fewer than one patient in a million [3]. This means that, for most rare diseases, even experienced physicians with a lot of patient contact never see a single patient in their lifetime. Correctly diagnosing patients is therefore difficult: According to a survey from 2013, it takes, on average, more than 5 years, eight physicians and two to three misdiagnoses until a rare disease patient receives the correct diagnosis [4]. Once correctly diagnosed, the challenges continue: Due to the small patient numbers, commercial incentives for developing medications are often low (although policies and legislations aim to raise financial incentives for developing rare disease treatments). Furthermore, the pathophysiological mechanisms underlying rare diseases are often not well understood. As a consequence, many rare diseases lack adequate treatment options. Improving the diagnosis and treatment of rare diseases is therefore an important public health concern. One valuable approach for improving medical care for rare disease patients are initiatives and networks that aim to bundle data and expertise about rare diseases so that healthcare providers can easily access and exchange relevant information. One of the most extensive knowledge bases for rare diseases is Orphanet [5], which provides information about, for example, disease epidemiology, associated genes, inheritance types, disease onsets or references to terminologies, as well as links to expert centers, patient organizations and other resources. Other European initiatives include RD-Connect, which combines registries, biobanks and genetic data with bioinformatics tools to provide a central resource for research on rare diseases [6]; the European Reference Networks (ERNs), which provide an IT infrastructure that allows healthcare professionals to collaborate on virtual panels to exchange knowledge and decide on optimal treatments [7]; and the European Joint Programme on Rare Diseases (EJP RD), a multinational cooperation aiming to create an ecosystem that facilitates research, care and medical innovation in the field of rare diseases [8]. In the US, the Undiagnosed Diseases Network (UDN) brings together experts to diagnose and treat patients with rare conditions [9]. And in Germany, a new national initiative, the Collaboration on Rare Diseases (CORD-MI), aims to improve the documentation and data exchange of rare diseases across German university hospitals [10]. In addition to these collaborative efforts and international platforms, another important factor that can improve the situation for rare disease patients are advances in information technology – particularly in the field of artificial intelligence (AI) and machine learning. AI and machine learning typically use large, multivariate datasets to “train” algorithms, which are then used to make predictions on new data (for example, by classifying tumors in radiological images as benign or malignant). Importantly, the computations by which these methods generate their output are not explicitly coded by a programmer, but instead are implicitly “learned” by the algorithm from example data (hence the term “machine learning”). AI and machine learning are increasingly applied in medicine and healthcare [11, 12] and, in some areas, are beginning to achieve (and sometimes even surpass) human-level performance [13-15]. Given the specific challenges in diagnosis and treatment discussed above, rare diseases can particularly benefit from AI and machine learning technologies: While it is virtually impossible for a physician to memorize information about thousands of rare diseases, modern computers can easily “memorize” huge quantities of digital information. If the computer can also extract and use this information in a meaningful way – for example, by classifying patients into disease groups or predicting outcomes – this has a high potential for improving diagnosis and treatment. Previous research, for example, has shown that an AI expert system that calculates disease probabilities based on patient symptoms can potentially accelerate rare disease diagnoses [16]. Using methods of computer vision and deep learning, another system, Face2Gene, can assist physicians in diagnosing rare genetic conditions based on photographs of patients’ faces [17]. Despite its potential for improving the quality of care for patients, the use of machine learning in the field of rare diseases has not been comprehensively reviewed (but see [18] for an overview with a special focus on congenital disorders of glycosylation). For example, it is unclear in which rare diseases machine learning is applied, which algorithms are typically used, which medical applications are studied (e.g., diagnosis, prognosis or treatment) and which type of input data is used. In this scoping review, we explore the scientific literature to answer these questions and investigate how machine learning is currently used in the context of rare diseases. Providing an overview of research in machine learning and rare diseases, our review can help to direct future work in this area, for example, by pointing to gaps in research or to promising fields for future study.

Methods

We opted to perform a scoping review because this type of review is best suited to map research activity in a broad and heterogeneous field such as machine learning and rare diseases (unlike typical systematic literature reviews that focus on more specific research questions) [19-22]. Where applicable, we follow the guidelines of the PRISMA extension for scoping reviews (PRISMA-ScR) [23]. No review protocol was registered for this study. To identify scientific articles that apply machine learning in the field of rare diseases, we systematically searched the literature on PubMed. The search string was constructed by concatenating general terms related to machine learning (“machine learning”, “artificial intelligence”) and rare diseases (“rare disease”, “orphan disease”), as well as names and synonyms of 381 specific rare diseases. These specific diseases comprised all rare diseases listed by Orphanet [5] with a point prevalence of 1–5 per 10,000 (146 diseases) or 1–9 per 100,000 patients (235 diseases). For many of these diseases, Orphanet provides PubMed search strings that were used to construct the search (for example, “Deletion[ti] 4p[ti] OR 4p syndrome[tw] OR wolf hirschhorn[tw] OR (chromosome deletion[mh] chromosome 4[mh])” for the Wolf-Hirschhorn syndrome). For diseases where no such search strings were available from Orphanet, the disorder name was used (for the exact search terms, see Additional file 1). The search was first conducted on January 2, 2020. During the revision process of the manuscript the search term was slightly modified and the search was conducted again on May 5, 2020. (The initial search term used in January had included some specific machine learning methods, such as “neural network” and “deep learning”, which could have biased the search results towards these methods. These search terms were omitted in the final search.) To be included in this review, the studies identified in the search had to fulfill the following eligibility criteria: rare disease topic; use of at least one machine learning method (and a description of the machine learning algorithm in sufficient detail to extract the basic information analyzed in this review); publication date between January 1, 2010, and December 31, 2019; publication as original research in a peer-reviewed journal or conference proceeding (i.e., review articles were excluded); publication in English or German; application of machine learning to human patient data or scientific texts or literature (i.e., articles using animal or simulation data were excluded). As our review does not aim to answer a specific clinical question, but instead explores the use of machine learning in rare diseases from a general perspective, we did not restrict eligibility to specific patient populations, interventions (except, of course, the use of machine learning), control groups or outcomes. For the same reason, we did not assess bias in the studies. After having selected relevant studies according to the eligibility criteria, the following data were extracted from the articles: 1) rare disease (diseases were specified using the Orphanet disorder name; studies investigating more than one disease were categorized as “Diverse”); 2) rare disease group (according to the “preferential parent” of the disease as defined in the hierarchy of the Orphanet classification [24], e.g., rare neurologic disease, rare hematologic disease etc.); 3) prevalence of rare disease (according to epidemiological information from Orphanet); 4) year of publication; 5) country where study was conducted (according to the senior author’s affiliation); 6) number of patients (if applicable / available); 7) medical application (i.e., “Diagnosis”, “Treatment”, “Prognosis” or “Basic research”); 8) type of input data; 9) type of algorithm; 10) validation of algorithm on external data or against human expert. For the variables “medical application”, “type of input data” and “type of algorithm”, categories were defined into which the studies were grouped. Categories were defined in a two-step process: First, the medical application, input data and machine learning algorithm were assessed in detail for each study (for example, a study might be described as aiming to distinguish patients from healthy controls, using a convolutional neural network on magnetic resonance imaging data of the brain). Based on these detailed data, two of the authors (JuS and ML) then defined meaningful, more general categories into which studies were grouped (for the previous example, this would be “Diagnosis” as medical application, “Images” as input data and “Artificial neural network” as type of algorithm). We did not rely on typical textbook categorizations of these variables (for example, classifying machine learning algorithms into supervised, unsupervised or reinforcement learning), as these categorizations were found not to be sufficiently informative and did not adequately reflect the studies (reinforcement learning, for example, does not play a significant role in the context of rare diseases). Instead, we defined a set of categories that aimed for a balance between sufficient detail and meaningful generalizations. This resulted in roughly ten categories for “type of input data” and “type of algorithm”. Note that a study could be grouped into more than one category when it used more than one type of input data or algorithm. Table 1 shows the variables extracted from the studies and the categories used for each variable.

Table 1

Data extracted from the studies

Variable	Categories	Definition	Example(s)
Rare disease	All rare diseases described at least once in the studies (studies investigating more than one rare disease were categorized as “Diverse”)	Orphanet disorder name	Cystic fibrosis, Sickle cell anemia, Gaucher disease
Disease group	All disease groups of the 381 specific diseases included in the search as well as disease groups of other diseases identified in the studies	Orphanet disease group as defined by the preferential parent in the classification hierarchy	Rare neurologic disease, Rare respiratory disease, Rare endocrine disease
Publication year	Years from 2010 to 2019	Year of the publication date of the article
Country of study	All countries that published at least one article	Country of institution of senior (i.e. last) author of the study
Medical application	Diagnosis	Studies aiming to correctly diagnose patients	Classification of cases and controls or different disease subtypes, Identification of biomarkers, Deep phenotyping, Decision support
	Treatment	Studies aiming to improve treatment or develop new therapies	Detection of therapeutic targets, Identification of binding proteins
	Prognosis	Prediction of a patient-relevant endpoint	Prediction of complication, disease onset, survival, disease progression, Risk estimation
	Basic research	Other basic research not classified into one of the categories above	Exploration of molecular disease mechanisms
Patient number	“< 20”, “20–99”, “100–1000”, “> 1000”, “not applicable / no information”	Number of patients included in the study
Input data^a	Clinical test score	Data from a clinical test score	Glasgow Coma Scale, ALS Functional Rating Scale
	Demographic data	General patient characteristics	Age, Sex, Ethnicity
	Functional test data	Data from physiological tests	ECG, EEG, EMG, gait pattern, pulse, blood pressure, eye movements
	Images	Data from medical imaging	MRI, PET, CT, retinal images, face photographs
	Laboratory data	Data from laboratory test	Blood glucose, platelet counts, creatinine
	Literature	Data extracted from scientific texts	Published literature, NCBI disease corpus
	Medication data	Data about medication	Use of antibiotics, medication plan
	Omics data	Molecular data	Genomics, Proteomics, Metabolomics, Epigenomics
	Patient / Family history	Data from patients’ or relatives’ past medical history	Pre-existing conditions, parental data
	Other EHR data	Other data from electronic health records	Diagnoses, procedures, other medical records
	Other	Other types of input data	Questionnaire or interview data, donors’ characteristics in HSCT
Type of algorithm^a	Artificial Neural Network		Convolutional neural network, Recurrent neural network, Multi-layer perceptron
	Bayesian Methods		Naïve Bayes
	Clustering		k-means clustering, Hierarchical clustering
	Decision Tree		Decision tree
	Discriminant Analysis		Linear discriminant analysis
	Ensemble Methods		AdaBoost, Random forest
	Instance-based Learning		k-nearest neighbor
	Regression (logistic)		Logistic regression
	Regression (other)		Linear regression
	Support Vector Machine		Support vector machine
	Other	Algorithms not classified into one of the categories above	Reinforcement learning, Graphical models
External validation	yes / no	Performance of algorithm tested on external data or against a human expert	Comparing automated scoring of chest radiographs with scoring by radiologists

aFor these variables, a study could be assigned to more than one category

Data extracted from the studies aFor these variables, a study could be assigned to more than one category Study selection and data extraction were performed by the first author (JuS). For unclear cases, the selection and data extraction were reviewed by the second author (ML) and discussed until a consensus was reached. Extracted data were saved in a spreadsheet for subsequent analysis. To get an overview of the use of machine learning in rare diseases, we then explored, for each of the variables described above, how many studies were in each category. We also explored possible gaps in research by comparing the distribution of rare disease groups investigated in the studies with the “baseline” distribution of disease groups of the 381 diseases included in our search. For this, we calculated the percentage of diseases within each disease group for the diseases from the studies as well as for the diseases from the search list and then calculated their difference (in percentage points). The magnitude of the difference then indicated which disease groups were underrepresented (or overrepresented) in the studies. All data analyses and visualizations were done with R [25] and the tidyverse packages [26].

Results

The literature search identified a total of 337 unique records. After screening and assessing the articles for eligibility, 211 articles were included in the final analysis (Fig. 1; the list of articles and extracted data is included in Additional file 1). Though not a strict inclusion criterion, all articles in the final selection were in English (no German-language articles were eligible for inclusion).

Fig. 1

Selection of sources of evidence

Selection of sources of evidence The studies originated from 32 different countries, with the largest number of publications (n = 91, 43.1%) coming from the United States (Fig. 2a, b). Over the 10-year time period considered in this review, publication numbers increased from 3 publications in 2010 to 79 publications in 2019. This increase in publication numbers appeared to parallel the increase of publications about machine learning in general (Fig. 2c).

Fig. 2

World map showing publications by country (a); countries with more than five publications (b); total number of publications per year (c; for comparison, the inset shows the publication trend for machine learning in general) Seventy-four different rare diseases were investigated in the studies. Of these 74 diseases, 71 were part of the list of the 381 rare diseases that were explicitly included in the search string (18.6%). Three diseases not explicitly listed in the search string – multiple osteochondromas, Fanconi anemia, juvenile idiopathic arthritis – were additionally described in the studies (these studies were identified via the generic search terms “rare disease” or “orphan disease”). Of the 74 diseases, 41 (55.4%) had a prevalence of 1–5 / 10,000 patients, 31 (41.9%) had a prevalence of 1–9 / 100,000, and 2 (2.8%) had a prevalence of 1–9 / 1000,000. The diseases most frequently investigated in the studies were amyotrophic lateral sclerosis, systemic lupus erythematosus, moderate and severe traumatic brain injury and cystic fibrosis (Table 2; note that some studies investigated more than one disease).

Table 2

Rare diseases most frequently investigated in the studies (all diseases appearing in five or more studies are listed)

Rare disease	Orpha number	Prevalence	Number of studies
Amyotrophic lateral sclerosis	803	1–9 / 100,000	16 (7.6%)
Systemic lupus erythematosus	536	1–5 / 10,000	14 (6.6%)
Moderate and severe traumatic brain injury	90056	1–5 / 10,000	12 (5.7%)
Cystic fibrosis	586	1–9 / 100,000	10 (4.7%)
More than one rare disease investigated	–	–	10 (4.7%)
Huntington disease	399	1–9 / 100,000	9 (4.3%)
Down syndrome	870	1–5 / 10,000	7 (3.3%)
Preeclampsia	275555	1–5 / 10,000	7 (3.3%)
Acquired aneurysmal subarachnoid hemorrhage	90065	1–5 / 10,000	6 (2.8%)
Systemic sclerosis	90291	1–5 / 10,000	6 (2.8%)
Fragile X syndrome	908	1–5 / 10,000	5 (2.4%)
Retinopathy of prematurity	90050	1–5 / 10,000	5 (2.4%)

Rare diseases most frequently investigated in the studies (all diseases appearing in five or more studies are listed) Comparing the distribution of disease groups investigated in the studies with the expected distribution (i.e., the “baseline” distribution of the diseases included in the literature search) revealed some groups that appeared to be overrepresented in the studies: Rare neurologic diseases, rare systemic or rheumatologic diseases, rare respiratory diseases, rare cardiac diseases and rare gastroenterologic diseases were investigated more frequently than to be expected (to a lesser extent, also rare hematologic and rare bone diseases). Conversely, other disease groups appeared to be underrepresented: Rare developmental defects during embryogenesis, rare inborn errors of metabolism, rare skin diseases and rare endocrine diseases were investigated less frequently than to be expected from their distribution in the search string (Fig. 3). For example, there were no studies on rare skin diseases, although the Orphanet list used in the literature search included 19 rare skin disorders (5.0%).

Fig. 3

Distribution across disease groups: The distribution of the 381 diseases included in the literature search is shown in comparison with the distribution of the 74 diseases investigated in the studies (left; disease groups smaller than 3% are not shown); differences between the percentages show disease groups that are over- or underrepresented in the studies (right) The algorithms most frequently used in the studies were ensemble methods (n = 76, 36.0%), support vector machines (n = 68, 32.2%) and artificial neural networks (n = 67, 31.8%) (Fig. 4a). Most frequent input data used by the algorithms were images (n = 68, 32.2%), demographic data (n = 57, 27.0%) and omics data (n = 56, 26.5%) (Fig. 4b). Most studies used machine learning for diagnosis (n = 86, 40.8%) or prognosis (n = 81, 38.4%), whereas studies aiming to improve treatment were relatively scarce (n = 10, 4.7%) (Fig. 4c). The number of patients investigated in the studies ranged from a few cases to several thousands, with studies typically using data from 20 to 99 patients (n = 75, 35.5%) (Fig. 4d). Twenty-five studies (11.8%) used an external data set to validate their algorithm; 5 studies (2.4%) validated their algorithm against a medical expert.

Fig. 4

Types of algorithms used in the studies (a); input data (b); medical application (c); number of patients (d). Studies using more than one type of algorithm or input data are listed in more than one category

Discussion

In this scoping review, we explored the scientific literature about machine learning methods used in the context of rare diseases. In particular, we investigated in which rare diseases and disease groups machine learning was typically applied, which types of algorithms and input data were used and which medical applications were studied. Considering the large number of known rare diseases, the number of diseases investigated in the machine learning studies identified in this review was relatively small. The majority of diseases was in the highest prevalence class (1–5 / 10,000 patients), despite the search string including more diseases in the lower prevalence class (1–9 / 100,000 patients). Moreover, a large proportion of studies investigated a few relatively “common” or well-known rare diseases, such as amyotrophic lateral sclerosis, lupus or cystic fibrosis. This shows that the pattern that applies to rare diseases in general also seems to apply within the group of rare diseases: Diseases with a comparatively high prevalence are investigated more frequently whereas diseases with a lower prevalence are “orphans” that receive less attention. (However, note that our literature search might have missed some studies about diseases with a very low prevalence of 1–9 / 1,000,000 or lower because these diseases were not explicitly included in the search string and could only be identified via the generic rare disease search strings.) Our review also revealed some rare disease groups that were investigated more frequently than to be expected from their occurrence in the search string. For example, the number of studies investigating rare neurologic diseases, rare systemic or rheumatologic diseases, rare respiratory diseases, rare cardiac diseases and rare gastroenterologic diseases was higher than to be expected. This observation can partly be explained by the prevalence of the diseases within a disease groups, i.e. disease groups containing more diseases with higher prevalence being investigated more frequently in the studies (as described in the previous paragraph). However, there were also disease groups – for example neurologic diseases – that were overrepresented in the studies, despite containing more diseases with a lower prevalence. For these disease groups the availability of data may play an important role: Many of the overrepresented disease groups work with imaging data (e.g., MRI data for neurologic diseases), which lend themselves particularly well for their use with machine learning. Some disease groups may also appear more frequently because they are part of large medical disciplines (e.g., neurology, rheumatology, cardiology etc.), which are not limited to rare conditions, and which can therefore draw on a large pool of existing research and methods. There were also disease groups underrepresented in the studies. Most interestingly, our review did not identify any machine learning studies about rare skin diseases. This is surprising, as the diagnosis of skin conditions is often cited as one of the prime examples of successful machine learning applications in medicine [13, 27]. Developing machine learning applications for the diagnosis of rare skin conditions could therefore be a highly promising field of research. Similarly, rare inborn errors of metabolism and rare developmental defects during embryogenesis were also underrepresented in the studies and could possibly benefit from machine learning research – in particular because they constitute two of the most common groups of rare diseases. Investigating typical algorithms, we identified ensemble methods, support vector machines and artificial neural networks as the algorithms most frequently used in the studies. Again, the choice of algorithms in the studies could be partly due to the data available to the algorithms. Images were identified as the most common type of input data, and the algorithms typically used in the studies (e.g., artificial neural networks) work well with this type of data. Moreover, image data (such as MRI, PET or CET) are acquired in large quantities in medical practice and can be processed in a relatively standardized way, thus providing a good data source for machine learning. The barrier of applying machine learning to other types of data, such as unstructured text data in medical records, is higher because these data are often not standardized and therefore more difficult to process. This highlights the importance of international health IT standards and medical terminologies that can improve interoperability and that can help to make medical data more accessible to machine learning [28]. In the context of rare diseases, standard vocabularies such as SNOMED CT [29], the Orphanet rare disease nomenclature [30] or the Human Phenotype Ontology (HPO) [31, 32] could particularly facilitate data interoperability. Only a relatively small proportion of the studies in this review tested their algorithms on an external validation data set or validated performance against human experts. However, to facilitate translation of machine learning methods into clinical practice, appropriate validation is crucial. Machine learning studies should therefore aim to evaluate their performance on external data so that their potential for real-world application can be more easily assessed (of course, this applies to machine learning in general, not only in the context of rare diseases). Note that our review did not evaluate the performance of the machine learning algorithms, since the studies identified in this scoping review were too heterogeneous to perform meaningful comparisons across studies. To investigate algorithm performance, more specific systematic literature reviews and meta-analyses are needed (for example, focusing on specific diseases, input data or outcome variables). Most studies identified in this review focused on diagnosis and prognosis of rare diseases. Considering that these are typical applications of machine learning (i.e., classification and prediction), this is not surprising. However, machine learning can also play an important role in improving the treatment of rare diseases, and future studies could focus more on this aspect, for example by using machine learning to accelerate drug development [33]. As to be expected in the context of rare diseases, the number of patients included in the studies was relatively small. Comparable reviews investigating machine learning in more common diseases, for example in diabetes mellitus [34], cancer [35] or coronary artery disease [36], have access to larger pools of patient data. This is important, as the performance of machine learning algorithms largely depends on the amount of data available for training the algorithms. The lack of sufficient training data could also explain why rare diseases with a higher prevalence were investigated more often than lower prevalence diseases. It is therefore important to further promote cross-institutional and international collaboration to create data sets sufficiently large for machine learning research.

Conclusion

Advances in machine learning can significantly improve diagnosis, treatment and prognosis of rare disease patients. This scoping review explored more than 200 scientific studies from a 10-year time period to assess the use of machine learning in rare diseases. Our findings provide a broad overview for researchers and healthcare professionals, which can guide future research and inspire more specific systematic literature reviews and meta-analyses. Our findings also point to promising areas of future research that are underrepresented in current studies (e.g., using machine learning to diagnose rare skin conditions). Additional file 1. Articles_included: List of articles included in the review and extracted data; Articles_all: All articles identified by the literature search and reasons for exclusion (if excluded); Search_string: Search strings used for the systematic literature search.

22 in total

1. Guidance for conducting systematic scoping reviews.

Authors: Micah D J Peters; Christina M Godfrey; Hanan Khalil; Patricia McInerney; Deborah Parker; Cassia Baldini Soares
Journal: Int J Evid Based Healthc Date: 2015-09

Review 2. Machine Learning in Medicine.

Authors: Alvin Rajkomar; Jeffrey Dean; Isaac Kohane
Journal: N Engl J Med Date: 2019-04-04 Impact factor: 91.245

3. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence.

Authors: Huiying Liang; Brian Y Tsui; Hao Ni; Carolina C S Valentim; Sally L Baxter; Guangjian Liu; Wenjia Cai; Daniel S Kermany; Xin Sun; Jiancong Chen; Liya He; Jie Zhu; Pin Tian; Hua Shao; Lianghong Zheng; Rui Hou; Sierra Hewett; Gen Li; Ping Liang; Xuan Zang; Zhiqi Zhang; Liyan Pan; Huimin Cai; Rujuan Ling; Shuhua Li; Yongwang Cui; Shusheng Tang; Hong Ye; Xiaoyan Huang; Waner He; Wenqing Liang; Qing Zhang; Jianmin Jiang; Wei Yu; Jianqun Gao; Wanxing Ou; Yingmin Deng; Qiaozhen Hou; Bei Wang; Cuichan Yao; Yan Liang; Shu Zhang; Yaou Duan; Runze Zhang; Sarah Gibson; Charlotte L Zhang; Oulan Li; Edward D Zhang; Gabriel Karin; Nathan Nguyen; Xiaokang Wu; Cindy Wen; Jie Xu; Wenqin Xu; Bochu Wang; Winston Wang; Jing Li; Bianca Pizzato; Caroline Bao; Daoman Xiang; Wanting He; Suiqin He; Yugui Zhou; Weldon Haw; Michael Goldbaum; Adriana Tremoulet; Chun-Nan Hsu; Hannah Carter; Long Zhu; Kang Zhang; Huimin Xia
Journal: Nat Med Date: 2019-02-11 Impact factor: 53.440

4. Scoping studies: advancing the methodology.

Authors: Danielle Levac; Heather Colquhoun; Kelly K O'Brien
Journal: Implement Sci Date: 2010-09-20 Impact factor: 7.327

5. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation.

Authors: Andrea C Tricco; Erin Lillie; Wasifa Zarin; Kelly K O'Brien; Heather Colquhoun; Danielle Levac; David Moher; Micah D J Peters; Tanya Horsley; Laura Weeks; Susanne Hempel; Elie A Akl; Christine Chang; Jessie McGowan; Lesley Stewart; Lisa Hartling; Adrian Aldcroft; Michael G Wilson; Chantelle Garritty; Simon Lewin; Christina M Godfrey; Marilyn T Macdonald; Etienne V Langlois; Karla Soares-Weiser; Jo Moriarty; Tammy Clifford; Özge Tunçalp; Sharon E Straus
Journal: Ann Intern Med Date: 2018-09-04 Impact factor: 25.391

6. Identifying facial phenotypes of genetic disorders using deep learning.

Authors: Yaron Gurovich; Yair Hanani; Omri Bar; Guy Nadav; Nicole Fleischer; Dekel Gelbman; Lina Basel-Salmon; Peter M Krawitz; Susanne B Kamphausen; Martin Zenker; Lynne M Bird; Karen W Gripp
Journal: Nat Med Date: 2019-01-07 Impact factor: 53.440

7. Can a decision support system accelerate rare disease diagnosis? Evaluating the potential impact of Ada DX in a retrospective study.

Authors: Simon Ronicke; Martin C Hirsch; Ewelina Türk; Katharina Larionov; Daphne Tientcheu; Annette D Wagner
Journal: Orphanet J Rare Dis Date: 2019-03-21 Impact factor: 4.123

Review 8. Why digital medicine depends on interoperability.

Authors: Moritz Lehne; Julian Sass; Andrea Essenwanger; Josef Schepers; Sylvia Thun
Journal: NPJ Digit Med Date: 2019-08-20

Review 9. Machine learning applications in cancer prognosis and prediction.

Authors: Konstantina Kourou; Themis P Exarchos; Konstantinos P Exarchos; Michalis V Karamouzis; Dimitrios I Fotiadis
Journal: Comput Struct Biotechnol J Date: 2014-11-15 Impact factor: 7.271

10. Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach.

Authors: Zachary Munn; Micah D J Peters; Cindy Stern; Catalin Tufanaru; Alexa McArthur; Edoardo Aromataris
Journal: BMC Med Res Methodol Date: 2018-11-19 Impact factor: 4.615

20 in total

Review 1. A Scoping Review of Artificial Intelligence and Machine Learning in Bariatric and Metabolic Surgery: Current Status and Future Perspectives.

Authors: Athanasios G Pantelis; Georgios K Stravodimos; Dimitris P Lapatsanis
Journal: Obes Surg Date: 2021-07-15 Impact factor: 4.129

Review 2. Artificial Intelligence in Medical Imaging and its Impact on the Rare Disease Community: Threats, Challenges and Opportunities.

Authors: Navid Hasani; Faraz Farhadi; Michael A Morris; Moozhan Nikpanah; Arman Rhamim; Yanji Xu; Anne Pariser; Michael T Collins; Ronald M Summers; Elizabeth Jones; Eliot Siegel; Babak Saboury
Journal: PET Clin Date: 2022-01

3. Development of a Prediction Model for COVID-19 Acute Respiratory Distress Syndrome in Patients With Rheumatic Diseases: Results From the Global Rheumatology Alliance Registry.

Authors: Zara Izadi; Milena A Gianfrancesco; Alfredo Aguirre; Anja Strangfeld; Elsa F Mateus; Kimme L Hyrich; Laure Gossec; Loreto Carmona; Saskia Lawson-Tovey; Lianne Kearsley-Fleet; Martin Schaefer; Andrea M Seet; Gabriela Schmajuk; Lindsay Jacobsohn; Patricia Katz; Stephanie Rush; Samar Al-Emadi; Jeffrey A Sparks; Tiffany Y-T Hsu; Naomi J Patel; Leanna Wise; Emily Gilbert; Alí Duarte-García; Maria O Valenzuela-Almada; Manuel F Ugarte-Gil; Sandra Lúcia Euzébio Ribeiro; Adriana de Oliveira Marinho; Lilian David de Azevedo Valadares; Daniela Di Giuseppe; Rebecca Hasseli; Jutta G Richter; Alexander Pfeil; Tim Schmeiser; Carolina A Isnardi; Alvaro A Reyes Torres; Gelsomina Alle; Verónica Saurit; Anna Zanetti; Greta Carrara; Julien Labreuche; Thomas Barnetche; Muriel Herasse; Samira Plassart; Maria José Santos; Ana Maria Rodrigues; Philip C Robinson; Pedro M Machado; Emily Sirotich; Jean W Liew; Jonathan S Hausmann; Paul Sufka; Rebecca Grainger; Suleman Bhana; Wendy Costello; Zachary S Wallace; Jinoos Yazdany
Journal: ACR Open Rheumatol Date: 2022-07-22

Review 4. Artificial intelligence at the national eye institute.

Authors: Noha A Sherif; Emily Y Chew; Michael F Chiang; Michelle Hribar; James Gao; Kerry E Goetz
Journal: Curr Opin Ophthalmol Date: 2022-07-16 Impact factor: 4.299

Review 5. New Developments and Possibilities in Reanalysis and Reinterpretation of Whole Exome Sequencing Datasets for Unsolved Rare Diseases Using Machine Learning Approaches.

Authors: Samarth Thonta Setty; Marie-Pier Scott-Boyer; Tania Cuppens; Arnaud Droit
Journal: Int J Mol Sci Date: 2022-06-18 Impact factor: 6.208

6. Predicting presumed serious infection among hospitalized children on central venous lines with machine learning.

Authors: Azade Tabaie; Evan W Orenstein; Shamim Nemati; Rajit K Basu; Swaminathan Kandaswamy; Gari D Clifford; Rishikesan Kamaleswaran
Journal: Comput Biol Med Date: 2021-02-20 Impact factor: 6.698

7. Artificial intelligence in nursing: Priorities and opportunities from an international invitational think-tank of the Nursing and Artificial Intelligence Leadership Collaborative.

Authors: Charlene Esteban Ronquillo; Laura-Maria Peltonen; Lisiane Pruinelli; Charlene H Chu; Suzanne Bakken; Ana Beduschi; Kenrick Cato; Nicholas Hardiker; Alain Junger; Martin Michalowski; Rune Nyrup; Samira Rahimi; Donald Nigel Reed; Tapio Salakoski; Sanna Salanterä; Nancy Walton; Patrick Weber; Thomas Wiegand; Maxim Topaz
Journal: J Adv Nurs Date: 2021-05-18 Impact factor: 3.057

8. Deep Learning-Based Microscopic Diagnosis of Odontogenic Keratocysts and Non-Keratocysts in Haematoxylin and Eosin-Stained Incisional Biopsies.

Authors: Roopa S Rao; Divya B Shivanna; Kirti S Mahadevpur; Sinchana G Shivaramegowda; Spoorthi Prakash; Surendra Lakshminarayana; Shankargouda Patil
Journal: Diagnostics (Basel) Date: 2021-11-24

9. Neural network classifiers for images of genetic conditions with cutaneous manifestations.

Authors: Dat Duong; Rebekah L Waikel; Ping Hu; Cedrik Tekendo-Ngongang; Benjamin D Solomon
Journal: HGG Adv Date: 2021-08-20

Review 10. Establishing a second-generation artificial intelligence-based system for improving diagnosis, treatment, and monitoring of patients with rare diseases.

Authors: Noa Hurvitz; Henny Azmanov; Asa Kesler; Yaron Ilan
Journal: Eur J Hum Genet Date: 2021-07-19 Impact factor: 5.351