| Literature DB >> 31413871 |
Joanna Kedra1,2, Timothy Radstake3, Aridaman Pandit3, Xenofon Baraliakos4, Francis Berenbaum5, Axel Finckh6, Bruno Fautrel1,2, Tanja A Stamm7, David Gomez-Cabrero8, Christian Pristipino9, Remy Choquet10, Hervé Servy11, Simon Stones12, Gerd Burmester13, Laure Gossec1,2.
Abstract
Objective: To assess the current use of big data and artificial intelligence (AI) in the field of rheumatic and musculoskeletal diseases (RMDs).Entities:
Keywords: artificial intelligence; big data; biostatistics; machine learning; rheumatology
Mesh:
Year: 2019 PMID: 31413871 PMCID: PMC6668041 DOI: 10.1136/rmdopen-2019-001004
Source DB: PubMed Journal: RMD Open ISSN: 2056-5933
Figure 1Flow-chart of the systematic literature review in RMDs.
Description of 55 articles on big data in RMDs, and 55 articles for comparison outside RMDs
| RMDs | Other medical fields | |
| Year of publication, mean (SD) (range) | 2014 (4.6) (1992–2018) | 2018 (0.4) (2018–2019) |
| Year of publication: last 5 years, N (%) | 40 (72) | 55 (100) |
| Impact factor, mean (SD) (range) | 3.8 (4.0) (0.35–23.3) | 5.56 (9.8) (0.56–47.7) |
| Geographic origin of the first author, N (%) | ||
| North America N=15 (34%) | 21 (38) | 17 (31) |
| Europe | 18 (33) | 18 (33) |
| Asia | 15 (27) | 19 (34) |
| Australia | 0 (0) | 1 (2) |
| South America | 1 (2) | 0 (0) |
| Africa | 0 (0) | 0 (0) |
| Clear definition of big data, N (%) | 2 (4) | 7 (13) |
| N data analysed | ||
| Units of observation, mean (SD) (range) | 1 142 000 (3 990 000) (5–25 000 000) | 5 298 000 (23 909 000) (40–140 000 000) |
| Data points, mean (SD) (range) | 746 000 000 (1 660 000 000) (2000–5 000 000 000) | 9 149 000 000 (39 000 000 000) (100 000–200 000 000 000) |
| Clinical data sources, N (%) | 26 (47) | 17 (31) |
| Registries/cohorts, N (%) | 14 (25) | 10 (18) |
| EHR | 11 (20) | 3 (6) |
| Claims databases | 1 (2) | 0 (0) |
| Trials | 0 (0) | 0 (0) |
| PGHD (sensors, etc) | 0 (0) | 4 (7) |
| Other | 0 (0) | 0 (0) |
| Biological data, N (%) | 8 (15) | 17 (31) |
| –omics | 8 (15) | 17 (31) |
| Other | 0 (0) | 0 (0) |
| Imaging, N (%) | 16 (29) | 16 (29) |
| Other data sources | 5 (9) | 5 (9) |
| Text-mining from publications | 5 (9) | 5 (9) |
| Other | 0 (0) | 0 (0) |
EHR, Electronic Health Record; PGHD, Patient Generated Health Data.
Figure 2Evolution of the number of articles on big data in the field of RMDs.
Description of the diseases in RMDs and other medical fields
| RMDs | |
|
|
|
| Gout | 3 (5) |
| Inflammatory joint diseases | 22 (40) |
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| Osteoarthritis | 16 (29) |
| |
|
| |
|
| Osteoporosis | 6 (11) |
| |
|
| |
|
| Spine pathology | 6 (11) |
| Other pathologies | 5 (9) |
The total of pathologies in RMDs and in other medical fields is above 55, as some articles were about several diseases at the same time.
Description of the statistical methods used to analyse big data in RMDs and in other medical fields
| RMDs | Other medical fields | |
|
| ||
| AI only, N (%) | 30 (55) | 35 (63) |
| Traditional statistics only N (%) | 10 (18) | 8 (15) |
| Both AI and traditional methods N (%) | 15 (27) | 12 (22) |
|
| ||
|
|
|
|
| AI methods | ||
| Machine learning, N | 44 | 47 |
| Other, N | 2 | 0 |
| Mention of supervision, N |
|
|
| Supervised | 4 | 14 |
| Unsupervised | 0 | 7 |
| Semi-supervised | 0 | 1 |
| Not reported | 41 | 29 |
|
| ||
| Not specified | 4 | 3 |
| Artificial Neural Networks | 20 | 24 |
| |
| 13 |
| Support Vector Machine | 10 | 8 |
| Random Forests | 7 | 13 |
| Natural Language Processing | 7 | 2 |
| k-Nearest Neighbors | 3 | 6 |
| Bayesian models | 3 | 5 |
| | ||
|
|
|
|
| Regression methods, N | 11 | 15 |
| Other methods, N | 16 | 21 |
Supervised learning refers to the machine learning task of learning a function that maps an input to an output based on example input–output pairs, and unsupervised learning to the ability to learn without a ‘teacher’.
Of note, one article in RMDs used both machine learning and another AI method (heuristic).
AI, artificial intelligence.