| Literature DB >> 32990634 |
Lara Kühnle1, Urs Mücke1, Werner M Lechner2, Frank Klawonn3,4, Lorenz Grigull5.
Abstract
BACKGROUND: Diagnostic delay in rare disease (RD) is common, occasionally lasting up to more than 20 years. In attempting to reduce it, diagnostic support tools have been studied extensively. However, social platforms have not yet been used for systematic diagnostic support. This paper illustrates the development and prototypic application of a social network using scientifically developed questions to match individuals without a diagnosis.Entities:
Keywords: artificial intelligence; diagnostic support tool; machine learning; prototype; rare disease; social network
Mesh:
Year: 2020 PMID: 32990634 PMCID: PMC7556379 DOI: 10.2196/21849
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Used material for finding and evaluating the matching algorithm.
Figure 2Simplified scheme of the clicking path for new and already registered users of RarePairs.
Figure 3Landing page of RarePairs where users can get information, register, or log in. Users find information by text and a short video addressing aims and scope of RarePairs. Currently, the landing page is in German, an English version is under construction.
Figure 4Visualization of the tables and possible interactions between them and the user.
Figure 5Illustration of the first part of identifying a matching method. Screening the data set for 10 "best" matches for one user (using the leave-one-out-method). This scheme only illustrates the basic principle of the simulation. The exact results are shown in Table 1. GBS: Guillain-Barré syndrome; M. Pompe: Morbus Pompe.
Figure 6Schematic illustration of the second part: identifying a matching method for a given data set of users with rare diseases. Ten matches for all 973 data sets with one calculating method, calculating the average of the matching accordance of properties. This figure illustrates the basic principle of the simulation (for the complete testing results, see Multimedia Appendix 5 and Table 1).
Different distance calculating methods simulated using the leave-one-out principle.
| Distance calculating method | Matching accordance in percent | ||||||||||||||
|
| Gendera | Ageb | Latencyc | Disease groupd | Diagnostic systeme | Exact diagnosisf |
| ||||||||
| (Pseudo)random sampling (=negative benchmark) | 59.6 | 15.7 | 11.9 | 8.3 | 31.7 | 3.8 |
| ||||||||
|
| |||||||||||||||
|
| Manhattan | 63.9g | 21.9g | 14.3 | 16.2g | 40.3g | 9.4g |
| |||||||
|
| Euclidean | 62.4g | 21.1 | 15.5g | 16.0g | 36.1g | 9.7 (=positive benchmark) |
| |||||||
|
| Minkowski | 63.8g | 21.9g | 14.4 | 16.2g | 40.3g | 9.4g |
| |||||||
|
| |||||||||||||||
|
| Sørensen | 64.8g | 22.1g | 13.2 | 15.3g | 37.8g | 8.8 |
| |||||||
|
| Gower | 63.8g | 21.9g | 14.4 | 16.2g | 40.3g | 9.4g |
| |||||||
|
| Canberra | 64.2g | 21.5g | 14.0 | 15.8g | 37.7g | 9.4g (does not differ from the positive benchmark; |
| |||||||
|
| Lorentzian | 63.5g | 21.7g | 13.9 | 15.9g | 39.4g | 9.3g |
| |||||||
|
| |||||||||||||||
|
| Wave Hedges | 64.1g | 21.5g | 14.0 | 15.9g | 38.3g | 9.3 |
| |||||||
|
| Czekanowski | 64.8g | 22.1g | 13.2 | 15.3g | 37.8g | 8.8 |
| |||||||
|
| Tanimoto | 64.8 (=positive benchmark) | 22.1 (=positive benchmark) | 13.2 (differs from negative benchmark; | 15.3g | 37.8g | 8.8 |
| |||||||
|
| Jaccard | 63.8g | 21.3 | 13.8 | 14.9 | 35.4g | 8.7 |
| |||||||
|
| Dice | 63.8g | 21.3 (differs from positive benchmark; | 13.8 | 14.9 (differs from positive benchmark; | 35.4g | 8.7 (differs from negative benchmark; |
| |||||||
|
| |||||||||||||||
|
| Cosine | 53.6h (< negative benchmark) | 12.0h (< negative benchmark) | 7.3h (< negative benchmark) | 4.7h (< negative benchmark) | 17.0h (< negative benchmark) | 1.9h (< negative benchmark) |
| |||||||
|
| |||||||||||||||
|
| Bhattacharyya | 64.8g | 19.0 (differs from negative benchmark; | 7.9h (< negative benchmark) | 10.1h (does not differ from negative benchmark; | 32.3 (differs from negative benchmark; | 2.8h (< negative benchmark) |
| |||||||
|
| Hellinger | 62.9g | 19.3 | 5.8h (< negative benchmark) | 7.4h (< negative benchmark) | 52.5 (=positive benchmark) | 0.3h (< negative benchmark) |
| |||||||
|
| Squared-Chord | 62.0g | 21.6g | 15.3g | 15.6g | 35.2g | 9.6g |
| |||||||
|
| |||||||||||||||
|
| Neyman | 59.5h (< negative benchmark) | 20.4 | 15.2 (differs from positive benchmark; | 14.9 (differs from negative benchmark; | 34.2g | 9.0 |
| |||||||
|
| Probabilistic Symmetric | 62.2g | 21.6g | 15.1 | 15.8g | 35.5g | 9.7g |
| |||||||
|
| Clark | 63.1g | 21.4g (does not differ from positive benchmark; | 14.8 | 15.3g | 35.5g | 9.3 |
| |||||||
|
| Additive symmetric | 61.3g (does not differ from positive benchmark; | 21.1 | 15.7g | 15.5g | 34.1g | 9.5g |
| |||||||
|
| |||||||||||||||
|
| Jeffreys | 62.0g | 21.5g | 15.3g | 15.6g | 35.1g | 9.6g |
| |||||||
|
| Jensen difference | 62.1g | 21.5g | 15.3g (does not differ from positive benchmark; | 15.7g | 35.3g | 9.6g |
| |||||||
|
| |||||||||||||||
|
| Kumar–Johnson | 60.8 (differs from positive benchmark; | 21.2 | 15.7 (=positive benchmark) | 15.0g (does not differ from positive benchmark; | 33.4g (does not differ from positive benchmark; | 9.3g (differs from positive benchmark; |
| |||||||
|
| Avg | 63.7 | 21.8h | 14.4 | 16.4 (=positive benchmark) | 40.3g | 9.4g |
| |||||||
aGender of the person.
bAge of the person
cTime with symptoms but no diagnosis.
dCategory of the diagnosis referring to the affected organ or pathophysiology (eg, neuromuscular disease, metabolic disease)
eGreater category the diagnosis can be assigned to (eg, RD, CD), not especially considering the affected organ
fExact name of the one diagnosis.
gFields do not differ significantly from the positive benchmark in this category; see P-value in those fields. If no P-values are mentioned, the matching values lie in between the benchmark and the furthest value, which is only just not differing significantly from this benchmark.
hFields do not differ significantly from the (pseudo)random matching (=negative benchmark); see P-value in those fields. If no P-values are mentioned, the matching values lie in between the benchmark and the furthest value, which is only just not differing significantly from this benchmark.
Figure 7A t-distributed stochastic neighbor embedding plot showing a possible clustering of the 973 test objects concerning the diagnostic system of their disease. Key: black: rare diseases; red: chronic diseases; dark blue: psychiatric diseases with somatoform part; dark green: unknown diagnosis; light green: healthy individuals; light blue: sarcoidosis; orange: idiopathic pulmonary arterial hypertonia; yellow: syringomyelia; brown: systemic lupus erythematosus.