| Literature DB >> 35893435 |
Hadas Samuels1, Malki Malov1, Trishna Saha Detroja1, Karin Ben Zaken1, Naamah Bloch1, Meital Gal-Tanamy1, Orly Avni1, Baruh Polis2, Abraham O Samson1.
Abstract
Autoimmune diseases (AIDs) are often co-associated, and about 25% of patients with one AID tend to develop other comorbid AIDs. Here, we employ the power of datamining to predict the comorbidity of AIDs based on their normalized co-citation in PubMed. First, we validate our technique in a test dataset using earlier-reported comorbidities of seven knowns AIDs. Notably, the prediction correlates well with comorbidity (R = 0.91) and validates our methodology. Then, we predict the association of 100 AIDs and classify them using principal component analysis. Our results are helpful in classifying AIDs into one of the following systems: (1) gastrointestinal, (2) neuronal, (3) eye, (4) cutaneous, (5) musculoskeletal, (6) kidneys and lungs, (7) cardiovascular, (8) hematopoietic, (9) endocrine, and (10) multiple. Our classification agrees with experimentally based taxonomy and ranks AID according to affected systems and gender. Some AIDs are unclassified and do not associate well with other AIDs. Interestingly, Alzheimer's disease correlates well with other AIDs such as multiple sclerosis. Finally, our results generate a network classification of autoimmune diseases based on PubMed text mining and help map this medical universe. Our results are expected to assist healthcare workers in diagnosing comorbidity in patients with an autoimmune disease, and to help researchers in identifying common genetic, environmental, and autoimmune mechanisms.Entities:
Keywords: Alzheimer’s disease; PubMed; autoimmune disease; classification; frequency analysis; list of autoimmune diseases; taxonomy; text mining
Year: 2022 PMID: 35893435 PMCID: PMC9369164 DOI: 10.3390/jcm11154345
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.964
Figure 1Validation of autoimmune disease association. Shown on the left is our autoimmune disease association matrix based on PubMed co-citation. Shown on the right is the same matrix based on genetic evidence (adapted from Cifuentes et al. [3]). Note that association values range from 0 (white background) to 1 (gray background). Notably, the correlation between the two matrices is high (R = 0.91), thus validating our technique. (SLE, systemic lupus erythematosus. SS, Sjögren’s syndrome. T1D, type 1 diabetes. AITD, autoimmune thyroid disease. RA, rheumatoid arthritis. MS, multiple sclerosis. SSc, systemic sclerosis).
Figure 2Autoimmune disease classification. Shown is a classification of the top co-cited 100 autoimmune diseases on PubMed. The diseases are clustered using association distances and listed in the order obtained from ClustVis [20]. Notably, the clustering follows gender, system, onset, and prevalence. For a detailed list of the autoimmune diseases, gender ratio, average onset age, prevalence in population, please see Supplementary Table S2. (NA–Not applicable, * varies in cold and warm weather).
Figure 3Autoimmune disease associations. Shown is a chord diagram of the top 80 autoimmune diseases based on co-citation in PubMed. The chord width is indicative of the number of co-citations of two AIDs and high comorbidity. The diagram was prepared using Circos software [22].