| Literature DB >> 33805313 |
Svetlana Tarbeeva1,2, Ekaterina Lyamtseva3, Andrey Lisitsa2, Anna Kozlova2,4, Elena Ponomarenko2, Ekaterina Ilgisonis2.
Abstract
We used automatic text-mining of PubMed abstracts of papers related to obesity, with the aim of revealing that the information used in abstracts reflects the current understanding and key concepts of this widely explored problem. We compared expert data from DisGeNET to the results of an automated MeSH (Medical Subject Heading) search, which was performed by the ScanBious web tool. The analysis provided an overview of the obesity field, highlighting major trends such as physiological conditions, age, and diet, as well as key well-studied genes, such as adiponectin and its receptor. By intersecting the DisGeNET knowledge with the ScanBious results, we deciphered four clusters of obesity-related genes. An initial set of 100+ thousand abstracts and 622 genes was reduced to 19 genes, distributed among just a few groups: heredity, inflammation, intercellular signaling, and cancer. Rapid profiling of articles could drive personalized medicine: if the disease signs of a particular person were superimposed on a general network, then it would be possible to understand which are non-specific (observed in cohorts and, therefore, most likely have known treatment solutions) and which are less investigated, and probably represent a personalized case.Entities:
Keywords: MeSH; data-mining; gene network; obesity; text-mining
Year: 2021 PMID: 33805313 PMCID: PMC8065449 DOI: 10.3390/jpm11040246
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1ScanBious Web-interface. (I) Query “obesity” with ScanBious Web-interface, (II) PubMed articles retrieved to the query, (III) statistics of the MeSH terms, relevant to the retrieved articles, presented as a network of MeSH terms, (IV) MeSH terms visualized as the semantic network, (V) texts of abstracts of relevant publications for the object.
Figure 2(a) Obesity at-a-glance. The compendium of PubMed abstracts from the top ten most cited authors is depicted as a network of MeSH terms. The size of the nodes reflects the occurrence of the MeSH terms in the sampled abstracts. (b) Gene network based on DisGeNET data obtained on the request “obesity.” The publications from DisGeNET were retrieved as a file in which the names of genes were assigned to the PubMed identifiers, and in this format were uploaded to ScanBious for visualization and interactive work with the texts of the abstracts of publications. The number of publications in which the name of the gene was found is indicated in parentheses after the gene name. The checkmarks indicate the notes of the network which are described in the text.
Top five genes related to the Mendelian forms of obesity, selected according to the number of gene-associated PubMed abstracts. N.Diseases and N.PMIDs were retrieved as a result of the DisGeNET search for a given gene, and denote the numbers of diseases and PubMed identifiers, respectively.
| Gene Name 1 | Number of References | Protein Name | ||
|---|---|---|---|---|
| Obesity/PubMed 2 | N.Diseases | Obesity/N.PMIDs | ||
| FTO | 26 | 286 | 426 | Alpha-ketoglutarate-dependent dioxygenase |
| POMC | 22 | 873 | 97 | Proopiomelanocortin |
| MC4R | 17 | 149 | 283 | Melanocortin receptor 4 |
| LEPR 3 | 13 | 416 | 214 | Leptin receptor |
| BDNF | 8 | 992 | 88 | Brain-derived neurotrophic factor |
1 collected from the reviews [1,20,21]. 2 sorted by this column. 3 appeared in the DisGeNET network (Figure 2b).
Figure 3Four clusters of the obesity-relevant semantic network. Clusters (a–d) were obtained as a result of comparing data from the DisGeNET expert system and the results of automatic processing of abstracts of scientific publications in ScanBious, with a threshold value of the Jaccard index > 0.2, with the condition that there were at least five articles for the object being visualized as a network node.
Figure 4The relationship among the clusters as found by using STRING (a) with consequent reproduction of the Figure 3 with graphical illustration of the links between genes (b).