| Literature DB >> 24479447 |
Juliana Tarossi Pollettini, José Augusto Baranauskas, Evandro Seron Ruiz, Maria da Graça Pimentel, Alessandra Alaniz Macedo1.
Abstract
BACKGROUND: Research on Genomic medicine has suggested that the exposure of patients to early life risk factors may induce the development of chronic diseases in adulthood, as the presence of premature risk factors can influence gene expression. The large number of scientific papers published in this research area makes it difficult for the healthcare professional to keep up with individual results and to establish association between them. Therefore, in our work we aim at building a computational system that will offer an innovative approach that alerts health professionals about human development problems such as cardiovascular disease, obesity and type 2 diabetes.Entities:
Mesh:
Year: 2014 PMID: 24479447 PMCID: PMC3938472 DOI: 10.1186/1755-8794-7-7
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1– collecting and updating the collection of papers.
Figure 2– retrieving relevant papers for a clinical record.
Figure 3CISS’s infrastructure (processes and storage).a) module for Collection Creation/Update; b) search interface; c) interactions with PubMed; d) Ontology concepts on the genetic and epigenetic risk factors domain; e) a collection of papers is retrieved from the public repository; f-g) textual processing of scientific papers; h) local database with a collection of pre-processed scientific papers to support retrieval tasks; i) user interface to submission of clinical records; g-j) textual processing of the clinical record; k) similarity processing among clinical records and scientific papers; l) similarity module accesses the pre-processed scientific paper collection and papers with the highest degrees of similarity to the clinical records are retrieved; m) the selected papers are shown to the health professional in a graphical user interface; n) the user interface has also an option to show a list of risk factors associated with the submitted clinical record; o) linguistic resources, like UMLS, that support overall textual processing; p) concept recognition module.
Number of documents retrieved by CISS and the three other search engines, based on ten queries in Portuguese composed by terms and expressions from a list of risk factors provided by specialists
| | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| a | 0 | 2 | 6 | 2 | 0 | 2 | 3 | 5 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| b | 0 | 0 | 0 | 0 | 0 | 1 | 6 | 0 | 0 | 0 | 0 | 0 | 3 | 4 | 2 | 0 |
| c | 2 | 0 | 4 | 4 | 0 | 0 | 7 | 3 | 0 | 2 | 4 | 4 | 0 | 0 | 0 | 0 |
| d | 0 | 0 | 2 | 8 | 0 | 0 | 5 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 |
| e | 3 | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 3 | 2 | 2 | 3 | 6 | 2 | 2 | 0 |
| f | 1 | 1 | 5 | 3 | 1 | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| g | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 | 0 | 3 | 0 | 6 | 0 | 0 | 0 |
| h | 0 | 1 | 3 | 6 | 0 | 0 | 3 | 7 | 0 | 0 | 0 | 0 | 8 | 1 | 1 | 0 |
| i | 0 | 0 | 2 | 7 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 6 | 2 | 2 | 0 |
| j | 1 | 3 | 4 | 2 | 0 | 1 | 8 | 1 | 0 | 0 | 2 | 8 | 2 | 3 | 1 | 0 |
Translated Queries (rows). (a) Gestational background: Did she receive folic acid and iron? When did she start taking each of them and for how long has she been taking them? Has she received other drugs during this pregnancy?(b) Gestational background: Disorders during pregnancy, such as hypertension, HDP, preeclampsia, eclampsia, heart disease, diabetes mellitus, gestational diabetes, anemia, bleeding (if positive, in which trimester), maternal infections during pregnancy such as toxoplasmosis, CMV, AIDS, syphilis, UTI, Leucorrhoea, Streptococcus, reported stress and its cause.(c) Maternal background: age at menarche.(d) Child’s everyday life: hours of sleep per day, sleep disorders, playing routine.(e) Newborn background: low-weight TNB, low-weight PTNB, metabolic disease, nutritional disease.(f) Environmental conditions: housing situation in regards to water and sewer systems, and physical space. (g) Diagnostics, medical managements, treatment and referrals: eutrophia, overweight or obesity, high BMI, body mass excess, adiposity excess, or malnutrition sign, dystrophy, iron deficiency, anemia, rickets.(h) Family background: history of diseases and habits of the nuclear family (father, mother, brothers, maternal grandparents, paternal grandparents and others), consanguinity between parents.(i) Familiogram: presence of several individuals with chronic diseases, sequelae from chronic disease, environmental risk for chronic disease, teenage parents, loss of family members by chronic disease.(j) Family risks and protection related to health: mother or relatives with psychiatric problems or special needs.
Number of documents retrieved by CISS and three other search engines, based on five queries composed in English by terms and expressions from a list of risk factors provided by specialists
| | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| - | ||||||||||||||||
| a | 0 | 0 | 1 | 9 | 0 | 1 | 6 | 3 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| b | 0 | 0 | 0 | 2 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| c | 4 | 1 | 2 | 3 | 3 | 0 | 3 | 4 | 0 | 4 | 5 | 1 | 0 | 4 | 4 | 0 |
| d | 1 | 0 | 5 | 4 | 2 | 0 | 8 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| e | 0 | 0 | 7 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
Original Queries (rows): (a) Gestational background: Did she receive folic acid and iron? When did she start taking each of them and how long has she been taking them? Has she received other drugs in this pregnancy?(b) Gestational background: disorders during pregnancy such as hypertension, HDP, preeclampsia, eclampsia, heart disease, diabetes mellitus, gestational diabetes, anemia, bleeding (if so, in which trimester), maternal infections during pregnancy such as toxoplasmosis, CMV, AIDS, syphilis, UTI, Leucorrhoea, Streptococcus, reported stress and its cause.(c) Maternal background: age at menarche.(d) Child’s everyday life: sleep (hours of sleep per day, sleep disorders), playing routine.(e) Newborn background: low-weight TNB, low-weight PTNB, metabolic disease, nutritional disease.
Average number of documents retrieved by CISS and the three other approaches in Portuguese
| Google | 0.7 | 0.7 | 3.2 | 3.3 | 157529.3 | 4810.0 |
| Google scholar | 0.1 | 0.4 | 4.2 | 2.2 | 2180.6 | 218.5 |
| PubMed | 1.0 | 0.4 | 1.1 | 1.5 | 1789.8 | 0.0 |
| CISS | 3.2 | 1.2 | 1.0 | 0.0 | 7.9 | 6.0 |
§Columns relative to the 10 first results for each query at each engine.
‡Mean of all results for each query.
†Median of all results for each query.
Accuracy of the results retrieved by CISS and the 3 other approaches in Portuguese
| % queries at whichthe search engineachieved the bestresults ((+) and (+/-)) | 15% | 15% | 10% | 60% |
| Accuracy (precision) | 0.14 | 0.05 | 0.14 | 0.61 |
Average number of documents retrieved by CISS and the 3 other approaches in English
| Google | 1.0 | 0.2 | 3.0 | 4.2 | 14028687 | 255000 |
| Google scholar | 1.2 | 0.6 | 3.6 | 1.4 | 12058.8 | 1190 |
| PubMed | 0.0 | 0.8 | 1.2 | 0.2 | 18.0 | 0.0 |
| CISS | 0.4 | 1.2 | 0.8 | 0.0 | 2.4 | 1.0 |
§Columns relative to the 10 first results for each query at each engine.
‡Mean of all results for each query.
†Median of all results for each query.
Accuracy of the results retrieved by CISS and the 3 other approaches in English
| % queries at whichthe search engineachieved the bestresults ((+) and (+/-)) | 20% | 40% | 0% | 40% |
| Accuracy (precision) | 0.12 | 0.32 | 0.10 | 0.90 |
Comparison considering Friedman’s Test for CISS versus all (in Portuguese)
| + | △ | △ | △ |
| +/- | △ | △ | △ |
| -/+ | △ | △ | ▿
|
| - | ▴ | △ | △ |
Comparison considering Friedman’s Test for CISS versus All (in English)
| + | ‡ ∘ | ▿
| △ |
| +/- | △ | △ | △ |
| -/+ | △ | △ | △ |
| - | ▴ | △ | △ |
‡ ∘ means No difference whatsoever.