| Literature DB >> 32496201 |
Clément R Massonnaud1,2, Gaétan Kerdelhué1,2, Julien Grosjean1,2, Romain Lelong1,2, Nicolas Griffon1,2, Stefan J Darmoni1,2.
Abstract
BACKGROUND: With the continuous expansion of available biomedical data, efficient and effective information retrieval has become of utmost importance. Semantic expansion of queries using synonyms may improve information retrieval.Entities:
Keywords: MEDLINE; Medical Subject Headings; PubMed; bibliographic database; information retrieval; literature search; precision; recall; search strategy; thesaurus
Year: 2020 PMID: 32496201 PMCID: PMC7303830 DOI: 10.2196/12799
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Representation of the three sets of citations retrieved for each descriptor, and how they were used to compute the metrics. ATM: automatic term mapping; CISMeF: Catalogue et Index des Sites Médicaux de langue Française; MeSH: Medical Subject Headings; UMLS: Unified Medical Language System.
Summary of the syntax of the nine different queries used in this study.
| Strategy | Relevant citations (A) | Retrieved citations (B) | Relevant citations retrieved (C) |
| ATMa | ”pref. term“[MH] | ”pref. term“[TIAB] AND medline[sb] | ”pref. term“[MH] AND (”pref. term“[TIAB]) AND medline[sb] |
| MeSHb | ”pref. term“[MH] | (”MeSH synonym 1“[TIAB] OR ”MeSH synonym 2“[TIAB] OR …) AND medline[sb] | ”pref. term“[MH] AND (”MeSH synonym 1“[TIAB] OR ”MeSH synonym 2“[TIAB] OR …) AND medline[sb] |
| UMLSc | ”pref. term“[MH] | (”UMLS synonym 1“[TIAB] OR ”UMLS synonym 2“[TIAB] OR …) AND medline[sb] | ”pref. term“[MH] AND (”UMLS synonym 1“[TIAB] OR ”UMLS synonym 2“[TIAB] OR …) AND medline[sb] |
| CISMeFd | ”pref. term“[MH] | (”CISMeF synonym 1“[TIAB] OR ”CISMeF synonym 2“[TIAB] OR …) AND medline[sb] | ”pref. term“[MH] AND (”CISMeF synonym 1“[TIAB] OR ”CISMeF synonym 2“[TIAB] OR …) AND medline[sb] |
aATM: automatic term mapping.
bMeSH: Medical Subject Headings.
cUMLS: Unified Medical Language System.
dCISMeF: Catalogue et Index des Sites Médicaux de langue Française.
Mean performances of the four search strategies for the 26,636 Medical Subject Heading descriptors.
| KOSa | Precision (%), mean (SD) | Recall (%), mean (SD) | F-measure (%), mean (SD) |
| ATMb | 44 (24) | 31 (29) | 28 (23) |
| MeSHc | 51 (23) | 38 (31) | 35 (24) |
| CISMeFd | 49 (23) | 40 (31) | 35 (24) |
| UMLSe | 49 (23) | 46 (31) | 36 (24) |
aKOS: knowledge organization system.
bATM: automatic term mapping.
cMeSH: Medical Subject Headings.
dCISMeF: Catalogue et Index des Sites Médicaux de langue Française.
eUMLS: Unified Medical Language System.
Number of descriptors for which two strategies had equal precision, recall, or F-measure.
| KOSa | Precision, n | Recall, n | F-measure, n |
| ATMb and MeSHc | 3037 | 3959 | 2938 |
| ATM and CISMeFd | 2551 | 3410 | 2459 |
| ATM and UMLSe | 2409 | 3265 | 2320 |
| MeSH and CISMeF | 19,261 | 20,232 | 19,001 |
| MeSH and UMLS | 17,176 | 18,394 | 16,917 |
| CISMeF and UMLS | 18,819 | 19,956 | 18,565 |
aKOS: knowledge organization system.
bATM: automatic term mapping.
cMeSH: Medical Subject Headings.
dCISMeF: Catalogue et Index des Sites Médicaux de langue Française.
eUMLS: Unified Medical Language System.
Comparisons of the strategies for each metric.
| Strategy comparison | Metric, na | ||
| Precision | Recall | F-measure | |
| CISMeFb vs UMLSc | 1180 vs 1017 | 793 vs 1262 | 678 vs 1140 |
| MeSHd vs UMLS | 2285 vs 215 | 9 vs 2857 | 553 vs 1761 |
| CISMeF vs MeSH | 170 vs 2088 | 2372 vs 9 | 1403 vs 669 |
| MeSH vs ATMe | 9650 vs 3299 | 8198 vs 3404 | 8150 vs 2446 |
| CISMeF vs ATM | 9112 vs 4557 | 9724 vs 2949 | 8895 vs 2628 |
| ATM vs UMLS | 4682 vs 9094 | 2852 vs 10,047 | 2448 vs 9217 |
aThe numbers are the numbers of descriptors for which the metric score of a strategy was at least 5% better than another strategy.
bCISMeF: Catalogue et Index des Sites Médicaux de langue Française.
cUMLS: Unified Medical Language System.
dMeSH: Medical Subject Headings.
eATM: automatic term mapping.
Figure 2F-measure scores of the four search strategies depending on the MeSH category of the descriptor. ATM: automatic term mapping; CISMeF: Catalogue et Index des Sites Médicaux de langue Française; MeSH: Medical Subject Headings; UMLS: Unified Medical Language System.