| Literature DB >> 34943202 |
Xingsi Xue1, Pei-Wei Tsai2, Yucheng Zhuang3.
Abstract
To integrate massive amounts of heterogeneous biomedical data in biomedical ontologies and to provide more options for clinical diagnosis, this work proposes an adaptive Multi-modal Multi-Objective Evolutionary Algorithm (aMMOEA) to match two heterogeneous biomedical ontologies by finding the semantically identical concepts. In particular, we first propose two evaluation metrics on the alignment's quality, which calculate the alignment's statistical and its logical features, i.e., its f-measure and its conservativity. On this basis, we build a novel multi-objective optimization model for the biomedical ontology matching problem. By analyzing the essence of this problem, we point out that it is a large-scale Multi-modal Multi-objective Optimization Problem (MMOP) with sparse Pareto optimal solutions. Then, we propose a problem-specific aMMOEA to solve this problem, which uses the Guiding Matrix (GM) to adaptively guide the algorithm's convergence and diversity in both objective and decision spaces. The experiment uses Ontology Alignment Evaluation Initiative (OAEI)'s biomedical tracks to test aMMOEA's performance, and comparisons with two state-of-the-art MOEA-based matching techniques and OAEI's participants show that aMMOEA is able to effectively determine diverse solutions for decision makers.Entities:
Keywords: biomedical ontology matching; guiding matrix; multi-modal multi-objective evolutionary algorithm
Year: 2021 PMID: 34943202 PMCID: PMC8698300 DOI: 10.3390/biology10121287
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1An example of the biomedical ontology alignment and the corresponding 0–1 matrix.
Brief description on OAEI’s biomedical tracks.
| Track ID | Ontologies | Tasks |
|---|---|---|
| Anatomy Track ( | Adult Mouse Anatomy (MA)-2744 classes | MA-HA |
| Large Biomed Track ( | Foundation Model of Anatomy (FMA)-78,989 classes | FMA-NCI |
| Disease and Phenotype Track ( | Human Phenotype Ontology (HP)-33,205 classes | HP-MP |
| Human Disease Ontology (DOID)-24,034 classes | DOID-ORDO |
The parameters used by NSGA-II and MOEA/D.
| Population Size | Selection Rate | Crossover Rate | Mutation Rate | Maximum Generations | |
|---|---|---|---|---|---|
| NSGA-II | 201 | 0.8 | 0.98 | 0.05 | 300 |
| MOEA/D | 201 | 0.8 | 0.98 | 0.05 | 300 |
Comparison among three MOEA-based matching techniques in terms of best f-measure (best recall and best precision) and standard deviation. The symbols f, r, p, and stand for f-measure, recall, precision, and standard deviation, respectively.
| ine Testing Case | NSGA-II | MOEA/D | aMMOEA | |||
|---|---|---|---|---|---|---|
| ine |
|
|
| |||
| ine Anatomy | 0.85 (0.89, 0.76) | 0.02 (0.02, 0.02) | 0.85 (0.89, 0.84) | 0.02 (0.02, 0.01) | 0.92 (0.94, 0.96) | 0.01 (0.02, 0.01) |
| FMA-NCI | 0.87 (0.86, 0.86) | 0.02 (0.02, 0.02) | 0.84 (0.88, 0.78) | 0.02 (0.02, 0.02) | 0.93 (0.95, 0.98) | 0.02 (0.02, 0.02) |
| FMA-SNOMED | 0.71 (0.77, 0.63) | 0.02 (0.02, 0.01) | 0.65 (0.62, 0.75) | 0.01 (0.01, 0.01) | 0.84 (0.86, 0.88) | 0.01 (0.02, 0.01) |
| NCI-SNOMED | 0.68 (0.69, 0.64) | 0.01 (0.01, 0.02) | 0.68 (0.65, 0.70) | 0.02 (0.02, 0.03) | 0.77 (0.77, 0.80) | 0.01 (0.01, 0.01) |
| HP-MP | 0.55 (0.47, 0.57) | 0.02 (0.02, 0.01) | 0.71 (0.68, 0.72) | 0.02 (0.02, 0.02) | 0.85 (0.78, 0.89) | 0.01 (0.01, 0.02) |
| DOID-ORDO | 0.81 (0.83, 0.80) | 0.01 (0.01, 0.01) | 0.84 (0.83, 0.85) | 0.02 (0.03, 0.01) | 0.93 (0.93, 0.97) | 0.02 (0.02, 0.02) |
| ine | ||||||
t-Test’s t-value on the alignment’s quality.
| ine Testing Case | ||
|---|---|---|
| ine | (NSGA-II, aMMOEA) | (MOEA/D, aMMOEA) |
| f-measure (recall, precision) | f-measure (recall, precision) | |
| ine Anatomy | −17.14 (−9.68, −48.98) | −17.14 (−9.68, −46.47) |
| FMA-NCI | −11.61 (−17.42, −23.23) | −17.42 (−13.55, −38.72) |
| FMA-SNOMED | −31.84 (−17.42, −6.82) | −73.58 (−58.78, −50.34) |
| NCI-SNOMED | −34.85 (−30.98, −39.19) | −22.04 (−29.39, −17.32) |
| HP-MP | −73.48 (−75.93, −78.38) | −34.29 (−24.49, −32.92) |
| DOID-ORDO | −29.39 (−24.49, −41.64) | −17.42 (−15.19, −29.39) |
| ine |
t-Test’s p-value on the alignment’s quality.
| ine Testing Case | ||
|---|---|---|
| ine | (NSGA-II, aMMOEA) | (MOEA/D, aMMOEA) |
| f-measure (recall, precision) | f-measure (recall, precision) | |
| ine Anatomy | 0.0185 (0.0327, 0.0064) | 0.0185 (0.0327, 0.0068) |
| FMA-NCI | 0.0273 (0.0182, 0.0136) | 0.0182 (0.0234, 0.0082) |
| FMA-SNOMED | 0.0099 (0.0182, 0.0463) | 0.0043 (0.0054, 0.0063) |
| NCI-SNOMED | 0.0091 (0.0102, 0.0051) | 0.0144 (0.0108, 0.0183) |
| HP-MP | 0.0043 (0.0041, 0.0040) | 0.0092 (0.0129, 0.0096) |
| DOID-ORDO | 0.0108 (0.0129, 0.0076) | 0.0182 (0.0209, 0.0108) |
| ine |
Comparison with OAEI participants in terms of f-measure.
| ine Testing Case | AML | LogMap | XMap | DOME | POMAP++ | aMMOEA |
|---|---|---|---|---|---|---|
| ine Anatomy | 0.94 | 0.89 | 0.89 | 0.76 | 0.89 | 0.92 |
| FMA-NCI | 0.93 | 0.92 | 0.86 | 0.86 | 0.88 | 0.93 |
| FMA-SNOMED | 0.83 | 0.79 | 0.77 | 0.33 | 0.40 | 0.84 |
| NCI-SNOMED | 0.80 | 0.77 | 0.69 | 0.64 | 0.68 | 0.77 |
| HP-MP | 0.84 | 0.85 | 0.47 | 0.47 | 0.68 | 0.85 |
| DOID-ORDO | 0.64 | 0.84 | 0.70 | 0.60 | 0.83 | 0.93 |
| ine Average | 0.83 | 0.84 | 0.73 | 0.61 | 0.72 | 0.87 |
| ine |