| Literature DB >> 23667440 |
Martin Boeker1, Ludger Jansen, Niels Grewe, Johannes Röhl, Daniel Schober, Djamila Seddig-Raufie, Stefan Schulz.
Abstract
BACKGROUND: The importance of ontologies in the biomedical domain is generally recognized. However, their quality is often too poor for large-scale use in critical applications, at least partially due to insufficient training of ontology developers.Entities:
Mesh:
Year: 2013 PMID: 23667440 PMCID: PMC3646875 DOI: 10.1371/journal.pone.0061425
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The sequence of modules in the curriculum.
It follows the stepwise layout of the GoodOD guideline and the increasing complexity of the contents. Modules 10–13 and 15–16 were used in the intervention (see Table 1).
Intervention and data collection of the study.
| Intervention/data collection | Application of top-level ontology | Application of ODPs | |
|
| Module 10 (PRO) Process and participation | Module 12 (IMM) Immaterial object | Module 15 (CLO) Closure ODP |
|
| Module 11 (CME) Collectivematerial entity | Module 13 (INF) Information object | Module 16 (SPA) Spatial disjointness ODP |
|
| PRO: Photosynthesis, Medical diagnosing; CME: Proteinuria,Penicillin | IMM: Fetogenesis, Stomach anatomy; INF: Operation plan, Pneumonia diagnosis | SPA: Cell membranes, Stomach wall; CLO: Circulatory system, Teeth |
The intervention of the study consisted of the differential training of the students in certain content areas: students in group A received training in modules 10, 12 and 15 and no training in modules 11, 13 and 16, and vice versa for students in group B. Training sessions were kept balanced with regard to instructor, length, difficulty and instructional format. Data were collected in the form of ontology development exercises which were distributed evenly over all content areas with two exercises per training module.
Figure 2Modified CONSORT diagram.
Twelve students were allocated to each group and could be analyzed.
Similarity with the gold standard model.
| group | topic | ontology | fm | n-mtb | mtb | atb |
| A | CLO | tee | −0.4 | −3.8 | −0.1 | −0.3 |
| A | CLO | bud | 6.6 | 0.6 | −2.5 |
|
| A | IMM | sto |
| 1.0 |
| −0.1 |
| A | IMM | fet | −5.4 | 2.0 | 0.3 | 1.2 |
| A | PRO | pho | 1.6 | 1.6 | −0.4 | −2.8 |
| A | PRO | dia | 1.7 | 2.2 | −1.0 | 2.4 |
| B | CME | pru | −4.7 | 0.1 | 0.0 | 0.0 |
| B | CME | pen | −3.6 | −0.9 | 1.4 | 0.1 |
| B | INF | pne | −6.0 | 2.0 | −0.1 | −1.1 |
| B | INF | ope | 1.0 | 1.5 | −0.1 | 0.0 |
| B | SPA | cem | 0.5 | −1.4 | 0.0 | 1.4 |
| B | SPA | sta | 7.0 | 4.1 | −0.8 | 1.5 |
Ontology similarity metrics, displayed as absolute difference in percent between trained and untrained groups ordered by trained group, training topic and individual assessment task (for details see Table 1). The similarity/distance metrics shown are f-measure (fm), MWMGMS after normalization (n-mtb), MWMGMS without normalization (mtb) and average linkage without normalization (atb), the last three combined with triple-bases entity similarity as local measure. Significance levels of group comparisons are indicated as ∼: p0.15, : p0.1, : p0.01, : p.001.
Aggregation of effect-sizes on topic level.
| group | topic | GS-fm [%] | GS-fm d | IH-fm [%] | IH-fm d |
| A | PRO | 1.6 | 0.12 | 8.1 | 0.49 |
| A | IMM | 2.8 | 0.16 | 6.8 | 0.40 |
| A | CLO | 3.1 | 0.23 | 0.5 | 0.04 |
| B | CME | −4.1 | −0.36 | −6.6 | −0.47 |
| B | INF | −2.5 | −0.16 | −7.7 | −0.50 |
| B | SPA | 3.7 | 0.24 | −4.6 | −0.23 |
F-measure ontology similarity metrics with the gold standard (GS) and f-measure intra-group homogeneity (IH) aggregated on topic level, displayed as absolute differences in percent between trained and untrained groups and Cohens’ d effect sizes. Ordering is by trained group and training topic. For details on abbreviations and symbols see Table 2.
Intra-group similarity (homogeneity).
| group | topic | ontology | fm | n-mtb | mtb | atb |
| A | PRO | pho | 9.7 | 0.1 | −0.5 | −0.3 |
| A | PRO | dia | 6.5 | 0.1 | −0.8 | −0.9 |
| A | IMM | sto | 11.3 | −0.3 | 0.0 |
|
| A | IMM | fet | 2.3 | −1.2 | 0.0 | 0.4 |
| A | CLO | tee |
| −0.7 | 0.0 | −0.4 |
| A | CLO | bud | 4.1 | 0.4 | 1.0 | −1.0 |
| B | CME | pru | −8.0 | 0.1 | 0.0 | 0.0 |
| B | CME | pen | −5.2 | −1.4 | 0.3 |
|
| B | INF | pne | −2.3 | −0.2 | 0.0 | 0.1 |
| B | INF | ope | −13.0 | −0.7 | 0.1 |
|
| B | SPA | cem | −9.0 | 0.3 | 0.0 | −0.2 |
| B | SPA | sta | −0.2 | 1.1 | −0.7 | −0.2 |
Intra-group ontology similarity metrics (homogeneity), displayed as absolute difference in percent between trained and untrained groups ordered by trained group, training topic and individual assessment task. For details on abbreviations and symbols see Table 2.
Effect-sizes of similarity with the gold standard ontology and intra-group similarity.
| group | topic | ontology | GS-fm[%] | GS-fm d | IH-fm[%] | IH-fm d |
| A | PRO | pho | 1.6 | 0.12 | 9.7 | 0.66 |
| A | PRO | dia | 1.7 | 0.12 | 6.5 | 0.35 |
| A | IMM | sto | 11.1 |
| 11.3 | 0.59 |
| A | IMM | fet | −5.4 | −0.30 | 2.3 | 0.16 |
| A | CLO | tee | −0.4 | −0.03 | −3.0 | −0.20 |
| A | CLO | bud | 6.6 | 0.51 | 4.1 | 0.31 |
| B | CME | pru | −4.7 | −0.38 | 8.0 | −0.51 |
| B | CME | pen | −3.6 | −0.34 | −5.2 | −0.43 |
| B | INF | pne | −6.0 | −0.46 | −2.3 | −0.23 |
| B | INF | ope | 1.0 | 0.05 | −13.0 | −0.69 |
| B | SPA | cem | 0.5 | 0.04 | −9.0 | −0.80 |
| B | SPA | sta | 7.0 | 0.37 | −0.2 | −0.01 |
F-measure ontology similarity metrics with the gold standard (GS) and f-measure intra-group homogeneity (IH), displayed as absolute differences in percent between trained and untrained groups and Cohens’ d effect sizes. Ordering is by trained group, training topic and individual assessment task. For details on abbreviations and symbols see Table 2.