| Literature DB >> 29220439 |
Alba Gutiérrez-Sacristán1, Àlex Bravo1, Marta Portero-Tresserra2, Olga Valverde2, Antonio Armario3,4, M C Blanco-Gandía5, Adriana Farré6, Lierni Fernández-Ibarrondo7, Francina Fonseca6, Jesús Giraldo4,8, Angela Leis1, Anna Mané4,6, M A Mayer1, Sandra Montagud-Romero5, Roser Nadal4,9, Jordi Ortiz4,10, Francisco Javier Pavon11, Ezequiel Jesús Perez6, Marta Rodríguez-Arias5, Antonia Serrano11, Marta Torrens6, Vincent Warnault2, Ferran Sanz1, Laura I Furlong1.
Abstract
Database URL: http://www.psygenet.org. PsyGeNET corpus: http://www.psygenet.org/ds/PsyGeNET/results/psygenetCorpus.tar.Entities:
Mesh:
Year: 2017 PMID: 29220439 PMCID: PMC5502359 DOI: 10.1093/database/bax043
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Examples of Association qualifiers. Disease and genes that have to be evaluated are highlighted in the sentence in green and orange, respectively.
| Association Type | PMID | Sentence |
|---|---|---|
| Association | 267012 | The |
| No Association | 17692928 | There was no association between |
| False | 25225167 | The findings that have gained support indicate that genetic variants of |
| Error | 21174530 |
Figure 2.The PsyGeNET annotation tool. A screenshot of the annotation tool is shown, see the text for more details.
Figure 1.The PsyGeNET curation workflow. The workflow includes: a) a Pilot phase for training of the curators and testing of the annotation tool, b) Curation Phase I and II where the curation of the text-mined data took place, and c) three Analysis phases after each curation to analyze the results and prepare the data for the next stage.
Figure 3.Psychiatric disease categories and the number of associated genes obtained by text mining in the present study, before expert curation.
Inter-annotator agreement during CP-I
| Teams | Validations | Agreement | Agreement (%) |
|---|---|---|---|
| Team 1 | 494 | 325 | 65.79 |
| Team 2 | 319 | 194 | 60.89 |
| Team 3 | 489 | 342 | 69.94 |
| Team 4 | 450 | 402 | 89.33 |
| Team 5 | 492 | 308 | 62.60 |
| Team 6 | 508 | 341 | 67.12 |
| Team 7 | 463 | 317 | 68.46 |
| Team 8 | 516 | 363 | 70.35 |
| Team 9 | 334 | 221 | 66.17 |
Figure 4.The PsyGeNET curation workflow results. The workflow includes the results in each phase according to the agreement or disagreement between experts and the final number of associations included in the new version of PsyGeNET database (PsyGeNET V.02) according to the evidence that support each annotation.
Number of validations and agreement obtained during each step of the curation process
| Curation phase | Total validations | Agreement | Disagreement |
|---|---|---|---|
| CP-I | 4065 | 2813 (69%) | 1252 (31%) |
| CP-II | 1252 | 888 (71%) | 364 (29%) |
| Whole curation workflow | 4065 | 3701 (91%) | 364 (9%) |
Figure 5.Summary of the agreement results. Each bar in the bar-plot represents the number of validations annotated as: Association, No association, False, Error and Not clear.