| Literature DB >> 25810766 |
Martin Krallinger1, Florian Leitner2, Obdulia Rabal3, Miguel Vazquez1, Julen Oyarzabal3, Alfonso Valencia1.
Abstract
Natural language processing (NLP) and text mining technologies for the chemical domain (ChemNLP or chemical text mining) are key to improve the access and integration of information from unstructured data such as patents or the scientific literature. Therefore, the BioCreative organizers posed the CHEMDNER (chemical compound and drug name recognition) community challenge, which promoted the development of novel, competitive and accessible chemical text mining systems. This task allowed a comparative assessment of the performance of various methodologies using a carefully prepared collection of manually labeled text prepared by specially trained chemists as Gold Standard data. We evaluated two important aspects: one covered the indexing of documents with chemicals (chemical document indexing - CDI task), and the other was concerned with finding the exact mentions of chemicals in text (chemical entity mention recognition - CEM task). 27 teams (23 academic and 4 commercial, a total of 87 researchers) returned results for the CHEMDNER tasks: 26 teams for CEM and 23 for the CDI task. Top scoring teams obtained an F-score of 87.39% for the CEM task and 88.20% for the CDI task, a very promising result when compared to the agreement between human annotators (91%). The strategies used to detect chemicals included machine learning methods (e.g. conditional random fields) using a variety of features, chemistry and drug lexica, and domain-specific rules. We expect that the tools and resources resulting from this effort will have an impact in future developments of chemical text mining applications and will form the basis to find related chemical information for the detected entities, such as toxicological or pharmacogenomic properties.Entities:
Keywords: BioCreative; ChemNLP; chemical entity recognition; chemical indexing; machine learning; named entity recognition; text mining
Year: 2015 PMID: 25810766 PMCID: PMC4331685 DOI: 10.1186/1758-2946-7-S1-S1
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Example team predictions for the CDI (left) and CEM (right) tasks.
| CDI | CEM | |||||
|---|---|---|---|---|---|---|
| 23380242 | TiO2 | 1 | 0.9 | T:0:16 | 1 | 0.5 |
| 23380242 | Titanium dioxide | 2 | 0.9 | A:323:331 | 2 | 0.5 |
| 23380242 | pyrrolidine dithiocarbamate | 3 | 0.9 | A:333:337 | 3 | 0.5 |
| 23380242 | SB203580 | 4 | 0.9 | A:528:532 | 4 | 0.5 |
| 23380242 | titanium | 5 | 0.9 | A:763:767 | 5 | 0.5 |
| 23380242 | LY294002 | 6 | 0.9 | A:894:898 | 6 | 0.5 |
| 23380242 | epigallocatechin gallate | 7 | 0.9 | A:945:949 | 7 | 0.5 |
| 23380242 | apocynin | 8 | 0.9 | A:1108:1112 | 8 | 0.5 |
| 23380242 | PD98059 | 9 | 0.9 | A:1118:1122 | 9 | 0.5 |
| 23380242 | SP600125 | 10 | 0.9 | A:1342:1369 | 10 | 0.5 |
Both task predictions have as first column the document identifier (PMID) and originally are formatted as tabulator-separated plain text columns. Only the top ten predictions are shown.
CHEMDNER team overview.
| Id | Team contact | Type | Affiliation | CDI | CEM | Tool URL | |
|---|---|---|---|---|---|---|---|
| 173 | Z. Lu | A | NCBI/NLM/NIH, USA | 5/3 | 5/1 | [ | [ |
| 177 | T. Can | A | Middle East Technical Univ., Ankara, Turkey | 2/16 | 2/16 | [ | [ |
| 179 | D. Lowe | C | NextMove Software | 5/4 | 5/2 | [ | [ |
| 182 | A. Klenner | A | Fraunhofer-Institute for Algor. and Sci.Comp., Germany | 4/17 | 4/18 | - | - |
| 184 | R. Rak | A | National Centre for Text Mining; Univ. Manchester, UK | 5/1 | 5/3 | [ | - |
| 185 | S.V. Ramanan | C | RelAgent Pvt Ltd | 3/6 | 3/6 | [ | [ |
| 191 | A. Usie Chimenos | A | Univ. of Lleida, Spain | 3/15 | 1/17 | [ | [ |
| 192 | H. Xu | A | Univ. of Texas Health Science Center at Houston, USA | 0/- | 5/5 | [ | - |
| 196 | F. Couto | A | LASIGE, Univ. of Lisbon, Portugal | 5/15 | 5/14 | [ | [ |
| 197 | S. Matos | A | Univ. of Aveiro, Portugal | 5/5 | 5/4 | [ | [ |
| 198 | P. Thomas | A | Humboldt-Univ. zu Berlin, Germany | 5/2 | 5/3 | [ | [ |
| 199 | M. Irmer | C | OntoChem | 1/8 | 1/7 | [ | [ |
| 207 | K. Verspoor | A | National ICT Australia, Australia | 2/10 | 2/11 | [ | - |
| 214 | D. Bonniot de Ruisselet | C | ChemAxon | 5/9 | 5/8 | - | [ |
| 217 | L. Li | A | Dalian Univ. of Technology, P.R. China | 5/11 | 5/14 | [ | - |
| 219 | M. Khabsa | A | The Pennsylvania State University, USA | 5/13 | 5/12 | [ | [ |
| 222 | S.A. Akhondi | A | Erasmus MC, Rotterdam, The Netherlands | 5/8 | 5/7 | [ | - |
| 225 | D. Sanchez-Cisneros | A | Univ. Carlos III and Univ. Autonoma Madrid, Spain | 5/18 | 5/19 | [ | [ |
| 231 | D. Ji | A | Wuhan University, China | 5/1 | 5/1 | [ | [ |
| 233 | T. Munkhdalai | A | Chungbuk National Univ., South Korea | 5/4 | 5/5 | [ | [ |
| 238 | H. Liu | A | Mayo Clinic, USA | 5/14 | 5/13 | [ | - |
| 245 | S. Zitnik | A | Univ. of Ljubljana, Slovenia | 3/7 | 3/7 | [ | - |
| 259 | S. Xu | A | Inst. of Scien. and Techn. Info. of China, P.R. China | 0/- | 5/7 | [ | [ |
| 262 | A. Ekbal | A | IIT Patna, India | 0/- | 5/9 | [ | - |
| 263 | M. Yoshioka | A | Hokkaido Univ., Sapporo, Japan | 0/- | 3/10 | [ | - |
| 265 | S. Ching-Yao | A | Yuan Ze Univ./Taipei Medical Univ., Taiwan, R.O.C. | 2/13 | 2/15 | [ | - |
| 267 | L. Li | A | Dalian Univ. of Technology, P.R. China | 1/12 | 0/- | - | - |
In the CDI and CEM columns the number or runs and the rank group of the best submission are shown. The asterisks indicate teams with articles in this special issue. Id: team identifier, A: academic, C: commercial
CDI evaluation results (best run per team only).
| Row | Team |
|
|
|
| Range | Group | Rank |
|---|---|---|---|---|---|---|---|---|
| A | 231 | 87.02% | 89.41% | 88.20% | 0.30% | A-C | 1 | 1 |
| B | 184 | 91.28% | 85.24% | 88.15% | 0.34% | A-C | 1 | 2 |
| C | 198 | 89.34% | 86.51% | 87.90% | 0.33% | A-D | 2 | 3 |
| D | 173 | 87.81% | 87.24% | 87.52% | 0.33% | C-D | 3 | 4 |
| E | 179 | 87.58% | 84.86% | 86.20% | 0.36% | E-F | 4 | 5 |
| F | 233 | 86.03% | 85.45% | 85.74% | 0.35% | E-F | 4 | 6 |
| G | 197 | 86.35% | 82.37% | 84.31% | 0.35% | G-G | 5 | 7 |
| H | 185 | 82.77% | 83.19% | 82.98% | 0.38% | H-H | 6 | 8 |
| I | 245 | 83.35% | 75.38% | 79.17% | 0.41% | I-I | 7 | 9 |
| J | 199 | 84.91% | 71.46% | 77.61% | 0.46% | J-K | 8 | 10 |
| K | 222 | 84.55% | 71.65% | 77.57% | 0.46% | J-K | 8 | 11 |
| L | 214 | 86.40% | 68.77% | 76.58% | 0.47% | L-M | 9 | 12 |
| M | 207 | 81.24% | 71.07% | 75.82% | 0.46% | L-N | 10 | 13 |
| N | 217 | 73.44% | 77.25% | 75.30% | 0.42% | M-N | 11 | 14 |
| O | 267 | 72.65% | 75.86% | 74.22% | 0.45% | O-O | 12 | 15 |
| P | 219 | 79.11% | 66.13% | 72.04% | 0.46% | P-Q | 13 | 16 |
| Q | 265 | 83.85% | 62.40% | 71.55% | 0.48% | P-Q | 13 | 17 |
| R | 238 | 76.49% | 64.96% | 70.25% | 0.56% | R-R | 14 | 18 |
| S | 191 | 80.99% | 58.28% | 67.78% | 0.49% | S-T | 15 | 19 |
| T | 196 | 57.66% | 81.48% | 67.53% | 0.38% | S-T | 15 | 20 |
| U | 177 | 62.11% | 70.20% | 65.91% | 0.47% | U-U | 16 | 21 |
| V | 182 | 60.31% | 57.36% | 58.79% | 0.49% | V-V | 17 | 22 |
| W | 225 | 60.80% | 53.09% | 56.69% | 0.50% | W-W | 18 | 23 |
P: precision; R: recall, F1: F-score, SD: standard deviation in the bootstrap samples.
CEM evaluation results (best run per team only).
| Row | Team |
|
|
|
| Range | Group | Rank |
|---|---|---|---|---|---|---|---|---|
| A | 173 | 89.09% | 85.75% | 87.39% | 0.37% | A-C | 1 | 1 |
| B | 231 | 89.10% | 85.20% | 87.11% | 0.37% | A-C | 1 | 2 |
| C | 179 | 88.73% | 85.06% | 86.86% | 0.41% | A-E | 2 | 3 |
| D | 184 | 92.67% | 81.24% | 86.58% | 0.39% | C-F | 3 | 4 |
| E | 198 | 91.08% | 82.30% | 86.47% | 0.38% | C-F | 3 | 5 |
| F | 197 | 86.50% | 85.66% | 86.08% | 0.45% | D-F | 4 | 6 |
| G | 192 | 89.42% | 81.08% | 85.05% | 0.44% | G-H | 5 | 7 |
| H | 233 | 88.67% | 81.17% | 84.75% | 0.40% | G-H | 5 | 8 |
| I | 185 | 84.45% | 80.12% | 82.23% | 0.44% | I-I | 6 | 9 |
| J | 245 | 84.82% | 72.14% | 77.97% | 0.47% | J-N | 7 | 10 |
| K | 199 | 85.20% | 71.77% | 77.91% | 0.47% | J-N | 7 | 11 |
| L | 222 | 85.83% | 71.22% | 77.84% | 0.50% | J-N | 7 | 12 |
| M | 259 | 88.79% | 69.08% | 77.70% | 0.49% | J-N | 7 | 13 |
| N | 214 | 89.26% | 68.08% | 77.24% | 0.52% | J-O | 8 | 14 |
| O | 262 | 78.28% | 74.59% | 76.39% | 0.45% | N-P | 9 | 15 |
| P | 263 | 82.14% | 70.94% | 76.13% | 0.46% | O-P | 10 | 16 |
| Q | 207 | 84.63% | 67.48% | 75.09% | 0.51% | Q-Q | 11 | 17 |
| R | 219 | 80.46% | 67.54% | 73.44% | 0.51% | R-R | 12 | 18 |
| S | 238 | 76.90% | 66.65% | 71.41% | 0.56% | S-S | 13 | 19 |
| T | 196 | 63.92% | 77.88% | 70.21% | 0.44% | T-U | 14 | 20 |
| U | 217 | 73.17% | 67.23% | 70.08% | 0.47% | T-U | 14 | 21 |
| V | 265 | 86.35% | 57.17% | 68.79% | 0.49% | V-V | 15 | 22 |
| W | 177 | 62.23% | 67.84% | 64.92% | 0.55% | W-W | 16 | 23 |
| X | 191 | 75.71% | 55.04% | 63.74% | 0.59% | X-X | 17 | 24 |
| Y | 182 | 61.46% | 60.33% | 60.89% | 0.57% | Y-Y | 18 | 25 |
| Z | 225 | 62.47% | 53.51% | 57.65% | 0.59% | Z-Z | 19 | 26 |
P: precision; R: recall, F1: F-score, SD: standard deviation in the bootstrap samples.
Figure 1Overview of the methods used by participating teams.
Figure 2Overview of used features by participating teams.