| Literature DB >> 31138159 |
Zolzaya Dashdorj1,1, Min Song2.
Abstract
BACKGROUND: Due to the advent of deep learning, the increasing number of studies in the biomedical domain has attracted much interest in feature extraction and classification tasks. In this research, we seek the best combination of feature set and hyperparameter setting of deep learning algorithms for relation classification. To this end, we incorporate an entity and relation extraction tool, PKDE4J to extract biomedical features (i.e., biomedical entities, relations) for the relation classification. We compared the chosen Convolutional Neural Networks (CNN) based classification model with the most widely used learning algorithms.Entities:
Keywords: Biomedical data analysis; Convolutional neural networks; Deep learning; Hyperparameter optimization; Relation classification
Mesh:
Year: 2019 PMID: 31138159 PMCID: PMC6538539 DOI: 10.1186/s12859-019-2808-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The overview of the PKDE4J architecture which includes our proposed relation classification module
The description of features with an example that extracted from a biomedical sentence by an entity and relation extraction tool, PKDE4J
| No | Biomedical features | Description | Example feature record |
|---|---|---|---|
| 1 | ID | Identifier number of an article | 15248468 |
| 2 | Sent ID | Sentence id of a sentence | 0 |
| in the abstract of an article | |||
| 3 | Entity-L | Entity on the left side | dehydroepiandrosterone sulfate |
| 4 | Type-L | Type of the entity on the left side | COMPOUND |
| 5 | Context-L | Context of the entity on the left side | NA; |
| 6 | Entity-R | Entity on the right side | ubiquinone-9 |
| 7 | Type-R | Type of the entity on the right side | COMPOUND |
| 8 | Context-R | Context of the entity on the right side | NA |
| 9 | Negation | Negativeness of the relation | POSITIVE |
| 10 | Tense | Tense of the relation | ACTIVE |
| 11 | Verb | Verb of the relation | increase |
| 12 | Relation | Reference word of the relation | LOCATION_OF |
| 13 | Context level | Level of the context in the relation | level=0 |
| 14 | Verb phrase | Verb phrase | increases hepatic ubiquinone-9 in |
| male F-344 | |||
| 15 | Sentence | Raw text of a sentence | Dehydroepiandrosterone sulfate |
| increases hepatic ubiquinone-9 | |||
| in male F-344 rats |
Fig. 2The distribution of the labeled dataset for binary and multi-class classification
Fig. 3The architecture of our proposed CNN classification with multiple convolutional layers
Fig. 4The hierarchy of relation classification: The right side entity is represented by R and the left side entity by L. ↑ represents a growing state and ↓ represents a declining state of an entity that affected by the relation verb
Entity-Relation Group based on pre-extracted features in sentence level and the raw text of sentences
| Group No | Type | Selected features concatenated with a order |
|---|---|---|
| Group 1 | Feature | Entity-L, Entity-R, Verb |
| Group 2 | Feature | Entity-L, Entity-R, Negation, Tense, Verb, Relation |
| Group 3 | Feature | Entity-L, Type-L, Context-L, Entity-R, Type-R, Context-R, |
| Negation,Tense, Verb, Relation, Context level, Verb phrase | ||
| Group 4 | Feature and | Entity-L, Type-L, Context-L, Entity-R, Type-R, Context-R, |
| Sentence | Negation, Tense, Verb, Relation, Context level, Verb phrase, | |
| Raw text of a sentence | ||
| Group 5 | Sentence | Raw text of a sentence |
The performance (weighted macro-average metric) of CNN based binary classification model on validation set
| Group No | Accuracy | Precision | Recall | F1 score |
|---|---|---|---|---|
| Max/Ave/Min | Max/Ave/Min | Max/Ave/Min | Max/Ave/Min | |
| Group 1 | 89.81/77.19/57.27 | 72.92/56.85/45.22 | 51.85/50.65/49.62 | 94.63/86.26/71.14 |
| Group 2 | 90.28/82.42/72.29 | 77.10/72.45/69.55 | 70.93/64.45/58.61 | 94.79/88.29/76.83 |
| Group 3 | 90.97/81.45/69.05 | 80.40/72.36/66.74 | 66.90/62.11/58.32 | 95.18/87.90/75.64 |
| Group 4 | 90.28/81.22/68.13 | 79.69/71.30/66.96 | 64.66/59.67/53.13 | 94.85/88.10/76.37 |
| Group 5 | 89.58/78.4/59.35 | 87.35/65.23/57.90 | 60.08/53.81/50.92 | 94.48/86.94/72.59 |
The performance (weighted macro-average metric) of CNN based multi-class classification model on validation set
| Group No | Accuracy | Precision | Recall | F1 score |
|---|---|---|---|---|
| Max/Ave/Min | Max/Ave/Min | Max/Ave/Min | Max/Ave/Min | |
| Group 1 | 77.74/68.69/60.74 | 88.54/49.78/26.35 | 11.11/10.5/10.25 | 47.70/36.3/21.44 |
| Group 2 | 76.67/63.68/44.24 | 76.67/55.67/20.23 | 11.11/10.22/10.00 | 86.80/67.19/25.18 |
| Group 3 | 76.91/63.77/44.47 | 76.91/64.50/48.15 | 11.11/10.26/10.00 | 86.95/69.33/21.82 |
| Group 4 | 76.91/63.77/44.24 | 76.91/63.77/44.24 | 11.11/10.22/10.00 | 86.95/77.26/61.34 |
| Group 5 | 76.91/63.77/44.24 | 76.91/63.77/44.24 | 11.11/10.22/10.00 | 86.95/77.26/61.34 |
Fig. 5The performance (weighted macro-average F1-score) of our proposed CNN based binary classification model
Fig. 6The performance (weighted macro-average F1-score) of our proposed CNN based multi-class classification model
Fig. 7Hyperparameters tuning on binary classification: hyperparameters effect vs loss function value
The optimization performance (weighted macro-average metric) in binary classification
| Optimization | Random search | Grid search |
|---|---|---|
| Execution time | 1150.446s | 1688.907s |
| Accuracy | 90.53% | 87.30% |
| Precision | 90.53% | 60.07% |
| Recall | 50.00% | 58.04% |
| F1-score | 90.53% | 93.06% |
| Suggested | learning rate=0.001; l2=1.0E-5; | learning rate=0.1; l2=1.0E-5; |
| hyperparameters | beta1=0.001; beta2=0.001; | beta1=0.001; beta2=0.001; |
| epsilon=0.001; activation=ReLU; | epsilon=0.001; activation=ReLU; |