| Literature DB >> 30961595 |
Boyu Chen1, Hao Jin1, Zhiwen Yang1, Yingying Qu2, Heng Weng3, Tianyong Hao4.
Abstract
BACKGROUND: Gender information frequently exists in the eligibility criteria of clinical trial text as essential information for participant population recruitment. Particularly, current eligibility criteria text contains the incompleteness and ambiguity issues in expressing transgender population, leading to difficulties or even failure of transgender population recruitment in clinical trial studies.Entities:
Keywords: Clinical trial; Gender; Information extraction; Summarization; Transgender
Mesh:
Year: 2019 PMID: 30961595 PMCID: PMC6454593 DOI: 10.1186/s12911-019-0768-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The number of clinical trials recruiting transgender population on ClincialTrial.gov
Fig. 2The virtual gender model and its mapping relations with conventional gender types
Fig. 3The framework of our approach
The gender mention types and their related gender mention features
| Gender mention types | Gender mention features |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Examples of logical judgment functions and their descriptions
| Function name | Description | Example |
|---|---|---|
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
|
Examples of transformation functions and their descriptions
| Function | Description | Parameter Restriction | Example |
|---|---|---|---|
|
|
| ||
|
|
| ||
|
|
|
|
The parameter training using F1-measure with three-fold cross-validation
|
| Round 1 | Round 2 | Round 3 | Round 4 | Round 5 | Round 6 | Round 7 | Round 8 | Round 9 | Round 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0.162 | 0.163 | 0.174 | 0.157 | 0.159 | 0.162 | 0.162 | 0.158 | 0.157 | 0.156 |
| 2 | 0.599 | 0.563 | 0.659 | 0.599 | 0.611 | 0.618 | 0.555 | 0.607 | 0.591 | 0.616 |
| 3 | 0.741 | 0.724 | 0.830 | 0.747 | 0.726 | 0.769 | 0.731 | 0.745 | 0.748 | 0.763 |
| 4 | 0.866 | 0.849 | 0.853 |
| 0.859 | 0.888 | 0.863 | 0.877 | 0.872 | 0.879 |
| 5 |
|
|
| 0.869 |
|
|
|
|
|
|
| 6 | 0.854 | 0.835 | 0.834 | 0.846 | 0.844 | 0.873 | 0.855 | 0.880 | 0.858 | 0.861 |
| 7 | 0.838 | 0.820 | 0.834 | 0.828 | 0.830 | 0.858 | 0.840 | 0.860 | 0.843 | 0.843 |
| 8 | 0.838 | 0.822 | 0.836 | 0.829 | 0.831 | 0.859 | 0.842 | 0.862 | 0.845 | 0.845 |
| 9 | 0.838 | 0.822 | 0.836 | 0.829 | 0.831 | 0.859 | 0.842 | 0.862 | 0.845 | 0.845 |
| 10 | 0.840 | 0.824 | 0.837 | 0.831 | 0.833 | 0.861 | 0.844 | 0.863 | 0.847 | 0.845 |
Fig. 4The performance of our approach on different datasets
The performance comparison on the datasets (A to G) using Macro-averaged F1-measure
| Method | A | B | C | D | E | F | G |
|---|---|---|---|---|---|---|---|
| Logit Boost | 0.637 | 0.674 | 0.681 | 0.639 | 0.667 | 0.636 | 0.628 |
| Logistic | 0.745 | 0.735 | 0.693 | 0.667 | 0.678 | 0.706 | 0.646 |
| Bayes Net | 0.680 | 0.662 | 0.652 | 0.624 | 0.665 | 0.665 | 0.655 |
| Simple Logistic | 0.761 | 0.668 | 0.697 | 0.684 | 0.644 | 0.685 | 0.658 |
| LMT | 0.772 | 0.668 | 0.643 | 0.686 | 0.625 | 0.686 | 0.665 |
| Random Committee | 0.728 | 0.738 | 0.696 | 0.695 | 0.688 | 0.750 | 0.673 |
| Decision Table | 0.637 | 0.609 | 0.590 | 0.599 | 0.605 | 0.617 | 0.675 |
| Random Tree | 0.674 | 0.667 | 0.661 | 0.646 | 0.652 | 0.668 | 0.718 |
| Random Forest | 0.774 | 0.739 | 0.760 | 0.698 | 0.733 | 0.747 | 0.765 |
| Our approach |
|
|
|
|
|
|
|