| Literature DB >> 32646408 |
Liangliang Liu1, Xiaojing Wu1, Hui Liu2, Xinyu Cao3, Haitao Wang3, Hongwei Zhou4, Qi Xie5.
Abstract
BACKGROUND: A semi-supervised model is proposed for extracting clinical terms of Traditional Chinese Medicine using feature words.Entities:
Keywords: Clinical terms; Deep learning; NER; Semi-supervised; TCM
Mesh:
Year: 2020 PMID: 32646408 PMCID: PMC7477860 DOI: 10.1186/s12911-020-1108-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1TCM Clinical Terms Extraction Model. This figure shows the overall TCM clinical terms extraction steps
Fig. 2The example of the combined of two kinds of character vectors. We combine the character vectors using the illustrated method in the figure
Fig. 3A part of the organized feature words set. We have sorted out different types of feature word sets, and the above are some examples of feature word sets
There are five types of TCM clinical terms label types shown below
| TCM Term Categories | Label Types |
|---|---|
| Chinese Traditional Medicine | B-MED, I-MED |
| Formulas | B-FOL, I-FOL |
| Symptoms | B-SYM, I-SYM |
| Diseases | B-DES, I-DES |
| Patterns | B-PAT, I-PAT |
| Other | O |
Extraction result for TCM clinical terms in the test dataset
| Type | Character Vector | Feature Word Categories | Corpus | Precision | Recall | F1 |
|---|---|---|---|---|---|---|
| Supervised | WIKI | 0 | Train_2w +Dev_3.5 k +Test_5.1 k | 0.6831 | 0.7053 | 0.6941 |
| Semi-Supervised | WIKI | 0 | Train_2w_10w +Dev_3.5 k +Test_5.1 k | 0.7782 | 0.7294 | 0.7530 |
| TCM | 0 | 0.7474 | 0.7262 | 0.7339 | ||
| 0.3TCM + 0.7WIKI | 0 | 0.7893 | 0.7294 | 0.7561 | ||
| 0.7TCM + 0.3WIKI | 0 | 0.7503 | 0.7335 | 0.7298 | ||
| 0.5TCM + 0.5WIKI | 0 | 0.7935 | 0.7228 | 0.7585 | ||
| 0.5TCM + 0.5WIKI | 4 | 0.7889 | 0.7525 | 0.7703 | ||
| 0.5TCM + 0.5WIKI | 39 | 0.7932 | 0.7723 | 07826 | ||
| 0.5TCM + 0.5WIKI | 8 | 0.7756 | 0.7987 |
Extraction result for TCM clinical terms in the test dataset
| Precision | Recall | F1 | |
|---|---|---|---|
| DES | 0.7419 | 0.7667 | 0.7541 |
| FOL | 0.6429 | 0.5000 | 0.5625 |
| MED | 0.8082 | 0.8489 | 0.8281 |
| PAT | 0.6250 | 0.8333 | 0.7143 |
| SYM | 0.7699 | 0.7909 | 0.7803 |
| Total | 0.7756 | 0.7987 | 0.7870 |