| Literature DB >> 32566676 |
Kai-Po Chang1,2, John Wang2, Chi-Chang Chang3,4, Yen-Wei Chu5,6,7,8,9,10.
Abstract
Information about the expression status of hormone receptors such as estrogen receptor (ER), progesterone receptor (PR), and Her-2 is crucial in the management and prognosis of breast cancer. Therefore, the retrieval and analysis of hormone receptor expression characteristics in metastatic breast cancer may be valuable in breast cancer study. Herein, we report a text mining tool based on word/phrase matching that retrieves hormone receptor expression data of regional or distant metastatic breast cancer from pathology reports. It was tested on pathology reports at the China Medical University Hospital from 2013 to 2018. The tool showed specificities of 91.6% and 63.3% for the detection of regional lymph node metastasis and distant metastasis, respectively. Sensitivity in immunohistochemical study result extraction in these cases was 98.6% for distant metastasis and 78.3% for regional lymph node metastasis. Statistical analysis on these retrieved data showed significant difference s in PR and Her-2 expressions between regional and metastatic breast cancer, which is compatible with previous studies. In conclusion, our study shows that metastatic breast cancer hormone receptor expression characteristics can be retrieved by text mining. The algorithm designed in this study may be useful in future studies about text mining in pathology reports.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32566676 PMCID: PMC7273481 DOI: 10.1155/2020/2654815
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Data retrieval and preprocessing steps.
Figure 2Protocol for searching metastatic breast cancer cases.
Figure 3Reporting immunohistochemical study results as a sentence in the microscopic description.
Figure 4Reporting immunohistochemical study results as a solitary paragraph with multiple rows.
Figure 5Reporting immunohistochemical study results as a solitary paragraph, with different studies separated by commas.
Summary of the results of metastatic breast cancer detection.
| Metastatic site | Cases labeled as metastatic carcinoma | Label correct | Specificity |
|---|---|---|---|
| Regional | 359 | 329 | 91.6% |
| Distant | 131 | 83 | 63.3% |
Summary of metastatic sites.
| Metastatic site | Case number |
|---|---|
| Nonregional lymph node | 22 |
| Bone | 20 |
| Brain | 12 |
| Liver | 8 |
| GI tract | 8 |
| Lung | 7 |
| Others | 6 |
Summary of results of the extraction of immunohistochemical study result data.
| Metastatic site | Case number | Result detected | Result correct | Sensitivity | Specificity |
|---|---|---|---|---|---|
| Regional | 83 | 65 | 64 | 78.3% | 98.4% |
| Distant | 329 | 322 | 322 | 98.6% | 100% |
Summary of immunohistochemical study results of distant metastatic tumors.
| Marker | Positive | Equivocal | Negative | Not tested |
|---|---|---|---|---|
| ER | 36 (62.0%) | 28 (38.0%) | 0 | |
| PR | 12 (23.0%) | 40 (67.0%) | 12 | |
| Her-2 | 23 (39.6%) | 11 (19.0%) | 24 (41.4%) | 8 |
Summary of immunohistochemical study results of regional metastatic tumors.
| Marker | Positive | Equivocal | Negative | Not tested |
|---|---|---|---|---|
| ER | 198 (64.3%) | 110 (35.7%) | 14 | |
| PR | 52 (57.1%) | 29 (42.9%) | 231 | |
| Her-2 | 103 (34.0%) | 95 (31.3%) | 112 (37.0%) | 8 |
Difference of ER expression between distant and regionally metastatic breast cancers.
| ER result | Distant metastasis | Regional metastasis |
|---|---|---|
| Positive | 36 | 198 |
| Negative | 28 | 110 |
χ 2 = 1.1422, df = 1, p = 0.2852.
Difference of PR expression between distant and regionally metastatic breast cancers.
| PR result | Distant metastasis | Regional metastasis |
|---|---|---|
| Positive | 12 | 52 |
| Negative | 40 | 29 |
χ 2 = 19.835, df = 1, p = 8.444e − 06.
Difference of Her-2 expression between expression between distant and regionally metastatic breast cancers.
| Her-2 result | Distant metastasis | Regional metastasis |
|---|---|---|
| Positive | 23 | 103 |
| Equivocal | 11 | 95 |
| Negative | 24 | 112 |
χ 2 = 37.556, df = 2, p = 6.995e − 09.
ER expression status of major metastatic sites.
| ER result | Bone | Liver | Lung |
|---|---|---|---|
| Positive | 8 | 4 | 1 |
| Negative | 4 | 4 | 1 |
χ 2 = 3.5011, df = 2, p = 0.1737.
PR expression status of major metastatic sites.
| PR result | Bone | Liver | Lung |
|---|---|---|---|
| Positive | 4 | 2 | 1 |
| Negative | 7 | 5 | 5 |
χ 2 = 4.6286, df = 2, p = 0.09884.
Her-2 expression status of major metastatic sites.
| Her-2 result | Bone | Liver | Lung |
|---|---|---|---|
| Positive | 3 | 3 | 6 |
| Equivocal | 3 | 0 | 5 |
| Negative | 4 | 1 | 0 |
χ 2 = 7.5455, df = 4, p = 0.1097.