| Literature DB >> 30932000 |
Jung Hun Lee1, Seon-Young Kwon2, Jiho Chang3, Jin-Sung Yuk4.
Abstract
The exact mechanism of endometriosis is unknown. The recommendation system (RS) based on item similarities of machine learning has never been applied to the relationship between diseases. The study aim was to identify diseases associated with endometriosis by applying RS based on item similarities to insurance data in South Korea. Women aged 15 to 45 years extracted from the Korean Health Insurance Review & Assessment Service National Inpatient Sample (HIRA-NIS) 2009-2015. We used the RS model to extract diseases that were correlated with an endometriosis diagnosis. Among women aged 15 to 45 years, endometriosis was defined as a diagnostic code of N80.x and a concurrent treatment code. A control group was defined as women who did not have the N80.x code. Benign breast diseases, cystitis, and non-toxic goitre were extracted by the RS. A total of 1,730,562 women were selected as the control group, and 11,273 women were selected as the endometriosis group. In logistic regression analysis adjusted for age per 5 years, data year, and socioeconomic status, benign neoplasm of breast (odds ratio (OR): 2.58; 95% confidence interval (CI): 1.90-3.50), other cystitis (OR: 2.63; 95% CI: 1.56-4.44), and non-toxic single thyroid nodule (OR: 1.62; 95% CI: 1.14-2.32) were statistically significant. Endometriosis was associated with benign breast disease, cystitis, and non-toxic goitre.Entities:
Mesh:
Year: 2019 PMID: 30932000 PMCID: PMC6443655 DOI: 10.1038/s41598-019-41973-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Similarity matrix between items. Background colours of cells indicate the similarity between the two items. A stronger blue colour indicates a higher similarity between the two items.
Figure 2Flowchart creating a recommender model using HIRA-NIS data. HIRA-NIS: Health Insurance Review & Assessment Service National Inpatient Sample.
Characteristics of endometriosis and control groups.
| Control | Endometriosis | P-value | |
|---|---|---|---|
| Number of patients | 1,730,562 | 11,273 | |
| Mean age, year | 30.8 ± 0.0 | 34.1 ± 0.1 | <0.01a |
| Low SES | 47,873 (2.8%) | 144 (1.3%) | <0.01 |
| Data year | 0.01 | ||
| 2009 | 249,171 (14.4%) | 1,564 (13.9%) | |
| 2010 | 249,849 (14.4%) | 1,684 (14.9%) | |
| 2011 | 250,509 (14.5%) | 1,622 (14.4%) | |
| 2012 | 247,925 (14.3%) | 1,682 (14.9%) | |
| 2013 | 240,694 (13.9%) | 1,460 (13.0%) | |
| 2014 | 245,698 (14.2%) | 1,653 (14.7%) | |
| 2015 | 246,716 (14.3%) | 1,604 (14.4%) | |
| Benign neoplasm of the breast | 10,368 (0.6%) | 137 (1.2%) | <0.01 |
| Benign mammary dysplasia | 6,255 (0.4%) | 82 (0.7%) | <0.01 |
| Other disorders of the breast | 10798 (0.6%) | 138 (1.2%) | <0.01 |
| Cystitis | 80,222 (4.6%) | 742 (6.6%) | <0.01 |
| Other non-toxic goitre | 13,702 (0.8%) | 119 (1.1%) | <0.01 |
| Iron deficiency anaemia | 20230 (1.2%) | 328 (2.9%) | <0.01 |
| Other anaemias | 8947 (0.5%) | 105 (0.9%) | <0.01 |
SES, socioeconomic status.
Diseases with a prevalence of less than 0.1% in both groups are not shown in the table.
aA weighted t-test was used.
Logistic regression analysis of endometriosis-related candidate diseases using middle-class diagnostic codes.
| Unadjusteda | Adjusted Modelb | |||
|---|---|---|---|---|
| OR (95% CI) | P-value | OR (95% CI) | P-value | |
| Age per 5 years | 1.26 (1.24–1.28) | <0.01 | ||
| Data year | 1.01 (0.99–1.02) | 0.38 | ||
| Low SES | 0.58 (0.44–0.77) | <0.01 | ||
| Benign neoplasm of the breast | 3.48 (2.57–4.72) | <0.01 | 2.58 (1.90–3.51) | <0.01 |
| Benign mammary dysplasia | 2.68 (1.83–3.92) | <0.01 | 1.92 (1.31–2.82) | <0.01 |
| Other disorders of the breast | 2.23 (1.71–2.92) | <0.01 | 1.76 (1.35–2.30) | <0.01 |
| Cystitis | 1.66 (1.46–1.88) | <0.01 | 1.51 (1.33–1.71) | <0.01 |
| Other non-toxic goitre | 1.95 (1.45–2.62) | <0.01 | 1.54 (1.15–2.08) | <0.01 |
| Iron deficiency anaemia | 3.48 (2.87–4.22) | <0.01 | 3.05 (2.51–3.72) | <0.01 |
| Other anaemias | 2.52 (1.82–3.47) | <0.01 | 2.08 (1.49–2.89) | <0.01 |
CI, confidence interval; OR, odds ratio; SES, socioeconomic status.
aORs were analysed for endometriosis and each disease without other adjustments.
bAnalysis was adjusted for all variables in the table (endometriosis ~ age per 5 years + data year + low SES + benign neoplasm of breast + benign mammary dysplasia + other disorders of the breast + cystitis + other non-toxic goitre + iron deficiency anaemia + other anaemias).
Logistic regression analysis of endometriosis-related candidate diseases using full diagnostic codes.
| Unadjusteda | Adjusted Modelb | |||
|---|---|---|---|---|
| OR (95% CI) | P | OR (95% CI) | P | |
| Age per 5 years | 1.26 (1.24–1.28) | <0.01 | ||
| Data year | 1.01 (0.99–1.02) | 0.38 | ||
| Low SES | 0.58 (0.44–0.77) | <0.01 | ||
| Benign neoplasm of the breast | 3.48 (2.57–4.72) | <0.01 | 2.58 (1.90–3.50) | <0.01 |
| Diffuse cystic mastopathy | 3.17 (1.45–6.89) | <0.01 | 2.24 (1.03–4.88) | 0.04 |
| Benign mammary dysplasia | 3.76 (1.92–7.4) | <0.01 | 2.66 (1.35–5.24) | <0.01 |
| Other symptoms in the breast | 3.62 (2.01–6.52) | <0.01 | 2.91 (1.61–5.24) | <0.01 |
| Other disorders of the breast | 2.65 (1.45–4.85) | <0.01 | 1.96 (1.07–3.60) | 0.03 |
| Unspecified disorder of the breast | 2.83 (1.98–4.03) | <0.01 | 2.03 (1.42–2.91) | <0.01 |
| Acute cystitis | 1.51 (1.31–1.74) | <0.01 | 1.34 (1.16–1.54) | <0.01 |
| Other cystitis | 3.13 (1.87–5.26) | <0.01 | 2.63 (1.56–4.44) | <0.01 |
| Unspecified cystitis | 2.02 (1.55–2.64) | <0.01 | 1.70 (1.31–2.22) | <0.01 |
| Non-toxic single thyroid nodule | 2.01 (1.47–2.99) | <0.01 | 1.62 (1.14–2.32) | <0.01 |
| Non-toxic multinodular goitre | 2.15 (1.36–3.4) | <0.01 | 1.60 (1.01–2.53) | 0.05 |
| IDA secondary to blood loss | 6.87 (3.88–12.15) | <0.01 | 5.30 (2.96–9.50) | <0.01 |
| Other IDA | 2.96 (1.96–4.46) | <0.01 | 2.31 (1.52–3.52) | <0.01 |
| Unspecified IDA | 3.37 (2.68–4.23) | <0.01 | 2.82 (2.23–3.56) | <0.01 |
| Other specified anaemias | 3.16 (1.88–5.31) | <0.01 | 2.50 (1.48–4.24) | <0.01 |
| Unspecified anaemia | 2.48 (1.74–3.53) | <0.01 | 2.03 (1.41–2.93) | <0.01 |
CI, confidence interval; OR, odds ratio; SES, socioeconomic status; IDA, iron deficiency anaemias.
aORs were analysed for endometriosis and each disease without other adjustments.
bAnalysis was adjusted for all variables in the table (endometriosis ~ age per 5 years + data year + low SES + benign neoplasm of breast + diffuse cystic mastopathy + unspecified benign mammary dysplasia + other signs and symptoms in the breast + other specified disorders of the breast + unspecified disorder of the breast + acute cystitis + other cystitis + unspecified cystitis + non-toxic single thyroid nodule + non-toxic multinodular goitre + iron deficiency anaemia secondary to blood loss + other iron deficiency anaemias + unspecified iron deficiency anaemia + other specified anaemias + unspecified anaemia).