| Literature DB >> 35923619 |
Feng Liu1, Xinyu Zhang2.
Abstract
Thyroid disease instances have rapidly increased in the past few decades; however, the cause of the disease remains unclear. Understanding the pathogenesis of thyroid disease will potentially reduce morbidity and mortality rates. Currently, the identified risk factors from existing studies are controversial as they were determined through qualitative analysis and were not further confirmed by quantitative implementations. Association rule mining, as a subset of data mining techniques, is dedicated to revealing underlying correlations among multiple attributes from a complex heterogeneous dataset, making it suitable for thyroid disease pathogenesis identification. This study adopts two association rule mining algorithms (i.e., Apriori and FP-Growth Tree) to identify risk factors correlated with thyroid disease. Extensive experiments were conducted to reach impartial findings with respect to knowledge discovery through two independent digital health datasets. The findings confirmed that gender, hypertension, and obesity are positively related to thyroid disease development. The history of I131 treatment and Triiodothyronine level can be potential factors for evaluating subsequent thyroid disease.Entities:
Keywords: association rule mining; data mining; machine learning; risk factors; thyroid disease pathogenesis
Mesh:
Year: 2022 PMID: 35923619 PMCID: PMC9339634 DOI: 10.3389/fendo.2022.939367
Source DB: PubMed Journal: Front Endocrinol (Lausanne) ISSN: 1664-2392 Impact factor: 6.055
Figure 1Thyroid disease pathogenesis knowledge extraction framework.
Dataset-I Thyroid Disease Dataset Selected Attributes.
| Attributes | Descriptions | Values |
|---|---|---|
| Age | Age group intervals | 20–30 (Age between 20 to 30) |
| 30–40 (Age between 30 to 40) | ||
| 40–50 (Age between 40 to 50) | ||
| 50–60 (Age between 50 to 60) | ||
| 60–70 (Age between 60 to 70) | ||
| 70–80 (Age between 70 to 80) | ||
| Sex | Gender groups | M=Male and F=female |
| On Thyroxine | On thyroxine status | f=False (Not on thyroxine) |
| t=True (On thyroxine) | ||
| On anti-thyroid med | Anti-thyroid medication | f=False (Currently not on anti-thyroid med) |
| t=True (Currently on anti-thyroid med) | ||
| Sick | Current sick status | f=False (Currently not sick) |
| t=True (Currently sick) | ||
| Pregnant | Pregnant status | f=False (Currently not pregnant) |
| t=True (Currently pregnant) | ||
| Thyroid surgery | Had thyroid surgery | f=False (Did not have thyroid surgery) |
| t=True (Had thyroid surgery) | ||
| I131 | Had I131 treatment | f=False (Did not have I131 treatment) |
| t=True (Had I131 treatment) | ||
| Query hypothyroid | Hypothyroidism statue | f=False (Do not have hypothyroidism) |
| t=True (Have hypothyroidism) | ||
| Query hyperthyroid | Hyperthyroidism status | f=False (Do not have hyperthyroidism) |
| t=True (Have hyperthyroidism) | ||
| Lithium | Lithium status | f=False (Do not have Lithium) |
| t=True (Have lithium) | ||
| Goiter | Goiter status | f=False (Do not have goiter) |
| t=True (Have goiter) | ||
| Tumor | Tumor status | f=False (Do not have tumor) |
| t=True (Have tumor) | ||
| Hypopituitary | Hypopituitary status | f=False (Do not have hypopituitary) |
| t=True (Have hypopituitary) | ||
| Psych | Psych status | f=False (Do not have psych) |
| t=True (Have psych) | ||
| TSH | TSH level intervals | TSH=Normal (0.27≤ TSH ≤4.2) |
| TSH=Abnormal (Not within the normal range) | ||
| T3 | T3 level intervals | T3=Normal (1.3≤ T3 ≤ 3.1) |
| T3=Abnormal (Not within the normal range) | ||
| TT4 | TT4 level intervals | TT4=Normal (62≤ TT4 ≤ 164) |
| TT4=Abnormal (Not within the normal range) | ||
| T4U | T4U level intervals | T4U=Normal (0.7≤ T4U ≤ 1.8) |
| T4U=Abnormal (Not within the normal range) | ||
| FTI | FTI level intervals | FTI=Normal (53≤ FTI ≤142) |
| FTI=Abnormal (Not within the normal range) | ||
| Class | Thyroid disease status | Negative (Does not have thyroid disease) |
| Positive (Has thyroid disease) |
Dataset-II Z-Alizadeh Sani Dataset Selected Attributes.
| Attributes | Descriptions | Values |
|---|---|---|
| Age | Age group intervals | ≤=50 (Less than or equal to 50) |
| >50 (Larger than 50) | ||
| Gender | Patient gender groups | M=Male, F=Female |
| DM | Diabetes Mellitus | 0=No (Currently not have diabetes mellitus) |
| 1=Yes (Currently has diabetes mellitus) | ||
| HTN | Hypertension | 0=No (Currently not have hypertension) |
| 1=Yes (Currently has hypertension) | ||
| Current Smoker | Current smoking status | 0=No (Currently does not smoke) |
| 1=Yes (Currently smokes) | ||
| Previous Smoker | Previous smoking status | 0=No (Did not smoke in the past) |
| 1=Yes (Used to smoke) | ||
| Obesity | Obesity disease status | N=No (Does not have obesity) |
| Y=Yes (Has obesity) | ||
| CRF | Chronic Renal Failure | N=No (Does not have chronic renal failure) |
| Y=Yes (Has chronic renal failure) | ||
| CVA | Cerebrovascular Accident | N=No (Did not have cerebrovascular accident) |
| Y=Yes (Had cerebrovascular accident) | ||
| Airway Disease | Airway disease status | N=No (Does not have airway disease) |
| Y=Yes (Has airway disease) | ||
| Edema | Edema status | 0=No (Without edema) |
| 1=Yes (With edema) | ||
| Lung Rales | Lung rales status | N=No (Without lung rales) |
| Y=Yes (With lung rales) | ||
| Dyspnea | Dyspnea status | N=No (Does not have dyspnea) |
| Y=Yes (Has dyspnea) | ||
| Cardiovascular Disease | Cardiovascular disease status | N=No (Does not have CAD) |
| Y=Yes (Has CAD) | ||
| Thyroid Disease | Thyroid disease status | N=Negative (Does not have thyroid disease) |
| Y=Positive (Has thyroid disease) |
Dataset-I Thyroid Disease Dataset Results.
| Groups | Antecedents | Consequent | Support | Confidence | |
|---|---|---|---|---|---|
| Healthy | I131=False, Hypopituitary = False | ⇒ | Negative | 0.98 | 1.00 |
| I131= False, Lithuim = False | ⇒ | Negative | 0.98 | 1.00 | |
| I131= False, On_antithyroid_med=False | ⇒ | Negative | 0.97 | 1.00 | |
| I131=False, Query_hyperthyroid=False | ⇒ | Negative | 0.92 | 1.00 | |
| I131= False, On_thyroxine=False | ⇒ | Negative | 0.86 | 1.00 | |
| Sick | T3=Abnormal | ⇒ | Positive | 0.88 | 1.00 |
| T3=Abnormal, Goiter=False | ⇒ | Positive | 0.87 | 1.00 | |
| T3=Abnormal, Query_hyperthyroid=False | ⇒ | Positive | 0.85 | 1.00 | |
| Female, T3=Abnormal | ⇒ | Positive | 0.80 | 1.00 | |
| Female, Thyroid_surgery=False | ⇒ | Positive | 0.61 | 1.00 |
Dataset-II Z-Alizadeh Dataset Results.
| Groups | Antecedents | Consequent | Support | Confidence | |
|---|---|---|---|---|---|
| Healthy | CRF=False,CVA=False | ⇒ | Negative | 0.97 | 1.00 |
| CRF=False,Ex_smoker=False | ⇒ | Negative | 0.95 | 1.00 | |
| CRF=False,Age>50,AD=False | ⇒ | Negative | 0.71 | 1.00 | |
| Age>50,Edema=False,LR=False | ⇒ | Negative | 0.70 | 1.00 | |
| AD=False,Diabetes_mellitus=False | ⇒ | Negative | 0.68 | 1.00 | |
| Sick | Hypertension=True | ⇒ | Positive | 0.71 | 1.00 |
| Obesity=True | ⇒ | Positive | 0.71 | 1.00 | |
| Hypertension=True,CRF=False | ⇒ | Positive | 0.71 | 1.00 | |
| Obesity=True,Ex_smoker=False | ⇒ | Positive | 0.71 | 1.00 |