| Literature DB >> 34828379 |
Niyazi Senturk1,2, Gulten Tuncel2, Berkcan Dogan3,4, Lamiya Aliyeva3, Mehmet Sait Dundar5,6, Sebnem Ozemri Sag3, Gamze Mocan7, Sehime Gulsun Temel3,4,8, Munis Dundar9, Mahmut Cerkez Ergoren2,10.
Abstract
Artificial intelligence provides modelling on machines by simulating the human brain using learning and decision-making abilities. Early diagnosis is highly effective in reducing mortality in cancer. This study aimed to combine cancer-associated risk factors including genetic variations and design an artificial intelligence system for risk assessment. Data from a total of 268 breast cancer patients have been analysed for 16 different risk factors including genetic variant classifications. In total, 61 BRCA1, 128 BRCA2 and 11 both BRCA1 and BRCA2 genes associated breast cancer patients' data were used to train the system using Mamdani's Fuzzy Inference Method and Feed-Forward Neural Network Method as the model softwares on MATLAB. Sixteen different tests were performed on twelve different subjects who had not been introduced to the system before. The rates for neural network were 99.9% for training success, 99.6% for validation success and 99.7% for test success. Despite neural network's overall success was slightly higher than fuzzy logic accuracy, the results from developed systems were similar (99.9% and 95.5%, respectively). The developed models make predictions from a wider perspective using more risk factors including genetic variation data compared with similar studies in the literature. Overall, this artificial intelligence models present promising results for BRCA variations' risk assessment in breast cancers as well as a unique tool for personalized medicine software.Entities:
Keywords: BRCA1; BRCA2; artificial intelligence; breast cancer; translational fuzzy logic; variation
Mesh:
Substances:
Year: 2021 PMID: 34828379 PMCID: PMC8623958 DOI: 10.3390/genes12111774
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The flowchart of the Fuzzy logic system.
Values of membership functions for each input cluster. * VUS: Variant of unknown significance.
| Input Clusters (Risk Factors) | Membership Functions | Values [0,1] |
|---|---|---|
| Age | <15 | 0 |
| 15–19 | 0.25 | |
| 20–39 | 0.5 | |
| 40–59 | 0.75 | |
| ≥60 | 1 | |
| Sex | Male | 0 |
| Female | 1 | |
| Consanguinuty | No | 0 |
| Yes | 1 | |
| Family History | No | 0 |
| Yes | 1 | |
| Number of Family Member | 0 | 0 |
| 1 and 2 | 0.5 | |
| ≥3 | 1 | |
| Tumor Size | 0–19 cm | 0 |
| 20–39 cm | 0.5 | |
| ≥40cm | 1 | |
| Lymph Node | Negative | 0 |
| Positive | 1 | |
| Degree of Malignancy | Grade 1 | 0 |
| Grade 2 | 0.5 | |
| Grade 3 | 1 | |
| Position | Other | 0.25 |
| Right Breast | 0.5 | |
| Left Breast | 0.75 | |
| Both Breast | 1 | |
| Estrogen Receptor | Negative | 0 |
| Positive | 1 | |
| Progesterone | Negative | 0 |
| Positive | 1 | |
|
| Negative | 0 |
| Positive | 1 | |
|
| Negative | 0 |
| Positive | 1 | |
| Other Genes | Negative | 0 |
| Positive | 1 | |
| Diagnosis | No | 0 |
| Yes | 1 | |
| Classification | Benign | 0 |
| Likely Benign | 0.25 | |
| VUS * | 0.5 | |
| Likely Pathogenic | 0.75 | |
| Pathogenic | 1 |
The distribution of the genes among suiTable 268 patients.
| Gene | Number |
|---|---|
|
| 61 |
|
| 128 |
| 11 | |
| Other genes * | 68 |
| Total | 268 |
* Other genes: BLM, BARD1, RAD50, PALB2, MSH2, ATM, MLH1, MRE11A, PMS2, MUTHY, XRCC2, ATN, CDH1, BARD, FAM175A, EPCAM, PKD1, STK11, NBN, MSH2, CHEK2, MSH6, CDH2, BRIP1, PTEN, PIK3CA, MEN1, TP53 and RAD51D.
Figure 2The figure illustrates generated rules section within the Fuzzy Logic system. The upper rectangle box presents an example of the data from 268 patients used that used to train the system. Lower small square boxes show example the parameters (age, sex, consanguineous marriage, family history and number of family members) which were defined as input and membership functions within rule section.
The table shows the created output cluster for given variant classificiation values. * VUS: Variant of unknown significance.
| Membership Functions of Output Cluster | Values of Membership Functions |
|---|---|
| Benign | 0 |
| Likely Benign | 0.25 |
| VUS * | 0.5 |
| Likely Pathogenic | 0.75 |
| Pathogenic | 1 |
Figure 3The generated appearance of the output cluster using fuzzy logic interface on the MATLAB. Small-merged yellow boxes illustrate sixteen parameters that were introduced as inputs. The blue box shows the output part and determines five different variant classifications as membership functions. The Y-axis presents membership functions of output which can be determine according to the output score. The X-axis presents values of membership function between 0–1.
Figure 4Neural Network regression results of 268 patients. (a) The train success of the system using 160 patients (99.9%). (b) The test success of the system using 54 patient (99.7%). (c) The validation success of the system using remain 54 patients (~99.6%). (d) The overall success rate of the system (~99.9%). X-axis represented as output explain regressions data. Y-axis represented as target meaning success ratio between 0–1.
a. Obtained results from testing the system. b. Obtained results from testing the system.
|
| ||||||
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
| ||||
| Age | 43 | 36 | 44 | 42 | 34 | 33 |
| Sex | Female | Female | Female | Female | Female | Female |
| Consanguineous Marriage | Unknown | Unknown | Yes | Unknown | Yes | Unknown |
| Family History | Unknown | Unknown | Yes | Unknown | Yes | Yes |
| Number of Affected Family Member | Unknown | Unknown | 1 | Unknown | 1 | 3 |
| Tumor Size | 17.5 cm | 0–1 cm | Unknown | 6.6 cm | Unknown | Unknown |
| Lymp Node | No | No | Unknown | No | Unknown | Unknown |
| Degree of Malignancy | Unknown | Grade 2 | Unknown | Grade 3 | Unknown | Unknown |
| Tumor Location | Right Breast | Right Breast | Right Breast | Right Breast | Right Breast | Unknown |
| Estrogen Receptor Hormone | Unknown | Unknown | Unknown | Positive | Unknown | Unknown |
| Progesterone Hormone | Positive | Positive | Unknown | Negative | Unknown | Unknown |
|
| No | No | Yes | Yes | Yes | Yes |
|
| Yes | Yes | No | No | No | No |
| Other Genes | No | No | No | No | No | No |
| Diagnosis | Yes | Yes | Yes | Yes | Unknown | No |
| Fuzzy Logic Result | 90% (0.900) | 89% (0.890) | 90% (0.900) | 90% (0.900) | 66.1% (0.661) | 66.1% (0.661) |
| Neural Network Result | 99.9% (0.999) | 99.9% (0.999) | 99.9% (0.999) | 99.9% (0.999) | 77.8% (0.778) | 75.1% (0.751) |
|
| ||||||
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
| ||||
| Age | 38 | 42 | 58 | 58 | 32 | 40 |
| Sex | Female | Female | Female | Female | Female | Female |
| Consanguineous Marriage | Unknown | No | Unknown | Unknown | Yes | No |
| Family History | No | No | Yes | No | No | No |
| Number of Affected Family Member | 0 | 0 | Unknown | 0 | 0 | 0 |
| Tumor Size | 3–4 cm | 0.5 cm | Unknown | Unknown | Unknown | 30 cm |
| Lymp Node | No | No | Unknown | Yes | Yes | Yes |
| Degree of Malignancy | Grade 3 | Grade 2 | Grade 2 | Grade 2 | Unknown | Grade 2 |
| Tumor Location | Right Breast | Right Breast | Right Breast | Right Breast | Right Breast | Both |
| Estrogen Receptor Hormone | Positive | Positive | Positive | Positive | Positive | Positive |
| Progesterone Hormone | Positive | Positive | Positive | Positive | Positive | Positive |
|
| Yes | No | Yes | Yes | No | Yes |
|
| Yes | Yes | No | No | No | No |
| Other Genes | No | No | No | No | Yes | Yes |
| Diagnosis | Yes | Yes | Yes | Yes | Yes | Yes |
| Fuzzy Logic Result | 42.5% (0.425) | 48.9% (0.489) | 48.9% (0.489) | 57.1% (0.571) | 42.5% (0.425) | 42.5% (0.425) |
| Neural Network Result | 50.2% (0.502) | 49.9% (0.499) | 50.5% (0.505) | 50.3% (0.503) | 49.9% (0.499) | 51.0% (0.510) |