| Literature DB >> 25894468 |
H Altay Güvenir1, Gizem Misirli2, Serdar Dilbaz3, Ozlem Ozdegirmenci4, Berfu Demir5, Berna Dilbaz5.
Abstract
In medicine, estimating the chance of success for treatment is important in deciding whether to begin the treatment or not. This paper focuses on the domain of in vitro fertilization (IVF), where estimating the outcome of a treatment is very crucial in the decision to proceed with treatment for both the clinicians and the infertile couples. IVF treatment is a stressful and costly process. It is very stressful for couples who want to have a baby. If an initial evaluation indicates a low pregnancy rate, decision of the couple may change not to start the IVF treatment. The aim of this study is twofold, firstly, to develop a technique that can be used to estimate the chance of success for a couple who wants to have a baby and secondly, to determine the attributes and their particular values affecting the outcome in IVF treatment. We propose a new technique, called success estimation using a ranking algorithm (SERA), for estimating the success of a treatment using a ranking-based algorithm. The particular ranking algorithm used here is RIMARC. The performance of the new algorithm is compared with two well-known algorithms that assign class probabilities to query instances. The algorithms used in the comparison are Naïve Bayes Classifier and Random Forest. The comparison is done in terms of area under the ROC curve, accuracy and execution time, using tenfold stratified cross-validation. The results indicate that the proposed SERA algorithm has a potential to be used successfully to estimate the probability of success in medical treatment.Entities:
Keywords: Classification; Clinical decision support system; Estimation of success; In vitro fertilization; Ranking
Mesh:
Year: 2015 PMID: 25894468 PMCID: PMC4768241 DOI: 10.1007/s11517-015-1299-2
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 2.602
Features in IVF dataset (N: numeric, C: Categorical, B: Binary)
| Variables related to female | Variables related to male | |
|---|---|---|
| Female_Age (N) | Laparoscopy (C) | Male_Factor (B) |
| Female_Blood_Type (C) | Hysteroscopy (C) | Male_Age (N) |
| Height (N) | Laparoscopic_Surgery (C) | Male_Blood_Type (C) |
| Weight (N) | Hysteroscopic_Surgery (C) | Male_Genital_Surgery (C) |
| BMI* (N) | Abdominal_Surgery (C) | Semen_Analysis_Category (C) |
| Tubal_Factor (B) | Abdominal_Surgery_Category (C) | Male_FSH (N) |
| Age_Related_Infertility (B) | Gynecologic_Surgery (C) | Sperm_Count (N) |
| Ovulatory_Dysfunction (B) | Ovarian_Surgery (C) | Sperm_Motility(N) |
| Unexplained_Infertility (B) | Tubal_Surgery (C) | Total_Progressive_Sperm_Count (N) |
| Severe_Pelvic_Adhesion (B) | Uterine_Surgery (C) | Sperm_Morphology (N) |
| Endometriosis (B) | Duration_Infertility (N) | Testicular_Biopsy (C) |
| Cycle_No. (N) | PCOS* (B) | TESE*_Outcome (C) |
| Baseline_FSH* (N) | HSG*_Cavity (C) | Male_Karyotype (C) |
| Baseline_LH* (N) | HSG*_Tubes* (C) | |
| Baseline_E2* (N) | Hydrosalpinx (C) | |
| G* (N) | Office_Hysteroscopy(C) | |
| A* (N) | Office_Hysteroscopic_Incision (B) | |
| Y* (N) | Office_Hysteroscopic_Procedure (C) | |
| DM* (C) | Total_Antral_Follicle_Count (N) | |
| HT* (B) | Right_Ovarian_Antral_Follicle_Count (N) | |
| Thyroid_Disease (C) | Left_Ovarian_Antral_Follicle_Count (N) | |
| Anemia (B) | Hyperprolactinemia (B) | |
| Laparotomy (C) | Hepatitis (C) | |
| Cyst_Aspiration (B) | Endometrioma_Surgery (C) | |
| Embryocryo (B) | Localization_Myoma_Uteri (C) |
* BMI body mass index, FSH follicle-stimulating hormone, LH luteinizing hormone, E2 estradiol, G gravida, A abortus, Y living children, DM diabetes mellitus, HT hypertension, PCOS polycystic ovary syndrome, HSG hysterosalpingography, TESE testicular sperm extraction
An example of computing score using RIMARC
| Feature | Feature weight | Feature value | Score value | Weighted score |
|---|---|---|---|---|
| Female_Age | 0.1753 | 25 | 0.2375 | 0.0416 |
| BMI | 0.1443 | 25.7 | 0.2169 | 0.0313 |
| Semen_Analysis_Category | 0.1407 | Astheno | 0.3571 | 0.0503 |
| Age_Related_Infertility | 0.1178 | No | 0.2245 | 0.0264 |
|
|
|
| ||
| Score( | ||||
Bold values indicate the results
Fig. 1Block diagram of the SERA algorithm
Results of AUC, accuracy and execution time on the IVF dataset
| Algorithm | AUC | Area under ROC curve | Accuracy | Execution | ||
|---|---|---|---|---|---|---|
| SE | 95 % Confidence interval | |||||
| Lower bound | Upper bound | |||||
| SERA | 0.833 (±0.003) | 0.012 | 0.809 (±0.003) | 0.857 (±0.004) | 0.844 (±0.004) | 1.4 (±0.2) |
| NBC | 0.794 (±0.002) | 0.014 | 0.767 (±0.002) | 0.822 (±0.002) | 0.783 (±0.002) | 0.8 (±0.1) |
| Random forest | 0.769 (±0.009) | 0.014 | 0.741 (±0.010) | 0.797 (±0.008) | 0.792 (±0.008) | 2.0 (±0.1) |
The values are mean (±SD) over 200 runs with random shuffling of the dataset (p < 0.0005 for all algorithms)
Fig. 2Effect of k on AUC and accuracy
Feature weights learned by RIMARC
| Feature | Weight | Feature | Weight |
|---|---|---|---|
| Laparoscopic_Surgery | 0.6455 | Laparotomy | 0.0909 |
| Total_Antral_Follicle_Count | 0.5498 | Male_Karyotype | 0.0834 |
| Right_Ovarian_Antral_Follicle_Count | 0.5163 | HSG_Tubes | 0.0783 |
| Left_Ovarian_Antral_Follicle_Count | 0.4934 | Myoma_Uteri | 0.0737 |
| Hysteroscopic_Surgery | 0.457 | Uterine_Surgery | 0.0711 |
| TESE_Outcome | 0.4254 | Sperm_Morphology | 0.0708 |
| Female_Age | 0.3957 | Abdominal_Surgery | 0.053 |
| Male_FSH | 0.3484 | Cycle_No | 0.0508 |
| Male_Blood_Type | 0.3118 | Tubal_Factor | 0.0496 |
| Male_Age | 0.2777 | Cyst_Aspiration | 0.0384 |
| Baseline_FSH | 0.2764 | Ovarian_Surgery | 0.0354 |
| PCOS | 0.2258 | Male_Factor | 0.0332 |
| Total_Progressive_Sperm_Count | 0.2151 | Endometrioma_Surgery | 0.0276 |
| Sperm_Count | 0.2098 | Abdominal_Surgery_Category | 0.0274 |
| Localization_Myoma_Uteri | 0.2021 | Thyroid_Disease | 0.0273 |
| Age_Related_Infertility | 0.1959 | Testicular_Biopsy | 0.0256 |
| Ovulatory_Dysfunction | 0.1791 | Laparoscopy | 0.0231 |
| Gynecologic_Surgery | 0.1779 | Hysteroscopy | 0.0197 |
| Semen_Analysis_Category | 0.1777 | DM | 0.0141 |
| Unexplained_Infertility | 0.1775 | Tubal_Surgery | 0.0125 |
| Duration_Infertility | 0.1567 | HT | 0.0122 |
| BMI | 0.1534 | Y | 0.012 |
| Height | 0.1339 | Endometriosis | 0.0118 |
| Weight | 0.1333 | Embryocryo | 0.0117 |
| Female_Blood_Type | 0.127 | Hydrosalpinx | 0.0104 |
| Office_Hysteroscopic_Procedure | 0.1245 | G | 0.0101 |
| Office_Hysteroscopy | 0.1238 | Office_Hysteroscopic_Incision | 0.01 |
| Baseline_LH | 0.1196 | A | 0.0093 |
| HSG_Cavity | 0.1048 | Hyperprolactinemia | 0.0085 |
| Male_Genital_Surgery | 0.1039 | Hepatitis | 0.0079 |
| Sperm_Motility | 0.096 | Severe_Pelvic_Adhesion | 0.0047 |
| Baseline _E2 | 0.095 | Anemia | 0.0004 |
Fig. 3Some rules learned by RIMARC