| Literature DB >> 32409705 |
N Rosato1, E Piccione2, V Bruno3, M D'Orazio4, C Ticconi2, P Abundo5, S Riccio2, E Martinelli4, E Zupi6, A Pietropolli2.
Abstract
RPL is a very debated condition, in which many issues concerning definition, etiological factors to investigate or therapies to apply are still controversial. ML could help clinicians to reach an objectiveness in RPL classification and access to care. Our aim was to stratify RPL patients in different risk classes by applying an ML algorithm, through a diagnostic work-up to validate it for the appropriate prognosis and potential therapeutic approach. 734 patients were enrolled and divided into 4 risk classes, according to the numbers of miscarriages. ML method, called Support Vector Machine (SVM), was used to analyze data. Using the whole set of 43 features and the set of the most informative 18 features we obtained comparable results: respectively 81.86 ± 0.35% and 81.71 ± 0.37% Unbalanced Accuracy. Applying the same method, introducing the only features recommended by ESHRE, a correct classification was obtained only in 58.52 ± 0.58%. ML approach could provide a Support Decision System tool to stratify RPL patients and address them objectively to the proper clinical management.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32409705 PMCID: PMC7224066 DOI: 10.1038/s41598-020-64512-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Clinical characteristics of studied population.
| Categories | n° patients | % | |
|---|---|---|---|
| 18–27 | 67 | 9.13 | |
| 28–32 | 149 | 20.30 | |
| 33–37 | 250 | 34.06 | |
| 38–42 | 209 | 28.47 | |
| ≥43 | 45 | 6.13 | |
| NaN | 14 | 1.91 | |
| 0 | 19 | 2.59 | |
| 1 | 79 | 10.76 | |
| 2 | 314 | 42.78 | |
| 3 | 194 | 26.43 | |
| 4 | 81 | 11.04 | |
| ≥5 | 47 | 6.40 | |
| <16.5 | 2 | 0.27 | |
| 16.5–18.49 | 27 | 3.69 | |
| 18.5–24.99 | 368 | 50.13 | |
| 25–30 | 132 | 17.99 | |
| >30 | 62 | 8.44 | |
| NaN | 143 | 19.48 | |
| Bicorporeal uterus- class U3 | 3 | 0.41 | |
| Septate uterus- class U2 | 53 | 7.22 | |
| Hemi uterus - class U4 | 4 | 0.54 | |
| Endometriosis | 17 | 2.31 | |
| Uterine Fibroids | 112 | 15.26 | |
| Intrauterine Synechiae | 19 | 2.59 | |
| Endometrial polyps | 45 | 6.13 | |
| Cervical-isthmic incontinence | 5 | 0.68 | |
| APCR | 215 | 29.29 | |
| Protein C | 58 | 7.9 | |
| Protein S | 108 | 14.7 | |
| AT III | 35 | 4.77 | |
| Factor V Leiden | |||
| • Eterozygosis | 22 | 3 | |
| • omozygosis | 13 | 1.77 | |
| Factor II | |||
| • Eterozygosis | 29 | 3.95 | |
| • omozygosis | 3 | 0.41 | |
| MTHFR + PAI-I | |||
| Score | |||
| • 1 | 101 | 13.76 | |
| • 2 | 191 | 26.02 | |
| • 3 | 129 | 17.57 | |
| • 4 | 46 | 6.27 | |
| • 5 | 1 | 0.14 | |
| omocysteinemia | 61 | 8.31 | |
| Diabetes | 30 | 4.09 | |
| PCOS | 28 | 3.81 | |
| Dysthiroidisms | 148 | 20.16 | |
| Smoking | 128 | 17.44 | |
| Maternal chromosomal abnormalities | 14 | 1.91 | |
| Paternal chromosomal abnormalities | 7 | 0.95 | |
| Infections | 102 | 13.9 | |
| LAC, ACI, Ab anti-β2 GP1 | |||
| Score: | |||
| • I Ab | 49 | 6.67 | |
| • 2 Abs | 6 | 0.82 | |
| • 3 Abs | 1 | 0.14 | |
| Ab anti-annexin V | 19 | 2.59 | |
| Ab anti-TPO, Ab anti-Tg | |||
| Score: | |||
| • I Ab | 84 | 11.44 | |
| • 2 Abs | 24 | 3.27 | |
| • NaN | 10 | 1.36 | |
| Ab anti-endomysium, Ab anti-transglutaminase, Ab anti-gliadin | |||
| Score: | |||
| • I Ab | 7 | 0.95 | |
| • 2 Abs | 2 | 0.27 | |
| • 3 Abs | 4 | 0.54 | |
| Score: | |||
| • Negative | 285 | 38.83 | |
| • 1:80 | 148 | 20.16 | |
| • 1:160 | 50 | 6.82 | |
| • 1:320 | 10 | 1.36 | |
| • 1:640 | 10 | 1.36 | |
| • 1:>640 | 2 | 0.27 | |
| • NaN | 229 | 31.2 | |
| ENA, AMA, ASMA | |||
| Score: | |||
| • 1 Ab | 66 | 8.99 | |
| • 2 Abs | 10 | 1.36 | |
For each clinical and etiological feature the incidence in our population is described (number and and percentage of patients). For coagulation abnormalities the incidence, in terms of numbers of patients with values which are outside the physiological ranges, is shown. In MTHFR + PAI-1 feature, a score which is the result of each single exam score (0 = negative for mutation, 1 = mutation in eterozygosis, 2 = mutation in omozygosis). Regarding APS, Abs anti-thyroid, Celiac disease, and systemic autoimmunity, the incidence for the presence in a single patient of only 1, or 2 or 3 antibodies of that one included in each category, is indicated.
Number of pregnancy losses and maternal age.
| Number of pregnancy losses | 0 | 1 | 2 | 3 | 4 | ≥5 |
|---|---|---|---|---|---|---|
| 18–27 (age) | 2 | 14 | 31 | 12 | 6 | 2 |
| 28–32 (age) | 8 | 15 | 71 | 33 | 16 | 6 |
| 33–37 (age) | 4 | 30 | 105 | 67 | 27 | 17 |
| 38–42 (age) | 1 | 14 | 90 | 63 | 26 | 15 |
| ≥43 | 0 | 4 | 15 | 17 | 3 | 6 |
Two-classes analysis overall results. Model Trained on all the available features selected in leave 200 out cross validation over 1000 iterations: Confusion matrix.
| 2-classes analysis | Predicted class | ||
|---|---|---|---|
| Healthy | Affected (RPL) | ||
| Real class | 83.71 ± 0.72% | 19.29 ± 0.72% | |
| 3.23 ± 0.16% | 96.76 ± 0.16% | ||
ACCub (unbalanced accuracy) = 93.51 ± 0.20%;
ACCb (balanced accuracy) =90.24 ± 0.36%.
Two-classes analysis overall results. Model Trained on the 18 features selected in leave 200 out cross validation over 1000 iterations: Confusion matrix.
| 2-classes analysis | Predicted class | ||
|---|---|---|---|
| Real class | 88.98 ± 0.66% | 11.01 ± 0.66% | |
| 1.29 ± 0.13% | 98.71 ± 0.13% | ||
ACCub (unbalanced accuracy) = 96.28 ± 0.19%;
ACCb (balanced accuracy) = 93.85 ± 0.34%.
Four-classes analysis overall results. Model Trained on the all the available features selected in leave 200 out cross validation over 1000 iterations: Confusion matrix.
| Predicted class | |||||
|---|---|---|---|---|---|
| 0 abortion | 1 abortion | 2–3 abortions | ≥4 abortions | ||
| Real class | 0 abortion | 91.46 ± 0.33% | 8.05 ± 0.21% | 0.47 ± 0.25% | 0.02 ± 0.06% |
| 1 abortion | 0.93 ± 0.23% | 89.39 ± 0.95% | 7.59 ± 0.83% | 2.08 ± 0.45% | |
| 2–3 abortions | 3.26 ± 0.22% | 11.82 ± 0.50% | 68.86 ± 0.76% | 16.06 ± 0.69% | |
| ≥4 abortions | 2.55 ± 0.07% | 8.67 ± 0.36% | 11.17 ± 0.85% | 77.62 ± 0.91% | |
ACCub (unbalanced accuracy) = 81.86 ± 0.35%;
ACCb (balanced accuracy) = 81.83 ± 0.35%.
Four-classes analysis. Model Trained on the 18 features selected in leave 200 out cross validation over 1000 iterations: Confusion matrix.
| Predicted class | |||||
|---|---|---|---|---|---|
| 0 abortion | 1 abortion | 2–3 abortions | ≥4 abortions | ||
| Real class | 0 abortion | 91.28 ± 0.34% | 8.06 ± 0.21% | 0.66 ± 0.27% | 0.003 ± 0.02% |
| 1 abortion | 0.91 ± 0.24% | 88.97 ± 0.95% | 8.15 ± 0.81% | 1.96 ± 0.41% | |
| 2–3 abortions | 4.16 ± 0.26% | 12.98 ± 0.45% | 70.12 ± 0.72% | 12.75 ± 0.63% | |
| ≥4 abortions | 2.63 ± 0.17% | 9.94 ± 0.41% | 11.09 ± 0.89% | 76.35 ± 0.94% | |
ACCub (unbalanced accuracy) = 81.71 ± 0.37%;
ACCb (balanced accuracy) = 81.55 ± 0.37%.
Comparison between the two different diagnostic work-up: ESHRE guidelines versus our RPL Unit recommendations.
| ESHRE guidelines | Our diagnostic work-up |
|---|---|
| Medical, obstetric and family anamnesis & Clinical characteristics (e.g. maternal age, BMI) and lifestyle | |
| Gynecologic clinical examination | |
| High resolution maternal and paternal karyotypes | |
| Factor V Leiden and factor II mutations, MTHFR A 1298 C and C 677 T polymorphisms, PAI-I mutation, APCR, protein C and S, ATIII, omocysteinemia | |
| Lupus anticoagulant (LAC), anti-cardiolipin Abs (IgM, IgG, IgA) | |
| Anti-β2 glycoprotein I and anti-annexin V Abs | |
| ANA | |
| TSH assay, anti-TPO and anti-Tg Abs | |
| FT3 and FT4 assays | |
| 3D transvaginal ultrasound | |
| Diagnostic hysteroscopy ± endometrial biopsy | |
| Vaginal-cervical swabs | |
| Oral glucose tolerance test and glycated haemoglobolin | |
| Anti-gliadin, anti-transglutaminase and anti-endomysium Abs | |
| Systemic autoimmunity: ASMA, ENA, anti-dsDNA Abs, AMA |
Four-classes analysis overall results: confusion matrix from ESHRE guidelines diagnostic work-up.
| Predicted class | |||||
|---|---|---|---|---|---|
| 1 st | 2 nd | 3 th | 4th | ||
| Real Class | 76.20 ± 1.04% | 13.32 ± 0.70% | 3.89 ± 0.56% | 6.59 ± 0.82% | |
| 15.92 ± 0.83% | 52.19 ± 1.29% | 10.40 ± 0.92% | 20.48 ± 1.10% | ||
| 11.00 ± 0.37% | 18.66 ± 0.61% | 44.43 ± 2.17% | 25.91 ± 2.16% | ||
| 6.74 ± 0.38% | 13.53 ± 0.56% | 19.50 ± 1.25% | 60.22 ± 1.29% | ||
ACCub (unbalanced accuracy) = 58.52 ± 0.58%;
ACCb (balanced accuracy) = 58.51 ± 0.58%.
Figure 1Method steps flow chart.
Discretization of continuous features.
| Risk Classes | 0 | 1 | 2 | 3 | 4 | NaN for each feature |
|---|---|---|---|---|---|---|
| 18–27 | 28–32 | 33–37 | 38–42 | >42 | 14 | |
| 2.4–3.5 | 1.84–2.39 & 3.51–4.05 | 1.28–1.83 & 4.06–4.61 | 0.16–1.27 & 4.62–5.73 | <0.16 & ≥ 5.74 | 393 | |
| 70–130 | 55–69.99 & 130.01–144.99 | 40–54.99 & 145–159.99 | 25–39.99 & 160–175 | <25 & ≥ 175 | 271 | |
| 53–109 | 38–52.99 & 109.01–123.99 | 23–37.99 & 124–138.99 | 8–22.99 & 139–154 | <8 & ≥ 154 | 263 | |
| 80–120 | 65–79.99 & 120.01–134.99 | 50–64.99 & 135–149.99 | 35–49.99 & 150–165 | <35 & ≥ 165 | 364 | |
| 5–12 | 4–4.99 & 12.01–14.99 | 3–3.99 & 15–17.99 | 2–2.99 & 18–21 | <2 & ≥ 21 | 272 | |
| 0.5–3.8 | 0.4–0.49 & 3.81–4.79 | 0.3–0.39 & 4.8–7.79 | 0.1–0.29 & 7.8–17.99 | <0.1 & ≥ 18 | 200 | |
| 18.5–24.99 | 16.5–18.49 | <16.5 | 25–30 | >30 | 143 |
Continuous features were discretized in 5 risk classes ranging from normal (risk class 0) up to high risk (risk class 4). The last column shows the number of NaN (Not a Number) for each feature present in the dataset. The NaN were set to 0 because they corresponded to the case in which the doctor found it was not necessary that exam for that patient and so the feature was in the normal range (see Materials and Methods, Discretization of continuous features).
The first and the second column contain the number and the name of the features. The third column reports which features are suggest from the guidelines. The third column reports which features are taken in our work-up. The last column reports which feature are selected by the feature selection algorithm.
| FEATURES | Work-up Guidelines | Our Work-up | Features selected | |
|---|---|---|---|---|
| 1 | Age | ✓ | ✓ | ✓ |
| 2 | BMI | ✓ | ✓ | ✓ |
| 3 | Smoke | ✓ | ✓ | ✓ |
| 4 | Voluntary termination of pregnancy | ✗ | ✓ | ✓ |
| 5 | Cytomegalovirus infection | ✗ | ✓ | ✓ |
| 6 | Escherichia Coli infection | ✗ | ✓ | ✗ |
| 7 | Staphylococcus infection | ✗ | ✓ | ✗ |
| 8 | Mycoplasma infection | ✗ | ✓ | ✗ |
| 9 | Ureaplasma infection | ✗ | ✓ | ✗ |
| 10 | Herpes virus infection | ✗ | ✓ | ✗ |
| 11 | Toxoplasma infection | ✗ | ✓ | ✓ |
| 12 | Steptococcus infection | ✗ | ✓ | ✗ |
| 13 | Gardnerella infection | ✗ | ✓ | ✗ |
| 14 | Candida infection | ✗ | ✓ | ✗ |
| 15 | Bicorporeal uterus-class U3 | ✓ | ✓ | ✗ |
| 16 | Bicorporeal uterus-classe U3bC2 | ✓ | ✓ | ✗ |
| 17 | Septate uterus-class U4 | ✓ | ✓ | ✗ |
| 18 | Hemi uterus-class U4 | ✓ | ✓ | ✗ |
| 19 | Endometriosis | ✗ | ✓ | ✗ |
| 20 | Uterine Fibroids | ✓ | ✓ | ✓ |
| 21 | Intrauterine Synechiae | ✓ | ✓ | ✗ |
| 22 | Cervical-isthmic incontinence | ✓ | ✓ | ✗ |
| 23 | Endometrial Polyps | ✓ | ✓ | ✗ |
| 24 | Diabetes | ✗ | ✓ | ✓ |
| 25 | PCOS (Polycystic Ovary Syndrome) | ✗ | ✓ | ✗ |
| 26 | Thyroid disorders | ✓ | ✓ | ✓ |
| 27 | APCR (Activated Protein C Resistence) | ✗ | ✓ | ✓ |
| 28 | Protein C | ✗ | ✓ | ✓ |
| 29 | Protein S | ✗ | ✓ | ✓ |
| 30 | Antithrombin III | ✗ | ✓ | ✗ |
| 31 | Antiphospholipid antibodies: | ✓ | ✓ | ✓ |
| - LAC (Lupus Anti Coagulant) | ✓ | |||
| - ACL (anti-cardiolipin Antibodies) | ✓ | |||
| - Ab anti-β2-GPI (anti-β2glycoprotein 1 Antibodies) | ✗ | |||
| 32 | ANA (Anti-nuclear antibodies) | ✗ | ✓ | ✓ |
| 33 | Systemic autoimmunity: - ENA (Extractable Nuclear Antigens) - ASMA (Anti-Smooth Muscle Antibodies) - AMA (Anti-mitochondrial Antibodies) - Ab anti-dsDNA (Anti-double stranded DNA Antibodies) | ✗ | ✓ | ✓ |
| 34 | Factor V Leiden Mutation | ✗ | ✓ | ✗ |
| 35 | Factor II Mutation | ✗ | ✓ | ✗ |
| 36 | Trombophilic polymorphisms: - MTHFR (Metilen Tetra-HydroFolate Reductase) C677T - MTHFR A1298C - PAI 1 (Plasminogen Activator Inhibitor 1) | ✗ | ✓ | ✓ |
| 37 | Homocysteine | ✗ | ✓ | ✓ |
| 38 | Anti-annexin V Antibodies | ✗ | ✓ | ✗ |
| 39 | Suspected celiac disease: - Anti-gliadin Antibodies - Anti-endomysium Antibodies - Anti-transglutaminase Antibodies | ✗ | ✓ | ✗ |
| 40 | Thyroid autoimmunity: | ✓ | ✓ | ✓ |
| - Ab anti-TPO (Anti-Thyroid Peroxidase Antibodies) | ✓ | |||
| - Ab anti-TG (Anti-Thyroglobulin Antibodies) | ✗ | |||
| 41 | Maternal chromosomal abnormalities | ✗ | ✓ | ✗ |
| 42 | Paternal chromosomal abnormalities | ✗ | ✓ | ✗ |
| 43 | TSH (Thyroid-Stimulating Hormone) | ✓ | ✓ | ✗ |
The outcome of SVM model, the number of preceding pregnancy losses, is considered as a fundamental feature for the prognosis by the guidelines.