| Literature DB >> 29467498 |
Liron Yoffe1, Avital Gilam1, Orly Yaron1, Avital Polsky1, Luba Farberov1, Argyro Syngelaki2, Kypros Nicolaides2, Moshe Hod1,3, Noam Shomron4.
Abstract
Preeclampsia is one of the most dangerous pregnancy complications, and the leading cause of maternal and perinatal mortality and morbidity. Although the clinical symptoms appear late, its origin is early, and hence detection is feasible already at the first trimester. In the current study, we investigated the abundance of circulating small non-coding RNAs in the plasma of pregnant women in their first trimester, seeking transcripts that best separate the preeclampsia samples from those of healthy pregnant women. To this end, we performed small non-coding RNAs sequencing of 75 preeclampsia and control samples, and identified 25 transcripts that were differentially expressed between preeclampsia and the control groups. Furthermore, we utilized those transcripts and created a pipeline for a supervised classification of preeclampsia. Our pipeline generates a logistic regression model using a 5-fold cross validation on numerous random partitions into training and blind test sets. Using this classification procedure, we achieved an average AUC value of 0.86. These findings suggest the predictive value of circulating small non-coding RNA in the first trimester, warranting further examination, and lay the foundation for producing a novel early non-invasive diagnostic tool for preeclampsia, which could reduce the life-threatening risk for both the mother and fetus.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29467498 PMCID: PMC5821867 DOI: 10.1038/s41598-018-21604-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Clinical characteristics of PE and control groups.
| Characteristic | Control (n = 40) | PE (n = 35) | p-value |
|---|---|---|---|
| Maternal age, years (IQR) | 31.3 (25.9–34.6) | 29.9 (28.1–34.5) | 0.9200 |
| Body Mass Index, kg/m2 (IQR) | 24.1 (22.6–28.7) | 28.4 (24.4–31.7) | 0.0125 |
| Gestational age, weeks (IQR) | 12.8 (12.3–13.2) | 12.7 (12.2–13.1) | 0.3916 |
| Crown-rump length, mm (IQR) | 63.9 (57.5–70.2) | 62.9 (56.2–67.9) | 0.3029 |
| Mean arterial pressure (MAP), mm Hg (IQR) | 83.2 (79.4–87.7) | 97.1 (90.1–108.7) | <0.0001 |
| Uterine artery pulsatility index (UT PI) (IQR) | 1.5 (1.3–1.7) | 2.5 (2–2.8) | <0.0001 |
| Gestational age at delivery, weeks (IQR) | 39.8 (39.4–40.7) | 31.4 (29.4–33.2) | <0.0001 |
| Birth weight, g (IQR) | 3,420 (3,198–3,578) | 1,222 (951–1,565) | <0.0001 |
| Fetal Gender, n(%) | 0.65 | ||
| female | 22 (55) | 17 (42.5) | |
| male | 18 (45) | 18 (45) | |
| Ethnicity, n(%) | 0.2848 | ||
| Afro-Caribbean | 15 (37.5) | 19 (54.3) | |
| South Asian | 1 (2.5) | 1 (2.9) | |
| Caucasian | 24 (60) | 15 (42.9) | |
| Cigarette smokers, n(%) | 0.11 | ||
| No smoker | 34 (85) | 34 (97.1) | |
| Smoker | 6 (15) | 1 (2.9) | |
| Family history of preeclampsia, n(%) | 0.41 | ||
| Yes | 2 (5) | 4 (11.4) | |
| No | 38 (95) | 31 (88.6) | |
| Parity, n(%) | 0.0018 | ||
| Multiparous with no previous PE | 15 (37.5) | 6 (17.1) | |
| Multiparous with previous PE | 0 (0) | 8 (22.9) | |
| Nulliparous | 25 (62.5) | 21 (60) | |
| Chronic hypertension, n(%) | 0.001 | ||
| Yes | 0 (0) | 8 (22.9) | |
| No | 40 (100) | 27 (77.1) |
A comparison of maternal and pregnancy characteristics between the two groups: pregnant women that have developed preeclampsia (PE) and pregnant women with uncomplicated pregnancies (control). P-values were calculated using Fisher exact test for categorical variables, and using Mann-Whitney U test for continuous variables.
Differentially expressed small ncRNAs in PE vs. control sample.
| Transcript | Transcript ID | Transcript Biotype | Mean Counts | Fold Change | Adjusted | |
|---|---|---|---|---|---|---|
| mitochondrially encoded tRNA proline | ENST00000387461 | Mitochondrial tRNA | 490 | 4.25 | 1.65 × 10−16 | 1.57 × 10−14 |
| mitochondrially encoded tRNA lysine | ENST00000387421 | Mitochondrial tRNA | 433 | 2.27 | 3.43 × 10−6 | 1.63 × 10−4 |
| microRNA 182 | ENST00000385255 | miRNA | 1,325 | 0.54 | 5.45 × 10−6 | 1.73 × 10−4 |
| microRNA 10b | ENST00000385011 | miRNA | 7115 | 0.50 | 8.96 × 10−6 | 2.13 × 10−4 |
| mucin 2, oligomeric mucus/gel−forming | ENST00000361558 | processed transcript | 901 | 2.34 | 1.68 × 10−5 | 3.19 × 10−4 |
| microRNA 25 | ENST00000384816 | miRNA | 5,585 | 0.61 | 5.38 × 10−5 | 6.39 × 10−4 |
| RP11–259O2.3-001 | ENST00000514519 | lincRNA | 409 | 2.97 | 4.92 × 10−5 | 6.39 × 10−4 |
| microRNA 4433b | ENST00000581329 | miRNA | 473 | 1.71 | 4.98 × 10−5 | 6.39 × 10−4 |
| mitochondrially encoded tRNA histidine | ENST00000387441 | Mitochondrial tRNA | 247 | 1.95 | 9.21 × 10−5 | 9.72 × 10−4 |
| HELLP associated long non-coding RNA | ENST00000626826 | macro lncRNA | 729 | 2.02 | 1.08 × 10−4 | 1.03 × 10−3 |
| microRNA 99b | ENST00000384819 | miRNA | 344 | 0.65 | 1.57 × 10−4 | 1.31 × 10−3 |
| microRNA 143 | ENST00000385300 | miRNA | 1,632 | 0.62 | 1.66 × 10−4 | 1.31 × 10−3 |
| mitochondrially encoded tRNA valine | ENST00000387342 | Mitochondrial tRNA | 664 | 1.99 | 2.11 × 10−4 | 1.54 × 10−3 |
| microRNA 151a | ENST00000521276 | miRNA | 10,021 | 0.75 | 5.68 × 10−4 | 3.85 × 10−3 |
| microRNA 191 | ENST00000384873 | miRNA | 31,187 | 0.75 | 6.26 × 10−4 | 3.97 × 10−3 |
| RNA, 5.8 S ribosomal pseudogene 4 | ENST00000365096 | rRNA | 1,652 | 1.68 | 1.65 × 10−3 | 9.21 × 10−3 |
| mitochondrially encoded tRNA serine 2 (AGU/C) | ENST00000387449 | Mitochondrial tRNA | 1,250 | 1.72 | 1.61 × 10−3 | 9.21 × 10−3 |
| microRNA 146b | ENST00000365699 | miRNA | 1,323 | 0.75 | 2.67 × 10−3 | 1.41 × 10−2 |
| microRNA 221 | ENST00000385135 | miRNA | 587 | 1.44 | 3.97 × 10−3 | 1.98 × 10−2 |
| mitochondrially encoded tRNA tyrosine | ENST00000387409 | Mitochondrial tRNA | 173 | 1.53 | 4.41 × 10−3 | 2.09 × 10−2 |
| mitochondrially encoded 16 S RNA | ENST00000387347 | Mitochondrial tRNA | 6,218 | 1.63 | 4.82 × 10−3 | 2.18 × 10−2 |
| microRNA let-7g | ENST00000362280 | miRNA | 1,073 | 1.27 | 9.85 × 10−3 | 4.26 × 10−2 |
| long intergenic non-protein coding RNA 324 | ENST00000315707 | lincRNA | 1,087 | 1.50 | 1.03 × 10−2 | 4.27 × 10−2 |
| AC113133.1-201 (microRNA-486) | ENST00000612171 | miRNA | 17,569 | 0.70 | 1.11 × 10−2 | 4.38 × 10−2 |
| AC020956.3-001 | ENST00000614316 | lincRNA | 902 | 1.71 | 1.18 × 10−2 | 4.47 × 10−2 |
Each of the most highly abundant transcripts in the women’s plasma was tested for differential expression in 35 PE vs. 40 control samples, and 25 transcripts were found to be differentially expressed (FDR adjusted p-value < 0.05).
Figure 1Normalized counts for differentially expressed transcripts in PE vs. control samples. Each of the most highly abundant transcripts in 35 PE vs. 40 control samples was tested for differential expression, and 25 transcripts were found to be differentially expressed (adjusted p-value < 0.05). Normalized counts are presented as violin and box plots. The upper and lower limits of the boxes represent the 75th and 25th percentiles. The upper and lower whiskers represent maximum and minimum values. The median is indicated by the line in each box. Outliers are indicated by circles.
Figure 2Schematic diagram of the workflow for PE/control samples classification. Data is randomly divided into a training set and a test set. 5-fold cross validation procedure is used on the training set to obtain a logistic regression model that best classifies training-set samples, and then it is tested on the blind test set. This process is repeated a 100 times, each time with a random partition to training and test set, in order to increase the stability and generalization of the results, and to estimate the goodness of the procedure on a new blind data set. The classification accuracy in the blind test set and related statistics are calculated in each of the iterations, and are summarized for overall evaluation of the pipeline.
Figure 3Classification results on real and permutated data sets. Density plots of statistical measures obtained by 100 iterations of our classification procedure on real (blue) and permutated (red) data sets. Real dataset included 35 PE and 40 control samples. Permutated dataset included the same samples after random shuffling of their conditions (i.e., PE/control). Means are indicated as well. Sensitivity: true positives out of all positives; Specificity: true negatives out of all negatives; Accuracy: true classifications out of all classifications; Matthews’s correlation coefficient (MCC): a correlation coefficient between the observed and predicted binary classifications; AUC: area under the ROC curve; F1 Score: the harmonic mean of precision and sensitivity; Positive Likelihood Ratio: sensitivity/(1-specificity); Negative Likelihood Ratio: (1-sensitivity)/specificity.