| Literature DB >> 28068927 |
Chuanwu Zhang1, Lili Garrard2, John Keighley3, Susan Carlson4, Byron Gajewski3.
Abstract
BACKGROUND: Despite the widely recognized association between the severity of early preterm birth (ePTB) and its related severe diseases, little is known about the potential risk factors of ePTB and the sub-population with high risk of ePTB. Moreover, motivated by a future confirmatory clinical trial to identify whether supplementing pregnant women with docosahexaenoic acid (DHA) has a different effect on the risk subgroup population or not in terms of ePTB prevalence, this study aims to identify potential risk subgroups and risk factors for ePTB, defined as babies born less than 34 weeks of gestation.Entities:
Keywords: Classification and regression tree; Early preterm birth; Enrichment trial design; Interaction; Logistic regression; Risk factor
Mesh:
Year: 2017 PMID: 28068927 PMCID: PMC5223445 DOI: 10.1186/s12884-016-1189-0
Source DB: PubMed Journal: BMC Pregnancy Childbirth ISSN: 1471-2393 Impact factor: 3.007
Subject demography information
| Variable | Newborn Gestational Age | |
|---|---|---|
| < 34 weeks: ePTB | ≥ 34 weeks | |
|
|
| |
| Mothers’ Age (%) | ||
| ≤ 24 Years | 40711 (30.38) | 1094793 (28.36) |
| 25-29 Years | 34831 (25.99) | 1112643 (28.82) |
| 30-34 Years | 33578 (25.06) | 1049775 (27.19) |
| ≥ 35 Years | 24889 (18.57) | 603652 (15.64) |
| Mothers’ Nativity (%) | ||
| Born in U.S. | 107578 (80.28) | 2996531 (77.61) |
| Born Outside U.S. /Unknown/Not Stated | 26431 (19.72) | 864332 (22.39) |
| Mothers’ Race (%) | ||
| White | 88185 (65.81) | 2938466 (76.11) |
| Black | 36554 (27.28) | 603921 (15.64) |
| American Indian/Alaskan Native/Asian or Pacific Islander | 9270 (6.92) | 318476 (8.25) |
| Mothers’ Hispanic Origin (%) | ||
| Non-Hispanic/Hispanic Origin Not Stated | 105011 (78.36) | 2968422 (76.88) |
| Hispanic | 28998 (21.64) | 892441 (23.12) |
| Marital Status (%) | ||
| Married | 65594 (48.95) | 2323620 (60.18) |
| Unmarried | 68415 (51.05) | 1537243 (39.82) |
| Mothers’ Education (%) | ||
| ≤ High School or GED/Unknown | 62819 (46.88) | 1512489 (39.17) |
| Associate/Some College Credit | 37338 (27.86) | 1086153 (28.13) |
| ≥ Bachelor's | 29145 (21.75) | 1124077 (29.11) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Pre-pregnancy Smoking Status (%) | ||
| Nonsmoker | 108663 (81.09) | 3258557 (84.40) |
| Smoker/Unknown/Not Stated | 20639 (15.40) | 464162 (12.02) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Pre-pregnancy BMI (%) | ||
| Under Weight-Normal ≤ 24.9 | 55824 (41.66) | 1785913 (46.26) |
| Overweight 25.0-29.9 | 30288 (22.60) | 918380 (23.79) |
| Obesity ≥ 30.0/Unknown/Not Stated | 43190 (32.23) | 1018426 (26.38) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Pre-pregnancy Diabetes Status (%) | ||
| No/Unknown/Not Stated | 126901 (94.70) | 3694967 (95.70) |
| Yes | 2401 (1.79) | 27752 (0.72) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Pre-pregnancy Hypertension Status (%) | ||
| No/Unknown/Not Stated | 123932 (92.48) | 3667289 (94.99) |
| Yes | 5370 (4.01) | 55430 (1.44) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Previous Preterm Birth Status (%) | ||
| No/Unknown/Not Stated | 118468 (88.40) | 3626879 (93.94) |
| Yes | 10834 (8.08) | 95840 (2.48) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Infertility Treatment Usage Status (%) | ||
| No/Unknown/Not Stated | 122859 (91.68) | 3669850 (95.05) |
| Yes | 6443 (4.81) | 52869 (1.37) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Fertility Enhancing Drug Usage Status (%) | ||
| No/Not Applicable/Unknown/Not Stated | 126582 (94.46) | 3697856 (95.78) |
| Yes | 2720 (2.03) | 24863 (0.64) |
| Missing | 4707 (3.51) | 138144 (3.58) |
| Delivery Payment Source (%) | ||
| Medicaid | 65048 (48.54) | 1598851 (41.41) |
| Private Insurance | 51753 (38.62) | 1771814 (45.89) |
| Self-pay/Other/Unknown | 12501 (9.33) | 352054 (9.12) |
| Missing | 4707 (3.51) | 138144 (3.58) |
Univariate difference between training sample and validation sample
| Variables | Cohort | |
|---|---|---|
| Training | Validation | |
|
|
| |
| Mothers’ Age (%) | ||
| ≤ 24 Years | 794486 (28.41) | 341018 (28.45) |
| 25-29 Years | 803113 (28.72) | 344361 (28.73) |
| 30-34 Years | 758087 (27.11) | 325266 (27.14) |
| ≥ 35 Years | 440725 (15.76) | 187816 (15.67) |
| Mothers’ Nativity (%) | ||
| Born in U.S. | 2172903 (77.70) | 931206 (77.70) |
| Born Outside U.S. /Unknown/Not Stated | 623508 (22.30) | 267255 (22.30) |
| Mothers’ Race (%) | ||
| White | 2119115 (75.78) | 907536 (75.73) |
| Black | 447972 (16.02) | 192503 (16.06) |
| American Indian/Alaskan Native/Asian or Pacific Islander | 229324 (8.20) | 98422 (8.21) |
| Mothers’ Hispanic Origin (%) | ||
| Non-Hispanic/Hispanic Origin Not Stated | 2151766 (76.95) | 921667 (76.90) |
| Hispanic | 644645 (23.05) | 276794 (23.10) |
| Marital Status (%) | ||
| Married | 1672583 (59.81) | 716631 (59.80) |
| Unmarried | 1123828 (40.19) | 481830 (40.20) |
| Mothers’ Education (%) | ||
| ≤ High School or GED/Unknown | 1102757 (39.43) | 472551 (39.43) |
| Associate/Some College Credit | 786618 (28.13) | 336873 (28.11) |
| ≥ Bachelor's | 806822 (28.85) | 346400 (28.90) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Pre-pregnancy Smoking Status (%) | ||
| Nonsmoker | 2357285 (84.30) | 1009935 (84.27) |
| Smoker/Unknown/Not Stated | 338912 (12.12) | 145889 (12.17) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Pre-pregnancy BMI (%) | ||
| Under Weight-Normal ≤ 24.9 | 1288811 (46.09) | 552926 (46.14) |
| Overweight 25.0-29.9 | 664673 (23.77) | 283995 (23.70) |
| Obesity ≥ 30.0/Unknown/Not Stated | 742713 (26.56) | 318903 (26.61) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Pre-pregnancy Diabetes Status (%) | ||
| No/Unknown/Not Stated | 2675048 (95.66) | 1146820 (95.69) |
| Yes | 21149 (0.76) | 9004 (0.75) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Pre-pregnancy Hypertension Status (%) | ||
| No/Unknown/Not Stated | 2653410 (94.89) | 1137811 (94.94) |
| Yes | 42787 (1.53) | 18013 (1.50) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Previous Preterm Birth Status (%) | ||
| No/Unknown/Not Stated | 2621496 (93.75) | 1123851 (93.77) |
| Yes | 74701 (2.67) | 31973 (2.67) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Infertility Treatment Usage Status (%) | ||
| No/Unknown/Not Stated | 2654757 (94.93) | 1137952 (94.95) |
| Yes | 41440 (1.48) | 17872 (1.49) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Fertility Enhancing Drug Usage Status (%) | ||
| No/Not Applicable/Unknown/Not Stated | 2676910 (95.73) | 1147528 (95.75) |
| Yes | 19287 (0.69) | 8296 (0.69) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Delivery Payment Source (%) | ||
| Medicaid | 1164617 (41.65) | 499282 (41.66) |
| Private Insurance | 1276362 (45.64) | 547205 (45.66) |
| Self-pay/Other/Unknown | 255218 (9.13) | 109337 (9.12) |
| Missing | 100214 (3.58) | 42637 (3.56) |
| Newborn Gestational Age (%) | ||
| < 34 weeks: ePTB | 93751 (3.35) | 40258 (3.36) |
| ≥ 34 weeks | 2702660 (96.65) | 1158203 (96.64) |
The estimate and adjusted OR of logistic regression analysis on the training cohort
| Parameter | Estimate | Adjusted OR (95% CI) |
|
|---|---|---|---|
| Intercept | -3.7154 | - | <.0001 |
| Mothers’ Age (%) | |||
| ≤ 24 Years | - | 1.0 (1.0–1.0) | - |
| 25-29 Years | 0.0129 | 1.013 (0.995, 1.032) | 0.169 |
| 30-34 Years | 0.1221 | 1.130 (1.108, 1.152) | <.0001 |
| ≥ 35 Years | 0.3034 | 1.354 (1.325, 1.385) | <.0001 |
| Mothers’ Nativity (%) | |||
| Born in U.S. | - | 1.0 (1.0–1.0) | - |
| Born Outside U.S. /Unknown/Not Stated | -0.1274 | 0.880 (0.863, 0.898) | <.0001 |
| Mothers’ Race (%) | |||
| White | - | 1.0 (1.0–1.0) | - |
| Black | 0.5727 | 1.773 (1.743, 1.803) | <.0001 |
| American Indian/Alaskan Native/Asian or Pacific Islander | 0.0917 | 1.096 (1.066, 1.127) | <.0001 |
| Mothers’ Hispanic Origin (%) | |||
| Non-Hispanic/Hispanic Origin Not Stated | - | 1.0 (1.0–1.0) | - |
| Hispanic | 0.0323 | 1.033 (1.013, 1.053) | 0.009 |
| Marital Status (%) | |||
| Married | - | 1.0 (1.0–1.0) | - |
| Unmarried | 0.2819 | 1.326 (1.304, 1.347) | <.0001 |
| Mothers’ Education (%) | |||
| ≤ High School or GED/Unknown | - | 1.0 (1.0–1.0) | - |
| Associate/Some College Credit | -0.1725 | 0.842 (0.828, 0.856) | <.0001 |
| ≥ Bachelor's | -0.3382 | 0.713 (0.698, 0.729) | <.0001 |
| Missing | 0.0031 | 1.003 (0.966, 1.042) | 0.8727 |
| Pre-pregnancy Smoking Status (%) a | |||
| Nonsmoker | - | 1.0 (1.0–1.0) | - |
| Smoker/Unknown/Not Stated | 0.1677 | 1.183 (1.160, 1.206) | <.0001 |
| Pre-pregnancy BMI (%) a | |||
| Under Weight-Normal ≤24.9 | - | 1.0 (1.0–1.0) | - |
| Overweight 25.0-29.9 | -0.0174 | 0.983 (0.966, 1.000) | 0.0472 |
| Obesity ≥30.0/Unknown/Not Stated | 0.1195 | 1.127 (1.109, 1.145) | <.0001 |
| Pre-pregnancy Diabetes Status (%) a | |||
| No/Unknown/Not Stated | - | 1.0 (1.0–1.0) | - |
| Yes | 0.5741 | 1.776 (1.685, 1.871) | <.0001 |
| Pre-pregnancy Hypertension Status (%) a | |||
| No/Unknown/Not Stated | - | 1.0 (1.0–1.0) | |
| Yes | 0.6849 | 1.984 (1.913, 2.056) | <.0001 |
| Previous Preterm Birth Status (%) a | |||
| No/Unknown/Not Stated | - | 1.0 (1.0–1.0) | - |
| Yes | 1.0999 | 3.004 (2.929, 3.081) | <.0001 |
| Infertility Treatment Usage Status (%) a | |||
| No/Unknown/Not Stated | - | 1.0 (1.0–1.0) | - |
| Yes | 1.6299 | 5.103 (4.888, 5.328) | <.0001 |
| Fertility Enhancing Drug Usage Status (%) a | |||
| No/Not Applicable/Unknown/Not Stated | - | 1.0 (1.0–1.0) | - |
| Yes | -0.1988 | 0.820 (0.769, 0.873) | <.0001 |
| Delivery Payment Source (%) a | |||
| Medicaid | - | 1.0 (1.0–1.0) | - |
| Private Insurance | -0.0352 | 0.965 (0.948, 0.983) | <.0001 |
| Self-pay/Other/Unknown | 0.0762 | 1.079 (1.054, 1.105) | <.0001 |
a: For the following parameters after mothers’ education, missing observations were automatically excluded from the analysis, and the corresponding parameters were automatically set to 0 due to they are from the same subset
Fig. 1ROC curve from logistic regression on the training dataset (Area under the curve = 0.646)
Fig. 2Calibration plot from the validation sample. Observed vs. Predicted Probability across the quartiles
The ePTB subgroup predicted /observed probability and maternal characteristics in validation cohort via logistic regression
| Variable | Subgroup | |||
|---|---|---|---|---|
| 1st Quartile | 2nd Quartile | 3rd Quartile | 4th Quartile | |
|
|
|
|
| |
| Probability (%) | ||||
| Average Predicted | 1.92 | 2.46 | 3.22 | 6.02 |
| Range Predicted | 0.55 | 0.52 | 0.95 | 60.6 |
| Average Observed | 1.83 | 2.33 | 3.24 | 6.07 |
| Mothers’ Age (%) | ||||
| ≤ 24 Years | 36603 (12.22) | 70681 (23.63) | 127739 (42.58) | 105995 (35.35) |
| 25-29 Years | 120779 (40.32) | 83600 (27.95) | 68003 (22.67) | 71979 (24.00) |
| 30-34 Years | 129538 (43.25) | 78439 (26.23) | 56362 (18.79) | 60927 (20.32) |
| ≥ 35 Years | 12609 (4.21) | 66358 (22.19) | 47889 (15.96) | 60960 (20.33) |
| Mothers’ Race (%) | ||||
| White | 259978 (86.80) | 273311 (91.38) | 260128 (86.71) | 114119 (38.06) |
| Black | 0 (0.00) | 872 (0.29) | 18661 (6.22) | 172970 (57.68) |
| American Indian/Alaskan Native/Asian or Pacific Islander | 39551 (13.20) | 24895 (8.32) | 21204 (7.07) | 12772 (4.26) |
| Marital Status (%) | ||||
| Married | 296804 (99.09) | 246717 (82.49) | 92320 (30.77) | 80790 (26.94) |
| Unmarried | 2725 (0.91) | 52361 (17.51) | 207673 (69.23) | 219071 (73.06) |
| Mothers’ Education (%) | ||||
| ≤ High School or GED/Unknown | 10988 (3.67) | 93778 (31.36) | 192086 (64.03) | 175699 (58.59) |
| Associate/Some College Credit | 69843 (23.32) | 117843 (39.40) | 69455 (23.15) | 79732 (26.59) |
| ≥ Bachelor’s | 217614 (72.65) | 71541 (23.92) | 21886 (7.30) | 35359 (11.79) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
| Pre-pregnancy Smoking Status (%) | ||||
| Nonsmoker | 295313 (98.59) | 262159 (87.66) | 234907 (78.30) | 217556 (72.55) |
| Smoker/Unknown/Not Stated | 3132 (1.05) | 21003 (7.02) | 48520 (16.17) | 73234 (24.42) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
| Pre-pregnancy BMI (%) | ||||
| Under Weight-Normal ≤ 24.9 | 183032 (61.11) | 142007 (47.48) | 119757 (39.92) | 108130 (36.06) |
| Overweight 25.0-29.9 | 82956 (27.70) | 67818 (22.68) | 70451 (23.48) | 62770 (20.93) |
| Obesity ≥ 30.0/Unknown/Not Stated | 32457 (10.84) | 73337 (24.52) | 93219 (31.07) | 119890 (39.98) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
| Pre-pregnancy Diabetes Status (%) | ||||
| No/Unknown/Not Stated | 298445 (99.64) | 283149 (94.67) | 282480 (94.16) | 282746 (94.29) |
| Yes | 0 (0.00) | 13 (0.00) | 947 (0.32) | 8044 (2.68) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
| Pre-pregnancy Hypertension Status (%) | ||||
| No/Unknown/Not Stated | 298445 (99.64) | 283162 (94.68) | 282293 (94.10) | 273911 (91.35) |
| Yes | 0 (0.00) | 0 (0.00) | 1134 (0.38) | 16879 (5.63) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
| Previous Preterm Birth Status (%) | ||||
| No/Unknown/Not Stated | 298445 (99.64) | 283162 (94.68) | 283427 (94.48) | 258817 (86.31) |
| Yes | 0 (0.00) | 0 (0.00) | 0 (0.00) | 31973 (10.66) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
| Infertility Treatment Usage Status (%) | ||||
| No/Unknown/Not Stated | 298445 (99.64) | 283162 (94.68) | 283427 (94.48) | 272918 (91.01) |
| Yes | 0 (0.00) | 0 (0.00) | 0 (0.00) | 17872 (5.96) |
| Missing | 1084 (0.36) | 15916 (5.32) | 16566 (5.52) | 9071 (3.03) |
Fig. 3Classification and Regression Tree model for predicting ePTB. Legend: The probability of ePTB (P) and the number of subject (N) are all given inside of each node for both training and validation cohort. In each end node, the subgroup birth prevalence (SBP) is also calculated. AI = American Indian; AN = Alaskan Native; PI = Pacific Islander