| Literature DB >> 20667136 |
Richard Charnigo1, Lorie W Chesnut, Tony Lobianco, Russell S Kirby.
Abstract
BACKGROUND: Greater epidemiologic understanding of the relationships among fetal-infant mortality and its prognostic factors, including birthweight, could have vast public health implications. A key step toward that understanding is a realistic and tractable framework for analyzing birthweight distributions and fetal-infant mortality. The present paper is the first of a two-part series that introduces such a framework.Entities:
Mesh:
Year: 2010 PMID: 20667136 PMCID: PMC2927479 DOI: 10.1186/1471-2393-10-37
Source DB: PubMed Journal: BMC Pregnancy Childbirth ISSN: 1471-2393 Impact factor: 3.007
Figure 1Four-Component Mixture Models for Birthweight Distribution. (a) A 4-component normal mixture model for birthweight distribution, with parameters estimated using a single sample of size 50000 from the population of white singletons born to heavily smoking mothers, is shown superimposed against a histogram of the 50000 birthweights. (b) A 4-component normal mixture model, with parameters estimated by combining the results for 25 samples of size 50000, is shown.
Figure 2Competing Models for Birthweight Distribution. (a) A contaminated normal model, a 2-component normal mixture model, and a 4-component normal mixture model are compared. The results are based on a single sample of size 50000 from the population of white singletons born to heavily smoking mothers. (b) to (d) Close-up views of the competing models are displayed.
Preferences of Model Selection Criteria on Real Data
| Number of Components | FLIC | BIC | AIC | FLIC | BIC | AIC |
|---|---|---|---|---|---|---|
| 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 20 | 21 | 1 | 8 | 11 | 0 |
| 3 | 5 | 4 | 4 | 5 | 5 | 0 |
| 4 | 0 | 0 | 14 | 12 | 9 | 15 |
| 5 | 0 | 0 | 1 | 0 | 0 | 1 |
| 6 | 0 | 0 | 4 | 0 | 0 | 8 |
| 7 | 0 | 0 | 1 | 0 | 0 | 1 |
| 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 | 25 | 25 | 5 | 22 | 22 | 1 |
| 5 | 0 | 0 | 1 | 0 | 0 | 0 |
| 6 | 0 | 0 | 10 | 3 | 3 | 6 |
| 7 | 0 | 0 | 9 | 0 | 0 | 18 |
The columns "FLIC 5000", "BIC 5000", and "AIC 5000" contain the preferences of the three model selection criteria for the number of components in a normal mixture model for birthweight distribution, based on 25 samples of size 5000 from the population of white singletons born to heavily smoking mothers. The next nine columns correspond to sample sizes of 10000, 25000, and 50000.
Estimating Parameters in a Four-Component Mixture Model
| Quantity | p1 | p2 | p3 | p4 |
|---|---|---|---|---|
| .007 | .182 | .758 | .052 | |
| .001 | .039 | .037 | .008 | |
| .001 | .041 | .032 | .009 | |
| Confidence interval | (.005, .010) | (.092, .272) | (.681, .836) | (.033, .071) |
| Quantity | μ1 | μ2 | μ3 | μ4 |
| 832 | 2772 | 3170 | 3804 | |
| 46 | 103 | 7 | 25 | |
| 34 | 80 | 9 | 38 | |
| Confidence interval | (741, 924) | (2565, 2979) | (3152, 3187) | (3735, 3873) |
| Quantity | σ1 | σ2 | σ3 | σ4 |
| 210 | 740 | 417 | 413 | |
| 28 | 23 | 10 | 38 | |
| 30 | 23 | 7 | 46 | |
| Confidence interval | (146, 274) | (688, 792) | (398, 436) | (321, 506) |
Parameters in a 4-component normal mixture model for birthweight distribution are estimated, based on 25 samples of size 50000 from the population of white singletons born to heavily smoking mothers. Interval estimates are constructed using Equations (7) and (8) with C0 = 2.5 and φ = .2465.
Mixture Models in Simulation Studies
| Design | Description | Mixture Density |
|---|---|---|
| A | 2 components | .120 |
| B | 3 components | .041 |
| C | 4 components | .007 |
| D | 5 components | .007 |
| E | 6 components | .006 |
Probability densities for normal mixture models used in our simulation studies are specified.
Figure 3Mixture Models in Simulation Studies. (a) Probability densities for the normal mixture models used in our simulation studies are compared. (b) to (d) Close-up views of the probability densities are displayed.
Preferences of Model Selection Criteria in Simulation Studies
| True Model | Sample size | FLIC preferences | BIC preferences | AIC preferences | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 3 | 4 | 5 | 6 | 7 | 2 | 3 | 4 | 5 | 6 | 7 | 2 | 3 | 4 | 5 | 6 | 7 | ||
| 2 | 5000 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 7 | 9 | 1 | 3 | 4 | 1 |
| 10000 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 2 | 12 | 7 | 2 | 2 | 0 | |
| 25000 | 20 | 5 | 0 | 0 | 0 | 0 | 21 | 4 | 0 | 0 | 0 | 0 | 0 | 5 | 4 | 8 | 4 | 4 | |
| 50000 | 5 | 19 | 1 | 0 | 0 | 0 | 5 | 19 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | 6 | 10 | 7 | |
| 100000 | 0 | 20 | 3 | 1 | 1 | 0 | 0 | 20 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 6 | 19 | |
| 3 | 5000 | 19 | 6 | 0 | 0 | 0 | 0 | 19 | 6 | 0 | 0 | 0 | 0 | 1 | 18 | 5 | 1 | 0 | 0 |
| 10000 | 1 | 24 | 0 | 0 | 0 | 0 | 1 | 24 | 0 | 0 | 0 | 0 | 0 | 16 | 6 | 1 | 2 | 0 | |
| 25000 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 9 | 4 | 3 | 6 | 3 | |
| 50000 | 0 | 24 | 1 | 0 | 0 | 0 | 0 | 24 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 15 | 7 | |
| 100000 | 0 | 17 | 3 | 4 | 1 | 0 | 0 | 17 | 3 | 4 | 1 | 0 | 0 | 0 | 0 | 3 | 7 | 15 | |
| 4 | 5000 | 18 | 2 | 5 | 0 | 0 | 0 | 20 | 1 | 4 | 0 | 0 | 0 | 0 | 1 | 20 | 1 | 2 | 1 |
| 10000 | 10 | 1 | 14 | 0 | 0 | 0 | 10 | 1 | 14 | 0 | 0 | 0 | 0 | 1 | 24 | 0 | 0 | 0 | |
| 25000 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 24 | 0 | 1 | 0 | |
| 50000 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 6 | 14 | 3 | 2 | |
| 100000 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 8 | 10 | 7 | |
| 5 | 5000 | 21 | 2 | 2 | 0 | 0 | 0 | 22 | 1 | 2 | 0 | 0 | 0 | 0 | 2 | 12 | 9 | 2 | 0 |
| 10000 | 9 | 3 | 12 | 1 | 0 | 0 | 9 | 3 | 12 | 1 | 0 | 0 | 0 | 0 | 15 | 9 | 1 | 0 | |
| 25000 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 8 | 10 | 6 | 1 | |
| 50000 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 10 | 13 | 2 | |
| 100000 | 0 | 0 | 15 | 10 | 0 | 0 | 0 | 0 | 15 | 10 | 0 | 0 | 0 | 0 | 0 | 6 | 10 | 9 | |
| 6 | 5000 | 24 | 1 | 0 | 0 | 0 | 0 | 24 | 1 | 0 | 0 | 0 | 0 | 0 | 3 | 7 | 10 | 2 | 3 |
| 10000 | 9 | 2 | 13 | 1 | 0 | 0 | 10 | 2 | 13 | 0 | 0 | 0 | 0 | 0 | 12 | 4 | 8 | 1 | |
| 25000 | 0 | 0 | 23 | 2 | 0 | 0 | 0 | 0 | 23 | 2 | 0 | 0 | 0 | 0 | 0 | 2 | 19 | 4 | |
| 50000 | 0 | 0 | 24 | 0 | 1 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | 10 | |
| 100000 | 0 | 0 | 10 | 7 | 8 | 0 | 0 | 0 | 11 | 6 | 8 | 0 | 0 | 0 | 0 | 0 | 10 | 15 | |
The row with "True Model" = 2 and "Sample size" = 5000 contains the preferences of the three model selection criteria for the number of components, based on 25 samples of size 5000 simulated from a 2-component normal mixture model. Other rows correspond to a different sample size and/or underlying number of components.
Confidence Interval Coverage Probabilities in Simulation Studies
| Population | Bias adjustment included | Bias adjustment omitted | |
|---|---|---|---|
| Number & Percentage of Intervals | Number & Percentage of Intervals | ||
| 2.0 | 200,000 | 106 (88.3) | 50 (41.7) |
| 1,000,000 | 110 (91.7) | 66 (55.0) | |
| Infinite | 105 (87.5) | 64 (53.3) | |
| 2.5 | 200,000 | 110 (91.7) | 61 (50.8) |
| 1,000,000 | 112 (93.3) | 75 (62.5) | |
| Infinite | 114 (95.0) | 74 (61.7) | |
| 3.0 | 200,000 | 110 (91.7) | 67 (55.8) |
| 1,000,000 | 112 (93.3) | 80 (66.7) | |
| Infinite | 115 (95.8) | 83 (69.2) | |
| 3.5 | 200,000 | 111 (92.5) | 68 (56.7) |
| 1,000,000 | 115 (95.8) | 83 (69.2) | |
| Infinite | 118 (98.3) | 88 (73.3) | |
| 4.0 | 200,000 | 112 (93.3) | 69 (57.5) |
| 1,000,000 | 118 (98.3) | 85 (70.8) | |
| Infinite | 118 (98.3) | 91 (75.8) | |
| 4.5 | 200,000 | 113 (94.2) | 74 (61.7) |
| 1,000,000 | 118 (98.3) | 87 (72.5) | |
| Infinite | 118 (98.3) | 96 (80.0) | |
| 5.0 | 200,000 | 116 (96.7) | 78 (65.0) |
| 1,000,000 | 118 (98.3) | 89 (74.2) | |
| Infinite | 118 (98.3) | 98 (81.7) | |
The row with "C" = 2 and "Population size" = 200,000 identifies the numbers and percentages of confidence intervals containing their targets of mixture parameters, based on 10 repetitions in each of which 25 samples of size 50000 were simulated from a 4-component normal mixture with 12 parameters; results under the heading of "Bias adjustment included" are based on Equation (7) with C = 2, results under the heading of "Bias adjustment omitted" are based on Equation (6) with C = 2, and the 25 samples of size 50000 had overlap consistent with a population size of 200,000. Other rows correspond to different choices of C and/or population sizes.
Another Example with Real Data
| Model | Number of | Fitted Mixture Density |
|---|---|---|
| 2 components | 0 | .144 |
| 3 components | 0 | .040 |
| 4 components | 1 | .012 |
| 5 components | 1 | .010 |
| 6 components | 22 | .007 |
| 7 components | 1 | .007 |
Parameters in 2-component through 7-component normal mixture models for birthweight distribution are estimated, based on 25 samples of size 50000 from the population of black singletons in general. The numbers of samples for which the FLIC preferred the various models are also recorded.
Figure 4Four- and Six-Component Mixture Models for Birthweight Distribution. (a) A 4-component normal mixture model for birthweight distribution, with parameters estimated by combining the results for 25 samples of size 50000 from the population of black singletons in general, is shown. (b) A 6-component normal mixture model, with parameters estimated in the same manner, is depicted.