| Literature DB >> 26977131 |
Hongyue Wang1, Guanqing Chen1, Xiang Lu1, Hui Zhang2, Changyong Feng3.
Abstract
The sample geometric mean has been widely used in biomedical and psychosocial research to estimate and compare population geometric means. However, due to the detection limit of measurement instruments, the actual value of the measurement is not always observable. A common practice to deal with this problem is to replace missing values by small positive constants and make inferences based on the imputed data. However, no work has been carried out to study the effect of this naïve imputation method on inference. In this report, we show that this simple imputation method may dramatically change the reported outcomes of a study and, thus, make the results uninterpretable, even if the detection limit is very small.Entities:
Keywords: population geometric mean; sample geometric mean; two-sample test
Year: 2015 PMID: 26977131 PMCID: PMC4764008 DOI: 10.11919/j.issn.1002-0829.215121
Source DB: PubMed Journal: Shanghai Arch Psychiatry ISSN: 1002-0829

Figure 1. Cumulative distribution functions of X and Y in formula (1) with c0=0.1
Table 1: A random sample from a distribution with geometric mean 0 (sample size n=100)
| 166.95 | 70.67 | 75.68 | 2.61 | 264.39 | 55.30 | 129.93 | 87.55 | 172.43 | 59.95 |
| 211.11 | 127.63 | 71.91 | 362.70 | 73.12 | 293.65 | 292.67 | 369.40 | 139.59 | 304.00 |
| 155.42 | 16.80 | 109.80 | 18.34 | 190.47 | 29.37 | 53.43 | 62.93 | 137.79 | 44.72 |
| 152.19 | 84.66 | 172.02 | 45.94 | 437.89 | 110.13 | 53.51 | 152.44 | 75.92 | 60.48 |
| 151.47 | 513.60 | 34.72 | 69.70 | 492.94 | 42.03 | 4.48 | 82.01 | 445.03 | 35.22 |
| 2.67 | 41.08 | 205.55 | 73.19 | 713.21 | 182.35 | 43.62 | 67.32 | 37.21 | 65.01 |
| 108.44 | 747.98 | 15.69 | 59.55 | 122.46 | 475.55 | 0.95 | 261.28 | 96.82 | 168.29 |
| 44.53 | 191.05 | 74.81 | 143.88 | 194.59 | 26.63 | 90.69 | 141.91 | 25.92 | 251.09 |
| 55.08 | 154.57 | 53.82 | 66.33 | 53.58 | 17.57 | 115.23 | 6.69 | 49.44 | 303.29 |
| 118.96 | 48.13 | 39.11 | 690.46 | 170.17 | 217.58 | 62.74 | 79.84 | 26.43 | 106.79 |
Table 2: Means and standard deviations of sample means and sample geometric means
| sample size |
| |||||||
| mean | sd | mean | sd | mean | sd | mean | sd | |
| 10 | 1.6459 | 0.6936 | 1.6485 | 0.6930 | 0.9169 | 0.4460 | 1.0326 | 0.3445 |
| 50 | 1.6406 | 0.3068 | 1.6433 | 0.3065 | 0.7573 | 0.3079 | 0.9887 | 0.1450 |
| 100 | 1.6415 | 0.2167 | 1.6442 | 0.2165 | 0.7027 | 0.2797 | 0.9834 | 0.1017 |
| 500 | 1.6418 | 0.0969 | 1.6445 | 0.0968 | 0.5944 | 0.2389 | 0.9795 | 0.0453 |
| 1, 000 | 1.6413 | 0.0688 | 1.6440 | 0.0688 | 0.5399 | 0.2464 | 0.9789 | 0.0321 |
| 5, 000 | 1.6411 | 0.0306 | 1.6438 | 0.0306 | 0.3083 | 0.3056 | 0.9783 | 0.0143 |
| 10, 000 | 1.6413 | 0.0217 | 1.6440 | 0.0216 | 0.1571 | 0.2651 | 0.9783 | 0.0101 |

Figure 2. Histogram of sample geometric means from the distributions of X (part A) and X* (part B) in formula (2) for different sample sizes

Histograms of p-values of the test statistic in formula (3) for different sample sizes