| Literature DB >> 26527366 |
Abstract
Computer-intensive resampling/bootstrap methods are feasible when calculating reference intervals from non-Gaussian or small reference samples. Microsoft Excel® in version 2010 or later includes natural functions, which lend themselves well to this purpose including recommended interpolation procedures for estimating 2.5 and 97.5 percentiles. The purpose of this paper is to introduce the reader to resampling estimation techniques in general and in using Microsoft Excel® 2010 for the purpose of estimating reference intervals in particular. Parametric methods are preferable to resampling methods when the distributions of observations in the reference samples is Gaussian or can transformed to that distribution even when the number of reference samples is less than 120. Resampling methods are appropriate when the distribution of data from the reference samples is non-Gaussian and in case the number of reference individuals and corresponding samples are in the order of 40. At least 500-1000 random samples with replacement should be taken from the results of measurement of the reference samples.Entities:
Keywords: Microsoft Excel; biostatistics; bootstrap method; reference interval; resampling method
Mesh:
Year: 2015 PMID: 26527366 PMCID: PMC4622197 DOI: 10.11613/BM.2015.031
Source DB: PubMed Journal: Biochem Med (Zagreb) ISSN: 1330-0962 Impact factor: 2.313
Figure 1General principles when calculating reference intervals.
Figure 2Simplified illustration of the principles of resampling techniques as employed by resampling methods with replacement. This figure illustrates the use of 83 raw data observations obtained from Geffré et al. () resampled 1000 times in this case. One thousand copies of each observation are created and thoroughly mixed giving each of them identical possibility of being selected. A multitude of random samples are then taken from the mixture, calculating the 0.025 and 0.975 percentiles for each sample in order to create an estimate of these percentiles in the intended population.
Detailed instructions to create the calculation sheet.
| Input the data from the reference persons | In column A. |
| Selecting the random samples from the data from the reference persons | In column C, row 1 enter =INDEX($A$1:$A$10000;RANDBETWEEN(1;COUNT($A$1:$A$10000))) |
| Calculate the 2.5 percentile | |
| Calculate the 97.5 percentile | |
| Create several random samples | Copy the column C you have created to the right to make as many random samples from the reference sample as you wish: |
| Calculate the median of the 2.5 percentiles | If the mean is preferred instead of the median, the function AVERAGE should be used instead of MEDIAN |
| Calculate the median of the 97.5 percentiles | If the mean is preferred instead of the median, the function AVERAGE should be used instead of MEDIAN |
Figure 3An example showing 30 random samples drawn from the 83 reference values shown in Figure 2. Only 30 random samples are shown here in order to be able to show the principle in a single figure/screen. The 83 reference values are shown in the column on the far left (column A) marked with a blue box.