| Literature DB >> 23228338 |
Li Yan1, Changxing Ma, Dan Wang, Qiang Hu, Maochun Qin, Jeffrey M Conroy, Lara E Sucheston, Christine B Ambrosone, Candace S Johnson, Jianmin Wang, Song Liu.
Abstract
BACKGROUND: Batch effect is one type of variability that is not of primary interest but ubiquitous in sizable genomic experiments. To minimize the impact of batch effects, an ideal experiment design should ensure the even distribution of biological groups and confounding factors across batches. However, due to the practical complications, the availability of the final collection of samples in genomics study might be unbalanced and incomplete, which, without appropriate attention in sample-to-batch allocation, could lead to drastic batch effects. Therefore, it is necessary to develop effective and handy tool to assign collected samples across batches in an appropriate way in order to minimize the impact of batch effects.Entities:
Mesh:
Year: 2012 PMID: 23228338 PMCID: PMC3548766 DOI: 10.1186/1471-2164-13-689
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Summary of final setup produced by the default algorithm. a) the distribution of SampleType characteristic across the plates; b) the distribution of Race characteristic across the plates; c) the distribution of AgeGrp characteristic across the plates; d) the index of optimization steps versus value of the objective function. The blue diamond indicates the starting point, and the red diamond marks the final optimized setup.
Comparison of sample assignment by two algorithms implemented in OSAT and an undesired sample assignment through complete randomization
| | | ||||||
|---|---|---|---|---|---|---|---|
| 5 | 0.2034518 | 0.9990763 | 0.03507789 | 0.9999879 | 13.25243 | 0.021124664 | |
| 5 | 0.2380335 | 0.9986490 | 3.68541503 | 0.5955359 | 14.22455 | 0.014244218 | |
| 20 | 0.8138166 | 1.0000000 | 5.08147313 | 0.9996856 | 39.75020 | 0.005371387 | |
Figure 2Summary of final setup produced by the alternative algorithm. a) the distribution of SampleType characteristic across the plates; b) the distribution of Race characteristic across the plates; c) the distribution of AgeGrp characteristic across the plates; d) the index of generated setups versus value of the objective function. The blue diamond indicates the first setup generated, and the red diamond marks the final selected setup.
Figure 3Summary of an undesired setup produced by complete randomization. a) the distribution of SampleType characteristic across the plates; b) the distribution of Race characteristic across the plates; c) the distribution of AgeGrp characteristic across the plates.