| Literature DB >> 27596149 |
Stefan Enroth1, Göran Hallmans2, Kjell Grankvist3, Ulf Gyllensten4.
Abstract
The quality of clinical biobank samples is crucial to their value for life sciences research. A number of factors related to the collection and storage of samples may affect the biomolecular composition. We have studied the effect of long-time freezer storage, chronological age at sampling, season and month of the year and on the abundance levels of 108 proteins in 380 plasma samples collected from 106 Swedish women. Storage time affected 18 proteins and explained 4.8-34.9% of the observed variance. Chronological age at sample collection after adjustment for storage-time affected 70 proteins and explained 1.1-33.5% of the variance. Seasonal variation had an effect on 15 proteins and month (number of sun hours) affected 36 proteins and explained up to 4.5% of the variance after adjustment for storage-time and age. The results show that freezer storage time and collection date (month and season) exerted similar effect sizes as age on the protein abundance levels. This implies that information on the sample handling history, in particular storage time, should be regarded as equally prominent covariates as age or gender and need to be included in epidemiological studies involving protein levels.Entities:
Keywords: Biobank; Covariate; Plasma proteins; Proximity extension assay; Sampling month; Storage time
Mesh:
Substances:
Year: 2016 PMID: 27596149 PMCID: PMC5078583 DOI: 10.1016/j.ebiom.2016.08.038
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1Study design. (a) Distribution of samples over time and individual age at time of sampling. Blue dots correspond to samples from the Västerbotten Intervention Program (VIP) and red dots to samples from the Mammography Screening Program (MA). (b) The VIP-study has been collecting samples since 1985 and the vast majority of the individuals in this study have donated samples at multiple occasions, some over a period of over 3 decades. (c) With the exception of July, the samples have been collected throughout the whole year.
Fig. 2Associations and variance explained by different variables. (a) Decreasing protein abundance levels (NPX) with longer storage time, (b) Increasing abundance levels (NPX) with longer storage time. (c) Fraction of variance explained by storage time (all proteins with nominally significant associations, Supplementary Table 1). (d) Increasing protein abundance levels (NPX) for CUB domain-containing protein 1 (CDCP1) with age after storage-time component removed (linear model). (e) Same as (d), but decreasing with increased age of individual for Neurotrophin-3 (NT-3). (f) Fraction of variance explained by age in the residuals after correcting for storage time (all proteins with nominally significant associations, Supplementary Table 2).
Fig. 3Seasonal and monthly variation in protein abundance levels. (a) Mean sun hours per season (Winter, Spring, Summer, Autumn) (b–d) proteins abundance levels (NPX) over seasons for Latency-associated peptide transforming growth factor beta-1 (LAP-TGF-beta-1), Heat shock 27 kDa protein (HSP 27) and Interleukin-20 receptor subunit alpha (IL-20RA). Red connections indicate nominally significant (p < 0.05, two-sided Wilcox test) differences between pair of seasons. (e) Sun hours per month. (f–g) Monthly differences in protein abundance levels for two proteins (LAP-TGF-beta-1 and HSP-27). Solid lines represent responses from linear models fitted with 0–11 months offset between sunlight and protein abundance levels (NPX). Blue lines indicate nominally significant contribution of sunlight hours to protein abundance levels for a specific offset. The best model in (f) was found for an offset of 8 months delay, and for (g) with a 1 month delay. (h) Fraction of variance explained by sunlight (as a proxy variable for a number of possible factors) on protein abundance levels after adjusting for storage time and age of the individual.