| Literature DB >> 29490628 |
Amna Klich1,2,3,4, Catherine Mercier5,6,7,8, Laurent Gerfault9,10, Pierre Grangeat9,10, Corinne Beaulieu11, Elodie Degout-Charmette11, Tanguy Fortin11,12, Pierre Mahé13, Jean-François Giovannelli14, Jean-Philippe Charrier11, Audrey Giremus14, Delphine Maucort-Boulch5,6,7,8, Pascal Roy5,6,7,8.
Abstract
BACKGROUND: In the field of biomarker validation with mass spectrometry, controlling the technical variability is a critical issue. In selected reaction monitoring (SRM) measurements, this issue provides the opportunity of using variance component analysis to distinguish various sources of variability. However, in case of unbalanced data (unequal number of observations in all factor combinations), the classical methods cannot correctly estimate the various sources of variability, particularly in presence of interaction. The present paper proposes an extension of the variance component analysis to estimate the various components of the variance, including an interaction component in case of unbalanced data.Entities:
Keywords: Experimental design; Mass spectrometry; SRM; Technical variability; Validation biomarkers; Variance component analysis
Mesh:
Substances:
Year: 2018 PMID: 29490628 PMCID: PMC5831836 DOI: 10.1186/s12859-018-2075-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1HS: pool of human serum - SRM: selected reaction monitoring - Inj: injection - The triangles indicate the samples destined for SRM and ELISA - The circle indicates the samples destined solely for estimating the digestion yield by SRM - The squares indicate the samples destined for reading by SRM readings - The diamond indicates the samples destined for ELISA readings
Fig. 2SRM: selected reaction monitoring- BHI algorithm: Bayesian Hierarchical algorithm- NLP algorithm: the classical algorithm- AQUA : Absolute QUAntification (labelled internal standard) - QC: quality control- θtech the set of latent technical parameters
Parameters and variables involved in the SRM analytical chain model
| Notation | Description | Range |
|---|---|---|
| t | Time | |
|
| Discrete time sample | |
|
| Digestion factor defined by the number of peptides | |
|
| Digestion yield defined by the correction factor to apply to the digestion factor | |
|
| Peptide to fragment gain | |
|
| Peptide to fragment gain correction factor for AQUA peptide | |
| Normalized chromatography peak response of peptide | ||
|
| Chromatography peak position | |
|
| Chromatography peak width | |
| Transition signal at time | ||
|
| Protein | |
|
| Peptide | |
|
| Concentration of selected ion | |
|
| Concentration of selected fragment |
Hierarchical model equations of the SRM analytical chain for the native transition signals I and labeled transition signals I*
| Quantity | Targeted protein | AQUA peptide standard | |
|---|---|---|---|
| Protein concentration |
| No labeled protein introduced | |
| Peptide concentration before chromatography |
|
| |
| Selected ion concentration before fragmentation |
|
| |
| Signal of transition at time |
|
| |
| Resulting signals of selected children of targeted peptidea |
|
|
aBold notation stands for vectors
Distribution type for each variable of the SRM acquisition chain
| Hierarchical level | Variable | Analytic expression distributiona | Distribution type |
|---|---|---|---|
| Transition | Noise |
| Normal |
| Peptide | Peptide to fragment gain |
| Normal |
| Peptide to fragment gain correction factor |
| Normal | |
| Noise inverse variance |
| Gamma | |
| Peak retention time |
| Uniform | |
| Peak width |
| Uniform | |
| Peptide concentration |
| Normal | |
| Protein | Protein concentration |
| Normal |
| Digestion yield |
| Normal |
aBold notation stands for vectors
Variance decomposition of Model 1S
| Source of variation | DF | Adjusted sum of squares |
|---|---|---|
| Theoretical concentration and interaction | J | |
| Two-day measurement and interaction | 2(J-1) | |
| Interaction | (J-1) | |
| Residual variation | IJR-2 J |
|
| Measurement error | (R-1)*I*J |
|
| Lack of fit | IJ-2 J |
|
DF degrees of freedom, I number of samples, J number of couples of days, R number of digestion-injections -: mean of digestion-injection replicate measurements of each sample and each couple of days - : predicted measurements - RSS: residual sum of squares - SS: sum of squares
Fig. 3Venn Diagrams showing the variance components
Estimations of the mean slope and results of variance decomposition
| Protein and algorithm | Peptide number | Mean slope | Theoretical concentration + interactiona | Interaction | Two-day process+ interactionb | Measurement errorb | Totalb | Lack of fit |
|---|---|---|---|---|---|---|---|---|
| L-FABP | 3 | |||||||
| NLP | 0.64 | 27.2 | 14.4 | 24.9 | 45.7 | 70.6 | 16.9 | |
| BHI | 0.72 | 54.8 | 3.9 | 5.7 | 30.5 | 36.2 | 12.9 | |
| ELISAc | 0.84 | 98.1 | 0.1 | 0.6 | 0.3 | 0.8 | 1.1 | |
| Villin | 3 | |||||||
| NLP | 1.14 | 51.1 | 5.6 | 9.9 | 14.4 | 24.3 | 28.8 | |
| BHI | 0.98 | 74.7 | 1.7 | 1.8 | 16.8 | 18.6 | 8.3 | |
| 14.3.3 sigma | 1 | |||||||
| NLP | 1.09 | 69.8 | 5.8 | 16.9 | 16.4 | 33.3 | 9.5 | |
| BHI | 0.77 | 87.2 | 0.7 | 2.1 | 10.1 | 12.2 | 1.4 | |
| Calgi | 2 | |||||||
| NLP | 1.02 | 86.2 | 1.8 | 6 | 6.7 | 12.7 | 2.9 | |
| BHI | 0.81 | 93.5 | 0.1 | 1 | 4.3 | 5.3 | 1.3 | |
| Def.A6 | 1 | |||||||
| NLP | 0.97 | 97.6 | 0.1 | 0.1 | 2. | 2.1 | 0.2 | |
| BHI | 0.95 | 97.3 | 0.0 | 0.3 | 2.2 | 2.5 | 0.2 | |
| Calmo | 1 | |||||||
| NLP | 0.55 | 87.1 | 0.6 | 8 | 4.7 | 12.7 | 2.4 | |
| BHI | 0.32 | 19.8 | 0.8 | 16 | 52. | 68.1 | 12.9 | |
| I-FABP | 1 | |||||||
| NLP | 0.86 | 89.3 | 0.4 | 2 | 5.1 | 7.1 | 2.5 | |
| BHI | 0.22 | 2.8 | 2.1 | 5.9 | 86. | 91.9 | 7.4 | |
| Peroxi-5 | 1 | |||||||
| NLP | 0.69 | 80.4 | 1.5 | 2.5 | 15.2 | 17.6 | 2.2 | |
| BHI | 0.30 | 27.3 | 15.2 | 24 | 51.5 | 75.5 | 12.4 | |
| S100A14 | 2 | |||||||
| NLP | 0.88 | 85.9 | 15.9 | 21.4 | 4.5 | 25.9 | 7.8 | |
| BHI | 0.34 | 30.2 | 20.3 | 41.4 | 34.6 | 76 | 14.1 |
The results of variance decomposition (columns 4 to 9) are expressed as percentages, a Reflects the dilution variance. b Reflects the technical variance. c Results stemming from the reading order (not the two-day readings)
Fig. 4Two-day reproducibility of the linear model slopes with the NLP algorithm on the log2-log2 scale. In each panel, the solid line represents the diagonal regression line
Fig. 5Two-day reproducibility of the linear model slopes with the BHI algorithm on the log2-log2 scale. In each panel, the solid line represents the diagonal regression line
Fig. 6Reading-order reproducibility of the linear model slopes with L-FABP protein quantification by ELISA on the log2-log2 scale. In each panel, the solid line represents the diagonal regression line