| Literature DB >> 27342544 |
P Scott Pine1, Sarah A Munro2, Jerod R Parsons2, Jennifer McDaniel2, Anne Bergstrom Lucas3, Jean Lozach4, Timothy G Myers5, Qin Su5, Sarah M Jacobs-Helber6,7, Marc Salit2.
Abstract
BACKGROUND: Highly multiplexed assays for quantitation of RNA transcripts are being used in many areas of biology and medicine. Using data generated by these transcriptomic assays requires measurement assurance with appropriate controls. Methods to prototype and evaluate multiple RNA controls were developed as part of the External RNA Controls Consortium (ERCC) assessment process. These approaches included a modified Latin square design to provide a broad dynamic range of relative abundance with known differences between four complex pools of ERCC RNA transcripts spiked into a human liver total RNA background.Entities:
Keywords: ERCC; Gene expression; Microarray; RNA controls; RNA sequencing; RNA-Seq; Spike-in controls
Mesh:
Substances:
Year: 2016 PMID: 27342544 PMCID: PMC4921035 DOI: 10.1186/s12896-016-0281-x
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Fig. 1Modified Latin square pool design. Five subpools of ERCC controls, each containing a unique set of transcripts at different target relative abundance (see Table 1), were combined in varying proportions. Subpools A, B, C, D, and E are shaded, black, white, light grey, medium grey, and dark grey, respectively. Subpool A is present as a constant 10 % component of each pool, the modification. Subpools B-E are mixed using a Latin square of proportions 10, 15, 25, and 40 %
Distribution of ERCCs among subpools and pools
| Target Relative Abundance | ||||||
|---|---|---|---|---|---|---|
| Control ID | Subpool | Pool 12 | Pool 13 | Pool 14 | Pool 15 | Note |
| ERCC-00073 | N/A | 0 | 0 | 0 | 0 | Omitted |
| ERCC-00162 | A | 1 | 1 | 1 | 1 | Non-SRM |
| ERCC-00163 | B | 1 | 1.5 | 2.5 | 4 | Antisense |
| ERCC-00164 | C | 1.5 | 2.5 | 4 | 1 | Antisense |
| ERCC-00165 | D | 2.5 | 4 | 1 | 1.5 | Antisense |
| ERCC-00123 | E | 4 | 1 | 1.5 | 2.5 | Reassigned to Pool C |
Dynamic range coverage
| Platform | Units | Subset | ERCC-00073 | Minimum | Maximum | Range |
|---|---|---|---|---|---|---|
| Illumina | log2 signal | ERCC | 5.81 ± 0.08 | 5.72 ± 0.07 | 13.88 ± 0.11 | 8.16 ± 0.13 |
| Bead | BKGDa | 5.35 ± 0.07 | 14.34 ± 0.08 | 8.99 ± 0.10 | ||
| NIAID | log2 signal | ERCC | 5.53 ± 0.02 | 5.53 ± 0.03 | 15.83 ± 0.38 | 10.30 ± 0.39 |
| In-house | BKGD | 5.20 ± 0.01 | 15.26 ± 0.27 | 10.06 ± 0.27 | ||
| Agilent | log2 signal | ERCC | 2.62 ± 0.14 | 2.57 ± 0.16 | 20.73 ± 0.18 | 18.16 ± 0.24 |
| One-color | BKGD | 2.41 ± 0.10 | 20.66 ± 0.06 | 18.25 ± 0.12 | ||
| Agilent | log2 signal | ERCC | 2.57 ± 0.06 | 2.40 ± 0.06 | 18.37 ± 0.10 | 15.98 ± 0.12 |
| Two-color | BKGD | 4.40 ± 0.15 | 20.00 ± 0.10 | 15.60 ± 0.19 | ||
| Illumina | log2 FPKM | ERCC | Undetected | −4.98 ± 0.67 | 14.58 ± 0.27 | 19.56 ± 0.72 |
| HiSeq | BKGD | −6.34 ± 0.40 | 18.27 ± 0.05 | 24.61 ± 0.40 | ||
| LifeTech | log2 RPKMb | ERCC | Undetected | −3.26 ± 0.38 | 16.47 ± 0.34 | 19.73 ± 0.51 |
| SOLiD | BKGD | −6.64 ± 0.00 | 17.30 ± 0.35 | 23.94 ± 0.35 | ||
aAll transcripts measured in the total human liver RNA background
bMinimum RPKM value reported is truncated at 0.01 for all replicates
Fig. 2ERCC signal response as a function of relative abundance in each of the four pools on the Illumina microarray platform. In Panel a, each line represents an individual ERCC control, where grey = titrated, black = 1-to-1, red = outlier, and dashed-line = background (average ERCC-00073). In Panel b, the centroid of each ERCC control is plotted, where the red line corresponds to the fitted Langmuir model, open circles = within 99 % CI, red circles = outliers, and dashed-line = background. In Panel c, the slope of each ERCC is plotted, where the red line corresponds to expected slope (first derivative of the Langmuir model), the vertical dotted lines correspond to the margins of the linear region (inflection points of the first derivative of the Langmuir model), the open circles = monotonic ERCC controls (ρ = 1), grey squares = non-monotonic, and red = outliers. Numbers adjacent to outliers in Panels b and c correspond to the last three digits of the Control ID in Table 3. In Panel d, each ERCC control is represented on the Bland-Altman plot of Pool 12 vs Pool 14, where the red line corresponds to the ratio versus average intensity derived from the fitted Langmuir model, with outliers coded as in Panels b and c above
Fig. 3ERCC signal response as a function of relative abundance in each of the four pools on the NIAID microarray platform. See Fig. 2 legend
Fig. 4ERCC signal response as a function of relative abundance in each of the four pools on the Agilent 1-color microarray platform. See Fig. 2 legend
Fig. 5ERCC signal response as a function of relative abundance in each of the four pools on the Agilent 2-color microarray platform. See Fig. 2 legend
Fig. 6ERCC signal response as a function of relative abundance in each of the four pools on the LifeTech NGS platform. In Panel a, each line represents an individual ERCC control, where grey = titrated, black = 1-to-1, and red = outlier. Partially detected and undetected ERCC controls are included at the bottom to indicate their targeted relative abundance. In Panel b, the centroid of each ERCC control is plotted, where the red line corresponds to the linear fitted model, open circles = within 99 % CI, and red circles = outliers. In Panel c, the slope of each ERCC is plotted, where the open circles = monotonic ERCC controls (ρ = 1), grey squares = non-monotonic, and red = outliers. Numbers adjacent to outliers in Panels b and c correspond to the last three digits of the Control ID in Table 3. In Panel d, each ERCC control is represented on the Bland-Altman plot of Pool 12 vs Pool 14, with outliers coded as in Panels b and c above
Fig. 7ERCC signal response as a function of relative abundance in each of the four pools on the Illumina NGS platform. See Fig. 6 legend
ERCC outliers grouped by performance criteria
| Log2 Target | Response Curve Outliers | Non-monotonic | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Controls | Relative Abundance | Subpool | AGLa | ILM | NIAID | LifeTech | ILM HiSeq | AGLa | ILM | NIAID | LifeTech | ILM HiSeq |
| ERCC-00156b | 1 | B | ◯ | ◯ | ||||||||
| ERCC-00147 | 2 | C | ◯ | ◯ | ◯ | |||||||
| ERCC-00148b | 2 | D | ◯[1,2] | ◯ | ◯[1] | ◯ | ◯ | |||||
| ERCC-00137 | 3 | B | ◯[2] | ◯[1] | ◯ | |||||||
| ERCC-00143b | 3 | E | ◯[1] | ◯[1] | ||||||||
| ERCC-00131b | 4 | D | ◯[1] | ◯ | ◯[1,2] | |||||||
| ERCC-00134b | 4 | E | ◯[1] | ◯[1,2] | ||||||||
| ERCC-00168b | 6 | E | ◯[1,2] | ◯ | ◯[2] | ◯ | ◯ | |||||
| ERCC-00095 | 8 | E | ◯ | |||||||||
| ERCC-00157 | 1 | C | ◯ | |||||||||
| ERCC-00158 | 1 | D | ◯[1] | |||||||||
| ERCC-00160 | 1 | E | ◯[1] | |||||||||
| ERCC-00145 | 2 | B | ◯[1] | |||||||||
| ERCC-00150 | 2 | E | ◯[1] | |||||||||
| ERCC-00138b | 3 | C | ◯[1,2] | ◯ | ◯ | |||||||
| ERCC-00142 | 3 | D | ◯[2] | ◯ | ||||||||
| ERCC-00128 | 4 | B | ◯[2] | ◯ | ||||||||
| ERCC-00111 | 6 | C | ◯ | |||||||||
| ERCC-00097 | 7 | B | ◯ | |||||||||
| ERCC-00098 | 7 | C | ◯ | ◯ | ||||||||
| ERCC-00104 | 7 | E | ◯ | |||||||||
| ERCC-00086 | 8 | C | ◯ | ◯ | ||||||||
| ERCC-00062b | 12 | C | ◯ | |||||||||
| ERCC-00046b | 13 | C | ◯ | |||||||||
| ERCC-00162 | 0 | A | ◯[1] | |||||||||
| ERCC-00126 | 4 | A | ◯ | |||||||||
| ERCC-00113b | 5 | C | ◯[2] | ◯ | ◯ | |||||||
| ERCC-00117 | 5 | C | ◯ | |||||||||
| ERCC-00120 | 5 | D | ◯ | |||||||||
| ERCC-00109 | 6 | B | ◯ | ◯ | ◯ | |||||||
| ERCC-00077 | 8 | A | ◯ | |||||||||
| ERCC-00081 | 9 | D | ◯ | |||||||||
| ERCC-00060 | 10 | A | ◯ | |||||||||
| ERCC-00075 | 10 | D | ◯ | |||||||||
| ERCC-00171 | 10 | B | ◯ | |||||||||
| ERCC-00054 | 11 | B | ◯ | |||||||||
| ERCC-00058b | 11 | D | ◯[1,2] | ◯ | ◯ | ◯ | ||||||
| ERCC-00069 | 11 | E | ◯ | ◯ | ||||||||
| ERCC-00044 | 12 | B | ◯ | ◯ | ||||||||
| ERCC-00025 | 14 | A | ◯ | |||||||||
| ERCC-00028 | 14 | B | ◯ | |||||||||
| ERCC-00040 | 14 | C | ◯ | |||||||||
| ERCC-00042 | 14 | E | ◯ | ◯ | ||||||||
| ERCC-00007 | 16 | D | ◯ | |||||||||
| ERCC-00022 | 16 | C | ◯[1,2] | |||||||||
| ERCC-00170b | 16 | A | ◯[2] | ◯ | ◯ | ◯ | ||||||
| ERCC-00023 | 18 | D | ◯[1,2] | |||||||||
| ERCC-00002 | 20 | B | ◯ | |||||||||
| ERCC-00012 | 20 | A | ◯ | |||||||||
| ERCC-00024 | 20 | E | ◯[1,2] | |||||||||
The following analytes were incorrectly prepared as their antisense sequence and omitted from the data anlysis: ERCC-00009, ERCC-00014, ERCC-00057, ERCC-00059, ERCC-00099, ERCC-00108, and ERCC-00116
aNumbers in bracket indicate array type: 1 = Agilent 1-color, 2 = Agilent 2-color
bDiscussed further in main text