| Literature DB >> 23079396 |
Kenneth J Livak1, Quin F Wills, Alex J Tipping, Krishnalekha Datta, Rowena Mittal, Andrew J Goldson, Darren W Sexton, Chris C Holmes.
Abstract
The stochastic nature of generating eukaryotic transcripts challenges conventional methods for obtaining and analyzing single-cell gene expression data. In order to address the inherent noise, detailed methods are described on how to collect data on multiple genes in a large number of single cells using microfluidic arrays. As part of a study exploring the effect of genotype on Wnt pathway activation, data were collected for 96 qPCR assays on 1440 lymphoblastoid cells. The description of methods includes preliminary data processing steps. The methods used in the collection and analysis of single-cell qPCR data are contrasted with those used in conventional qPCR.Entities:
Mesh:
Year: 2012 PMID: 23079396 PMCID: PMC3562442 DOI: 10.1016/j.ymeth.2012.10.004
Source DB: PubMed Journal: Methods ISSN: 1046-2023 Impact factor: 3.608
Fig. 1Distribution of estimated efficiencies for 95 qPCR assays detecting human transcripts. Panel A is a histogram displaying the efficiencies estimated from the slopes of standard curve plots. The average efficiency of this distribution is 0.98 with a standard deviation of 0.042. Panel B is a Q–Q plot with the experimental estimated efficiencies plotted on the y-axis and the values expected for a normal distribution with mean efficiency = 0.98 and standard deviation = 0.042 plotted on the x-axis. The black line indicates the values expected for a normal distribution (y = x). For the 85 efficiency values determined using GM12802 RNA, the data points are depicted as light blue (derived from plots with 3 points in the standard curve), dark blue (4 points in the standard curve), or red (⩾5 points in the standard curve). The 10 efficiency values determined using Universal Human cDNA are depicted in gray and all of these values are derived from standard curves with at least 5 points. It can be seen that the points that deviate the most from a normal distribution are all derived from standard curves with only 3 or 4 points. Such determinations are probably more prone to error than those derived from standard curves with 5 or more points.
Genes and primer pairs for the assays used in this study.
| Gene | Ensembl Gene ID | Forward primer | Reverse primer | Comment |
|---|---|---|---|---|
| ENSG00000075624 | CGACCAACCGCGAGAAGATGAC | CGTTAGCACAGCCTGGATAGCAA | ||
| ENSG00000160710 | CGAGCACTGTTGACCCACTTCC | CGTCAGATGCCCTTGGCTGAAAA | ||
| ENSG00000134982 | CGAGCACTTGTGGCCCAACTAAA | CGTCGCCAAGACAAATTCCTCAAAAC | ||
| ENSG00000103126 | CGACAAGGAGCTGCTGACCAAAA | CGTCACCACCCCACAGTCAAAC | ||
| ENSG00000168646 | CGAGTCCACGGAAACTGTTGACA | CGTGTGGCTGGTGCAAAGACATA | ||
| ENSG00000116128 | CGAACTCCAGCCAAAGTGGTGTA | CGTCAACCTGGCCCTTCAAAACA | ||
| ENSG00000166167 | CGATAAGCGGCCTTCGAGACAA | CGTAACCTGTATGGCCTGTGAGAA | ||
| ENSG00000106144 | CGAAACTGCCCAAGCCTACAGAA | CGTTTGGTCAACCCCACGATCA | ||
| ENSG00000110092 | CGAAGAGGCGGAGGAGAACAAA | CGTAGGGCGGATTGGAAATGAAC | ||
| ENSG00000118971 | CGAGCAGAAGGACATCCAACCCTA | CGTTCTTCGCACTTCTGTTCCTCA | ||
| ENSG00000112576 | CGACCGACAGGCCTTGGTCAA | CGTTGGCGGGTACATGGCAAA | ||
| ENSG00000039068 | CGAAGTGCCAACTGGACCATTCA | CGTTCTAAGGCCATCTTTGGCTTCA | ||
| ENSG00000062038 | CGAGAAGATGACACCCGTGACAA | CGTTGGAGCTGGGTGATGTCATA | ||
| ENSG00000124762 | CGATGGAGACTCTCAGGGTCGAAAA | CGTCGGCGTTTGGAGTGGTAGAA | ||
| ENSG00000169118 | CGATTGACCTCTGTGACCGAACA | CGTGTGCACGTATTCCATTCGAGAA | ||
| ENSG00000101266 | CGATCCGAGTTGCTTCCCGATAC | CGTCAACCCAAACTCCACATATCCAAA | ||
| ENSG00000159692 | CGAGAGCACAACCACCACCTCA | CGTGGGCTGTGTTCACCAGGAA | ||
| ENSG00000168036 | CGAAGCTCTTACACCCACCATCC | CGTTGCATGATTTGCGGGACAAA | ||
| ENSG00000100592 | CGAAGCCCACAAATGCCCTGAAA | CGTTCGGTCCATACTGTTCCTTCCA | ||
| ENSG00000153071 | CGACCCACCTCCACAAAGTACCA | CGTGATGTCTGATGCAAGCAAGTCA | ||
| ENSG00000165617 | CGAATCTGAAGAGCACCTGGAGAC | CGTGCCCCATCACTCAGCTCATA | ||
| ENSG00000107984 | CGACGGGCGGGAATAAGTACCA | CGTGGACTAGCGCAGTACTCATCA | ||
| ENSG00000050165 | CGAAATGGGACCATCTGTGACAACC | CGTGCAAAGCTCGCCCTCCA | ||
| ENSG00000004975 | CGATGCCTCCCGCCTCCTTAA | CGTTGACGCTGCTGAAGGATGAC | ||
| ENSG00000151247 | CGAACTTCTTATTGCAAGGCAGTCTCTA | CGTGTGCTCCAAACTTATGCTGTTCA | ||
| ENSG00000141642 | CGAGGAAAAGAAACGCCCAGGTAA | CGTAGCTTCCCATAGGCAGGAC | ||
| ENSG00000102678 | CGAGGTGTGGACAGTGGTCTCTA | CGTCCCTAAAGATGCATTCGGAAGTA | ||
| ENSG00000078579 | CGACCAGGGAACCAGGAAAGACC | CGTCCTTCTCATTCATCCCGAGGTA | ||
| ENSG00000109101 | CGAACCTGGATGCCATCAATCCC | CGTGGGCCAAGCTATCATCCTTCA | ||
| ENSG00000150907 | CGAGGTGTCAGGCTGAGGGTTA | CGTTTCTCTCAGTTCCTGCTGTCA | ||
| ENSG00000118689 | CGACACTGAGGAAGGGGAAGTGG | CGTGAGAGCAGATTTGGCAAAGGG | ||
| ENSG00000162998 | CGACCTCTGCCCTCCACTTAATGTTA | CGTCAGCTATAGAGCCTTCCACCAA | ||
| ENSG00000157240 | CGAGGCAACCTTGCCTTTGAGAA | CGTCCAGGTGACCTCAACATTTCC | ||
| ENSG00000180340 | CGACTGCGCTTCCACCTTCTTCA | CGTAATGATAGGCCGCTCTGGGTA | ||
| ENSG00000163251 | CGATGGGGACTGTCTGCTCTTCT | CGTTGGGGAGAGACGGTTAGGG | ||
| ENSG00000177283 | CGACGTGGTCTTCTTGCTGGTCTA | CGTAGGAACCATGTGAGCGACAA | ||
| ENSG00000116717 | CGAGCGACCTGCAGTTTGCAATA | CGTTTTGCTGAGCACTTCCTCCA | ||
| ENSG00000111640 | CGAACACCATGGGGAAGGTGAAG | CGTGTGACCAGGCGCCCAATA | ||
| ENSG00000105723 | CGACGCCATCAAGAAGGTTCTCC | CGTTTGCAGTGGTCCAGCTTAC | ||
| ENSG00000082701 | CGAACTACCAAATGGGCGAGACA | CGTATGGTAGCCAGAGGTGGATTAC | ||
| ENSG00000075218 | CGAGGGCGATCCCTGTTCCA | CGTTCCTTGCGAGATTGCTGGTA | ||
| ENSG00000048052 | CGAGGGCCAACTGGAAGTGTTAC | CGTATGCGTTGCTGTGAAACCA | ||
| ENSG00000101076 | CGAGTGCGGAAGAACCACATGTAC | CGTAGTAGCGGCACTGGTTCC | ||
| ENSG00000167862 | CGAAAAGCAAGCCGACAGTGAC | CGTCAGGACCACTACTCCGACAATA | ||
| ENSG00000115738 | CGAAGACCCGGGCAGAACCA | CGTCACACAGTGCTTTGCTGTCA | ||
| ENSG00000101384 | CGAAACAAAGGCTTCACGGGAAC | CGTCAAGTGCCACCGTTTCTACAA | ||
| ENSG00000177606 | CGAAAGAACTCGGACCTCCTCAC | CGTTGGATTATCAGGCGCTCCA | ||
| ENSG00000183762 | CGAAGAGCACGAGGATGGTGTCTA | CGTTTGTAGCAGCCAAGGTTTCCA | ||
| ENSG00000130164 | CGACACCACGGTGGAGATAGTGAC | CGTTTCTCATTTCCTCTGCCAGCAA | ||
| ENSG00000138795 | CGAAAGAAAGTGCAGCTATCAACCA | CGTGCTGTCTTTCTTTCCGTGCTA | ||
| ENSG00000162337 | CGACTGCGCCTCACACTACAC | CGTGGCAGATTTCTGGCTGAACA | ||
| ENSG00000070018 | CGAGACAGACCTGGACACCAACTTA | CGTGGATGAGGCAAGTCATCTGCTA | ||
| ENSG00000135341 | CGACGAATCATGTGGGCTGTTCA | CGTACGAGTCATCAGGCTCTCAA | ||
| ENSG00000109339 | CGATCATCCTGGGGATGGGCTA | CGTTTTGTGGCGAACCATTTCTCC | ||
| ENSG00000105976 | CGACAGAGACTTGGCTGCAAGAA | CGTCATGTCTCTGGCAAGACCAAA | ||
| ENSG00000107789 | CGATCCTCCAGTTTGGTCATGCA | CGTTGTACGCTGTTAGGGGTTCC | ||
| ENSG00000137673 | CGAGTGAGCTACAGTGGGAACA | CGTTCTCCTTGAGTTTGGCTTCTAA | ||
| ENSG00000136997 | CGACTCCTTGCAGCTGCTTAGAC | CGTCGAGTCGTAGTCGAGGTCATA | ||
| ENSG00000140807 | CGAGGCTCCAAGAAGCAGCTGAA | CGTTACAGGGTGAAGGTCCACTCC | ||
| ENSG00000087095 | CGAAGACATTAAGCCAGGGAATCTCC | CGTCTTCCACTCTGGCCAATCCA | ||
| ENSG00000163273 | CGACCAACGCGCGCAAATACAAA | CGTCAGCTTGAGGCCGAAGCA | ||
| ENSG00000181222 | CGACTCGCCTCTTCTACTCCAACA | CGTATGGAGTCCCCAATGCCAATA | ||
| ENSG00000112033 | CGAGGCAAAGCCAGCCACAC | CGTGCCATTCACCAACTGCTTCC | ||
| ENSG00000196262 | CGATCTGGTTCCTTCTGCGTGAA | CGTCACCCAGGGAATACGTAACCA | ||
| ENSG00000113575 | CGAGTGGTAACCAAGCTGCAATCA | CGTCTACGAGGTGCTGGGTCAA | ||
| ENSG00000105568 | CGAGTTGCCAATGTCCGCTTCAA | CGTTCTAGGATGGGCTTGACTTCAC | ||
| ENSG00000154001 | CGACAACCCAGCATTGCCAAAA | CGTGAGGGTCTTCGCTGTCAAA | ||
| ENSG00000154229 | CGAACCATCCGCTCCACACTAAA | CGTAGTCGTCGGTCTTTGTCTGAA | ||
| ENSG00000171132 | CGATATCTTCGGCAGCCCACCTA | CGTGACACTGGTATCCCTGCTTTCC | ||
| ENSG00000171016 | CGATATCCTGGCTTTGGAGGCTA | CGTACCACAGTATGGGGAAGACA | ||
| ENSG00000136238 | CGACTCCTGTAGTCGCTTTGCCTA | CGTAGAACATCGTCAGCACTAGCA | ||
| ENSG00000113643 | CGAAGCTGCTACTGTGTGGAGAA | CGTCAGCATACGCCACATGTTCA | ||
| ENSG00000164736 | CGACACAACGCCGAGTTGAGCAA | CGTGCTCTGCCTCCTCCACGAA | ||
| ENSG00000164458 | CGACGCTTCAAGGAGCTCACCAA | CGTGCCAGACACGTTCACCTTCA | ||
| ENSG00000112592 | CGATGCCCGAAACGCCGAATATA | CGTCGTGGTTCGTGGCTCTCTTA | ||
| ENSG00000196628 | CGAAGCCTGCATCCACATGAAC | CGTACATCGGAGGAAGACTGGAA | ||
| ENSG00000081059 | CGATAAGGAGAGCGCTGCCATCA | CGTTTGCGGGCCAGCTCATAGTA | ||
| ENSG00000152284 | CGATCTCCCCAGAGATCGATCCA | CGTGAGAGTGGGTAATACGGTGACA | ||
| ENSG00000148737 | CGACGCTTTGGCCTTGATCAACA | CGTCCTTCACCTTGTATGTAGCGAAC | Whole gene assay | |
| ENSG00000148737 | CGAATCATGATCCCCGACCTGAC | CGTGTGCTGCCGGACTGAAAA | RefSeq NM_001146274.1 | |
| ENSG00000148737 | CGACCCCTCAGACTTCACTGTCA | CGTGCACCACTGGCACTTTGTTA | RefSeq | |
| ENSG00000148737 | CGACATGTCTTTGAATTTGGAATATTACAATG | CGTCCTTCACCTTGTATGTAGCGAAC | RefSeq | |
| ENSG00000148737 | CGAAGCTTCATATGCAACTGTACCC | CGTGGCTGCTTGTCCCTTTTCC | RefSeq | |
| ENSG00000141655 | CGACTTCTCTGCCAGCTAGAAAACC | CGTAGACGCGAAGAGAAGCAGAA | ||
| ENSG00000077097 | CGAGATGCTGCAAGCCCTCGTTA | CGTGGTTGTCATCCACAGCAGGAA | ||
| ENSG00000173915 | CGAACTGGCCACATATGGAAGCA | CGTCAGATGAGGTTAAGAACCGTAGACA | ||
| ENSG00000150630 | CGAGCCAACCTCAACTCAAGGAC | CGTGCATGCATTGAGTCTTTCTCCA | ||
| ENSG00000156076 | CGACATCTGCCCACCTGGATTCTA | CGTACAGGTCCCTCCATTAAAGCA | ||
| ENSG00000125084 | CGACGCTTCCTCATGAACCTTCAC | CGTCGTGGCACTTGCACTCC | ||
| ENSG00000135925 | CGAGACTCGCAACAAGATCCCCTA | CGTGCGATGGCGTAGGCAAAA | ||
| ENSG00000085741 | CGAGGCGTGTGCTATGGCATCAA | CGTGCAGTGTTGCGTCTGGTTCA | ||
| ENSG00000002745 | CGACACCACGGGCAAAGAAAACAA | CGTTGGCAGCGGCAGTCTAC | ||
| ENSG00000134245 | CGACCGGGCCCTCATGAACTTA | CGTACTCACGCCATGGCACTTA | ||
| ENSG00000154342 | CGAGCCCCACTCGGATACTTCTTA | CGTGAGGAATACTGTGGCCCAAC | ||
| ENSG00000162552 | CGAAGAGCCCTCATGAACCTCCA | CGTCCGTGGCACTTGCATTCC | ||
| ENSG00000111186 | CGACTTCTGACAGACGCCAACTCC | CGTGCTGGGCACCGATGATAAACA |
Fig. 2Correlation of qPCR and digital PCR single-cell results for transcripts from two genes. Plots show qPCR results plotted on the y-axis and digital PCR results plotted on the x-axis for CTNNB1 (A) and WNT10A (B). The results are for individual cells from 9 of the cell lines (GM07029, GM07019, GM12239, GM12864, GM12865, GM12752, GM12753, GM06991, GM11881). The qPCR values are Log2Ex values determined as described in Section 2.8.3. The digital PCR values are the log base 2 values of the number of target molecules estimated to be present in each panel of the 48.770 IFCs.
Fig. 7Results of a simulation testing the robustness of using data from 96 assays to estimate the median expression level per cell. The details of the simulation are described in Section 2.8.4. The x-axis plots the number of randomly selected assays (out of 96 assays) that were used to determine a median Log2Ex value for each cell. These median Log2Ex values were used to normalize the results for two batches of 96 single cells. The y-axis is a measure of how well the results from the two batches of cells correlate after normalization. The blue horizontal line is at a Pearson correlation value of 0.95.
Fig. 6Distributions of standard deviations for the 73 assays expressed in at least 10% of the cells. Results are shown separately for pooled baseline (blue) and perturbed (red) cells.
Fig. 3Distribution of Detection Failure Scores for one group of 48 cells. Detection Failure Scores were calculated for the 48 GM11881 baseline cells as described in Section 2.8.1. The vertical line at Detection Failure Score = 24 is the 3× median threshold used to cull low expressing cells from the data set.
Fig. 4Effect of sampling error on qPCR results. Panel A depicts Poisson distributions at low concentrations of target molecules per qPCR reaction volume, ranging from 1 to 5 molecules per reaction volume. Panel B shows the calculated effect (see Section 2.8.2) of the Poisson distribution on Cq standard deviation values at various low concentrations of target molecules per qPCR reaction volume.
Theoretical number of molecules generated by preamplification from one single-stranded cDNA molecule as a function of cycles and efficiency.
| Cycles | Efficiency (%) | Number of molecules generated | Concentration (number of molecules/μL) | |
|---|---|---|---|---|
| In 75 μL | After dilution with qPCR reagents | |||
| 18 | 100 | 131,072 | 1748 | 749 |
| 19 | 95 | 166,197 | 2216 | 950 |
| 20 | 90 | 197,842 | 2638 | 1131 |
| 21 | 85 | 220,513 | 2940 | 1260 |
Volume of sample at the end of the preamplification step.
Effect of efficiency on ΔΔCq after 14 cycles of preamplification.a
| Efficiency | 100% | 95% | 90% | 85% | 80% |
|---|---|---|---|---|---|
| ΔΔ | 0.0 | 0.5 | 1.0 | 1.6 | 2.1 |
Calculated assuming one assay at 100% and the other at the indicated efficiency.
Fig. 5Histograms showing CTNNB1 transcript levels in lymphoblastoid cells in logarithmic (A) and linear (B) scale. Conversion of Cq values to Log2Ex values and conversion from logarithmic to linear scale are described in Section 2.8.3. For these distributions, the data for baseline (blue) and perturbed (red) cells for all 15 cell lines were pooled. The zero bins show the number of cells in which CTNNB1 transcript was not detected.