Literature DB >> 27920993

Determining lower limits of detection of digital PCR assays for cancer-related gene mutations.

Coren A Milbury1, Qun Zhong1, Jesse Lin1, Miguel Williams1, Jeff Olson1, Darren R Link1, Brian Hutchison1.   

Abstract

Digital PCR offers very high sensitivity compared to many other technologies for processing molecular detection assays. Herein, a process is outlined for determining the lower limit of detection (LoD) of two droplet-based digital PCR assays for point mutations of the epidermal growth factor receptor (EGFR) gene. Hydrolysis probe mutation-detection assays for EGFR p.L858R and p.T790M mutations were characterized in detail. Furthermore, sixteen additional cancer-related mutation assays were explored by the same approach. For the EGFR L8585R assay, the assay sensitivity is extremely good, and thus, the LoD is limited by the amount of amplifiable DNA that is analyzed. With 95% confidence limits, the LoD is one mutant in 180,000 wild-type molecules for the evaluation of 3.3 μg of genomic DNA, and detection of one mutant molecule in over 4 million wild-type molecules was achieved when 70 million copies of DNA were processed. The measured false-positive rate for the EGFR L8585R assay is one in 14 million, which indicates the theoretical LoD if an unlimited amount of DNA is evaluated. For the EFGR T790M assay, the LoD is one mutant in 13,000 for analysis of a 3.3 μg sample of genomic DNA, and the dPCR assay limit sensitivity approaches one mutant in 22,000 wild-type molecules.

Entities:  

Keywords:  Assay sensitivity; Digital PCR; EGFR L858R; EGFR T790M; EGFR, epidermal growth factor receptor; Limit of detection; LoB, limit of blank; LoD, limit of detection; Mutation detection; N, total number of droplet events counted; NMut, number of droplets with only mutated DNA; NWT, number of droplets with only wild-type DNA; PCR, Polymerase Chain Reaction; R, ratio of mutant to wild-type molecules; RFP, average false positive rate (ΛFP/#WT); p, fraction of PCR-positive droplets; ΛFP, average number of false-positive events; λ, average number of targets “loaded” per droplet

Year:  2014        PMID: 27920993      PMCID: PMC5129438          DOI: 10.1016/j.bdq.2014.08.001

Source DB:  PubMed          Journal:  Biomol Detect Quantif


Introduction

Digital PCR

Invention of the Polymerase Chain Reaction (PCR) [1] changed life science research and molecular diagnostics. Now, digital PCR (dPCR) is changing the field of PCR and re-defining expectations for mutation detection. The power of digital PCR arises from evaluating individual molecules as positive or negative for a particular parameter, such as mutation status. For many assays, dPCR sensitivity is significantly higher than traditional PCR analysis, and the accuracy and precision of the assay improves by counting larger numbers of molecules individually. The sensitivity of digital PCR will facilitate detection limits that redefine our understanding of disease onset, progression, and recurrence. One particularly attractive application of digital PCR is the quantitative detection of a small number of mutated DNA molecules among a large number of wild-type molecules, which is relevant to cancer research, and especially for the detection of minor alleles. Cancerous tissue is often highly heterogeneous and cancer biomarkers vary across types of disease and stages of disease progression which complicates cancer detection and identification at early stages. Subclonal populations of cells within a tumor may contain a mutation that differs from the primary mutation, but the subclonal mutation could be correlated to a prognosis and/or a response to specific therapy regimens. Similarly, detecting mutations in circulating tumor DNA (i.e., a Fluid Biopsy™ sample) is a relatively simple and non-invasive approach to monitoring disease recurrence, which requires a high sensitivity of mutation detection to provide effective therapies at the earliest stage of progression [2], [3], [4]. These examples reflect the need for mutation detection tools that are qualitatively more sensitive than existing tools, enabling sensitivity better than 1 in 10,000 and maximized to enable transformational advances in cancer research. Digital PCR technologies can be deeply sensitive and highly precise for detection of low abundance minority alleles, such as those observed in progressive cancer and metastatic samples. The epidermal growth factor receptor (EGFR) is frequently impacted by mutations that arise in cancer. Mutations in the EGFR gene (chromosomal locus 7p12.3-p12.1) are common across several cancer types, and the mutations often result in altered expression and activity. Upregulation and overexpression of EGFR can lead to uncontrolled cellular division, resulting in rampant tissue growth and lead to cancer. A common example is a point mutation (c.2573T>G) in exon 21 of EGFR, commonly known as variant L858R. This type of activating mutation occurs frequently in epithelial cancers, particularly lung cancer and glioblastomas. Non-small cell lung cancers (NSCLC) are some of the most common cancers in the world and are frequently associated with mutations in EGFR. Two tyrosine kinase inhibitors (TKIs), Gefitinib (Iressa®) and Erlotinib (Tarceva®), are common treatments for NSCLC. Nearly all responders to treatment possess somatic alterations in exons 18–21 of the EGFR gene. Approximately 90% of EGFR mutations in NSCLC are highly variable deletions located in exon 19 or point mutations in exon 21[5]. The L858R and T790M alterations are two of the most frequent EGFR mutations. The EGFR T790M mutation is particularly important because it is linked to known drug resistance, reinforcing the value of early detection [6]. Herein, the process of determining lower limits of detection (LoD) for assays performed using the RainDance RainDrop® digital PCR system for the detection of mutations in EGFR and other common cancer-related mutations is reported. The data demonstrate that digital PCR supports the potential for detecting mutated DNA in highly heterogeneous tumor samples or body fluids, which provides a broad view of all biomarkers arising from heterogeneous and highly localized tumor(s).

Methods

EGFR assays for digital PCR analysis

Assays comprised of primers and hydrolysis probes were developed to screen for EGFR T790M and L858R point mutations via digital PCR. The digital PCR system utilized for this evaluation was the RainDrop® Digital PCR System (RainDance Technologies). Table 1 presents the assay details for the EGFR T790M and L858R assays. Probes were labeled with 6-carboxyfluorescein (FAM, ex 494 nm/em 522 nm), VIC (from ABI, ex 528 nm/em 554 nm), or TET (from Integrated DNA Technologies, ex 522 nm/em 539 nm). The EGFR L858R TaqMan® MGB probes were designed using the Life Technologies online TaqMan design tool, whereas the PrimeTime® LNA-ZEN probes were designed with the assistance of Integrated DNA Technologies, as these are custom probes (non-cataloged item). The EGFR T790M probe sequence contains LNA nucleotides that are denoted with “+”.
Table 1

EGFR assay details.

Protein Variantp.T790Mp.L858R
Nucleotide Variantc.2369C>Tc.2573T>G
Amplicon Locationchr7: 55249045-55249095chr7:55259486-55259563
Amplicon Length51 base pairs78 base pairs
Forward Primer5′-CCTCACCTCCACCGTGCA-3′5′-GCAGCATGTCAAGATCACAGATT-3′
Reverse Primer5′-AGGCAGCCGAAGGGCA-3′5′-CCTCCTTCTGCATGGTATTCTTTCT-3′
Wild-type Probe/5TET/T+CATC+A+C+GC/ZEN/A+GCTC/3IABkFQ//VIC/-AGTTTGGCCAGCCCAA-MGBNFQ
Mutant Probe/6FAM/T+CATC+A+T+GC/ZEN/A+GC+TC/3IABkFQ//6FAM/-AGTTTGGCCCGCCCAA-MGBNFQ
Probe Type (Vendor)PrimeTime® LNA-ZEN (Integrated DNA Technologies)TaqMan® MGB (Life Technologies)

Samples and reagent components

A series of experimental samples were processed to determine the LoD for each EGFR assay. Table 2 summarizes the replicate samples that were analyzed in this study. A two-part design was executed; one subset of samples is composed of wild-type genomic DNA samples to assess the false positive rate of the assay, and the second subset included a mutation titration series to verify the linearity and sensitivity of the assay.
Table 2

Samples evaluated to measure and verify the sensitivity of each EGFR assay.

Sample (Mut/WT)EGFR T790MEGFR L858R
0% (wild-type only)N = 58N = 71
∼0.5 to 1.0%N = 4N = 4
∼0.05 to 0.1%N = 4N = 4
∼0.005 to 0.01%N = 4N = 4
∼0.0005 to 0.001%N = 4N = 4
The EGFR T790M and L858R mutation templates were synthetic GeneArt® plasmid templates (Life Technologies Inc.). Each plasmid was linearized using restriction enzyme digestion. Wild-type genomic DNA (G3041, Promega Inc.) was nebulized to approximately 3 kb in length according to manufacturer's protocols (K7025-05, Life Technologies). DNA was quantified and qualified using a NanoDrop 2000 spectrophotometer. Fragmentation length was confirmed via gel electrophoresis. Following DNA preparation, PCR reagent components were prepared as presented in Table 3 in a pre-PCR room to limit the risk of reagent contamination. All sample PCR reactions were prepared to 50 μl volumes, containing approximately ∼20,000 copies/μl of genomic DNA (∼3.3 μg DNA per each 50 μl reaction). The mutant template was only incorporated into the second subset of samples, following a titration of the DNA sample, according to Table 2.
Table 3

Digital PCR reagent components.

ReagentVendorItem #Final concentration
TaqMan® Genotyping Master MixLife Technologies4371355
TaqMan® MGB ProbesLife TechnologiesCustom0.2 μM
PrimeTime® LNA-ZEN qPCR ProbesIntegrated DNA TechnologiesCustom0.2 μM
Oligonucleotide primersIntegrated DNA TechnologiesCustom0.9 μM
Droplet StabilizerRainDance Technologies20-00803
WT genomic DNA controlPromegaG3041∼20,000 copies/μl
Mutant plasmid DNA control (GeneArt® Gene Synthesis)Life TechnologiesCustomVariable by sample type
DNase/RNase-free waterSigma–AldrichW4502-1LTo volume
Outside of the samples defined within this evaluation, preliminary assessment, verification, and optimization of both assays was performed. The false positive mutant count is zero for both assays, though some false-positive wild-type counts have been observed for the EGFR T790M assay, possibly due to non-specific probe hydrolysis.

Emulsion generation, thermal cycling, and sample reading

Samples were loaded onto the RainDrop® Source instrument (RainDance Technologies), following operating guidelines, to emulsify the sample. The RainDrop® Source instrument uses real-time closed-loop image control to ensure uniformity of droplet creation both within and across runs. Each 50 μl sample was emulsified into 5 picoliter droplet volumes, partitioning single molecule of DNA into approximately 10 million droplets. Following emulsion generation on the RainDrop® Source instrument, the samples were thermal cycled following the protocol outlined in Table 4. A Bioer GenePro thermal cycler was used. Thermal cycled samples were loaded onto the RainDrop® Sense instrument (RainDance Technologies). The RainDrop® Sense instrument uses a 488 nm laser to read the FAM and VIC/TET fluorescence intensity per droplet.
Table 4

Thermal cycling protocols.

A. EGFR T790M
B. EGFR L858R
StepTempTimeCyclesStepTempTimeCycles
Polymerase activation95 °C10 min1Polymerase activation95 °C10 min1
Denaturation95 °C15 sDenaturation95 °C15 s
Annealing58 °Ca15 s}50Annealing & Extension60 °Ca60 s}45
Extension60 °C45 s
Incubation98 °C10 min1Incubation98 °C10 min1
Final hold12 °CIndefiniteFinal hold12 °CIndefinite

A slow ramping speed (0.5 °C) was used during cooling from the denaturation step to the annealing step.

Assessment of additional cancer assays

The approach to determine assay sensitivity was employed to assess sixteen cancer-related assays in addition to EGFR. The assay designs are listed in Appendix Table A1. Analysis of wild-type DNA, as well as a mutation-titration series, was evaluated for each of these assays. Assay designs included TaqMan® MGB probes as well as custom PrimeTime® LNA-ZEN qPCR probes (similar to those used for the two EGFR assays). Reagent composition and sample layout followed the methods presented above, and limits of detection were determined by the process outlined in the following sections.
Table A1

Sixteen cancer assay sequences and probe chemistries evaluated in addition to EGFR L858R and T790M.

TargetOligonucleotideSequenceProbe typeVendor
Epidermal growth factor receptor (EGFR) p.E746_A750del, c.2235del15WT PROBEVIC-ATTAAGAGAAGCAACATCTCCGA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TCGCTATCAAAACATCTCCGA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGGATCCCAGAAGGTGAGA-3′
REVERSE PRIMER5′-CCACACAGCAAAGCAGAAAC-3′



Phosphoinositide-3-kinase, catalytic, alpha polypeptide (PIK3CA)p.H1047R, c.3140A>GWT PROBE/5TET/C+CATG+A+T+GT/ZEN/G+CAT/3IABkFQ/LNA-ZENIntegrated DNA Technologies
MUTANT PROBE/56-FAM/C+CATG+A+C+GT/ZEN/GCAT/3IABkFQ/LNA-ZENIntegrated DNA Technologies
FORWARD PRIMER5′-GAGCAAGAGGCTTTGGAGTA-3′
REVERSE PRIMER5′-ATGCTGTTTAATTGTGTGGAAGA-3′



Phosphoinositide-3-kinase, catalytic, alpha polypeptide (PIK3CA)p.E545K, c.1633G>AWT PROBE/5TET/CCT +GC+T +C+AG/ZEN/TG+AT/3IABkFQ/LNA-ZENIntegrated DNA Technologies
MUTANT PROBE/56-FAM/TC+C TGC +T+T+A/ZEN/+GT+G/3IABkFQ/LNA-ZENIntegrated DNA Technologies
FORWARD PRIMER5′-GACAAAGAACAGCTCAAAGCA-3′
REVERSE PRIMER5′-GCACTTACCTGTGACTCCAT-3′



Murine sarcoma viral oncogene homolog B1 (BRAF)p.V600E, c.1799T>AWT PROBE/5TET/TCGAGAT+TTC+ACT+GTAGCT/3IABkFQ/LNA-ZENIntegrated DNA Technologies
MUTANT PROBE/56-FAM/TCGAGAT+TTC+TCT+GTAGCT/3IABkFQ/LNA-ZENIntegrated DNA Technologies
FORWARD PRIMER5′-ACCTCAGATATATTTCTTCATG-3′
REVERSE PRIMER5′-CCAGACAACTGTTCAAAC-3′



Anaplastic lymphoma receptor tyrosine kinase (ALK)p.R1275Q, c.3824G>AWT PROBEVIC-TGGCCCGAGACATC-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TGGCCCAAGACATC-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-GGAAGAGTGGCCAAGATTGGA-3′
REVERSE PRIMER5′-TGAGGCAGTCTTTACTCACCTGTA-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G12D, c.35G>AWT PROBEVIC-TTGGAGCTGGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TTGGAGCTGATGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G12C, c.34G>TWT PROBEVIC-TTGGAGCTGGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TTGGAGCTTGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G12V, c.35G>TWT PROBEVIC-TTGGAGCTGGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TTGGAGCTGTTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G12S, c.34G>AWT PROBEVIC-TTGGAGCTGGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TTGGAGCTAGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G12A, c.35G>CWT PROBEVIC-TTGGAGCTGGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TTGGAGCTGCTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G12R, c.34G>CWT PROBEVIC-TTGGAGCTGGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-TTGGAGCTCGTGGCGTA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Kirsten rat sarcoma viral oncogene homolog (KRAS)p.G13D, c.38G>AWT PROBEVIC-CTGGTGGCGTAGGC-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-CTGGTGACGTAGGC-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CTGAAAATGACTGAATATAAACTTGTGG-3′
REVERSE PRIMER5′-TAGCTGTATCGTCAAGGCACTC-3′



Human Epidermal Growth Factor Receptor 2 (HER2)p.L755S, c.2264T>CWT PROBE6FAM-TCCCTCAACACTTTG-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBEVIC-TCCCTCGACACTTTG-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-CCTGATGGGGAGAATGTGAA-3′
REVERSE PRIMER5′-GTGGAGGGGCTTACGTCTAA-3′



Guanine Nucleotide Binding Protein (G Protein), Alpha Stimulating Activity Polypeptide (GNAS)p.R201C, c.601C>TWT PROBE/5TET/tcgc+Tgc+Cgt+Gtcc/3IABkFQ/LNA-ZENIntegrated DNA Technologies
MUTANT PROBE/56-FAM/tcgc+Tgc+Tgt+Gtcc/3IABkFQ/LNA-ZENIntegrated DNA Technologies
FORWARD PRIMER5′-ACCTCAGATATATTTCTTCATG-3′
REVERSE PRIMER5′-CCAGACAACTGTTCAAAC-3′



Guanine Nucleotide Binding Protein (G Protein), Alpha Stimulating Activity Polypeptide (GNAS)p.R201H, c.602G>AWT PROBE/5TET/tcgc+Tgc+Cgt+Gtcc/3IABkFQ/LNA-ZENIntegrated DNA Technologies
MUTANT PROBE/56-FAM/cgct+Gcc+Atg+Tcct/3IABkFQ/LNA-ZENIntegrated DNA Technologies
FORWARD PRIMER5′-ACCTCAGATATATTTCTTCATG-3′
REVERSE PRIMER5′-CCAGACAACTGTTCAAAC-3′



Tumor Suppressor Protein P53 (TP53)p.R273H, c.818G>AWT PROBEVIC-ACAAACACGCACCTCA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
MUTANT PROBE6FAM-ACAAACATGCACCTCA-MGBNFQTAQMAN MGBLIFE TECHNOLOGIES
FORWARD PRIMER5′-TGGTAATCTACTGGGACGGAACAGC-3′
REVERSE PRIMER5′-GGAGATTCTCTTCCTCTGTG-3′

Data analysis with the RainDrop Analyst data analysis software

After evaluating all samples using the RainDrop® digital PCR System, data from cluster plots were spectrally-compensated and analyzed using the RainDrop Analyst data analysis software, following standard procedures. The sample containing the highest mutant titration was used as a control sample to define gates around the cluster of droplet events displaying signal from only wild-type copies (identified by VIC/TET fluorescence) and the mutant cluster (identified by FAM signal). These gates were applied across all samples evaluated within each assay. The same mutant gate was set within all wild-type only samples that were evaluated. For all wild-type only samples, any droplets with mutant signal (droplet events that are counted within the mutant gate) are considered mutant-positive, and are therefore false-positive events. Examples of gated dPCR cluster plots for the EGFR T790M and L858R assays are presented in Fig. 1, Fig. 2, respectively. For the EGFR T790M data in this report, the red target gates (Fig. 1) were relatively conservative (e.g., larger), whereas the gates for the EGFR L858R data relatively stringent (e.g., narrowly defined regions; shown in red, Fig. 2). Fig. 3 demonstrates the difference between smaller elliptical gates (stringent) and larger rectangular (more conservative) gates. Setting larger gates provides a more inclusive count of droplet events, but also leads to a higher probability for counting false-positive events due to gates overlapping with neighboring clusters or ambiguous single events being captured within the gated region. Though the gates for EGFR L858R are more stringent gates, they still capture approximately 98% or more of events associated with each cluster. The difference in droplet count between the stringent and conservative gates is typically quite small. Establishing a balance between stringent and conservative gate setting is defined by the user, based on tolerance for false-positives (leading to lower sensitivity) versus risk of missing a small fraction of true-positive events.
Fig. 1

EGFR T790M. The two dPCR plots are from a sample containing wild-type and mutant DNA (left) and a wild-type only sample (right) with the maximum recommended DNA loading. A small number of false-positive samples are routinely observed with this assay. Rectangular gates were applied for EGFR T790M, as this assay can exhibit some sample-to-sample variability.

Fig. 2

EGFR L858R. The two dPCR plots are from a sample containing wild-type and mutant DNA (left) and a wild-type only sample (right) with the maximum recommended DNA loading. 67 of 71 wild-type only samples analyzed with this assay yielded no false-positives, and the other 4 samples each yielded 1 false-positive droplet. Stringent gates were applied for the EGFR L858R assay, as clusters are typically tight and exhibit minimal sample-to-sample variability.

Fig. 3

Comparison of stringent (small elliptical) gates to conservative (rectangular) gates for one of the EGFR L858R samples. The number of droplet events counted in the elliptical gate is at least 98% of the population in rectangular gates. A smaller gate will miss a small percentage of true-positives but will minimize false-positives.

Statistical calculations

Assessing sensitivity of dPCR assays

The sensitivity of a mutation detection assay is defined by a threshold for making positive calls about mutational status on individual samples with a known level of confidence. For quantitative real-time PCR (qPCR), the baseline for mutation detection is generated by evaluating negative controls (i.e., samples that do not contain target DNA). The quantification cycle (C) is the cut-off for qPCR that defines a concentration below which a mutation-containing sample cannot be called with confidence. Similarly, for dPCR, the baseline level of false-positive molecule counts determines what concentration of true mutant-positives can be detected with statistical confidence. The following sections describe the analysis and process for evaluating the limit of blank (LoB) and limit of detection (LoD) for mutation detection with digital PCR. This approach is not limited to mutation detection, and should be applicable for other minor allele detection approaches (i.e., pathogen and viral detection).

Conversion of droplet event counts to molecule counts

When a bulk solution of homogeneously-distributed target molecules, with an average concentration of less than one molecule per partition volume, is partitioned into droplets, some droplets will contain more than one target molecule. However, a droplet partition that initially contains more than one identical target amplified by PCR is not distinct from a droplet initially containing only one target molecule. Therefore, the actual number of target molecules must be calculated from the measured number of target-containing partitions (i.e., droplets). The partitioning of molecular targets into droplets is described by a Poisson distribution [7], [8], [9], where P(k;λ) is the probability for there to be k targets in one droplet when the population of all droplets contains an average “loading” of λ targets per droplet. For a system in which 50 molecules are loaded into 100 droplets, λ is ½. Alternatively, if 500 molecules are loaded into 100 droplets, then λ is 5. The fraction of PCR-negative drops (p) is:and the fraction of PCR-positive drops (p), which is inclusive of all droplets with one or more molecules of target DNA is: A duplex assay enables detection of two independent molecular targets, e.g., wild-type DNA (WT) and mutant DNA (Mut). When a sample containing both mutant and wild-type DNA is analyzed, four clusters of dPCR data are evident, as depicted in the two-dimensional histogram of the fluorescence intensities in Fig. 4.
Fig. 4

Two-dimensional dPCR cluster plot landscape illustrating a heterogeneous sample analyzed with two probes (i.e., a duplex assay).

The partitioning of fragments of homoduplex DNA into droplets is independent of mutational status,2 and the joint probability of two independent events is equal to the product of the individual probabilities. Therefore, the fraction of wild-type-only droplets is:where λ is the average “loading” of mutant targets per droplet, and λ is the average “loading” of wild-type targets per droplet. The number of wild-type only droplets is: Similarly, the numbers of droplet events corresponding to other combinations of mutant and wild-type DNA are: This series of four equations (Eqs. (5), (6), (7), (8)) contains only two unknown quantities (λ and λ) and these values can be calculated by measurement of at least three quantities from the set of (N, N, N, N, and N). In most cases, it is recommended that N, N, and N are used to calculate λ and λ following Eqs. (9), (10). Alternative methods for selecting and processing information from gated clusters of droplet events are discussed in the Appendix. Finally, the ratio of mutant to wild-type molecules in the sample is given by: Appendix Table A3, Table A4, Table A5, Table A6 contain the droplet event counts and calculated wild-type and mutant DNA molecule counts for the EGFR T790M and L858R samples evaluated.
Table A3

Poisson correction for 58 negative samples for EGFR T790M assay. N is the total measured droplet count, N is the count of wild-type only drops; N is the count of mutant only drops; λ is the measured wild-type loading, λ the mutant loading, calculated following Eq. (9), (10); R is the ratio of mutant to wild type in the sample; R is the average R of the 58 negative controls, R = 4.5e−5.

SampleDroplet count
Loading
RTarget count
NNMutNWTλmλw#WT#Mut
19,292,32039857,2464.6E−069.7E−020.0048%899,40643
28,751,56038792,9094.8E−069.5E−020.0050%831,16142
39,661,96045870,5605.1E−069.4E−020.0054%912,31249
48,520,88024762,9833.1E−069.4E−020.0033%799,33226
58,272,48032733,2294.2E−069.3E−020.0046%767,78535
68,697,58026762,8663.3E−069.2E−020.0036%798,41928
78,765,59037763,3514.6E−069.1E−020.0051%798,65841
87,788,45028683,6663.9E−069.2E−020.0043%715,55531
99,399,53043862,7345.0E−069.6E−020.0052%904,93447
109,857,77032886,8333.6E−069.4E−020.0038%929,29435
118,913,88028809,8593.5E−069.5E−020.0036%849,04331
127,878,07035688,0374.9E−069.1E−020.0053%719,95838
138,629,56022764,7662.8E−069.3E−020.0030%800,80124
148,733,07033778,8054.1E−069.3E−020.0044%815,74836
158,223,94036750,8314.8E−069.6E−020.0050%787,35040
167,189,28028683,0114.3E−061.0E−010.0043%717,67231
178,628,23032778,4544.1E−069.5E−020.0043%815,84135
188,682,56029806,3953.7E−069.7E−020.0038%846,33832
199,258,96043808,4175.1E−069.1E−020.0056%845,91347
209,074,19044793,7535.3E−069.2E−020.0058%830,64148
218,833,63023778,3972.9E−069.2E−020.0031%814,85325
228,650,79032775,3014.1E−069.4E−020.0043%812,27335
239,359,87034867,2544.0E−069.7E−020.0041%910,10437
248,749,98049807,0346.2E−069.7E−020.0064%846,71654
259,924,81036904,0484.0E−069.6E−020.0042%947,91140
269,273,75044852,7825.2E−069.6E−020.0054%894,57948
279,238,32033828,8523.9E−069.4E−020.0042%868,42336
2810,142,20040891,1114.3E−069.2E−020.0047%932,71844
299,452,31049831,4025.7E−069.2E−020.0062%870,26754
3010,307,20046901,2864.9E−069.2E−020.0053%943,15550
318,824,91045836,5575.6E−061.0E−010.0057%878,91250
3210,015,80038935,1564.2E−069.8E−020.0043%981,74042
339,594,88038881,7454.4E−069.6E−020.0045%924,93142
349,626,79042906,7684.8E−069.9E−020.0049%952,36546
358,497,39040767,3345.2E−069.5E−020.0055%804,22244
369,338,54046834,2945.4E−069.4E−020.0058%873,94651
378,948,33027802,3493.3E−069.4E−020.0035%840,62930
389,194,42029824,6323.5E−069.4E−020.0037%863,98632
399,665,90033909,8503.8E−069.9E−020.0038%955,56836
408,631,75041813,7785.2E−069.9E−020.0053%854,73945
418,451,16031784,8774.0E−069.7E−020.0041%823,75434
4210,825,000411,020,9604.2E−069.9E−020.0042%1,072,37045
4310,332,80045950,1644.8E−069.6E−020.0050%996,73350
449,414,42029857,4673.4E−069.5E−020.0035%899,06532
459,909,55032900,6583.6E−069.5E−020.0037%944,25335
469,023,92028832,1703.4E−069.7E−020.0035%873,07931
479,110,15036823,0094.3E−069.5E−020.0046%862,59140
488,620,50025784,6793.2E−069.5E−020.0033%822,72128
498,238,16031749,6704.1E−069.5E−020.0043%786,00534
508,225,37029734,5773.9E−069.4E−020.0041%769,47532
518,585,66029772,5463.7E−069.4E−020.0039%809,54332
529,058,93043859,6345.2E−061.0E−010.0053%903,20548
5310,044,10030957,6333.3E−061.0E−010.0033%1,006,41533
549,342,81040868,6844.7E−069.8E−020.0048%911,76544
558,217,48043757,6985.8E−069.7E−020.0060%794,94247
569,048,17044823,1295.3E−069.5E−020.0056%863,01248
578,718,73029798,6983.7E−069.6E−020.0038%837,68432
589,225,92035829,9714.2E−069.4E−020.0044%869,70938
Table A4

Poisson correction for 71 negative samples for EGFR L858R assay. R = 7.1e−8.

SampleDroplet count
Loading
RTarget count
NNMutNWTλmλw#WT#Mut
17,221,3900680,1700.0E+009.9E−020.0E+00714,3670
28,099,9200774,8470.0E+001.0E−010.0E+00814,4560
38,498,8800815,9680.0E+001.0E−010.0E+00857,8410
48,436,8000830,4300.0E+001.0E−010.0E+00874,1960
57,943,1800729,8510.0E+009.6E−020.0E+00765,5890
67,454,7400685,9250.0E+009.7E−020.0E+00719,5620
79,147,6601809,3881.2E−079.3E−021.3E−06847,4591.1
89,288,8600834,0790.0E+009.4E−020.0E+00873,9310
98,437,4300828,2800.0E+001.0E−010.0E+00871,8080
106,545,5800636,3780.0E+001.0E−010.0E+00669,4770
118,117,2500810,6590.0E+001.1E−010.0E+00854,0530
128,710,9900854,7990.0E+001.0E−010.0E+00899,7020
137,509,4600719,8800.0E+001.0E−010.0E+00756,7620
148,328,4200798,5320.0E+001.0E−010.0E+00839,4510
158,602,6100781,6590.0E+009.5E−020.0E+00819,4800
169,974,8200894,4980.0E+009.4E−020.0E+00937,1770
178,236,1800787,7410.0E+001.0E−010.0E+00828,0010
187,667,5600748,5260.0E+001.0E−010.0E+00787,6290
198,703,5500854,2630.0E+001.0E−010.0E+00899,1490
208,519,8000836,9640.0E+001.0E−010.0E+00880,9820
217,512,5800695,5740.0E+009.7E−020.0E+00729,9120
228,167,9400772,2410.0E+009.9E−020.0E+00811,2240
238,137,7001739,7861.4E−079.5E−021.4E−06775,6001.1
246,794,8000766,0880.0E+001.2E−010.0E+00812,8230
258,053,8200856,0690.0E+001.1E−010.0E+00905,0710
268,350,2200943,0070.0E+001.2E−010.0E+001,000,6370
278,712,6600969,4560.0E+001.2E−010.0E+001,027,7590
286,968,8800739,9340.0E+001.1E−010.0E+00782,2390
298,043,3700834,4800.0E+001.1E−010.0E+00881,0160
308,311,2300878,6180.0E+001.1E−010.0E+00928,6160
319,628,4600999,3250.0E+001.1E−010.0E+001,055,0770
328,712,9600930,8130.0E+001.1E−010.0E+00984,3840
339,316,3000930,8920.0E+001.1E−010.0E+00980,7500
348,620,2900928,5100.0E+001.1E−010.0E+00982,4240
358,928,0800963,6370.0E+001.1E−010.0E+001,019,7150
368,603,1200890,0800.0E+001.1E−010.0E+00939,5680
378,377,5400875,3730.0E+001.1E−010.0E+00924,5650
387,993,7100834,6340.0E+001.1E−010.0E+00881,4990
399,129,1500975,7210.0E+001.1E−010.0E+001,031,9040
408,453,9600897,8370.0E+001.1E−010.0E+00949,1830
419,034,0900908,8220.0E+001.1E−010.0E+00957,8530
428,289,4200896,5660.0E+001.1E−010.0E+00948,8580
438,441,6500921,6430.0E+001.2E−010.0E+00975,9450
447,925,7800814,8540.0E+001.1E−010.0E+00859,8540
458,879,9100919,8980.0E+001.1E−010.0E+00971,1150
467,887,0100819,6080.0E+001.1E−010.0E+00865,3960
479,008,6300942,6610.0E+001.1E−010.0E+00995,7160
487,895,1600852,8310.0E+001.1E−010.0E+00902,5030
498,621,5400875,8330.0E+001.1E−010.0E+00923,5820
508,792,0800937,4300.0E+001.1E−010.0E+00991,2680
518,656,1301941,0221.3E−071.2E−011.1E−06996,2101.1
528,269,5400852,2380.0E+001.1E−010.0E+00899,4240
538,578,7700876,4700.0E+001.1E−010.0E+00924,5470
548,464,5800857,6870.0E+001.1E−010.0E+00904,3180
558,835,7400911,0900.0E+001.1E−010.0E+00961,5640
568,568,8600936,1260.0E+001.2E−010.0E+00991,3190
578,713,7400890,7860.0E+001.1E−010.0E+00939,6800
588,449,8100921,7280.0E+001.2E−010.0E+00975,9840
598,726,2900941,1350.0E+001.1E−010.0E+00995,8580
607,648,5400812,5000.0E+001.1E−010.0E+00858,9780
618,378,4601881,8571.3E−071.1E−011.2E−06931,8031.1
627,780,1200823,3540.0E+001.1E−010.0E+00870,2610
638,732,8200911,8190.0E+001.1E−010.0E+00963,0190
648,778,1200977,7320.0E+001.2E−010.0E+001,036,5980
658,157,0800863,7530.0E+001.1E−010.0E+00912,9930
668,359,8800928,3190.0E+001.2E−010.0E+00984,0260
679,002,2000999,6170.0E+001.2E−010.0E+001,059,6010
688,851,7800941,8240.0E+001.1E−010.0E+00995,7930
699,687,2400977,0810.0E+001.1E−010.0E+001,029,9430
708,771,7500954,0420.0E+001.2E−010.0E+001,010,0220
718,368,7700924,2060.0E+001.2E−010.0E+00979,3370
Table A5

Poisson correction for 16 mutant titration samples for EGFR T790M assay. [WT] is the measured concentration of wild type DNA and [Mut] the concentration of mutant DNA in the sample. #Mut target is the measured count of mutant DNA molecules. Λ = R*N*λ; p-value is calculated following Eq. (14). R = 4.5e−5.

SampleDroplet count
Loading
Mut/WT[Target] (copies/μl)
p-Value calculation
N#WT#MutλwλmR[WT][Mut]#Mut targetλFPp-Value
19,142,560814,578419.3E−024.9E−060.0053%1.9E+049.8E−0145391.7E−01
29,838,690903,463449.6E−024.9E−060.0051%1.9E+049.8E−0148432.4E−01
39,573,960870,985449.5E−025.1E−060.0053%1.9E+041.0E+0048411.7E−01
48,994,840789,122449.2E−025.4E−060.0058%1.8E+041.1E+0048375.4E−02
59,447,200843,816749.4E−028.6E−060.0092%1.9E+041.7E+0081408.7E−09
69,121,840810,395809.3E−029.6E−060.010%1.9E+041.9E+0088381.3E−11
79,583,900839,461829.2E−029.4E−060.010%1.8E+041.9E+0090401.4E−11
89,861,590897,2661089.5E−021.2E−050.013%1.9E+042.4E+00119430.0E+00
98,282,410745,1924709.4E−026.2E−050.066%1.9E+041.2E+01516350.0E+00
109,669,580845,0555249.1E−025.9E−050.065%1.8E+041.2E+01574400.0E+00
119,470,450863,7185499.6E−026.4E−050.067%1.9E+041.3E+01604410.0E+00
1210,087,300917,2735669.5E−026.2E−050.065%1.9E+041.2E+01623440.0E+00
139,257,860826,62352119.4E−026.2E−040.66%1.9E+041.2E+025724390.0E+00
149,830,510871,28253629.3E−026.0E−040.64%1.9E+041.2E+025886410.0E+00
159,690,850867,21453629.4E−026.1E−040.65%1.9E+041.2E+025891410.0E+00
1610,503,100920,32857109.2E−026.0E−040.65%1.8E+041.2E+026261440.0E+00
Table A6

Poisson correction for 16 mutation titration samples for EGFR L858R. R = 7.1e−8.

SampleDroplet Count
Loading
Mut/WT[target] (copies/μl)
p-Value calculation
N#WT#MutλwλmR[WT][Mut]#Mut targetλFPp-Value
18,368,910884,43821.1E−012.7E−072.4E−062.2E+045.3E−022.20.072.1E−03
28,402,990836,01351.0E−016.6E−076.3E−062.1E+041.3E−015.60.067.3E−09
38,680,540918,60731.1E−013.9E−073.5E−062.2E+047.7E−023.40.075.1E−05
48,916,470906,36021.1E−012.5E−072.3E−062.1E+045.0E−022.20.072.2E−03
58,349,320910,776461.2E−016.2E−065.4E−052.3E+041.2E+00520.070.0E+00
68,428,520890,651491.1E−016.5E−065.8E−052.2E+041.3E+00550.070.0E+00
78,641,460928,568451.1E−015.8E−065.1E−052.3E+041.2E+00500.070.0E+00
88,393,990923,295411.2E−015.5E−064.7E−052.3E+041.1E+00460.070.0E+00
98,213,490851,8974601.1E−016.2E−055.7E−042.2E+041.2E+015130.060.0E+00
108,832,850916,3474951.1E−016.3E−055.7E−042.2E+041.3E+015520.070.0E+00
118,079,670848,0264831.1E−016.7E−056.0E−042.2E+041.3E+015400.060.0E+00
128,715,960903,6184401.1E−015.6E−055.1E−042.2E+041.1E+014910.070.0E+00
137,638,330797,68939851.1E−015.8E−045.3E−032.2E+041.2E+0244510.060.0E+00
148,604,530886,04043461.1E−015.6E−045.2E−032.2E+041.1E+0248470.070.0E+00
158,386,260851,84842561.1E−015.7E−045.3E−032.1E+041.1E+0247390.060.0E+00
167,999,900844,01542931.1E−016.0E−045.4E−032.2E+041.2E+0248010.060.0E+00

Estimation of 95% confidence intervals

Confidence intervals (CI) define a range of values which is likely to include the true value of the sample. Confidence intervals are constructed at a confidence level (e.g., 95%), which defines the likelihood that the true value is within the range. When a DNA sample is partitioned, the probability of a droplet being either PCR-positive or PCR-negative follows a binomial distribution [8], [10]. When the number of PCR-positive droplets (Np) is fairly large (>20), the binomial distribution is approximated with a normal distribution with a mean of Np and a variance of Np(1 − p) [10], [11] so the 95% confidence interval for Np is approximated by . In the case of rare mutation detection, p is small enough that the confidence interval for Np simplifies to . The normal approximation is not appropriate for very small values of Np (<20). For such a case, a Poisson distribution with Np as the expected value is a better approximation [12]. Table 5 presents Excel function (Microsoft) formulas for Np < 20 and Np > 20, such that one can calculate the confidence limits in an Excel spreadsheet. Alternatively, web-based calculators that use binomial distributions are available. Online tools include uCountSM from The Wittwer Lab at the University of Utah (https://dna.utah.edu/ucount/uc.php) [13] or the Exact Binomial and Poisson Confidence Intervals calculator by John C. Pezzullo (http://statpages.org/confint.html) [14].
Table 5

Confidence intervals for various mutant-positive droplet event counts (Np). Np is the count of mutant-positive droplets, which includes droplets that contain only mutant molecules and droplets that contain both mutant and wild-type molecules.

Confidence limitsNp = 0 and upNp > 20
99% CI_lower limitCHIINV(0.995, 2Np)/2Np − NORMSINV(0.995)*sqrt(Np)
99% CI_upper limitCHIINV(0.005, 2Np + 2)/2Np + NORMSINV(0.995)*sqrt(Np)
95% CI_lower limitCHIINV(0.975, 2Np)/2Np − NORMSINV(0.975)*sqrt(Np)
95% CI_upper limitCHIINV(0.025, 2Np + 2)/2Np + NORMSINV(0.975)*sqrt(Np)
90% CI_lower limitCHIINV(0.95, 2Np)/2Np − NORMSINV(0.95)*sqrt(Np)
90% CI_upper limitCHIINV(0.05, 2Np + 2)/2Np + NORMSINV(0.95)*sqrt(Np)
With the RainDrop® dPCR System, 25–50 μl samples are partitioned into 5-picoliter droplets, which yields 5–10 million individual droplets. The recommended maximum loading of total human genomic DNA is 10% (i.e., one genome equivalent per 10 droplets), and for most digital PCR applications, the amount of mutant DNA within the sample is normally present at a small fraction of the wild-type DNA. At this low fraction, very few mutant-positive droplets are expected to contain more than one copy of the mutated DNA fragment, so the number of mutant-positive droplets is no different from the number of mutant molecules (λ = p). For example, in the mutant titration series, wild-type DNA was loaded into ∼10% of the drops, and the highest mutant loading was present in only ∼0.06% of the drops, so λ = 0.0006, and p = 1 − e = 0.9997λ, so the difference between the two is within 0.03%. Therefore, the quantity, Np, is used interchangeably to refer to mutant positive droplets and mutant DNA molecules in this paper. For mutation detection applications, the number of wild-type targets is usually much greater than the number of mutant targets, and the uncertainty in the ratio of the two will be dominated by the mutant counting. The 95% CI of the ratio (R) of mutant to wild type are given as below:

Assessing the frequency of false-positives

To determine application and assay sensitivity using any molecular detection technology, one must account for signal-to-noise limitations. Similarly, for mutation detection assays using digital PCR, false-positives might arise from fluorescence detection (“system”) noise and molecular biology (“assay”) noise. Comparing samples to negative controls is typically necessary to classify mutational status as positive or negative. The Clinical and Laboratory Standards Institute (CLSI) guidelines recommend measurement of at least 60 negative samples to assess the probability distribution function of false-positive events detected in negative control samples.[15] In this work, collections of wild-type only samples were analyzed with the EGFR T790M and L858R assays. The data from EGFR T790M in Fig. 5 reveals an average false-positive count (Λ) of 39 mutant molecules per sample, which was measured from 58 negative control samples that each contained, on average, 862,147 wild-type molecules. For purposes of assessing the false-positive distribution displayed in Fig. 5, the mutant count from each individual wild-type sample was normalized to the average number of wild-type counts using the average false positive ratio (R, the average false-positive ratio of all negative control samples) of 4.5E−5 to compensate for differences in the absolute number of molecules analyzed in each sample.
Fig. 5

False-positives measured in EGFR T790M (left, Λ = 39) and L858R (right, Λ = 0.06) negative controls.

In contrast to the EGFR T790M assay, the data from L858R yields only 4 samples out of 71 with one false-positive event detected (Λ = 0.06) and a false positive ratio of 7.1E−8. Fig. 5 reveals that the number of false-positive mutant counts follows a Poisson distribution in both cases, confirming that the Poisson model is a valid approximation of the measured data. (Note that applying the Poisson model fit to the false-positive data is not related to the Poisson loading correction described in previous sections.) A dPCR practitioner often evaluates and reports the confidence level when assigning mutational status to a sample containing a number of positive events that is very close to the false-positive baseline. For this purpose, a hypothesis (significance) test is executed and a p-value is calculated. The p-value is the probability that the observed mutant count or higher falls within the distribution of false-positive counts measured from negative control samples. In this case, the null hypothesis is: “there is no difference between the sample and negative controls.” The null hypothesis is rejected when the p-value is less than a specified significance level. The choice of significance level at which to reject the null hypothesis depends on the application and user preference. For clinical molecular analysis, common choices for p-value thresholds are 0.05 (95%) and 0.01 (99%); the analysis described in this report is based on a 95% confidence threshold. Two EGFR T790M mutant-positive samples are highlighted below to demonstrate the calculation of p-values. P(k;Λ) represents the Poisson distribution that estimates the frequency of false-positives. Because these positive samples have very similar wild-type load to the negative control samples, it is assumed that the false positive ratio is consistent with negative samples. From the ratio, Λ is calculated for each sample, followed by calculating the p-value. Sample 1 (#1 in Appendix Table A5): measured mutants = 45, Λ = 39, Sample 2 (#5 in Appendix Table A5): measured mutant events = 81, Λ = 40, In the first sample, a measurement of 45 mutant molecules results in a p-value of 0.17, which is greater than 0.05 (i.e., 5% significance level for p-value), and therefore, the sample is not significantly different than the wild-type only controls. In contrast, the second sample has a p-value of 8.7E−9, which indicates that this sample is significantly resolved from the negative controls. Appendix Table A5, Table A6 list the p-values calculated for the rest of the mutant-positive titration samples for the EGFR T790M and L858R assays. An Excel equation could be used to calculate the p-value as shown below:

Estimating assay sensitivity and the limit of detection (LoD)

The limit of detection (LoD) and limit of blank (LoB) are metrics that describe the sensitivity of an analytical procedure [15], [16], [17], [18]. Fig. 6 depicts how the LoB and LoD are defined at stated error rates. The α error rate indicates the fraction of all measurements of true negative samples that will be greater than the LoB (i.e., false-positive mutation call). The β error rate indicates the probability of a false-negative mutation call (i.e., measurement less than LoB) from a sample with a true value equal to the LoD. For the analyses in this paper, α error (false positive) and β error (false negative) rates are 5%, which is a common approach. In practice, the selection of α and β values is largely dictated by the application. For example, there may be applications in which both false positive and false negative mutation calls are unacceptable, and one would decide to lower the α and β rates (resulting in a more conservative LoD). Some applications may dictate a relatively high mutational abundance threshold (i.e., well-separated from analytical performance) to avoid unnecessary and expensive validation tests. Conversely, the occurrence of false negatives is not acceptable in some applications. In such cases, the β-error value would be conservative to reflect the needs of the application.
Fig. 6

Relationship between the limit of blank (LoB) and the limit of detection (LoD). (a) The blue curve is the distribution of measurements from negative samples; α-error refers to false-positive mutation calls. (b) The red curve is the expected distribution of measurements from positive samples at the LoD concentration; β error refers to false-negative mutation calls. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

In this report, false-positive droplet event probability (i.e., data shown in Fig. 5 and represented as the blue curve in Fig. 6 panel A) was determined from a series of samples that each contained approximately the same level of wild-type DNA. The false-positive rate is likely dependent (but not necessarily linearly proportional) with respect to wild-type load. Unless all samples are expected to contain the same approximate level of DNA, evaluating false positives with varying wild-type loads is recommended for assays of interest. Minimally, one should consider the desired LoD, and how much DNA should be evaluated to achieve that sensitivity. For example, detection of 1 mutant in 1000 wild-type molecules will require a substantially smaller amount of total DNA than what would be required to attempt detection of 1 mutant in 1 million wild-type molecules. The total amount of sample DNA does not guarantee a level of sensitivity, but sufficient input DNA is necessary to support the desired LoD. In addition to characterizing the false-positive distribution by processing wild-type only samples, CLSI guidelines [15] recommend characterization of the distribution of measured droplet events for positive samples with concentrations at the limit of detection. However, in this report, it has been demonstrated that the distribution of false-positive counts follows a distribution that arises from counting uncertainty (i.e., a Poisson distribution) and it has been assumed that the dominant variability in true positive counting also arises from counting uncertainty. Therefore, “true positive” distribution (i.e., the red curve in Fig. 6 panel B) is considered a Poisson function as well, for the purpose of determining the LoD. Alternatively, one could estimate the LoB and LoD following a non-parametric approach with a sufficient number of negative and low-level positive control samples to adequately represent the probability distributions with empirical data [17]. Table 6 reports the LoB and LoD at various average false-positive levels, Λ, assuming a Poisson probability model fit to empirical data. In this analysis, α-error and β-error are set at 5%. For Λ greater than 0.05, a normal approximation with a correction is used to describe the Poisson distribution to simplify the calculation of LoB and LoD with an analytical expression.
Table 6

Calculation of LoB and LoD from the average false-positive counts.

ΛFPLoBLoD
003
0–0.0515
>0.05ΛFP+1.645ΛFP+0.8(1.645+1.6452+4LoB)2/4

Note: non-integers should be rounded up to the nearest integer.

Results and discussion

EGFR T790M

EGFR T790M limit of detection

Appendix Table A3 contains the raw data from 58 wild-type only samples that were processed to calculate the LoB and LoD for the EGFR T790M assay; Table 7 contains the summarized data from negative control samples. Based on the wild-type only controls, and the equations described in Table 6, the LoB for EGFR T790M mutations in a single sample with an average measured count of 862,147 copies of genomic DNA is 1 mutant in 17,000 wild-type molecules, and the LoD is 1 in 13,000. This represents the highest sensitivity that can be achieved with a single sample processed with one assay reaction. One might split a single sample across multiple dPCR reactions if an extremely sensitive LoD is desired and the DNA is plentiful. For cases with enough sample DNA to split across 8 samples at maximum loading (i.e., 26 μg of human genomic DNA processed in an 8-channel microfluidic chip), then the analysis described above results in a demonstrated LoD of 1 in 18,000. Furthermore, if data from all 58 wild-type samples are pooled together (representing a single replicate of 200 μg of DNA negative control DNA), then the LoD is 1 in 20,000. Finally, we extrapolate to determine that the LoD would continue to improve as a function of the wild-type load, and reach a plateau at the measured false-positive rate of 0.0045% (or 1 in 22,000). Fig. 7 displays the LoD assessed from collections of empirical data (blue points) as well as the extrapolated sensitivity that approaches a limit equal to the measured false-positive rate. We do not envision practical applications for human genomic DNA analysis that would yield sufficient DNA and justify splitting a sample over many chips or channels. However, the demonstration and extrapolation highlight the transformational sensitivity that the assay and technology can deliver as well as emphasizing that the practical sensitivity is likely to be limited by the amount of available sample DNA. Furthermore, applications probing non-human genomes (e.g., viral load or pathogen detection and microbial genomics) are more likely to take advantage of ultra-high sensitivity workflows due to the possibility for large quantities of input DNA with relatively small genomes.
Table 7

Droplet event and molecule counts from 58 EGFR T790M wild-type samples.

AverageStandard deviationTotal
Total droplets processed (N)9,049,729683,030524,884,260
Mutant only droplets35.57.12059
Mutant molecules (false-positives)397.82265
Wild-type only droplets822,32568,46547,694,859
Wild-type molecules862,14772,26150,004,523
Fig. 7

LoD performance model for EGFR T790M. Blue points represent the empirical data collected from 58 negative samples pooled to different levels. The red curve is an extrapolation of the data to represent evaluation of more DNA. The green dashed line is the LoD if the amount of DNA available for analysis is unlimited. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

The data from the wild-type samples that is reported in Fig. 5 presents a clearly-defined probability function related to the false-positives. No-template control samples do not yield false-positives, and multiple lots of wild-type control DNA (i.e., negative controls) yield the same level of false-positives. This evidence suggests that the false-positive droplet events are not due to contamination of the reagents or wild-type samples, and the false-positive background is expected to be present in all samples with this assay. For the purposes of demonstrating this analysis approach, the average background level of false-positive mutant molecules can be considered as a systematic offset, and the offset can be subtracted from the number of mutant molecules detected in any unknown samples. The result is a more accurate measurement of the true level of mutant molecules present in the sample, albeit with no improvement to the precision of the measurement. The limit of detection, which is defined as the true mutant concentration within a sample that will be detected within stated α and β error limits, can be recalculated after removal of the systematic false-positive offset; see Appendix Table A7 for the associated calculations. The LoD for the EGFR T790M assay, after correction for the false-positive background, is 1 in 34,000 when one 50ul sample at ∼10% wild-type loading is processed.
Table A7

Calculation of LoB and LoD from EGFR T790M assay data. R is the average false positive ratios of negative control samples, #WT_avg is the average of the wild-type load per sample, Λ is the average of the false mutant counts. LoB and LoD are limit of blank and lower limit of detection of mutant counts at #W_avg load, R_LoB and R_LoD are the limit of blank and lower limit of detection expressed in R.

RFP#WT_avgΛFPLoBLoDR_LoBR_LoD
4.5E−058.6E+053950.164.00.0058%0.0074%
The key to this analysis approach is establishing confidence that the false-positive background is universal for all samples. For the EGFR T790M assay, this approach is justified by processing commercially-available blended human cell-line DNA and no-template controls. Additional testing, outside the scope of this document, has evaluated several individual DNA samples (Coriell Specimen Collections), as well as synthetic plasmid templates, wherein comparable numbers of false-positive events have been observed. The approach would be solidified by sound explanation of the reason (other than contamination) for the level of false-positives observed with this assay. A subsequent section of this report presents some possible reasons for false-positive events and recommends further exploration of the root cause of systematic false-positive events. Further development of this approach, understanding reasons for false-positives, and establishing metrics for justification to subtract a background of false-positives will emerge from the community of dPCR practitioners.

EGFR T790M – verification of linearity and sensitivity

A mutation titration series was evaluated in quadruplicate, and the data are displayed in Fig. 8. The expected mutant to wild-type ratio (solid red line) and 95% confidence interval (dotted red lines) are plotted along with the measured data. The data is congruent with the expected concentration at the high values; it deviates from the expected value at low mutant concentrations due to the contribution of false-positive counts. Notice that the last titration sample (R = 0.00065%) is a mutant concentration with an expected value that is below the LoB. Indeed, none of the four replicates at this concentration are significantly different from the negative controls.
Fig. 8

EGFR T790M mutant titration series with four replicates of each concentration.

EGFR L858R assay characterization

EGFR L858R limit of detection

Following the same analysis described in the previous sections, data from 71 wild-type samples and a mutant titration series was processed to assess the sensitivity of the EGFR L858R assay. Table 8 summarizes the data collected from wild-type samples (data for each individual sample are reported in Appendix Table A4).
Table 8

Droplet event and molecule counts from 71 EGFR L858R wild-type samples.

AverageStandard deviationTotal
Total droplets processed (N)8,405,878622,029596,817,340
Mutant only droplets0.060.234
Mutant molecules (false positives)0.060.264.4
Wild-type only droplets861,26082,95661,149,444
Wild-type molecules908,85189,51364,528,410
Even though there are very few false-positives detected by this assay, the LoD is modulated by the conservative β-error of 5%. For example, as displayed in Table 6, if an assay yields zero false-positives (Λ = 0), at least three true mutants must be present to limit the probability of making a false negative call to be below 5%. Similarly, a single false-positive (Λ = 1) among 1 million wild-type alleles dictates that a minimum of 9 true mutants must be present to maintain α and β errors of 5% or better, and the resulting LoD of this hypothetical example is 1 in 111,000. Fig. 9 presents the assay LoD relative to the amount of DNA that is evaluated. The blue data points display the demonstrated LoD that is achieved for a single sample (1 in 180,000), a sample with enough DNA to be processed with eight assay reactions (1 in 1 million), or by dividing a very large amount of DNA (230 μg) into 71 individual samples for analysis, then recombination of data (1 in 4 million) (see Appendix Table A8 for associated calculations). Similar to the T790M assay, more DNA yields higher detection sensitivity. In contrast to the T790M assay, the false-positive rate is extremely low, so the LoD is almost exclusively limited by the amount of DNA processed and the LoD does not plateau until 1 in 14 million if one were to process hundreds of billions of wild-type molecules.
Fig. 9

LoD performance model for EGFR L858R. Blue points represent the empirical data collected from 71 negative samples pooled to different levels. The red curve is an extrapolation of the data to represent evaluation of more DNA. The green dashed line is the LoD if the amount of DNA available for analysis is unlimited. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Table A8

Calculation of LoB and LoD for EGFR L858R.

RFP#WT_avgΛFPLoBLoDR_LoBR_LoD
7.1E−089.1E+050.061.35.00.00014%0.00054%

EGFR L858R – verification of linearity and sensitivity

A mutant titration series was evaluated in quadruplicate and the data are displayed in Fig. 10. Unlike the EGFR T790M assay, the data is linear throughout the whole titration range. Notice that the last titration sample (R = 0.00053%) has a mutant concentration above the LoB but below the LoD. For this experiment, all four replicates are significantly different than the negative controls, but if this sample were to be processed numerous times, it is anticipated from the negative control data that more than 5% of the measurements would fall below the LoB, leading to false-negative calls. This ambiguity, which arises from measuring a single or small number of replicates, demonstrates the importance of calculating and understanding the LoB and LoD parameters, even for extremely sensitive assays.
Fig. 10

EGFR L858R mutant titration series with four replicates of each concentration.

Comparing and contrasting EGFR assay performance

The results reveal a significantly greater sensitivity associated with the EGFR L858R assay in contrast to the EGFR T790M assay. Many factors possibly contribute to the performance of an assay. For example, the complexity of the sequence composition within the targeted region, the primer and probe oligonucleotide design and quality of synthesis, reagent quality, polymerase efficiency and fidelity, and sample preparation might influence the measured assay sensitivity. Many commercially-available quantitative PCR (qPCR) assays fail to enable highly sensitive mutation detection of EGFR T790M due to a number of compounding factors that reduce the specificity of primers and probes leading to a relatively high level of signal background (i.e., a high Cq). It is difficult to design sensitive assays for the EGFR T790M (c.2369C>T) point mutation. This region of EGFR (chr7:55259486-55259563) is relatively rich in guanine and cytosine and exhibits sequence homology (∼83%) with the proto-oncogene receptor tyrosine-protein kinase erbB-2 (HER2/neu). In addition to the high sequence similarities with HER2, this region of EGFR T790M contains a repetitive sequence element adjacent to the mutation, which creates a challenge to design robust probes with high specificity. Further adding to this challenge, there is a single nucleotide polymorphism located eight nucleotides upstream from the mutation, which limits primer design options. Some or all of the challenges associated with qPCR assay design might be reflected in the lower sensitivity of the EGFR T790M dPCR assay. Another hypothesis for the source of false-positives is that mutant DNA fragments are generated during the PCR reaction from wild-type DNA fragments as a result of polymerase error. In principle, one of the advantages of single-molecule partitioning with the RainDrop™ dPCR System is that the sensitivity of mutation detection is better than the polymerase error rate [9]. Polymerase errors typically occur as purine/purine (A ↔ G) or pyrimidinepyrimidine (C ↔ T) transitions. Pyrimidinepurine transversions (A/G ↔ C/T) are less common [19]. If a polymerase error occurs at or near the nucleotide of interest, and results in an amplifiable fragment that hybridizes with the mutant probe, then, due to the presence of the initial two complementary strands of wild type target, the resulting droplet will produce fluorescence signal from both wild-type and mutant probes. The level of polymerase error resulting in droplet events counted in the “dual” cluster can be determined by comparing the actual count of events in the dual cluster to the expected count based on the Poisson loading calculations. Evidence of polymerase error with specific sequence targets has been observed, which supports this hypothesis. However, for the EGFR T790M assay, the false-positive background arises from droplet events that are characterized as mutant-only (“Mut” cluster). To satisfy the hypothesis of polymerase error, another process or reaction must eliminate the amplification and/or generation of fluorescence signal from the original two wild-type template strands, and this factor is not yet explained.

Additional cancer-based assays

Following the same methodologies presented herein, each of the additional sixteen assays was evaluated. Data from these assays are presented in Appendix Table A2. As with EGFR T790M, some assays present an apparent level of low-level positives in the presumably wild-type controls. As with EGFR T790M, it is difficult to assign the source of these positives that reside within the mutant control gates. There are a couple assays that exhibit very few false-positives, as observed with EGFR L858R; namely PIK3CA H1047R and KRAS G12C. For these assays, data indicate that the sensitivity of the assays is largely dependent upon the amount of DNA that is evaluated. For both PIK3CA H1047R and KRAS G12C, fewer molecules of wild-type DNA were evaluated than those evaluated in the EGFR L858R assay. Similar to EGFR T790M, however, several assays demonstrated assay LoDs of around 1 mutant in 20,000 wild-type molecules, as they too exhibited false positives in the mutant clusters gates. This screening of an additional sixteen assays emphasizes the importance of evaluating assays with controls prior to initiating a full analysis of critical samples.
Table A2

Sixteen cancer-based assays evaluated, in addition to the EGFR L858R and T790M assays, following the approach described herein. The best empirical LoD was calculated for using the number of wild-type DNA molecules evaluated.

GeneProtein variantNucleotide variantBest empirical LoD# WT DNA molecules evaluatedEstimated LoD with unlimited DNA
EGFRp.E746_A750delc.2235del151:648,0001.7E+071:1.5 million (0.00006%)
BRAFp.V600Ec.1799T>A1:340,0001.6E+071:615,000 (0.00016%)
PIK3CAp.H1047Rc.3140A>G1:1.3 million2.0E+071:5 million (0.00009%)
PIK3CAp.E545Kc.1633G>A1:66,0002.1E+071:80,000 (0.0012%)
ALKp.R1275Qc.3824G>A1:26,0002.1E+071:30,000 (0.003%)
KRASp.G12Cc.34G>T1:940,0002.5E+071:2.2 million (0.00004%)
KRASp.G12Dc.35G>A1:45,0003.5E+071:46,500 (0.002%)
KRASp.G12Sc.34G>A1:59,0002.3E+071:70,000 (0.0014%)
KRASp.G12Vc.35G>T1:425,0002.4E+071:720,000 (0.00014%)
KRASp.G12Ac.35G>C1:290,0002.2E+071:455,000 (0.0002%)
KRASp.G12Rc.34G>C1:290,0002.2E+071:450,000 (0.0002%)
KRASp.G13Dc.38G>A1:19,0002.0E+071:21,000 (0.005%)
HER2p.L755Sc.2264C>T1:185,0002.4E+071:255,000 (0.0004%)
GNASp.R201Cc.601C>T1:19,0002.0E+071:21,000 (0.0048%)
GNASp.R201Hc.602G>A1:17,0002.0E+071:18,000 (0.0055%)
TP53p.R273Hc.818G>A1:19,6002.0E+071:22,000 (0.0045%)
As discussed in the section above, polymerase errors are more likely to present as transitions than transversions. Within these sixteen assays (as well as the EGFR assays), less sensitive LoDs were observed for those assays whose targets are transitions. Out of 17 assays tested herein, all eight assays with G/C >A/T mutations have much lower LoDs than the other mutations assed. Though further evaluation is warranted, it is likely that the false positive rate and assay LoD are influenced by polymerase errors.

Conclusion

The performance of the EGFR assays in the RainDrop® Digital PCR System demonstrates unprecedented sensitivity. The data presented demonstrate that the sensitivity of digital PCR assays is often limited by the amount of DNA that is evaluated, and it is possible to achieve sensitivity for detection of mutant alleles that is orders of magnitude better than most alternative technologies. A LoD for the EGFR T790M assay has been determined at 1 in 13,000 for a single sample at maximum loading, and up to 1 in 22,000 with unlimited DNA (or 1 in 34,000 for a single sample if false-positives are qualified to be a systematic offset and subtracted). For this assay, the primer and probe design challenges or polymerase errors are factors that impact the ultimate sensitivity but single molecule dPCR overcomes many sensitivity limitations that arise from environments containing many target molecules. Finally, for the EGFR L858R assay, we have empirically demonstrated sensitivity of 1 in 180,000 for a single sample. Moreover, by extrapolation, we have determined that the EGFR L858R assay can reach a LoD that far surpasses empirical data (i.e., 1 in 14 million), and, in practice, the sensitivity is limited only by the amount of sample DNA that is processed. This report provides guidance on the analysis of digital PCR data and it reveals some topics worthy of deeper exploration. Firm understanding of the molecular biology and technology contributions to false-positive droplet events is crucial for improved sensitivity. Also, validation of objective automated gating of droplet event clusters is likely necessary for dPCR practitioners to take advantage of the full potential sensitivity of the technology for routine applications. Finally, availability of DNA reference materials will enable validation within and across platforms.
  12 in total

1.  Partly nonparametric approach for determining the limit of detection.

Authors:  Kristian Linnet; Marina Kondratovich
Journal:  Clin Chem       Date:  2004-02-05       Impact factor: 8.327

Review 2.  Principle and applications of digital PCR.

Authors:  Gudrun Pohl; Ie-Ming Shih
Journal:  Expert Rev Mol Diagn       Date:  2004-01       Impact factor: 5.225

Review 3.  Detecting biomarkers with microdroplet technology.

Authors:  Valérie Taly; Deniz Pekin; Abdel El Abed; Pierre Laurent-Puig
Journal:  Trends Mol Med       Date:  2012-06-22       Impact factor: 11.951

4.  Inherited susceptibility to lung cancer may be associated with the T790M drug resistance mutation in EGFR.

Authors:  Daphne W Bell; Ira Gore; Ross A Okimoto; Nadia Godin-Heymann; Raffaella Sordella; Roseann Mulloy; Sreenath V Sharma; Brian W Brannigan; Gayatry Mohapatra; Jeff Settleman; Daniel A Haber
Journal:  Nat Genet       Date:  2005-10-30       Impact factor: 38.330

5.  Limit of blank, limit of detection and limit of quantitation.

Authors:  David A Armbruster; Terry Pry
Journal:  Clin Biochem Rev       Date:  2008-08

6.  Specific enzymatic amplification of DNA in vitro: the polymerase chain reaction.

Authors:  K Mullis; F Faloona; S Scharf; R Saiki; G Horn; H Erlich
Journal:  Cold Spring Harb Symp Quant Biol       Date:  1986

Review 7.  Lung adenocarcinoma: guiding EGFR-targeted therapy and beyond.

Authors:  Marc Ladanyi; William Pao
Journal:  Mod Pathol       Date:  2008-05       Impact factor: 7.842

8.  Multiplex picodroplet digital PCR to detect KRAS mutations in circulating DNA from the plasma of colorectal cancer patients.

Authors:  Valerie Taly; Deniz Pekin; Leonor Benhaim; Steve K Kotsopoulos; Delphine Le Corre; Xinyu Li; Ivan Atochin; Darren R Link; Andrew D Griffiths; Karine Pallier; Hélène Blons; Olivier Bouché; Bruno Landi; J Brian Hutchison; Pierre Laurent-Puig
Journal:  Clin Chem       Date:  2013-08-12       Impact factor: 8.327

9.  BEAMing and Droplet Digital PCR Analysis of Mutant IDH1 mRNA in Glioma Patient Serum and Cerebrospinal Fluid Extracellular Vesicles.

Authors:  Walter W Chen; Leonora Balaj; Linda M Liau; Michael L Samuels; Steve K Kotsopoulos; Casey A Maguire; Lori Loguidice; Horacio Soto; Matthew Garrett; Lin Dan Zhu; Sarada Sivaraman; Clark Chen; Eric T Wong; Bob S Carter; Fred H Hochberg; Xandra O Breakefield; Johan Skog
Journal:  Mol Ther Nucleic Acids       Date:  2013-07-23       Impact factor: 10.183

10.  Mathematical analysis of copy number variation in a DNA sample using digital PCR on a nanofluidic device.

Authors:  Simant Dube; Jian Qin; Ramesh Ramakrishnan
Journal:  PLoS One       Date:  2008-08-06       Impact factor: 3.240

View more
  48 in total

1.  Employing Digital Droplet PCR to Detect BRAF V600E Mutations in Formalin-fixed Paraffin-embedded Reference Standard Cell Lines.

Authors:  Nirmal Rajasekaran; Myung Ryurl Oh; Sung-Su Kim; Si Eun Kim; Young Deug Kim; Hyun-Jeung Choi; Bohyun Byun; Young Kee Shin
Journal:  J Vis Exp       Date:  2015-10-08       Impact factor: 1.355

2.  Clinical Application of Picodroplet Digital PCR Technology for Rapid Detection of EGFR T790M in Next-Generation Sequencing Libraries and DNA from Limited Tumor Samples.

Authors:  Laetitia Borsu; Julie Intrieri; Linta Thampi; Helena Yu; Gregory Riely; Khedoudja Nafa; Raghu Chandramohan; Marc Ladanyi; Maria E Arcila
Journal:  J Mol Diagn       Date:  2016-09-12       Impact factor: 5.568

3.  Validation Strategy for Ultrasensitive Mutation Detection.

Authors:  Marija Debeljak; Michael Noë; Stacy L Riel; Lisa M Haley; Alexis L Norris; Derek A Anderson; Emily M Adams; Masaya Suenaga; Katie F Beierl; Ming-Tseh Lin; Michael G Goggins; Christopher D Gocke; James R Eshleman
Journal:  Mol Diagn Ther       Date:  2018-10       Impact factor: 4.074

4.  Isolation and Quantification of Plasma Circulating Tumor DNA from Melanoma Patients.

Authors:  Gabriela Marsavela; Anna Reid; Elin S Gray; Leslie Calapre
Journal:  Methods Mol Biol       Date:  2021

5.  An ultrasensitive test for profiling circulating tumor DNA using integrated comprehensive droplet digital detection.

Authors:  Chen-Yin Ou; Tam Vu; Jonathan T Grunwald; Michael Toledano; Jan Zimak; Melody Toosky; Byron Shen; Jason A Zell; Enrico Gratton; Timothy J Abram; Weian Zhao
Journal:  Lab Chip       Date:  2019-03-13       Impact factor: 6.799

Review 6.  Current, Emerging, and Future Applications of Digital PCR in Non-Invasive Prenatal Diagnosis.

Authors:  Juliette Nectoux
Journal:  Mol Diagn Ther       Date:  2018-04       Impact factor: 4.074

Review 7.  Current and Emerging Applications of Droplet Digital PCR in Oncology.

Authors:  Susana Olmedillas-López; Mariano García-Arranz; Damián García-Olmo
Journal:  Mol Diagn Ther       Date:  2017-10       Impact factor: 4.074

8.  Circulating Tumor DNA Measurement by Picoliter Droplet-Based Digital PCR and Vemurafenib Plasma Concentrations in Patients with Advanced BRAF-Mutated Melanoma.

Authors:  Fanny Garlan; Benoit Blanchet; Nora Kramkimel; Alicja Puszkiel; Jean-Louis Golmard; Gaelle Noe; Nicolas Dupin; Pierre Laurent-Puig; Michel Vidal; Valerie Taly; Audrey Thomas-Schoemann
Journal:  Target Oncol       Date:  2017-06       Impact factor: 4.493

9.  Ultraspecific and Amplification-Free Quantification of Mutant DNA by Single-Molecule Kinetic Fingerprinting.

Authors:  Stephen L Hayward; Paul E Lund; Qing Kang; Alexander Johnson-Buck; Muneesh Tewari; Nils G Walter
Journal:  J Am Chem Soc       Date:  2018-09-05       Impact factor: 15.419

10.  Rational "Error Elimination" Approach to Evaluating Molecular Barcoded Next-Generation Sequencing Data Identifies Low-Frequency Mutations in Hematologic Malignancies.

Authors:  Saradhi Mallampati; Dzifa Y Duose; Michael A Harmon; Meenakshi Mehrotra; Rashmi Kanagal-Shamanna; Stephanie Zalles; Ignacio I Wistuba; Xiaoping Sun; Rajyalakshmi Luthra
Journal:  J Mol Diagn       Date:  2019-02-20       Impact factor: 5.568

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.