Literature DB >> 28475662

twoddpcr: an R/Bioconductor package and Shiny app for Droplet Digital PCR analysis.

Anthony Chiu1, Mahmood Ayub1, Caroline Dive1, Ged Brady1, Crispin J Miller2.   

Abstract

SUMMARY: Droplet Digital PCR (ddPCR) is a sensitive platform used to quantify specific nucleic acid molecules amplified by polymerase chain reactions. Its sensitivity makes it particularly useful for the detection of rare mutant molecules, such as those present in a sample of circulating free tumour DNA obtained from cancer patients. ddPCR works by partitioning a sample into individual droplets for which the majority contain only zero or one target molecule. Each droplet then becomes a reaction chamber for PCR, which through the use of fluorochrome labelled probes allows the target molecules to be detected by measuring the fluorescence intensity of each droplet. The technology supports two channels, allowing, for example, mutant and wild type molecules to be detected simultaneously in the same sample. As yet, no open source software is available for the automatic gating of two channel ddPCR experiments in the case where the droplets can be grouped into four clusters. Here, we present an open source R package 'twoddpcr', which uses Poisson statistics to estimate the number of molecules in such two channel ddPCR data. Using the Shiny framework, an accompanying graphical user interface (GUI) is also included for the package, allowing users to adjust parameters and see the results in real-time.
AVAILABILITY AND IMPLEMENTATION: twoddpcr is available from Bioconductor (3.5) at https://bioconductor.org/packages/twoddpcr/ . A Shiny-based GUI suitable for non-R users is available as a standalone application from within the package and also as a web application at http://shiny.cruk.manchester.ac.uk/twoddpcr/ . CONTACT: ged.brady@cruk.manchester.ac.uk or crispin.miller@cruk.manchester.ac.uk. PACKAGE MAINTAINER: anthony.chiu@cruk.manchester.ac.uk.
© The Author(s) 2017. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28475662      PMCID: PMC5860069          DOI: 10.1093/bioinformatics/btx308

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


Bio-Rad’s QX100 and QX200 Droplet Digital PCR (ddPCR) are platforms for quantifying the number of copies of a given nucleic acid molecule in a sample. Starting material is mixed with the necessary reagents to perform PCR and then partitioned into ~20 000 droplets that each become individual reaction chambers for PCR. By distributing the target sequence across ~20 000 droplets the platform gains in sensitivity, making it well-suited for tasks such as the detection of small amounts of tumour DNA against a larger background of wild-type DNA in circulating free DNA (cfDNA) extracted from patient blood (Hindson ). Conditions are set up on the assumption that target molecules are Poisson distributed across the droplets, and designed to maximize the number of droplets that each contain only a single copy. The mixture is then thermocycled, resulting in amplification. TaqMan reagents are included in the reaction mixture and upon annealing of the TaqMan probe to the PCR product, the quenching component of the probe is removed by Taq polymerase, resulting in an increase in fluorescence. The number of droplets that fluoresce can then be used to estimate the concentration and allelic fraction of the target molecule in the sample. The Bio-Rad platform supports two channels, allowing different fluorochromes to be used to detect two different target sequences in the sample at once. The goal of data analysis is therefore to use the two channel fluorescence data to classify droplets into four sets: positive in both channels (PP), positive only in Channel 1 (PN), positive only in Channel 2 (NP), or negative in both channels (NN). Having established the number of positive droplets in each channel, Poisson statistics can then be used to estimate the starting concentrations and hence the absolute copy numbers. Droplets can vary in brightness. This is a particular problem for cfDNA samples such as in (Wang ; Zhu ), where the droplet amplitude plots may be noisy and the PN cluster can ‘lean’ and the NP cluster can ‘lift’. In a study by (Whale ), the reasons for these phenomena may be due to the cross-hybridization of probes and the presence of multiple target molecules in the same droplet. Therefore, classifying droplets on absolute fluorescence intensity can be problematic, resulting in a number of alternative gating strategies being proposed. For example, the Javascript-based web application ‘definetherain’ uses k-means clustering to classify droplets in one channel data only (Jones ). It removes droplets between the two clusters that are more than three times the standard deviation away from their respective cluster means. These droplets are colloquially known as ‘rain’. The ‘ddpcRquant’ R package estimates the parameters for a generalized extreme value distribution, and uses 0.995 percentiles to set a threshold for one channel data (Trypsteen ). This is repeated 100 times and the mean threshold taken as the final cut-off. A modified version of this approach is included in the R package ‘dpcR’, which is a toolset for analysing data from various digital PCR platforms (Burdukiewicz ). Another R package, ‘ddpcr’, gates droplets in a specific class of experiments, where the droplets cluster into three distinct classes: NN, PP, and either NP or PN, but not both. The package also supports manual gating of more general two channel experiments (Attali ). Here, we present ‘twoddpcr’, an open source package for the R programming language (R Core Team, 2016) designed for gating general two channel ddPCR experiments, allowing for the integration of ddPCR analyses of such experiments into bioinformatics pipelines. Unlike existing packages, it supports manual and automatic gating of experiments where droplets fall into all four classes (NN, NP, PN and PP), making it suitable for the detection of mutant molecules in challenging samples such as cfDNA. In addition, the ‘twoddpcr’ package adopts a novel approach for further refining such classifications; this approach uses Mahalanobis distance and will be discussed shortly. An accompanying web application built using the Shiny framework (Chang ) provides a graphical user interface (GUI), enabling non-R users to use the package and means that parameters can be adjusted easily in order to view the results in real-time. The package accepts raw data in the form of droplet amplitude files exported from Bio-Rad’s QuantaSoft. Software is provided as a Bioconductor (Huber ) package ‘twoddpcr’, making particular use of the ‘class’ (Venables and Ripley, 2002) and ‘stats’ packages for classifying droplets. The package revolves around the use of ddpcrPlate S4 objects, which are extended from the SimpleList S4 class from the ‘S4Vectors’ package (Pagès ). A number of other R packages were used for visualization purposes (Carr ; Neuwirth, 2014; Wickham, 2009). The Shiny-based GUI is included in the package as a standalone application and is also available at http://shiny.cruk.manchester.ac.uk/twoddpcr/. The package uses k-means clustering to classify droplets by identifying four candidate groups. This approach is particularly effective when these groups are not separated into four non-overlapping quadrants; an example of this can be seen in Figure 1.
Fig. 1

Various classifications of the droplet amplitudes from the KRASdata dataset. (a) Setting thresholds with the Channel 1 threshold at 6789 and Channel 2 threshold at 3000. (b) The k-means classification. (c) The k-means classification with rain defined by standard deviations of clusters. (d) The k-means classification with rain defined by Mahalanobis distance

Various classifications of the droplet amplitudes from the KRASdata dataset. (a) Setting thresholds with the Channel 1 threshold at 6789 and Channel 2 threshold at 3000. (b) The k-means classification. (c) The k-means classification with rain defined by standard deviations of clusters. (d) The k-means classification with rain defined by Mahalanobis distance In addition, the package provides a Mahalanobis distance-based approach to refine the classifications by defining ellipses around each candidate group, taking their rotations into account. For each cluster, c, the mean and covariance Σ is calculated. This allows the squared Mahalanobis distance (Mahalanobis, 1936) between a point and a distribution M that has mean and covariance Σ to be calculated by . A maximum distance m can be fixed by the user so that all droplets where will be considered as part of the class; remaining droplets will be treated as ‘rain’. The k-nearest neighbour (k-NN) algorithm is also provided as an option. Once the number of positive and negative droplets in each channel has been determined, the actual number of fragments in each sample can be estimated. The package assumes that the molecules of both targets are Poisson distributed: Suppose that the volume of each droplet is V (0.85nl by default), and that there are P droplets containing the amplified target type T, and R total accepted droplets in the well. The concentration of targets per microlitre can be estimated by , which is derived from the probability mass function of the Poisson distribution. We illustrate the use of the package using sample data as follows: A549 KRAS mutant cell lines and H1048 KRAS wild type cells were used to provide triplicate samples with 5% mutant DNA at increasing amounts of starting material (at 1 ng, 4 ng, 16 ng, 64 ng). This dataset is included as KRASdata in the package. Example data are shown in Figure 1a, using independent linear gating on each channel. With these data it is not possible to choose a threshold that correctly defines the NP and PP clusters. While k-means classification performs better, as shown in Figure 1b, the lack of separation between clusters NP and PP make it difficult to accurately define ambiguous droplets (‘rain’) that lie between the main clusters. As with the (Jones ) and (Attali ) publications, twoddpcr’s sdRain method identifies droplets that lie further than n (by default 5) multiples of the standard deviation from the mean of each cluster. This leads to improved classification but since it still relies on linear gates, separation between NP and PP clusters is still unsatisfactory (Fig. 1c). In contrast, the Mahalanobis-based mahalanobisRain refinement better separates the groups because it takes the cluster rotations into account (Fig. 1d). This method requires that the parameters are manually adjusted. This ensures that the ellipses are large enough to include most droplets of each class but small enough to remove ambiguous droplets. In conclusion, twoddpcr uniquely provides open source software in R and Bioconductor for the analysis of two channel ddPCR data. The included Shiny app provides a GUI that allows for parameters in the analysis to be adjusted and visualized in real-time.
  8 in total

1.  ddpcRquant: threshold determination for single channel droplet digital PCR experiments.

Authors:  Wim Trypsteen; Matthijs Vynck; Jan De Neve; Pawel Bonczkowski; Maja Kiselinova; Eva Malatinkova; Karen Vervisch; Olivier Thas; Linos Vandekerckhove; Ward De Spiegelaere
Journal:  Anal Bioanal Chem       Date:  2015-05-29       Impact factor: 4.142

Review 2.  Orchestrating high-throughput genomic analysis with Bioconductor.

Authors:  Wolfgang Huber; Vincent J Carey; Robert Gentleman; Simon Anders; Marc Carlson; Benilton S Carvalho; Hector Corrada Bravo; Sean Davis; Laurent Gatto; Thomas Girke; Raphael Gottardo; Florian Hahne; Kasper D Hansen; Rafael A Irizarry; Michael Lawrence; Michael I Love; James MacDonald; Valerie Obenchain; Andrzej K Oleś; Hervé Pagès; Alejandro Reyes; Paul Shannon; Gordon K Smyth; Dan Tenenbaum; Levi Waldron; Martin Morgan
Journal:  Nat Methods       Date:  2015-02       Impact factor: 28.547

3.  Highly Sensitive Droplet Digital PCR Method for Detection of EGFR-Activating Mutations in Plasma Cell-Free DNA from Patients with Advanced Non-Small Cell Lung Cancer.

Authors:  Guanshan Zhu; Xin Ye; Zhengwei Dong; Ya Chao Lu; Yun Sun; Yi Liu; Rose McCormack; Yi Gu; Xiaoqing Liu
Journal:  J Mol Diagn       Date:  2015-03-11       Impact factor: 5.568

4.  Sensitive Detection of Mono- and Polyclonal ESR1 Mutations in Primary Tumors, Metastatic Lesions, and Cell-Free DNA of Breast Cancer Patients.

Authors:  Peilu Wang; Amir Bahreini; Rekha Gyanchandani; Peter C Lucas; Ryan J Hartmaier; Rebecca J Watters; Amruth R Jonnalagadda; Humberto E Trejo Bittar; Aaron Berg; Ronald L Hamilton; Brenda F Kurland; Kurt R Weiss; Aju Mathew; Jose Pablo Leone; Nancy E Davidson; Marina N Nikiforova; Adam M Brufsky; Tadeu F Ambros; Andrew M Stern; Shannon L Puhalla; Adrian V Lee; Steffi Oesterreich
Journal:  Clin Cancer Res       Date:  2015-10-23       Impact factor: 12.531

5.  High-throughput droplet digital PCR system for absolute quantitation of DNA copy number.

Authors:  Benjamin J Hindson; Kevin D Ness; Donald A Masquelier; Phillip Belgrader; Nicholas J Heredia; Anthony J Makarewicz; Isaac J Bright; Michael Y Lucero; Amy L Hiddessen; Tina C Legler; Tyler K Kitano; Michael R Hodel; Jonathan F Petersen; Paul W Wyatt; Erin R Steenblock; Pallavi H Shah; Luc J Bousse; Camille B Troup; Jeffrey C Mellen; Dean K Wittmann; Nicholas G Erndt; Thomas H Cauley; Ryan T Koehler; Austin P So; Simant Dube; Klint A Rose; Luz Montesclaros; Shenglong Wang; David P Stumbo; Shawn P Hodges; Steven Romine; Fred P Milanovich; Helen E White; John F Regan; George A Karlin-Neumann; Christopher M Hindson; Serge Saxonov; Bill W Colston
Journal:  Anal Chem       Date:  2011-10-28       Impact factor: 6.986

6.  ddpcr: an R package and web application for analysis of droplet digital PCR data.

Authors:  Dean Attali; Roza Bidshahri; Charles Haynes; Jennifer Bryan
Journal:  F1000Res       Date:  2016-06-17

Review 7.  Fundamentals of multiplexing with digital PCR.

Authors:  Alexandra S Whale; Jim F Huggett; Svilen Tzonev
Journal:  Biomol Detect Quantif       Date:  2016-05-27

8.  Low copy target detection by Droplet Digital PCR through application of a novel open access bioinformatic pipeline, 'definetherain'.

Authors:  Mathew Jones; James Williams; Kathleen Gärtner; Rodney Phillips; Jacob Hurst; John Frater
Journal:  J Virol Methods       Date:  2014-03-02       Impact factor: 2.014

  8 in total
  11 in total

1.  Quantitative Measurement of Transposon Copy Number Using the Droplet Digital PCR.

Authors:  Wenwen Fan; Jungnam Cho
Journal:  Methods Mol Biol       Date:  2021

2.  Mitigation of the effect of variability in digital PCR assays through use of duplexed reference assays for normalization.

Authors:  Sherry J Coulter
Journal:  Biotechniques       Date:  2018-08       Impact factor: 1.993

3.  Advances in Detecting Low Prevalence Somatic TERT Promoter Mutations in Papillary Thyroid Carcinoma.

Authors:  Vitor Rodrigues da Costa; Larissa Valdemarin Bim; Luiza Dornelles Penteado Pacheco E Silva; Gabriel Avelar Colloza-Gama; André Uchimura Bastos; Rosana Delcelo; Gisele Oler; Janete Maria Cerutti
Journal:  Front Endocrinol (Lausanne)       Date:  2021-03-12       Impact factor: 5.555

4.  Quantification of HIV DNA Using Droplet Digital PCR Techniques.

Authors:  Elizabeth M Anderson; Frank Maldarelli
Journal:  Curr Protoc Microbiol       Date:  2018-09-25

5.  ddPCRclust: an R package and Shiny app for automated analysis of multiplexed ddPCR data.

Authors:  Benedikt G Brink; Justin Meskas; Ryan R Brinkman
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

6.  Simultaneous detection of EGFR amplification and EGFRvIII variant using digital PCR-based method in glioblastoma.

Authors:  Maxime Fontanilles; Florent Marguet; Philippe Ruminy; Carole Basset; Adrien Noel; Ludivine Beaussire; Mathieu Viennot; Pierre-Julien Viailly; Kevin Cassinari; Pascal Chambon; Doriane Richard; Cristina Alexandru; Isabelle Tennevet; Olivier Langlois; Frédéric Di Fiore; Annie Laquerrière; Florian Clatot; Nasrin Sarafan-Vasseur
Journal:  Acta Neuropathol Commun       Date:  2020-04-17       Impact factor: 7.801

7.  Chromatin maturation of the HIV-1 provirus in primary resting CD4+ T cells.

Authors:  Birgitta Lindqvist; Sara Svensson Akusjärvi; Anders Sönnerborg; Marios Dimitriou; J Peter Svensson
Journal:  PLoS Pathog       Date:  2020-01-30       Impact factor: 6.823

Review 8.  Computational challenges in detection of cancer using cell-free DNA methylation.

Authors:  Madhu Sharma; Rohit Kumar Verma; Sunil Kumar; Vibhor Kumar
Journal:  Comput Struct Biotechnol J       Date:  2021-12-07       Impact factor: 7.271

9.  Cell-free DNA and circulating TERT promoter mutation for disease monitoring in newly-diagnosed glioblastoma.

Authors:  Maxime Fontanilles; Florent Marguet; Ludivine Beaussire; Nicolas Magne; Louis-Ferdinand Pépin; Cristina Alexandru; Isabelle Tennevet; Chantal Hanzen; Olivier Langlois; Fabrice Jardin; Annie Laquerrière; Nasrin Sarafan-Vasseur; Fréderic Di Fiore; Florian Clatot
Journal:  Acta Neuropathol Commun       Date:  2020-11-04       Impact factor: 7.801

10.  Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples.

Authors:  Gabriel A Colozza-Gama; Fabiano Callegari; Nikola Bešič; Ana Carolina de J Paniza; Janete M Cerutti
Journal:  Sci Rep       Date:  2021-06-16       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.