Literature DB >> 36176301

qPCRtools: An R package for qPCR data processing and visualization.

Xiang Li1,2, Yingmin Wang1,2, Jingyu Li1,2, Xinyue Mei1,2, Yixiang Liu1,2, Huichuan Huang1,2.   

Abstract

In biological research, qPCR is a technique that is frequently used to measure gene expression levels. The calculation of gene amplification efficiency is a critical step in the processing of qPCR data since it helps to decide which method to employ to compute gene expression levels. Here, we introduce the R package qPCRtools, which enables users to analyze the efficiency of gene amplification. Additionally, this software can determine gene expression levels using one of three approaches: the conventional curve-based method, the 2-ΔΔCt method, and the SATQPCR method. The qPCRtools package produces a table with the statistical data of each method as well as a figure with a box or bar plot illustrating the results. The R package qPCRtools is freely available at CRAN (https://CRAN.R-project.org/package=qPCRtools) or GitHub (https://github.com/lixiang117423/qPCRtools/tree/main/CRAN/qPCRtools).
Copyright © 2022 Li, Wang, Li, Mei, Liu and Huang.

Entities:  

Keywords:  R package; bioinformatics; gene expression; qPCR; visualization

Year:  2022        PMID: 36176301      PMCID: PMC9513427          DOI: 10.3389/fgene.2022.1002704

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.772


1 Introduction

Quantitative PCR (qPCR) is a method for the precise quantification of gene expression. The experimental procedure for qPCR includes sample/template preparation, assay optimization/validation, RT‒PCR, and data analysis (Bustin et al., 2010). For data analysis, several methods can be chosen, such as the most widely used 2−ΔΔCT method. The 2−ΔΔCt method is a convenient way to analyze the relative changes in gene expression from qPCR experiments. However, it assumes an almost equal amplification efficiency for both the internal target gene and the reference gene (Livak and Schmittgen, 2001). If the amplification efficiency does not meet the prerequisites of the 2−ΔΔCT method, absolute quantification should be used (Whelan et al., 2003; Dhanasekaran et al., 2010). Due to the limitations of absolute quantification, such as its time-consuming nature, relative quantification is preferred. In relative quantification, in which the expression of a target gene is measured in relation to one or multiple reference genes, the results can be considerably influenced by various factors. A major point that must be considered when using a relative quantification approach is the amplification efficiency. It has been shown that even minor variations in amplification efficiency can lead to considerable variation in the calculated gene expression value (Pfaffl, 2001). One way of calculating the amplification efficiency is to produce serial dilutions of the target genes. Then, the Ct values are plotted on a logarithmic scale along with the corresponding concentrations. Next, a linear regression curve based on the data points is generated, and the slope of the trend line is calculated. Finally, efficiency is calculated using the following equation: E = −1+10(−1/slope) (the dilution factor is 10). The amplification efficiency value can help us choose the best method for processing qPCR data. On the other hand, the relative standard curve can be used to handle qPCR data. The data analyses required in qPCR experiments, including the calculation of primer amplification efficiency, gene expression levels, and final statistics, are relatively difficult for novices. Many tools have been developed to process qPCR data (Pabinger et al., 2014). However, data wrangling is required to apply some of these tools (Mar et al., 2009; Lievens et al., 2012; Rödiger et al., 2015), and the web services of some tools, such as CampER (Blom et al., 2020) and PCR-Miner (Zhao and Fernald, 2005), are no longer available. To address these limitations, we have developed qPCRtools, a package developed based on the statistical computing language R and the widely used R visualization package ggplot2 (Wickham, 2016), for processing qPCR data. The main features of qPCRtools include the following: 1) construction of relative standard curves from gradient dilution qPCR data and calculation of the amplification efficiency (for any dilution factor); 2) calculation of the gene expression based on several methods, including the relative standard curve method, 2−ΔΔCT method (Livak and Schmittgen, 2001), and RqPCR method (Rancurel et al., 2019); and 3) statistical calculations based on the t test or ANOVA with Tukey’s test. All results generated from all functions except CalRTable include a table, detailed results with statistical information, and a figure in which the table results can be visualized.

2 Description

2.1 Overview of the qPCRtools package

The qPCRtools package was created based on the R language (R Core Team, 2013). Five functions are included in the qPCRtools package for the processing and display of qPCR data (Table 1). The amount of RNA needed for reverse transcription can be determined using the CalRTable function. The CalCurve function can determine the amplification efficiency of each gene, in addition to calculating the relative standard curve for each gene based on equal dilution. The expression levels of each gene can be determined using the other three functions in various ways.
TABLE 1

This table shows all features of qPCRtools.

FunctionDescription
CalRTableCalculates volume for reverse transcription
CalCurveCalculates a relative standard curve
CalExpCurveCalculates expression values using the relative standard curve method
CalExp2ddCtCalculates expression using the 2−ΔΔCt method
CalExpRqPCRCalculates expression values using the RqPCR method
This table shows all features of qPCRtools.

2.2 Case study

2.2.1 Example 1: Calculation of RNA volume for reverse transcription

Reverse transcription will be necessary to generate cDNA for qPCR after RNA extraction. The user can obtain the reagent volume table from the protocol provided in the kit. However, when the average RNA concentration is being determined for a large number of samples, it will take some time to generate the final table. Reverse transcription can be performed by creating a table of each component’s volume using the CalRTable function. As an example, we consider the EasyScript All-in-One First-Strand cDNA Synthesis SuperMix for qPCR (One-Step gDNA Removal) (TransGen Biotech, AE341) reverse transcription reagent kit, which only comprises two components: SuperMix and gDNA Remover. In this case, we will require 4 μl of SuperMix and 1 μl of gDNA remover for 1 μg of total RNA. To determine the volume of total RNA, its concentration must be known. To obtain the average concentration, three or more replications of each sample are needed. The volumes of RNA and other components can be estimated using the CalRTable function (Table 2). There must be a column “all” in the second input file as shown in the example data.
TABLE 2

Example of CalRTable output.

SampleAverage concentrationRNA (μl)SuperMix (μl)gDNARemover (μl)H2O (μl)Total (μl)
1160.4012.478.002.0017.5340.00
2163.3312.248.002.0017.7640.00
3182.5710.958.002.0019.0540.00
4203.809.818.002.0020.1940.00
5180.1311.108.002.0018.9040.00
6171.8311.648.002.0018.3640.00
Example of CalRTable output.

2.2.2 Example 2: Calculation of the relative standard curve and amplification efficiency

For qPCR studies as well as for final data processing and analysis, it is essential to known the primer amplification efficiency (Livak and Schmittgen, 2001; Pfaffl, 2001), since it determines which technique should be applied to analyze qPCR data. For novel primers, the initial stage is the evaluation of amplification efficiency. The CalCurve function can be used to determine the amplification efficiency based on the concentration gradient dilution approach (Supplementary Table S1). Additionally, the function provides a regression equation for relative concentrations that have been log-transformed and the accompanying Cq values (Figure 1 and Supplementary Table S1). The relative expression levels of genes under various treatments can be determined using the regression equation.
FIGURE 1

The relative standard curve of example genes.

The relative standard curve of example genes.

2.2.3 Example 3: Calculation of gene expression levels using the relative standard curve method

After the above step, if the amplification efficiencies of reference genes and target genes are not equal, the 2−ΔΔCt method cannot be used (Livak and Schmittgen, 2001). Some methods without a reference gene can also be chosen, such as the method based on the Markov chain Monte Carlo algorithm (Matz et al., 2013). One way to determine the expression level that disregards amplification efficiency is the relative standard curve approach. Based on the relative standard curve depicted above, the CalExpCurve function can calculate the expression levels of individual genes. A table (Supplementary Table S2) and a figure are included in the results (Figure 2).
FIGURE 2

Box plot of each gene in different treatments based on the relative standard curve method.

Box plot of each gene in different treatments based on the relative standard curve method.

2.2.4 Example 4: Calculation of expression levels using the 2−ΔΔCt method

If the amplification efficiency of the target genes is consistent with that of the reference gene, the most widely used2−ΔΔCt method is available for expression level calculation (Livak and Schmittgen, 2001). There are many software packages that can apply this method, such as the R package pcr (Ahmed and Kim, 2018). However, most of these packages require some data wrangling before their application. For users who have large samples and data, this approach is time-consuming and is not reproducible. The CalExp2ddCt function can process raw Cq data from the qPCR machine and then produces a table (Supplementary Table S3) and figure (Figure 3) illustrating the expression levels. More importantly, the templates can be reused.
FIGURE 3

Bar plot of the relative expression of each gene under different treatments calculated via the 2−ΔΔCt method.

Bar plot of the relative expression of each gene under different treatments calculated via the 2−ΔΔCt method.

2.2.5 Example 5: Calculation of expression levels using the RqPCR method

For most methods used in qPCR data processing, a reference is required to correct the expression values of the target genes (Livak and Schmittgen, 2001). Some methods do not require reference genes; such methods are based on the Markov chain Monte Carlo algorithm (Matz et al., 2013) or SATQPCR (Rancurel et al., 2019) according to the MIQE guidelines (dMIQE Group and Huggett, 2020). The CalExpRqPCR function can implement this function from SATQPCR (Supplementary Table S4 and Figure 4), and the required amplification efficiency can be calculated using the CalCurve function.
FIGURE 4

Bar plot of the relative expression of each gene under different treatments calculated via the RqPCR method.

Bar plot of the relative expression of each gene under different treatments calculated via the RqPCR method.

3 Conclusion

An important step in qPCR is calculating the efficiency of gene primer amplification, but previous R packages cannot perform this step effectively. Here, we offer qPCRtools, a fully functional tool that can not only quickly determine the amplification efficiency of gene primers but also calculate gene expression levels from a Cq table based on several methods that return statistical results and visualization figures. The resulting figures are ggplot objects that can be further customized by applying scale and theme settings and superimposed annotation layers.
  14 in total

1.  A new mathematical model for relative quantification in real-time RT-PCR.

Authors:  M W Pfaffl
Journal:  Nucleic Acids Res       Date:  2001-05-01       Impact factor: 16.971

2.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors:  K J Livak; T D Schmittgen
Journal:  Methods       Date:  2001-12       Impact factor: 3.608

3.  Comprehensive algorithm for quantitative real-time polymerase chain reaction.

Authors:  Sheng Zhao; Russell D Fernald
Journal:  J Comput Biol       Date:  2005-10       Impact factor: 1.479

4.  Comparison of different standards for real-time PCR-based absolute quantification.

Authors:  S Dhanasekaran; T Mark Doherty; John Kenneth
Journal:  J Immunol Methods       Date:  2010-01-25       Impact factor: 2.303

5.  SATQPCR: Website for statistical analysis of real-time quantitative PCR data.

Authors:  Corinne Rancurel; Trang van Tran; Céline Elie; Frédérique Hilliou
Journal:  Mol Cell Probes       Date:  2019-07-05       Impact factor: 2.365

6.  The Digital MIQE Guidelines Update: Minimum Information for Publication of Quantitative Digital PCR Experiments for 2020.

Authors:  Jim F Huggett
Journal:  Clin Chem       Date:  2020-08-01       Impact factor: 8.327

7.  MIQE précis: Practical implementation of minimum standard guidelines for fluorescence-based quantitative real-time PCR experiments.

Authors:  Stephen A Bustin; Jean-François Beaulieu; Jim Huggett; Rolf Jaggi; Frederick S B Kibenge; Pål A Olsvik; Louis C Penning; Stefan Toegel
Journal:  BMC Mol Biol       Date:  2010-09-21       Impact factor: 2.946

8.  Enhanced analysis of real-time PCR data by using a variable efficiency model: FPK-PCR.

Authors:  Antoon Lievens; S Van Aelst; M Van den Bulcke; E Goetghebeur
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

9.  pcr: an R package for quality assessment, analysis and testing of qPCR data.

Authors:  Mahmoud Ahmed; Deok Ryong Kim
Journal:  PeerJ       Date:  2018-03-16       Impact factor: 2.984

10.  Data-driven normalization strategies for high-throughput quantitative RT-PCR.

Authors:  Jessica C Mar; Yasumasa Kimura; Kate Schroder; Katharine M Irvine; Yoshihide Hayashizaki; Harukazu Suzuki; David Hume; John Quackenbush
Journal:  BMC Bioinformatics       Date:  2009-04-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.