Literature DB >> 30607091

Normalization of Gene Expression by Quantitative RT-PCR in Human Cell Line: comparison of 12 Endogenous Reference Genes.

Fowotade Adeola1.   

Abstract

BACKGROUND: Polymerase Chain Reaction (PCR) has become an important diagnostic and research tool of modern molecular biology globally. Real-time PCR allows for rapid and reliable quantification of mRNA transcription. Reference genes are used as internal reaction control to normalise mRNA levels between different samples in order to allow for an exact comparison of mRNA transcription level.
METHODS: In this study, twelve commonly used human reference genes were investigated in Human Embryonic Kidney Cell Lines (HEK293) using real-time qPCR with SYBR green. The genes included beta-2-microglobulin (B2M), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), succinate dehydrogenase complex subunit A (SDHA), and tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein zeta polypeptide (YWHAZ). The stability of these reference genes was investigated using the geNorm application.
RESULTS: The range of expression stability in the genes analysed was (from the most stable to the least stable): UBC, TOP1, ATP5B, CYC1, GAPDH, SDHA, YWHAZ, CTB, 18S, EIFA-2, B2M and RPL13A. The optimal number of reference targets in the experiment was calculated to be 2 (geNorm V<0.15) when comparing a normalization factor based on the 2 or 3 most stable targets).
CONCLUSION: The expression stability varied greatly between the 12 candidate reference genes. UBC, TOP1, ATP5B, CYC1 and GAPDH respectively showed the highest stability in HEK293 cells based on both expression stability and expression level. Overall, our data suggest that UBC and TOP1show the least variation and the highest expression stability. This report validates the need for rational selection of reference genes for data normalization to ensure accuracy of quantitative PCR assays.

Entities:  

Keywords:  Gene expression; Human cell lines; Nrmalization; PCR; Reference genes

Mesh:

Substances:

Year:  2018        PMID: 30607091      PMCID: PMC6308757          DOI: 10.4314/ejhs.v28i6.9

Source DB:  PubMed          Journal:  Ethiop J Health Sci        ISSN: 1029-1857


Introduction

The polymerase chain reaction (PCR) is a molecular biology based method used for amplifying DNA; for RNA-based PCR the RNA sample is first reverse-transcribed to complementary DNA (cDNA) using the reverse transcriptase enzyme (1). Real-time polymerase chain reaction (RT-qPCR) is a polymerase chain reaction that monitors the amplification of a targeted DNA molecule through monitoring of the fluorescence of dyes or probes introduced into the reaction, which is proportional to the amount of product formed during the cycling phase of the PCR (1). Gene expression is regulated by cells in all organisms through the turnover of gene and PCR allows for robust detection and quantification of gene expression from small amounts of RNA (2). Accurate gene expression in RT-qPCR requires data normalisation as a further step in gene quantification analysis. The inclusion of endogenous controls in the assay serves to correct for sample to sample variations. Additionally, it improves the reliability of relative RT-qPCR experiments (3). A reference gene is important for relative quantification of gene expression, and it is important to choose a suitable reference gene for each experiment (4). An ideal reference gene for RT-qPCR should not be affected by the experimental conditions and the expression level of the reference gene should also be approximately similar to the target gene under study (4). Recent studies have also shown that no single reference gene is ideal for all experiments hence the determination of the appropriate reference gene(s) is very important for the interpretation of RT-qPCR data. Common genes for relative quantification include; β-actin, β-2-microglobulin, GAPDH mRNAs, and also 18S rRNA (5,6). Both β-Actin, a cytoskeletal protein, and GAPDH, an enzyme of glycolysis, are the two most commonly used reference genes for relative quantification in gene expression assay (7). The β-actin mRNA was one of the earliest RNAs to be used as a reference target because it is ubiquitously expressed; however, its limitation includes the variability in its transcription levels and the presence of pseudogenes that results in the erroneous detection of genomic DNA during real-time qPCR, thus leading to inaccuracies in quantification (8). GAPDH is also a commonly used reference gene for quantification assays. However, it is also limited as a result of its variation from one individual to the other during different stages of the cell cycle. Additionally, it is unsuitable in some systems, because its abundance also varies following treatment with different drugs (8). Furthermore, all genes used for normalization have different limitations hence reference genes should be validated for one's condition of interest (9–11). Previous studies have demonstrated that reference gene choice for qPCR data analysis has a significant impact on the study outcome. Hence, it is necessary to choose a suitable reference for reliable expression data (3). The geNorm system allows for selection of the best candidate reference gene for a given experimental assay by measuring the expression of 6 or 12 reference genes in a representative set of their own samples. The reference (house-keeping) genes are ranked in the order of stability of expression. The aim of this research was to select and evaluate the stability of 12 reference genes for the purpose of normalization in studying gene expression in a specific human cell line.

Materials and Methods

Study site: The experiment detailed below was conducted in the Department of Microbial Sciences at the University of Surrey, United Kingdom. Cell line: Human embryonic kidney (HEK293) cell lines were grown in Dulbecco's modified Eagle's medium (DMEM; Gibco) supplemented with 10% heat-treated (56°C, 1 h) fetal bovine serum (FBS; Gibco) and penicillin-streptomycin (50 µg/ml). RNA extraction: RNA from HEK293 grown in 6-well plates, was extracted using the ZR MiniPrep TM kit (ZymoResearch, USA) according to the manufacturer's instruction. The RNA was DNAse treated using the Ambion TURBO DNA-free™ Kit (Life Technologies). The concentration and purity of the RNA was measured on a Nanodrop spectrophotometer (Thermo Scientific, Wilmington, DE) and also analysed on an Agilent 2100 Bioanalyzer (Agilent Technologies) to check the RNA integrity (Fig 1).
Fig 1

The results of RNA analysis by Agilent bioanalyzer (A) Lane 1 of A shows the molecular marker while lanes 2–9 contain RNA from HEK293 cells.

The results of RNA analysis by Agilent bioanalyzer (A) Lane 1 of A shows the molecular marker while lanes 2–9 contain RNA from HEK293 cells. Quantitative RT-PCR: One µg of total RNA was reverse transcribed to cDNA using the Transcriptor First strand cDNA synthesis kit (Roche) according to the manufacturer's instruction. A cDNA library was created from 10 samples and primers for the 12 reference genes were obtained from PrimerDesign, UK. The reference genes were selected from commonly used reference genes (9,12–14) and the gene symbols, their full names and functions are listed in Table 1.
Table 1

Reference genes selected with their mean Ct Values and biological functions

Gene symbolGene NameCT ValuesFunction
ATP5BATP synthase subunit beta, mitochondrial20.12ATP synthesis, Hydrogen ion transport, Ion transport, Transport
β2MBeta-2-microglobulin20.87cytoskeletal protein involved in cell locomotion
CTBCytochrome B21.02Electron transport, Respiratory chain, Transport
CYC1Cytochrome C122.50Accepts electrons from Rieske protein and transfers electrons to cytochrome c in the mitochondrial respiratory chain.
EIF4A-2Eukaryotic initiation factor 4A-II22.89Required for mRNA binding to ribosome
18S18S ribosomal RNA23.09RNA component of the 40S ribosome
GAPDHGlyceraldehyde-3-phosphate dehydrogenase23.78carbohydrate metabolism
RPL13A60S ribosomal protein L13a24.62Mediates interferon-gamma-induced transcript-selective translation inhibition in inflammation processes
UBCPolyubiquitin-C24.97Ubiquitn conjugation
SDHASuccinate dehydrogenase complex, subunit A25.86Tricarboxylic acid cycle
TOP1DNA topoisomerase 128.56 DNA binding
YWHAZTyrosine 3-monooxygenase/tryptophan 5- monooxygenase activation protein, zeta polypeptide30.22Protein domain specific binding
Reference genes selected with their mean Ct Values and biological functions Genes with different functions were chosen in order to avoid genes belonging to the same biological pathways that may be co-regulated. Sequences of reference gene primers (UBC, TOP1, ATP5B, CYC1, CTB, 18S, EIFA-2 and PPC13A, β2M, GAPDH, SDHA, and YWHAZ) are proprietary property of PrimerDesign (15). All qPCR assays were performed on the Quantstudio 7 FLEX Real-Time PCR system (Life Technologies) in 20µL reaction volume containing 10µL of 2x PrecisionPLUS Mastermix (PrimerDesign), 5µL of diluted cDNA (25 ng) and 1µL (300 nmol) each of a gene-specific forward and reverse primer and 3µL of RNAse/DNAse free water. The following standard PCR reaction conditions were used for all transcripts: 95 °C 2 min; 40 cycles of 95 °C 10 s, 60 °C 1 min; 1 cycle of 95 °C 15 s, 60 °C 15 s, 95 °C 15 s. The last cycle provided the Post PCR run melt curve, for assessment of the specificity of amplification. Each of the 10 sample tests and each PCR reaction had three replicates. The experiment was repeated three times to ensure reproducibility. geNorm analysis: The gene expression stability (M) of the 12 reference genes was derived using the software geNorm (15,16). The geNorm program is a Visual Basic application tool for Microsoft Excel and is based on the principle of keeping the expression ratio of two perfect reference genes in a constant state throughout the different experimental conditions (17). The average pair-wise variation of a certain gene with all other tested reference genes is referred to as the M value, whereas the variation of this certain reference gene to another is determined as the standard deviation of the log2-transformed expression level ratios. The gene expression stability M values of the 12 reference genes are shown in Figure 2. The gene with the lowest M value is considered as the most stable expression, while the highest M value has the least stable expression. Prior to the entry of Ct values from the quantitative real-time PCR into geNorm, all Ct values were transformed into relative quantification data, by subtraction of the highest Ct value from all other Ct values for each gene measured. The levels of variation in average reference gene stability with the sequential addition of each reference gene to the equation was also assessed (Figure 2).
Fig. 2

Gene expression stability of candidate reference genes in HEK293 analyzed by the geNorm program.

Gene expression stability of candidate reference genes in HEK293 analyzed by the geNorm program. Threshold for eliminating a gene as unstable was M ≥ 1.5. The chart generated indicates the average expression stability value M of reference genes at each step during stepwise exclusion of the least stable expressed reference gene. Starting from the least stable gene at the left, the genes are ranked according to increasing expression stability, ending with the two most stable genes on the right. In this example TOP1 and UBC are the two most stable genes.

Results

Analysis of the raw expression levels across all samples identified some variation among reference genes. Quantification cycle (Ct) values for the 12 genes studied ranged from 15 to 30.9, while the majority of these values were between 18.6 and 24.6. The gene encoding UBC was highly expressed compared to the protein coding genes, reaching threshold fluorescence after only 8.2 amplification cycles, whereas the Ct average of all reference genes within the datasets was approximately 20.7 cycles. The range of expression stability in the genes analysed was (from the most stable to the least stable): UBC, TOP1, ATP5B, CYC1, CTB, 18S, EIFA-2 and RPL13A, β2M, GAPDH, SDHA, and YWHAZ. The optimal number of reference targets in the experiment was calculated to be 2 (geNorm V<0.15) when comparing a normalization factor based on the 2 or 3 most stable targets). Based on both the expression stability and expression level, our data suggested that UBC and TOP1 can be used as a reference gene for high abundance gene transcripts, CTB and 18S for medium abundance transcripts, and YWHAZ for low abundance transcripts in gene expression studies (Figure 3).
Fig. 3

The relative expression level of the reference candidate genes in HEK293 cell lines. The chart illustrates the levels of variation in average reference gene stability with the sequential addition of each reference gene to the equation (for calculation of the normalization factor). Starting with the two most stably expressed genes on the left, with the inclusion of a 3rd, 4th 5th gene etc. moving to the right. This measure is known as the “pairwise variation V”. A V score of below 0.15 is recommended as the ideal for this test system. In the example above the two most stable genes from Figure 2, would give high quality data.

The relative expression level of the reference candidate genes in HEK293 cell lines. The chart illustrates the levels of variation in average reference gene stability with the sequential addition of each reference gene to the equation (for calculation of the normalization factor). Starting with the two most stably expressed genes on the left, with the inclusion of a 3rd, 4th 5th gene etc. moving to the right. This measure is known as the “pairwise variation V”. A V score of below 0.15 is recommended as the ideal for this test system. In the example above the two most stable genes from Figure 2, would give high quality data.

Discussion

Quantitative PCR is one of the most sensitive and flexible quantification methods, and it provides simultaneous measurement of gene expression in many different samples for a number of genes. A major limitation of real-time PCR, particularly for relative quantification assays, is that several factors, including the selection of ideal reference genes, may significantly affect the results generated. The use of reference gene helps to control for differences in the amount of starting material, efficiency of amplification, and differences in expression from cells as well as overall level of transcription (5). For accurate gene expression measurements, it is essential to normalise results from qPCR experiments to a fixed reference gene. The ideal reference gene should be unaffected by the experimental treatment and should be expressed at a constant level among different tissues of an organism. Expectedly, no one single gene is expressed at such a constant level in all situations (4,14). Although Ribonucleic acid (RNA) is a thermodynamically stable molecule, it is highly susceptible to rapid digestion in the presence of RNase enzymes. Hence, a number of electrophoretic methods have been applied to evaluate the integrity of RNA in samples through separation according to the size of the comprised molecule (18,19). RNA quantity and integrity have also been shown to be critical for successful gene expression analysis as misleading qPCR results are obtained following the use of degraded or inaccurately quantified RNA. In this study, the total RNA was extracted from lysates from HEK293 cell lines and careful RNA analysis was performed using an Agilent 2100 Bioanalyzer (Agilent Technologies) prior to the gene expression study. The results suggested that all our RNA samples were of good quality (Fig 1). The geNorm software provides a useful algorithm for determining the most stable reference genes from a set of tested candidate reference genes in a given sample panel. Other software for ranking of reference genes include the BestKeeper and NormFinder, all three programs are based on different algorithms and analytical procedures. In the current study, only the geNorm software was used for determining overall gene stability, as previous comparative studies have shown that all three software packages, geNorm, BestKeeper, and NormFinder generated similar results when used to analyse data from the same tissue (20). The geNorm software is based on the prediction that none of the genes being analysed are co-regulated as this would lead to an erroneous choice of optimum normaliser pair, hence the use of a wide array of genes with diverse regulation (Table 1). The optimal number of reference targets in the experiment was calculated to be 2 (geNorm V<0.15) when comparing a normalization factor based on the 2 or 3 most stable targets). Both UBC and TOP1 were considered as useful reference genes for high abundance gene transcripts based on their observed expression stability and expression level. Additionally, our data suggested that both genes CTB and 18S were useful for medium abundance transcripts, while YWHAZ could be considered for low abundance transcripts in gene expression studies. Although a number of studies have reported that only one single gene is required as an internal control for normalization, it has also been widely suggested that the use of two or more reference genes for RT-qPCR studies might generate more reliable results (9–11). Based on the cut-off value of 0.15 proposed by geNorm program, below which the inclusion of an additional reference gene is not required, the two most stable reference genes of each series subset would be sufficient for accurate normalization. The optimal number of reference targets in this experimental situation is 2 (geNorm V<0.15 when comparing a normalization factor based on the 2 or 3 most stable targets). As such, the optimal normalization factor can be calculated as the geometric mean of reference targets TOP1 and UBC (Figure 3). In conclusion, the current study has shown that UBC, TOP1 and ATP5B were the most reliable internal controls for accurate normalization when looking at the expression data set as a whole, because these three genes were always classified among the 3 best performing reference genes analyzed by geNorm in all the sample pools. On the other hand, β2M, EIFA2 and RPL13A ranked poorly based on the geNorm software program, indicating that these three genes were not consistently expressed and should be avoided as internal controls when doing gene expression studies in our experimental setup. This study details the validation of a procedure to select the most stable and the optimal number of control genes for normalization of RT-PCR in a human cell line. Both UBC and TOP1genes can be used for highabundance mRNA, and also allows for more accurate normalization. Our data suggest that UBC and TOP1 may be suitable reference genes in gene expression studies of HEK293 cell lines. Additionally, there is a need to evaluate reference genes in each experimental setting to allow for rational basis for choosing reference genes needed for quantitative expression studies. A number of molecular biology based studies from developing countries are conducted without rational selection of reference target genes. This is because of financial constraints due to sparse funding for research and lack of technical expertise. This study therefore provided a panel of tested reference targets that could be utilized for RTPCR studies in mammalian cell lines.
  19 in total

1.  Chipping away at the chip bias: RNA degradation in microarray analysis.

Authors:  Herbert Auer; Sandya Lyianarachchi; David Newsom; Marko I Klisovic; Guido Marcucci; Uido Marcucci; Karl Kornacker
Journal:  Nat Genet       Date:  2003-12       Impact factor: 38.330

2.  Guideline to reference gene selection for quantitative real-time PCR.

Authors:  Aleksandar Radonić; Stefanie Thulke; Ian M Mackay; Olfert Landt; Wolfgang Siegert; Andreas Nitsche
Journal:  Biochem Biophys Res Commun       Date:  2004-01-23       Impact factor: 3.575

3.  Housekeeping genes as internal standards: use and limits.

Authors:  O Thellin; W Zorzi; B Lakaye; B De Borman; B Coumans; G Hennen; T Grisar; A Igout; E Heinen
Journal:  J Biotechnol       Date:  1999-10-08       Impact factor: 3.307

4.  GAPDH as a housekeeping gene: analysis of GAPDH mRNA expression in a panel of 72 human tissues.

Authors:  Robert D Barber; Dan W Harmer; Robert A Coleman; Brian J Clark
Journal:  Physiol Genomics       Date:  2005-03-15       Impact factor: 3.107

Review 5.  Absolute quantification of mRNA using real-time reverse transcription polymerase chain reaction assays.

Authors:  S A Bustin
Journal:  J Mol Endocrinol       Date:  2000-10       Impact factor: 5.098

Review 6.  Control selection for RNA quantitation.

Authors:  T Suzuki; P J Higgins; D R Crawford
Journal:  Biotechniques       Date:  2000-08       Impact factor: 1.993

7.  beta-Actin and GAPDH housekeeping gene expression in asthmatic airways is variable and not suitable for normalising mRNA levels.

Authors:  E M Glare; M Divjak; M J Bailey; E H Walters
Journal:  Thorax       Date:  2002-09       Impact factor: 9.139

8.  Beta-actin cannot be used as a control for gene expression in ovine interstitial cells derived from heart valves.

Authors:  Jessa Yperman; Geofrey De Visscher; Paul Holvoet; Willem Flameng
Journal:  J Heart Valve Dis       Date:  2004-09

9.  Utility of the housekeeping genes 18S rRNA, beta-actin and glyceraldehyde-3-phosphate-dehydrogenase for normalization in real-time quantitative reverse transcriptase-polymerase chain reaction analysis of gene expression in human T lymphocytes.

Authors:  A Bas; G Forsberg; S Hammarström; M-L Hammarström
Journal:  Scand J Immunol       Date:  2004-06       Impact factor: 3.487

10.  Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes.

Authors:  Jo Vandesompele; Katleen De Preter; Filip Pattyn; Bruce Poppe; Nadine Van Roy; Anne De Paepe; Frank Speleman
Journal:  Genome Biol       Date:  2002-06-18       Impact factor: 13.583

View more
  6 in total

1.  Screening and stability analysis of reference genes in fasting caecotrophy model in rabbits.

Authors:  Hui He; Zhichao Li; Mengke Ni; Shanshan Xing; Lei Yu; Huifen Xu; Dehu Zhuo; Ming Li
Journal:  Mol Biol Rep       Date:  2021-11-06       Impact factor: 2.316

2.  Field Evaluation of a Duplex RT-RAA Assay for Rapid Detection of SARS-CoV-2 - Hebei Province, China, January 2021.

Authors:  Ruiqing Zhang; Shunxiang Qi; Jiandong Li; Shiyou Liu; Guangyue Han; Xinxin Shen; Qi Li; Wenbo Xu; Xuejun Ma
Journal:  China CDC Wkly       Date:  2022-06-10

3.  NR4A1 inhibits the epithelial-mesenchymal transition of hepatic stellate cells: Involvement of TGF-β-Smad2/3/4-ZEB signaling.

Authors:  Qian Huang; Jingying Xu; Yanyan Ge; Yue Shi; Fei Wang; Mingli Zhu
Journal:  Open Life Sci       Date:  2022-05-04       Impact factor: 1.311

4.  Selection of optimal reference genes for gene expression studies in chronically hypoxic rat heart.

Authors:  Daniel Benak; Dita Sotakova-Kasparova; Jan Neckar; Frantisek Kolar; Marketa Hlavackova
Journal:  Mol Cell Biochem       Date:  2019-07-12       Impact factor: 3.842

5.  The GLP-1R Agonist Exendin-4 Attenuates Hyperglycemia-Induced Chemoresistance in Human Endometrial Cancer Cells Through ROS-Mediated Mitochondrial Pathway.

Authors:  Yu Zhang; Juan Cheng; Jing Li; Junxian He; Xiaomao Li; Fen Xu
Journal:  Front Oncol       Date:  2021-12-20       Impact factor: 6.244

6.  Reference gene validation for the relative quantification of cannabinoid receptor expression in human odontoblasts via quantitative polymerase chain reaction.

Authors:  Laura M Navarro-Saiz; Lilia J Bernal-Cepeda; Felipe García-Jiménez; Deisy Abril; Jaime E Castellanos
Journal:  J Oral Biol Craniofac Res       Date:  2022-09-10
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.