| Literature DB >> 34593877 |
Gergely Attila Rácz1,2, Nikolett Nagy3,4, József Tóvári5, Ágota Apáti3, Beáta G Vértessy6,7.
Abstract
Reverse transcription-quantitative real-time PCR (RT-qPCR) is a ubiquitously used method in biological research, however, finding appropriate reference genes for normalization is challenging. We aimed to identify genes characterized with low expression variability among human cancer and normal cell lines. For this purpose, we investigated the expression of 12 candidate reference genes in 13 widely used human cancer cell lines (HeLa, MCF-7, A-549, K-562, HL-60(TB), HT-29, MDA-MB-231, HCT 116, U-937, SH-SY5Y, U-251MG, MOLT-4 and RPMI-8226) and, in addition, 7 normal cell lines (HEK293, MRC-5, HUVEC/TERT2, HMEC, HFF-1, HUES 9, XCL-1). In our set of genes, we included SNW1 and CNOT4 as novel candidate reference genes based on the RNA HPA cell line gene data from The Human Protein Atlas. HNRNPL and PCBP1 were also included along with the "classical" reference genes ACTB, GAPDH, IPO8, PPIA, PUM1, RPL30, TBP and UBC. Results were evaluated using GeNorm, NormFiner, BestKeeper and the Comparative ΔCt methods. In conclusion, we propose IPO8, PUM1, HNRNPL, SNW1 and CNOT4 as stable reference genes for comparing gene expression between different cell lines. CNOT4 was also the most stable gene upon serum starvation.Entities:
Mesh:
Year: 2021 PMID: 34593877 PMCID: PMC8484624 DOI: 10.1038/s41598-021-98869-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Cell lines used in this study.
| Accession (RRID) | Cell line | Category | Disease | Cell type |
|---|---|---|---|---|
| CVCL_0023 | A-549 | Cancer cell line | Lung adenocarcinoma | Alveolar basal epithelial cell |
| CVCL_0291 | HCT 116 | Cancer cell line | Colon carcinoma | Intestinal epithelial cell |
| CVCL_0030 | HeLa | Cancer cell line | Human papillomavirus-related endocervical adenocarcinoma | Epithelial cell |
| CVCL_A794 | HL-60(TB) | Cancer cell line | Adult acute myeloid leukemia | Promyelocyte |
| CVCL_0320 | HT-29 | Cancer cell line | Colon adenocarcinoma | Intestinal epithelial cell |
| CVCL_0004 | K-562 | Cancer cell line | Chronic myelogenous leukemia, BCR-ABL1 positive | Highly undifferentiated myeloid cell |
| CVCL_0031 | MCF-7 | Cancer cell line | Invasive breast carcinoma | Mammary gland luminal A epithelial cell |
| CVCL_0062 | MDA-MB-231 | Cancer cell line | Breast adenocarcinoma | Mammary gland basal B epithelial cell |
| CVCL_0013 | MOLT-4 | Cancer cell line | Adult T acute lymphoblastic leukemia | Precursor T-cell |
| CVCL_0014 | RPMI-8226 | Cancer cell line | Plasma cell myeloma | B lymphocyte |
| CVCL_0019 | SH-SY5Y | Cancer cell line | Neuroblastoma | Neuron (dopaminergic/adrenergic) |
| CVCL_0021 | U-251MG | Cancer cell line | Astrocytoma | Astrocyte |
| CVCL_0007 | U-937 | Cancer cell line | Adult acute monocytic leukemia | Monocyte |
| CVCL_0045 | HEK293 | Transformed cell line | Normal—transformed with Ad5 | Adrenal precursor cell |
| CVCL_3285 | HFF-1 | Finite cell line | Normal | Foreskin fibroblast |
| CVCL_UW69 | HMEC | Telomerase immortalized cell line | Normal—immortalized with TERT | Mammary epithelial cell |
| CVCL_0057 | HUES 9 | Embryonic stem cell | Normal | Embryonic stem cell from blastocyst |
| CVCL_9Q53 | HUVEC/TERT2 | Telomerase immortalized cell line | Normal—immortalized with TERT | Umbilical vascular endothelial cell |
| CVCL_0440 | MRC-5 | Finite cell line | Normal | Embryo lung fibroblast |
| CVCL_WM82 | XCL-1 | Induced pluripotent stem cell | Normal | Induced pluripotent stem cell |
List of potential reference genes.
| Gene symbol | Gene name | Ensembl gene ID | Function | CV | Rank | No. of articles |
|---|---|---|---|---|---|---|
| RPL13A | Ribosomal protein L13a | ENSG00000142541 | Translation—component of the 60S ribosomal subunit | 0.438 | 2715 | 35 |
| YWHAZ | Tyrosine 3-monooxygenase/ tryptophan 5-monooxygenase activation protein zeta | ENSG00000164924 | Signal transduction—central hub protein for many signal transduction pathways | 0.462 | 3258 | 37 |
| PGK1 | Phosphoglycerate kinase 1 | ENSG00000102144 | Metabolism—kinase in glycolysis and gluconeogenesis | 0.497 | 4014 | 29 |
| HMBS | Hydroxymethylbilane synthase | ENSG00000256269 | Metabolism—heme biosynthesis | 0.551 | 4966 | 22 |
| HPRT1 | Hypoxanthine phosphoribosyltransferase 1 | ENSG00000165704 | Metabolism—purine synthesis in salvage pathway | 0.551 | 4973 | 65 |
| EEF1A1 | Eukaryotic translation elongation factor 1 alpha 1 | ENSG00000156508 | Translation—aminoacyl-trna delivery to the ribosome | 0.557 | 5060 | 12 |
| ALAS1 | 5'-aminolevulinate synthase 1 | ENSG00000023330 | Metabolism—heme biosynthesis | 0.578 | 5397 | 10 |
| SDHA | Succinate dehydrogenase complex flavoprotein subunit A | ENSG00000073578 | Metabolism—part of the mitochondrial respiratory chain | 0.590 | 5579 | 35 |
| GUSB | Glucuronidase beta | ENSG00000169919 | Metabolism—degradation of dermatan and keratan sulfates | 0.713 | 7074 | 24 |
| TFRC | Transferrin receptor | ENSG00000072274 | Metabolism—cellular iron uptake | 0.738 | 7316 | 10 |
| B2M | Beta-2-microglobulin | ENSG00000166710 | Immunity—β-chain of major Histocompatibility complex class I molecules | 0.906 | 8802 | 78 |
| POLR2A | RNA polymerase II subunit A | ENSG00000181222 | Transcription—RNA polymerase | 1.143 | 10,376 | 12 |
Coefficient of variation (CV) and the corresponding rank is calculated from the relative expression data in the RNA HPA cell line gene dataset from The Human Protein Atlas[39,40]. Genes are arranged in ascending order of the rank. The genes used in the present study are shown in bold. Genes suggested by Jo et al. are shown in italic[34]. For details regarding the No. of articles, refer to “Methods”.
List of candidate reference genes and the corresponding primer parameters used in this study.
| Gene symbol | Primer sequences (5'–3') | PCR product length (bp) | Tm of PCR products (°C) | Primer design | PCR amplification efficiency (%) | Regression Coefficient |
|---|---|---|---|---|---|---|
| SNW1 | Fw: GCAGCTCCTGATAAGAGGTCG Rev: CCGAGGATTAGGAACACCGAG | 87 | 78.0 | Intron-spanning | 95.8 | 0.9996 |
| CNOT4 | Fw: GTCCAAAACCTGACTGCATGTATC Rev: GGTGTTTACCCGCCTGCAT | 87 | 80.8 | Intron-spanning | 96.3 | 0.9999 |
| PUM1 | Fw: TGCGGGAGATTGCTGGACAT Rev: GTGTGGCACGCTCCAGTTTC | 87 | 80.4 | Intron-flanking | 98.4 | 0.9999 |
| PCBP1 | Fw: ATTCGCCGGAATTGACTCCA Rev: TGCCCAATAGCCTTTCACCT | 49 | 86.4 | Exonic | 99.8 | 0.9998 |
| IPO8 | Fw: GGCATACAGTTTAACCTGCCAC Rev: CAGGAGAGGCATCATGTCTGTAA | 118 | 78.6 | Intron-spanning | 92.5 | 0.9995 |
| HNRNPL | Fw: CCAAGGCCTCTCTCAATGGG Rev: TTCAAGCGTGTAGGCTTTGC | 82 | 80.0 | Intron-spanning | 97.9 | 0.9998 |
| TBP | Fw: ATATAATCCCAAGCGGTTTGCTG Rev: AAAATCAGTGCCGTGGTTCG | 66 | 79.8 | Intron-spanning | 97.4 | 0.9989 |
| UBC | Fw: GGTCGCAGTTCTTGTTTGTGG Rev: TTCACGAAGATCTGCATTGTCAAG | 60 | 78.4 | Exonic | 100.7 | 0.9998 |
| PPIA | Fw: TGGGTTACTTCTGAAACATCACTTGT Rev: TTGACACTTCCTGGGACTGGA | 85 | 75.1 | Exonic | 98.2 | 0.9999 |
| RPL30 | Fw: TTCTCGCTAACAACTGCCCA Rev: TGCCACTGTAGTGATGGACAC | 90 | 78.4 | Intron-flanking | 95.9 | 0.9991 |
| ACTB | Fw: ACAGAGCCTCGCCTTTGC Rev: CGCGGCGATATCATCATCCA | 76 | 86.9 | Intron-flanking | 95.2 | 0.9998 |
| GAPDH | Fw: GAGAAGGCTGGGGCTCATTT Rev: TGATGACCCTTTTGGCTCCC | 46 | 79.4 | Intron-spanning | 97.8 | 0.9999 |
Regression coefficients were determined by performing least squares linear regression to the average Cq values of technical replicates. Base pairs, bp.
Figure 1Optimization of the reverse transcription conditions. The graph shows Cq values from performing qPCR measurements of RNA dilution series comparing the Maxima First Strand cDNA Synthesis Kit for RT-qPCR (red lines) and the High-Capacity cDNA Reverse Transcription Kit (blue lines). Three technical replicates for both kits at each concentration point are marked as hollow circles. Least squares linear regression was performed to the average of the technical replicates in the range of RNA amount from 100 to 800 ng per reaction. Individual graphs were created with OriginPro 2018 (OriginLab Corp.) and the figure was assembled using CorelDRAW Graphics Suite 2020 (Corel Corporation).
Figure 2Cq values of twelve candidate reference genes in human cancer (red) and normal (blue) cell lines. Each dot represents the average Cq value of three biological replicates for each cell line. Candidate reference genes are arranged according to the comprehensive ranking in a decreasing order of expression stability. The boxes represent the data between the 25th and 75th percentile, while whiskers show the data range from the 10th to the 90th percentile. The minimum and maximum values are depicted as ‘x’. The average values are represented as squares and the median values are illustrated as lines. This graph was created with OriginPro 2018 (OriginLab Corp.).
Ranking of candidate reference genes.
| Comprehensive rank | Gene symbol | GeNorm | Comparative ΔCt | NormFinder | BestKeeper | Geometric mean | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Rank | M value | Rank | Mean StdDev | Rank | Stability value | Rank | Stability value | |||
| 1 | IPO8 | 1 | 0.381 | 3 | 0.555 | 1 | 0.205 | 3 | 0.336 | 1.73 |
| 2 | PUM1 | 2 | 0.384 | 4 | 0.568 | 4 | 0.222 | 1 | 0.326 | 2.38 |
| 3 | HNRNPL | 3 | 0.386 | 1 | 0.538 | 3 | 0.215 | 6 | 0.353 | 2.71 |
| 4 | SNW1 | 4 | 0.407 | 2 | 0.545 | 2 | 0.206 | 4 | 0.344 | 2.83 |
| 5 | CNOT4 | 5 | 0.428 | 5 | 0.597 | 5 | 0.266 | 5 | 0.345 | 5.00 |
| 6 | GAPDH | 10 | 0.547 | 7 | 0.652 | 2 | 0.328 | 5.38 | ||
| 7 | PCBP1 | 6 | 0.45 | 6 | 0.612 | 7 | 0.273 | 7 | 0.394 | 6.48 |
| 8 | TBP | 7 | 0.477 | 9 | 0.678 | 11 | 0.483 | 8.89 | ||
| 9 | PPIA | 8 | 0.504 | 8 | 0.657 | 8 | 0.308 | 10 | 0.471 | 8.46 |
| 10 | RPL30 | 9 | 0.523 | 10 | 0.701 | 11 | 0.366 | 8 | 0.406 | 9.43 |
| 11 | UBC | 11 | 0.587 | 11 | 0.777 | 10 | 0.362 | 9 | 0.457 | 10.22 |
| 12 | ACTB | 12 | 0.636 | 12 | 0.911 | 12 | 0.503 | 12 | 0.679 | 12.00 |
| 1 | HNRNPL | 1 | 0.333 | 1 | 0.475 | 1 | 0.160 | 1 | 0.275 | 1.00 |
| 2 | IPO8 | 2 | 0.346 | 3 | 0.535 | 2 | 0.212 | 5 | 0.322 | 2.78 |
| 3 | PUM1 | 3 | 0.363 | 4 | 0.539 | 4 | 0.215 | 4 | 0.32 | 3.72 |
| 4 | SNW1 | 5 | 0.416 | 2 | 0.533 | 3 | 0.213 | 8 | 0.368 | 3.94 |
| 5 | CNOT4 | 4 | 0.393 | 5 | 0.564 | 6 | 0.257 | 6 | 0.332 | 5.18 |
| 6 | GAPDH | 10 | 0.523 | 7 | 0.616 | 3 | 0.31 | 5.69 | ||
| 7 | RPL30 | 7 | 0.467 | 9 | 0.627 | 11 | 0.321 | 2 | 0.308 | 6.10 |
| 8 | UBC | 6 | 0.437 | 6 | 0.599 | 7 | 0.260 | 7 | 0.355 | 6.48 |
| 9 | PCBP1 | 9 | 0.503 | 8 | 0.623 | 8 | 0.299 | 9 | 0.394 | 8.49 |
| 10 | TBP | 8 | 0.487 | 10 | 0.640 | 10 | 0.447 | 9.46 | ||
| 11 | PPIA | 11 | 0.539 | 11 | 0.644 | 9 | 0.308 | 11 | 0.458 | 10.46 |
| 12 | ACTB | 12 | 0.595 | 12 | 0.904 | 12 | 0.510 | 12 | 0.716 | 12.00 |
| 1 | IPO8 | 1 | 0.223 | 1 | 0.495 | 1 | 0.149 | 3 | 0.318 | 1.32 |
| 2 | SNW1 | 2 | 0.236 | 3 | 0.515 | 2 | 0.187 | 1 | 0.29 | 1.86 |
| 3 | PUM1 | 3 | 0.26 | 2 | 0.508 | 4 | 0.323 | 3.31 | ||
| 4 | CNOT4 | 9 | 0.433 | 5 | 0.552 | 2 | 0.301 | 4.05 | ||
| 5 | PCBP1 | 6 | 0.368 | 4 | 0.544 | 4 | 0.214 | 6 | 0.386 | 4.90 |
| 6 | PPIA | 5 | 0.349 | 7 | 0.570 | 6 | 0.282 | 7 | 0.422 | 6.19 |
| 7 | HNRNPL | 4 | 0.327 | 6 | 0.568 | 7 | 0.286 | 9 | 0.455 | 6.24 |
| 8 | GAPDH | 10 | 0.481 | 9 | 0.678 | 8 | 0.306 | 5 | 0.368 | 7.75 |
| 9 | RPL30 | 7 | 0.387 | 8 | 0.632 | 9 | 0.318 | 8 | 0.437 | 7.97 |
| 10 | TBP | 8 | 0.407 | 10 | 0.685 | 10 | 0.374 | 10 | 0.533 | 9.46 |
| 11 | ACTB | 11 | 0.554 | 11 | 0.884 | 11 | 0.464 | 11 | 0.553 | 11.00 |
| 12 | UBC | 12 | 0.622 | 12 | 0.967 | 12 | 0.554 | 12 | 0.617 | 12.00 |
The comprehensive rank is generated based on the geometric mean of ranks. The best combinations of reference genes as suggested by NormFinder are shown in bold.
Figure 3Relative expression of the candidate reference genes upon serum starvation. Relative expression values of serum starved cell lines can be compared to the average relative expression of the non-treated biological replicates selected as control and set to 1, which is represented by a dotted line. Error bars show standard deviation of three biological replicates (n = 3) for each cell line. The number of asterisks indicate increasing possibility that gene expression remains constant upon serum starvation. *p > 0.8, **p > 0.95 as calculated by the CFX Maestro software. This graph was created with OriginPro 2018 (OriginLab Corp.).
Figure 4Schematic figure of the overall experimental design and the results of this study. In the upper left corner the investigated candidate reference genes are shown in grey except for the novel reference genes SNW1 and CNOT4, which are shown in yellow. The investigated cell lines are depicted in the lower left corner. Cancer cell lines are colored red, while normal cell lines are colored blue. On the right side the top-ranking reference genes are illustrated for cancer cell lines in red circle, for normal cell lines in blue circle and for all cell lines in green circle. The most stable gene upon serum starvation is shown in brown circle. The methods used for performing the experiments and the software used for the evaluation of the results are depicted as a yellow arrow. This graph was created with CorelDRAW Graphics Suite 2020 (Corel Corporation).