Literature DB >> 19014639

Identification of importin 8 (IPO8) as the most accurate reference gene for the clinicopathological analysis of lung specimens.

Paul A Nguewa1, Jackeline Agorreta, David Blanco, Maria Dolores Lozano, Javier Gomez-Roman, Blas A Sanchez, Iñaki Valles, Maria J Pajares, Ruben Pio, Maria Jose Rodriguez, Luis M Montuenga, Alfonso Calvo.   

Abstract

BACKGROUND: The accurate normalization of differentially expressed genes in lung cancer is essential for the identification of novel therapeutic targets and biomarkers by real time RT-PCR and microarrays. Although classical "housekeeping" genes, such as GAPDH, HPRT1, and beta-actin have been widely used in the past, their accuracy as reference genes for lung tissues has not been proven.
RESULTS: We have conducted a thorough analysis of a panel of 16 candidate reference genes for lung specimens and lung cell lines. Gene expression was measured by quantitative real time RT-PCR and expression stability was analyzed with the softwares GeNorm and NormFinder, mean of |Delta Ct| (= |Ct Normal-Ct tumor|) +/- SEM, and correlation coefficients among genes. Systematic comparison between candidates led us to the identification of a subset of suitable reference genes for clinical samples: IPO8, ACTB, POLR2A, 18S, and PPIA. Further analysis showed that IPO8 had a very low mean of |Delta Ct| (0.70 +/- 0.09), with no statistically significant differences between normal and malignant samples and with excellent expression stability.
CONCLUSION: Our data show that IPO8 is the most accurate reference gene for clinical lung specimens. In addition, we demonstrate that the commonly used genes GAPDH and HPRT1 are inappropriate to normalize data derived from lung biopsies, although they are suitable as reference genes for lung cell lines. We thus propose IPO8 as a novel reference gene for lung cancer samples.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19014639      PMCID: PMC2612021          DOI: 10.1186/1471-2199-9-103

Source DB:  PubMed          Journal:  BMC Mol Biol        ISSN: 1471-2199            Impact factor:   2.946


Background

Lung cancer is one of the most fatal types of cancer in the world. The overall 5-yr survival rate remains at 15%, as most patients present with advanced disease [1]. The prognosis for the patients is highly correlated to the stage of disease at the time of diagnosis. Lung cancer is usually diagnosed in an advanced stage, which is frequently too late for surgical intervention, and therefore, it usually becomes incurable. During the past few years, the application of microarray technology has revolutionized cancer genomics, making possible the simultaneous evaluation of the expression of thousands of genes. Newly discovered gene signatures in lung [2] and breast cancer [3] may predict disease outcome and contribute to the design of novel therapeutic targets. The use of gene expression profiles in routine clinical practice is highly dependent on precise identification and robust validation of these gene signatures, which relies upon a high-throughput RT-PCR-based technology is available. Quantitative real time PCR (qRT-PCR) is one of the most powerful quantification methods for gene expression analysis. This technology has been applied to identify molecular tumor biomarkers [4], splice variants of target genes [5], and microRNAs [6], and to quantify circulating DNA [7], with the final goal of improving diagnosis and predicting clinical outcome [8]. In these studies, target gene expression is usually quantified in relation to a stably expressed reference gene, simultaneously determined in the sample [9]. Although it is assumed that these reference genes are constitutively expressed in certain tissues and under certain circumstances, the literature shows that the expression levels of some of the "classic" endogenous control genes may in fact vary in different tissues, cell types, and disease stages [10]. It is then clear that if US Food and Drug Administration (FDA) or other Regulatory Agencies are to approve any diagnostic or prognostic test based on qRT-PCR, the proof of the stability of the proposed reference genes will be a major requirement. Therefore, the selection of suitable reference genes is a key prerequisite to control the variability of clinical samples. Recent lung cancer molecular profiling studies have employed a group of widely used endogenous control genes, such as GAPDH [11], beta-actin (ACTB) [12], TATA-binding protein (TBP) [4], 18s-rRNA [13], HMBS [5] and phenylalanine hydroxylase [14], for RT-PCR. Such genes were selected in the past as reference genes for non- or semi-quantitative techniques and have been used for many years in most experiments to measure qualitative gene expression changes. These widely used reference genes were not selected for specific tissue types or organs and were mainly validated in cell lines. The advent of qRT-PCR allows for the accurate quantification of expression changes, albeit some studies have continued using these old reference genes without a re-evaluation of their suitability as endogenous control genes. The requirement for a specific validation of the currently used reference genes is compelling and the need for robust stable endogenous genes for lung cancer is urgent. We studied a panel of sixteen genes (some of them frequently used as endogenous controls) and analyzed their suitability as reference genes in both lung cell lines and clinical lung samples. From those, we identified Importin 8 (IPO8) as the most suitable gene for normalizing clinical lung specimens.

Methods

Tumor tissues and cultured cells

Tumor samples were obtained from Non-Small Cell Lung Cancer (NSCLC) patients who underwent tumor resectional surgery at the University Hospital of Navarra (Pamplona, Spain) and at the Hospital Marqués de Valdecilla (Santander, Spain), under approved ethical protocols and informed consent from each patient (See Supplemental Table 1, Additional file 1). Surgically removed samples (the tumor and its corresponding matched normal tissue) were snap-frozen in liquid nitrogen. A 5 μm section was cut with a cryostat, and analyzed by histology. The general strategy of our analysis is illustrated in Figure 1. In a first study, consisting in the analysis of GAPDH and HPRT1 mRNA quantification by qRT-PCR analysis, samples from the Set A of patients were used (Set A). A second analysis was conducted with the Human Endogenous Control Plate (#4367563, Applied Biosystems) using the Set B of samples. A third analysis included the clinical validation of the selected optimally performing reference genes and was carried out on Sets A+C primary tumor samples and their paired non-malignant lung tissues.
Figure 1

Schematic diagram of the overall procedure for the identification of accurate reference genes. General strategy to identify the most accurate reference genes for lung cancer mRNA quantification analysis in three different sets of samples.

Schematic diagram of the overall procedure for the identification of accurate reference genes. General strategy to identify the most accurate reference genes for lung cancer mRNA quantification analysis in three different sets of samples. Non-malignant human bronchial epithelial (NHBE) and small airways epithelial cells (SAEC), and seven NSCLC cell lines (NCI-H460, NCI-H1385, NCI-H157, NCI-H1648, NCI-H23, NCI-H441 and SK-MES-1) were used. Non-tumor cells were obtained from CAMBREX (NJ, USA), and tumor cells were obtained from ATCC (VA, USA). Non-malignant cells were grown in Bronchial Epithelial Cell Basal Medium (CAMBREX, NJ, USA), and NSCLC cell lines in RPMI-1640 supplemented with 10% fetal calf serum (Invitrogen, Carlsbad, CA).

RNA extraction and qRT-PCR

Total RNA was isolated using the AllPrep DNA/RNA mini Kit (Qiagen, CA, USA) as described by the manufacturer. RNA concentrations and the A260/A280 ratio were measured with a NanoDrop® ND-1000 (NanoDrop Technologies, Montchanin, DE, USA). The threshold inclusion values for the RNA samples were > 1.90 for the A260/A280 ratio. The absence of contaminating DNA was analyzed by running the samples through 2% agarose gels. RNA quality was also determined in a Bioanalyzer platform (Agilent, CA, USA). Two micrograms RNA were reverse transcribed. Before transcription, RNA was denatured for 5 min at 65°C followed by cooling on ice. First strand cDNA synthesis was carried out with SuperScript™ III Reverse Transcriptase (Invitrogen) and random primers (Invitrogen) in a total volume of 20 μl. Reverse transcription was performed at 42°C for 1 h followed by 72°C for 15 min. Finally, RNase H was added to the reaction mixture for 20 min at 37°C. cDNA was stored at -80°C until RT-PCR analysis. Each RNA sample was controlled for genomic DNA contamination by a reaction mix without reverse transcriptase addition. All cDNAs were diluted 1:10 before being used as PCR template. Measurement of the expression of candidate genes was performed with the TaqMan® Low Density Human Endogenous Control Panel (Applied Biosystems), according to the manufacturer's protocol. This plate contains sixteen human endogenous candidate genes (Table 1). qRT-PCR was performed with an Applied Biosystems 7900HT Fast Real-time PCR System. PCR efficiencies were calculated according to Rasmussen [15] and the standard curves generated in the qRT- PCR were plotted as Ct values versus logarithms of the given concentrations of the DNA templates.
Table 1

Genes included in the TaqMan® low density human endogenous control panel

Gene symbolGene nameFunction
18S18S Ribosomal RNAOne 18S molecule makes the small subunit of the ribosome
ACTBBeta-ActinCytoskeletal structural protein
B2MBeta-2-MicroglobulinBeta-chain of major histocompatibility complex class I molecules
GAPDHGlyceraldehyde-3-phosphate dehydrogenaseOxidoreductase in glycolysis and gluconeogenesis
GUSBBeta-GlucuronidaseDegradation of dermatan and keratan sulfates
HMBSHydroxymethylbilane synthaseHeme synthesis, porphyrin metabolism
HPRT1Hypoxanthine phosphoribosyl transferasePurine synthesis in salvage pathway
IPO8Importin8Function in nuclear protein import
PGK1Phosphoglycerate kinaseGlycolytic enzyme
POLR2ARNA Polymerase IICatalyzes the RNA synthesis from DNA
PPIAPeptidylprolyl isomerase ACatalyzes the cis-trans isomerization of proline imidic peptide bonds in oligopeptides, accelerating folding
RPLP0Ribosomal large P0Ribosome biogenesis and assembly
TBPTATA binding proteinGeneral RNA polymerase II transcription factor
TFRCTransferrin receptorCellular uptake of iron
UBCUbiquitin CProtein degradation
YWHAZTyrosinmonooxygenase/Tryp-tophanmonooxygenase activation proteinSignal transduction by binding to phosphorylated serine residues on a variety of signaling molecules
Genes included in the TaqMan® low density human endogenous control panel

Determination of gene stability

To evaluate suitability of candidates as reference genes, we applied two powerful previously published Microsoft Excel-based applications: 1) GeNorm [16], which calculates gene stability as the standard deviation (SD) of the log2-transformed expression ratios of each reference gene. The program is available on the Internet . Ct values were converted into relative quantities for analysis with GeNorm, considering the PCR efficiencies of the genes. 2) NormFinder [17], that uses a model-based approach to estimate expression stability based on intra- and intergroup variations for candidate endogenous control genes. It is also freely available on the Internet .

Analysis of the absolute variation of Ct values

Ct variations were expressed as ΔCt, the difference between Ct Normal and Ct Tumor (ΔCt = Ct Normal-Ct Tumor). To analyze the absolute variation of Ct values, we calculated the mean and the standard error of the mean (SEM) of absolute values of ΔCt ("|ΔCt|") for each gene.

Microarray data analysis

Three lung cancer patient microarrays (HuGene-FL, HG-U95A, HG-U133A), previously described [18-20], were analyzed. The raw datasets are publicly available at: . For analysis, significant differences in a specific reference gene candidate expression, between normal and tumor samples from patients (lung adenocarcinomas and squamous cell lung carcinomas) were identified by ANOVA. p-values < 0.05 were considered statistically significant.

Statistical analysis

All statistical evaluations were carried out using the SPSS software package. Correlations between genes were determined by Pearson's test. All p-values < 0.01 were considered statistically significant in this analysis. Normal distributions were assessed with the Shapiro-Wilk's W test. Differences in gene expressions between nonmalignant and malignant samples were calculated by the Student's t test for paired data with normal distribution, or by Wilcoxon's test for paired data following non-parametric distribution. p-values were considered significant when p < 0.05.

Results

HPRT1 and GAPDH, two genes commonly used for normalization, are inappropriate reference genes for human lung tissue analyses

GAPDH and HPRT1 have been recommended as suitable reference genes for lung cancer research [21]. Thus, we first evaluated expression levels of these genes in the Set A samples. The analysis revealed that gene mRNA levels were significantly higher in tumors than in non-malignant tissues for both genes: GAPDH (p = 0.0001) and HPRT1 (p = 0.00003). Ct variations (ΔCt = Ct Normal-Ct Tumor) were calculated for each gene (Supplemental Table 2, Additional file 1). Ideally, a good reference gene should have |ΔCt| values close to zero with low SEM. However, as shown in Figure 2, neither GAPDH (|ΔCt| = 2.27 ± 0.31) nor HPRT1 (|ΔCt| = 1.81 ± 0.26) showed this pattern.
Figure 2

Variations of GAPDH and HPRT1 expression levels. Mean ± SEM of absolute values of ΔCt (|ΔCt| = |Ct Normal-Ct Tumor|) of the two commonly used reference genes (GAPDH and HPRT1) in paired lung clinical samples (Set A).

Variations of GAPDH and HPRT1 expression levels. Mean ± SEM of absolute values of ΔCt (|ΔCt| = |Ct Normal-Ct Tumor|) of the two commonly used reference genes (GAPDH and HPRT1) in paired lung clinical samples (Set A). We also performed a statistical analysis of three lung cancer microarrays previously published [18-20]. The ANOVA analysis (p < 0.05) confirmed that there was a significant increase in both GAPDH and HPRT1 expression levels in tumor samples compared to normal tissues (Supplemental Table 3, Additional file 1). In summary, in lung clinical samples, as described in several other cancers [22-25], the expression of the two most commonly used reference genes is heterogeneous and, consequently, not valid for gene expression normalization.

Variable expression of sixteen endogenous control genes in both lung cell lines and clinical samples

In the next experiments, we used the Low Density Endogenous Control Panel on human samples and cultured cells, with the goal of identifying suitable genes for normalization. We first analyzed the efficiency of the PCR assay. The linear correlation coefficient (R2) of the standard curves of all the genes ranged from 0.9942 to 0.999. Based on the slopes of the standard curves, the amplification efficiencies of the standards were from 91% to 100%, which were derived from the formula E = (101/-slope -1) × 100 [15]. The Ct values of the 16 genes in all the samples were within 10.7 and 35.3 cycles. Differential expression levels and dispersion of individual Ct values from the mean Ct value were calculated for the 16 genes (Table 2). Except for 18S, which was the gene with the highest expression, all the other candidates showed Cts ranging from 19 to 33. In NSCLC cell lines, non-malignant cells, and lung tumors, the gene with the lowest mRNA levels was TBP, whereas the gene with the highest expression was ACTB. However, in normal clinical samples, HMBS showed the lowest level of expression (Ct = 32.68 ± 0.76), and B2M was highest expressed transcript (Ct = 22.94 ± 0.75) (Table 2).
Table 2

Comparison of mean cycle threshold (Ct ± SD) values across different sample groups

Cultured cellsClinical samples
TumorNormalTumorNormal
18S12.88 ± 0.6111.24 ± 0.7312.60 ± 1.6213.79 ± 1.00
ACTB22.20 ± 1.0719.30 ± 0.2623.07 ± 1.3623.59 ± 1.15
B2M24.99 ± 0.5123.46 ± 0.4323.21 ± 1.1822.94 ± 0.75
GAPDH22.61 ± 0.7220.64 ± 0.0024.48 ± 1.9227.13 ± 1.11
GUSB29.36 ± 0.7328.61 ± 0.3628.81 ± 1.3629.77 ± 1.04
HMBS29.23 ± 1.1128.41 ± 0.4530.89 ± 1.4732.68 ± 0.76
HPRT127.07 ± 0.7126.20 ± 0.4029.10 ± 1.9030,63 ± 0.97
IPO829.65 ± 1.1928.97 ± 0.0330.43 ± 1.3730.91 ± 0.97
PGK124.97 ± 0.8323.15 ± 0.3225.23 ± 1.6327.56 ± 0.92
POLR2A27.95 ± 0.9326.37 ± 0.0628.55 ± 1.3229.58 ± 1.06
PPIA23.16 ± 0.8221.49 ± 0.2324.37 ± 1.4726.00 ± 1.07
RPLP023.01 ± 0.7121.02 ± 0.3724.34 ± 1.8226.01 ± 0.98
TBP30.95 ± 1.7529.00 ± 0.9631.79 ± 2.1232.59 ± 1.16
TFRC26.71 ± 0.7626.64 ± 0.1327.94 ± 2.1828.65 ± 1.06
UBC23.11 ± 0.6220.83 ± 0.1023.82 ± 2.2924.78 ± 1.48
YWHAZ29.25 ± 0.6826.72 ± 0.2829.47 ± 2.0531.06 ± 0.85
Comparison of mean cycle threshold (Ct ± SD) values across different sample groups The SD of Ct values for each control gene was also calculated in the samples (Table 2). For NSCLC cell lines, B2M (SD = 0.51) expression showed the lowest SD. On the contrary, in non-malignant cells, GAPDH (SD = 0.00) was the gene with the lowest variability. B2M had the lowest SD for both tumor and normal clinical samples (1.18 and 0.75, respectively). Because of the different patterns of expression of cultured cells and clinical samples, we decided to conduct further analyses separately. In the search for the most stable reference candidates, the gene expression stability was analyzed with GeNorm and NormFinder softwares.

In lung cell lines, five genes (including GAPDH and HPRT1) perform optimally as endogenous control genes

As explained in Materials and Methods, GeNorm and NormFinder are two mathematical tools recently developed to identify expression stability of a set of candidate genes. The model-based approach (NormFinder) selects the candidates with minimal combined inter- and intra-group expression variation. The pairwise comparison approach (GeNorm) selects genes with a low intra-group variation and roughly the same no vanishing intergroup variation. GeNorm calculates the gene expression stability measure "M" of one gene, based on the average pairwise variation between all studied genes. The lowest M values characterize genes with the most stable expression. Expression stabilities were first evaluated with GeNorm in cell lines (Table 3). In the ranking of expression stability, these genes were top-classified (M < 0.5): PPIA and RPLPO>GAPDH>18S>HPRT1. Genes with middle stability (0.5 POLR2A>HMBS>PGK1>UBC>ACTB. The less stable genes were B2M, TFRC, YWHAZ, IPO8 and TBP, whose M values were higher than 0.7 (Table 3).
Table 3

Candidate reference genes for normalization of qRT-PCR (in lung cultured cells) ranked according to their expression stability by GeNorm and NormFinder programs

GeNormNormFinder
GenesStability valueGenesStability value
PPIA0.377Most stable genesPPIA0.078
RPLPO0.377RPLP00.199
GAPDH0.387HPRT10.232
18S0.430GAPDH0.244
HPRT10.47318S0.257
GUSB0.509GUSB0.328
POLR2A0.547POLR2A0.407
HMBS0.580UBC0.408
PGK10.614PGK10.420
UBC0.644HMBS0.425
ACTB0.681ACTB0.508
B2M0.719B2M0.529
TFRC0.761TFRC0.583
YWHAZ0.801YWHAZ0.631
IPO80.852IPO80.758
TBP0.917Least stable genesTBP0.863
Candidate reference genes for normalization of qRT-PCR (in lung cultured cells) ranked according to their expression stability by GeNorm and NormFinder programs Using NormFinder with arbitrary cut-off values of 0.4 and 0.6 in the cell lines, the most stable genes were PPIA>RPLPO>HPRT1>GAPDH>18S>GUSB (Table 3). Genes with intermediate stability included POLR2A>UBC>PGK1>HBMS>ACTB>B2M>TFRC. Finally, YWHAZ, IPO8, and TBP were the least stable genes. Considering results from both softwares, PPIA, RPLPO, GAPDH, HPRT1, and 18S can be considered accurate reference genes in lung cancer cell lines. We further determined correlations between genes by Pearson's test, considering statistically significant p-values < 0.01 (supplemental Text S1 and supplemental Table 4, Additional file 1). Interestingly, our five candidate genes (PPIA, RPLPO, GAPDH, HPRT1, and 18S) exhibited a very strong correlation among themselves (r = 0.949–0.814).

Determination of a set of five candidate genes to be used as reference genes in clinical samples

In clinical samples, GeNorm identified the following genes with M values < 0.5 (very stable): IPO8, ACTB>POLR2A>18S; genes with M values ranging from 0.5 to 0.7 included PPIA>HMBS>RPLPO>YWHAZ. The group with M values >0.7 (not suitable for normalization) were PGK1>HPRT1>TBP>GAPDH>UBC>B2M>GUSB>TFRC (Table 4). Using NormFinder in clinical samples we observed that the most stable candidates were the following: PPIA>POLR2A>18S>HMBS>RPLPO>IPO8>ACTB>YWHAZ, with stability values < 0.4 (Table 4). Genes with stability values between 0.4 and 0.6 were HPRT1, TBP, PGK1, UBC, GUSB, and TFRC. The worse stable genes were GAPDH and B2M.
Table 4

Rank of expression stability (in lung tissues) calculated by GeNorm and NormFinder softwares

GeNormNormFinder
GenesStability valueGenesStability value
IPO80.245Most stable genesPPIA0.253
ACTB0.245POLR2A0.267
POLR2A0.34418S0.276
18S0.466HMBS0.318
PPIA0.530RPLP00.323
HMBS0.564IPO80.336
RPLP00.599ACTB0.360
YWHAZ0.658YWHAZ0.383
PGK10.717HPRT10.430
HPRT10.774TBP0.474
TBP0.823PGK10.484
GAPDH0.872UBC0.510
UBC0.931GUSB0.535
B2M0.984TFRC0.586
GUSB1.045GAPDH0.653
TFRC1.101Least stable genesB2M0.701
Rank of expression stability (in lung tissues) calculated by GeNorm and NormFinder softwares Consequently, the best three genes using GeNorm were IPO8, ACTB, and POLR2A, whereas the best three candidates using NormFinder were PPIA, POLR2A, and 18S. We conclude from both analyses that IPO8, ACTB, POLR2A, 18S and PPIA are suitable reference genes for lung biopsies. Furthermore, these genes exhibited a high Pearson correlation among themselves (r = 0.981–0.857) (supplemental Text S2 and Supplemental Table 6, Additional file 1). Moreover, IPO8 was the gene with the highest number of significant correlations (supplemental Table 7, Additional file 1). Since we had two different top-ranked candidates, depending on the type of analysis, we carried out further experiments to identify the best normalizing gene among the best five genes. This analysis was performed using the Set C sample, and calculating the means ± SEM of |ΔCt| (|Ct Normal-Ct Tumor|) for the aforementioned five genes (Figure 3). The ranking of these genes according to this criterion was as follows: 18S(0.563 ± 0.123)>IPO8(0.638 ± 0.222)>ACTB(0.875 ± 0.153)>POLR2A(0.878 ± 0.242)>PPIA(1.633 ± 0.201). The tests of paired data showed no significant differences between normal and malignant tissues for IPO8 (p = 0.877), whereas significant differences were found for all the other genes: PPIA (p = 0.000), 18S (p = 0.007), POLR2A (p = 0.011) and ACTB (p = 0.046). Therefore, IPO8 can be considered the best candidate, taking into account all these criteria (high stability, low mean ± SEM of |ΔCt| with no significant differences between paired samples, and number of correlations).
Figure 3

Average of absolute values of ΔCt of IPO8, ACTB, POLR2A, 18S and PPIA. Mean ± SEM of |ΔCt| (=|Ct Normal-Ct Tumor|) of five selected optimally performing reference genes (IPO8, ACTB, POLR2A, 18S and PPIA) in paired lung clinical samples (Set C).

Average of absolute values of ΔCt of IPO8, ACTB, POLR2A, 18S and PPIA. Mean ± SEM of |ΔCt| (=|Ct Normal-Ct Tumor|) of five selected optimally performing reference genes (IPO8, ACTB, POLR2A, 18S and PPIA) in paired lung clinical samples (Set C).

IPO8 as the most accurate reference gene for clinicopathological specimens

In order to further validate IPO8 as the best control gene for lung tissues, a third analysis was performed in Sets A+C samples. In addition, we used this analysis to reexamine PPIA as a putative control gene (since it was top-classified by NormFinder) and to further validate the inaccuracy of GAPDH and HPRT1 as normalizing genes for clinicopathological lung specimens. In this case, we used |ΔCt| ± SEM, but not expression stability based on GeNorm and NormFinder, because the validity of this latter analysis relies on examining a large number of genes (typically 5 to 10) [16]. The test of paired data revealed no significant differences between normal and malignant samples for IPO8 mRNA levels (|ΔCt| = 0.70 ± 0.09). On the contrary, the three other genes showed significantly different |ΔCt| values when comparing non-malignant with malignant tissues (Figure 4). IPO8 was also the gene with the lowest SEM (Figure 4). Therefore, these results confirmed that IPO8 is the best reference gene for normalizing lung tissue samples. PPIA showed |ΔCt| = 1.45 ± 0.17, and GAPDH and HPRT1 had the highest difference in |ΔCt| (on average) between normal and tumor samples: 2.42 ± 0.20 for GAPDH and 1.91 ± 0.21 for HPRT1.
Figure 4

IPO8 as the most accurate reference gene in lung specimens. Average (mean ± SEM) of |ΔCt| of the two commonly used reference genes (GAPDH and HPRT1), PPIA and the novel reference gene IPO8 in paired lung clinical samples (Sets A+C).

IPO8 as the most accurate reference gene in lung specimens. Average (mean ± SEM) of |ΔCt| of the two commonly used reference genes (GAPDH and HPRT1), PPIA and the novel reference gene IPO8 in paired lung clinical samples (Sets A+C).

Discussion

The identification of novel diagnostic tools and therapeutic targets for lung cancer relies on the accurate normalization with reference genes whose expression remains constant in both normal and malignant tissues. Stringent requirements for selecting endogenous controls are essential, and the task of identifying normalization genes is not trivial. Several recent papers have demonstrated that classical "housekeeping" genes such as GAPDH, HPRT1, and ACTB (β-actin) are inaccurate to normalize different types of clinical samples [23,25-27] In the present study, we have analyzed the expression of a panel of 16 genes in lung cancer cell lines and biopsies, with the goal of identifying the most accurate candidate to be used as a reference gene. The main finding of our study is the identification of importin-8 (IPO8) as a very robust reference gene for lung clinical specimens, which could become the gold-standard endogenous gene for lung tissues. One first conclusion is that GAPDH and HPRT1, the two most commonly reference genes used in the literature, are not suitable for the normalization of gene expression lung biopsies. Nonetheless, our data also show that the use of both genes is perfectly appropriate for expression studies using lung cell lines. In contrast, in clinical specimens we have clearly shown a significant increase in GAPDH and HPRT1 mRNA levels in tumors (as compared to normal matched tissues) and low expression stability. Despite GAPDH was widely used in the past, its use as a reference gene has recently been challenged in the majority of tumor types, including melanoma [22], liver [23], bladder [24], renal cancer [25], prostate [26], gastroesophagic and pancreatic cancer [27], and colon adenocarcinoma [28]. In vitro assays have demonstrated that GAPDH contributes to diverse cellular functions related to glycolysis, nuclear RNA export, DNA replication and repair, exocytosis, and cytoskeletal organization [29]. GAPDH was also suggested to play a role in the pathogenesis of cancer [29]. Remarkably, antisense oligodeoxynucleotides targeting GAPDH inhibit cell proliferation and induce apoptosis in cervical carcinoma cells. Taken together our results and data from the literature, in spite of a previous study proposing GAPDH as a good normalizing gene for lung biopsies [21], we strongly suggest not to use this gene for gene expression normalizing purposes in lung. In our search for the most accurate gene to normalize lung specimens, we ranked the 16 candidate genes according to expression stability, and mean of |ΔCt| ± SEM values. According to GeNorm, the best three genes in terms of expression stability were IPO8, ACTB, and POLR2A, whereas using NormFinder PPIA, POLR2A, and 18S were top-classified. From both analyses, we consequently proposed a set of five genes (IPO8, ACTB, POLR2A, 18S and PPIA) as suitable reference genes for lung specimens. Considering the lowest |ΔCt|, 18S was top-ranked (0.563 ± 0.123), closely followed by IPO8 (0.638 ± 0.222). However, statistical comparison of expression levels between normal and malignant tissues found no differences exclusively for IPO8, but not for any other gene. In addition, expression of IPO8 strongly correlated (r2 > 0.9) with that of 18S, ACTB, and POLR2A. IPO8 had also the highest number of gene correlations. In view of all these results we conclude that IPO8 is the most robust reference gene for lung cancer studies. Indeed, we further validated the accuracy of IPO8 as a reference gene in a different set of samples and found again that Ct values for normal samples were statistically similar to those of tumors. Interestingly, IPO8 has never been proposed as a potential reference gene in cancer research. Importin 8 (IPO8), a gene located at 12p11.21, which encodes a protein of 1037 aminoacids, is a member of a class of approximately 20 potential Ran targets that share a sequence motif related to the Ran-binding site of importin-beta. This protein binds to the nuclear pore complex and, along with RanGTP and RANBP1, inhibits the GAP stimulation of the Ran GTPase. The importin-alpha/beta complex and the GTPase Ran mediate nuclear import of proteins with a classical nuclear localization signal [30]. Despite its accuracy in the normalization of lung clinical samples, IPO8 is not the best option for in vitro studies. Although |ΔCt| for IPO8 was also very low in cell lines, its expression is not stable (according to Genorm and NormFinder analysis). In cell lines, PPIA was top-classified in terms of expression stability, followed by RPLPO, 18S, and HPRT1. 18S was included in the group of optimally performing endogenous genes in all the analyses of our study, for both cell lines and clinical samples. Therefore, 18S could be an alternative to IPO8 when a study required the use of cell lines and biopsies with a single reference gene. However, 18S rRNA levels are extremely high (Ct values between 12.6–12.8) in comparison to other target housekeeping mRNAs, which may increase the risk of introducing quantification errors. In addition, some studies have suggested that mRNA transcripts should not be normalized with a ribosomal RNA, because of their unrelated expression mechanisms [31]. Systematic comparisons of gene sets in different types of tumors have recently led to the selection of a variety of optimal reference genes: SDHA and TBP for bladder [24], 18S for gastric and colorectal [27], PPIA and TBP for renal [25], HPRT1 for prostate [26], SFRS4 for hepatocellular carcinoma [23], or B2M for colon adenocarcinoma [28]. It seems clear that a single definitive universal reference gene has not been identified yet, and may be very difficult to find, as tissue specific gene expression is the basis for tissue and organ differentiation. Consequently, appropriate control genes for each specific tumor type need to be selected among a variety of candidates, using stringent mathematical criteria.

Conclusion

We can draw several important conclusions from our study: a) GAPDH and HPRT1 are not suitable genes to normalize lung specimens but are appropriate when using lung cell lines; b) The best performing reference genes for lung cell lines are not coincident with those of clinical samples; c) PPIA is a novel reference gene for lung cell lines; d) Finally, and most importantly, we have described for the first time that Importin-8 is the best performing gene to normalize clinicopathological lung samples and should be considered as the main option when using lung biopsies. We believe that this finding will help further studies to normalize potential new targets for diagnosis and treatment of lung cancer.

Abbreviations

cDNA: complementary DNA; NSCLC: non-small cell lung cancer; RT-PCR: reverse transcription-PCR.

Authors' contributions

PAN and JA were the main contributors to the manuscript, carrying out most of the experiments, data analysis, design, interpretation of the results, and writing most of the manuscript. MDL and JGR collected the samples and made critical contributions to the analysis of results. MJR and BAS participated in the study design and performed part of the experiments. DB, IV, MJP, and RP contributed to the data analysis. LMM and AC participated in the study design, supervised the work and partially wrote the manuscript. All authors read and approved the manuscript.

Additional file 1

Supplementary material Click here for file
  30 in total

1.  Clinical validity of the lung cancer biomarkers identified by bioinformatics analysis of public expression data.

Authors:  Bumjin Kim; Hyun Joo Lee; Hye Young Choi; Youngah Shin; Seungyoon Nam; Gilju Seo; Dae-Soon Son; Jisuk Jo; Jaesang Kim; Jinseon Lee; Jhingook Kim; Kwhanmien Kim; Sanghyuk Lee
Journal:  Cancer Res       Date:  2007-08-01       Impact factor: 12.701

2.  Ribosomal 18S RNA prevails over glyceraldehyde-3-phosphate dehydrogenase and beta-actin genes as internal standard for quantitative comparison of mRNA levels in invasive and noninvasive human melanoma cell subpopulations.

Authors:  D Goidin; A Mamessier; M J Staquet; D Schmitt; O Berthier-Vergnes
Journal:  Anal Biochem       Date:  2001-08-01       Impact factor: 3.365

3.  Altered HOX and WNT7A expression in human lung cancer.

Authors:  R Calvo; J West; W Franklin; P Erickson; L Bemis; E Li; B Helfrich; P Bunn; J Roche; E Brambilla; R Rosell; R M Gemmill; H A Drabkin
Journal:  Proc Natl Acad Sci U S A       Date:  2000-11-07       Impact factor: 11.205

4.  Interleukin-8 messenger ribonucleic acid expression correlates with tumor progression, tumor angiogenesis, patient survival, and timing of relapse in non-small-cell lung cancer.

Authors:  A Yuan; P C Yang; C J Yu; W J Chen; F Y Lin; S H Kuo; K T Luh
Journal:  Am J Respir Crit Care Med       Date:  2000-11       Impact factor: 21.405

Review 5.  Absolute quantification of mRNA using real-time reverse transcription polymerase chain reaction assays.

Authors:  S A Bustin
Journal:  J Mol Endocrinol       Date:  2000-10       Impact factor: 5.098

6.  Cancer statistics, 2008.

Authors:  Ahmedin Jemal; Rebecca Siegel; Elizabeth Ward; Yongping Hao; Jiaquan Xu; Taylor Murray; Michael J Thun
Journal:  CA Cancer J Clin       Date:  2008-02-20       Impact factor: 508.702

7.  GeneChip, geNorm, and gastrointestinal tumors: novel reference genes for real-time PCR.

Authors:  Mark Kidd; Boaz Nadler; Shrikant Mane; Geeta Eick; Maximillian Malfertheiner; Manish Champaneria; Roswitha Pfragner; Irvin Modlin
Journal:  Physiol Genomics       Date:  2007-04-24       Impact factor: 3.107

8.  Quantitative real-time reverse transcription PCR study of the expression of vascular endothelial growth factor (VEGF) splice variants and VEGF receptors (VEGFR-1 and VEGFR-2) in non small cell lung cancer.

Authors:  Eleni Zygalaki; Emily G Tsaroucha; Loukas Kaklamanis; Evi S Lianidou
Journal:  Clin Chem       Date:  2007-06-28       Impact factor: 8.327

9.  Survival prediction of stage I lung adenocarcinomas by expression of 10 genes.

Authors:  Fabrizio Bianchi; Paolo Nuciforo; Manuela Vecchi; Loris Bernard; Laura Tizzoni; Antonio Marchetti; Fiamma Buttitta; Lara Felicioni; Francesco Nicassio; Pier Paolo Di Fiore
Journal:  J Clin Invest       Date:  2007-11       Impact factor: 14.808

10.  Signal recognition particle protein 19 is imported into the nucleus by importin 8 (RanBP8) and transportin.

Authors:  K A Dean; O von Ahsen; D Görlich; H M Fried
Journal:  J Cell Sci       Date:  2001-10       Impact factor: 5.285

View more
  25 in total

1.  Novel functional germline variants in the VEGF receptor 2 gene and their effect on gene expression and microvessel density in lung cancer.

Authors:  Dylan M Glubb; Elisa Cerri; Alexandra Giese; Wei Zhang; Osman Mirza; Emma E Thompson; Peixian Chen; Soma Das; Jacek Jassem; Witold Rzyman; Mark W Lingen; Ravi Salgia; Fred R Hirsch; Rafal Dziadziuszko; Kurt Ballmer-Hofer; Federico Innocenti
Journal:  Clin Cancer Res       Date:  2011-06-28       Impact factor: 12.531

2.  Anaphylatoxin C5a creates a favorable microenvironment for lung cancer progression.

Authors:  Leticia Corrales; Daniel Ajona; Stavros Rafail; Juan J Lasarte; Jose I Riezu-Boj; John D Lambris; Ana Rouzaut; Maria J Pajares; Luis M Montuenga; Ruben Pio
Journal:  J Immunol       Date:  2012-10-01       Impact factor: 5.422

3.  Varied pathways of stage IA lung adenocarcinomas discovered by integrated gene expression analysis.

Authors:  Chengwen Chen; Xuping Fu; Deqiang Zhang; Yuan Li; Yi Xie; Yao Li; Yan Huang
Journal:  Int J Biol Sci       Date:  2011-04-28       Impact factor: 6.580

4.  A large-scale analysis of alternative splicing reveals a key role of QKI in lung cancer.

Authors:  Fernando J de Miguel; María J Pajares; Elena Martínez-Terroba; Daniel Ajona; Xabier Morales; Ravi D Sharma; Francisco J Pardo; Ana Rouzaut; Angel Rubio; Luis M Montuenga; Ruben Pio
Journal:  Mol Oncol       Date:  2016-08-09       Impact factor: 6.603

5.  Development of a novel splice array platform and its application in the identification of alternative splice variants in lung cancer.

Authors:  Ruben Pio; David Blanco; Maria Jose Pajares; Elena Aibar; Olga Durany; Teresa Ezponda; Jackeline Agorreta; Javier Gomez-Roman; Miguel Angel Anton; Angel Rubio; Maria D Lozano; Jose M López-Picazo; Francesc Subirada; Tamara Maes; Luis M Montuenga
Journal:  BMC Genomics       Date:  2010-06-03       Impact factor: 3.969

Review 6.  Novel insight into the role of GAPDH playing in tumor.

Authors:  Chunmei Guo; Shuqing Liu; Ming-Zhong Sun
Journal:  Clin Transl Oncol       Date:  2012-08-22       Impact factor: 3.405

7.  miRNA processing gene polymorphisms, blood DNA methylation age and long-term ambient PM2.5 exposure in elderly men.

Authors:  Jamaji C Nwanaji-Enwerem; Elena Colicino; Lingzhen Dai; Qian Di; Allan C Just; Lifang Hou; Pantel Vokonas; Immaculata De Vivo; Bernardo Lemos; Quan Lu; Marc G Weisskopf; Andrea A Baccarelli; Joel D Schwartz
Journal:  Epigenomics       Date:  2017-11-06       Impact factor: 4.778

8.  Identification of reference genes across physiological states for qRT-PCR through microarray meta-analysis.

Authors:  Wei-Chung Cheng; Cheng-Wei Chang; Chaang-Ray Chen; Min-Lung Tsai; Wun-Yi Shu; Chia-Yang Li; Ian C Hsu
Journal:  PLoS One       Date:  2011-02-24       Impact factor: 3.240

9.  Detection of collagen triple helix repeat containing-1 and nuclear factor (erythroid-derived 2)-like 3 in colorectal cancer.

Authors:  Marco Palma; Lissett Lopez; Margarita García; Nuria de Roja; Tamara Ruiz; Julita García; Elisabet Rosell; Carmen Vela; Paloma Rueda; María-Jose Rodriguez
Journal:  BMC Clin Pathol       Date:  2012-02-09

10.  Overexpression of TMPRSS4 in non-small cell lung cancer is associated with poor prognosis in patients with squamous histology.

Authors:  L Larzabal; P A Nguewa; R Pio; D Blanco; B Sanchez; M J Rodríguez; M J Pajares; R Catena; L M Montuenga; A Calvo
Journal:  Br J Cancer       Date:  2011-11-08       Impact factor: 7.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.