| Literature DB >> 35770000 |
Xiaoxia Wen1, Guishu Yang2, Yongcheng Dong3, Liping Luo4, Bangrong Cao4, Birga Anteneh Mengesha5, Ruiling Zu6, Yulin Liao6, Chang Liu6, Shi Li6, Yao Deng6, Kaijiong Zhang6, Xin Ma3, Jian Huang5, Dongsheng Wang6, Keyan Zhao3, Ping Leng1, Huaichao Luo6.
Abstract
Many studies in recent years have demonstrated that some messenger RNA (mRNA) in platelets can be used as biomarkers for the diagnosis of pan-cancer. The quantitative real-time polymerase chain reaction (RT-qPCR) molecular technique is most commonly used to determine mRNA expression changes in platelets. Accurate and reliable relative RT-qPCR is highly dependent on reliable reference genes. However, there is no study to validate the reference gene in platelets for pan-cancer. Given that the expression of some commonly used reference genes is altered in certain conditions, selecting and verifying the most suitable reference gene for pan-cancer in platelets is necessary to diagnose early stage cancer. This study performed bioinformatics and functional analysis from the RNA-seq of platelets data set (GSE68086). We generated 95 candidate reference genes after the primary bioinformatics step. Seven reference genes (YWHAZ, GNAS, GAPDH, OAZ1, PTMA, B2M, and ACTB) were screened out among the 95 candidate reference genes from the data set of the platelets' transcriptome of pan-cancer and 73 commonly known reference genes. These candidate reference genes were verified by another platelets expression data set (GSE89843). Then, we used RT-qPCR to confirm the expression levels of these seven genes in pan-cancer patients and healthy individuals. These RT-qPCR results were analyzed using the internal stability analysis software programs (the comparative Delta CT method, geNorm, NormFinder, and BestKeeper) to rank the candidate genes in the order of decreasing stability. By contrast, the GAPDH gene was stably and constitutively expressed at high levels in all the tested samples. Therefore, GAPDH was recommended as the most suitable reference gene for platelet transcript analysis. In conclusion, our result may play an essential part in establishing a molecular diagnostic platform based on the platelets to diagnose pan-cancer.Entities:
Keywords: normalization; pan-cancer; platelets; quantitative real time polymerase chain reaction; reference genes
Year: 2022 PMID: 35770000 PMCID: PMC9234127 DOI: 10.3389/fgene.2022.913886
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
An overview of the current reference genes commonly used for studies of pan-cancer and platelets.
| References gene | Cancer type | Sample size | PMID |
|---|---|---|---|
|
| Colorectal cancer (CRC) | 286 CRC patients and 41 healthy controls and 22 patients with ulcerative colitis and 23 patients with Crohn’s disease | 31639773 |
|
| Lung cancer | 48 lung cancer patients and 48 healthy donors | 31552488 |
|
| Colorectal cancer (CRC) and non–small-cell lung cancer (NSCLC) | 19 CRC patients, 16 NSCLC patients, and 4 healthy volunteers | 33955587 |
|
| Non–small-cell lung cancer (NSCLC) | 243 NSCLC patients, 150 healthy controls, and 141 benign pulmonary nodules patients | 31523198 |
|
| Non–small-cell lung cancer (NSCLC) | 10 NSCLC patients and 7 healthy subjects | 33287695 |
|
| Hepatocellular carcinoma (HCC) | 20 HCC patients, 20 liver cirrhosis patients, and 10 healthy subjects | 34469466 |
|
| Lung cancer | 58 healthy donors and 156 lung cancer patients | 30201066 |
Primer sequences of the seven candidate reference genes.
| Gene | Primer sequences (5′–3′) |
|---|---|
|
| F:GCTATACGACCTGCTGCCTTTCT |
| RR:CTCCTTAATGTCACGCACGAT | |
| CTCCTTAATGTCACGCACGAT | |
|
| F:ACCCAGAAGACTGTGGATGG |
| R:TTCAGCTCAGGGATGACCTT | |
|
| F:CCTGCATGAAGTCTGTAACTGAG |
| R:GACCTACGGGCTCCTACAACA | |
|
| F:GAGGCTATCCAGCGTACTCCA |
| R:CGGCAGGCATACTCATCTTTT | |
|
| F:TGCCTCGGGAACAGTAAGAC |
| R:GCCGCCCTCTCCATTAAAC | |
|
| F:CTCCACTGCTGTAGTAACCCG |
| R:GATCCCTCTGACTATTCCCTCG | |
|
| F:TCAGACGCAGCCGTAGACA |
| R:GCATTCCCGTTAGCAGGGG |
FIGURE 1The overall workflow of bioinformatical statistics for screening the candidate reference genes from the platelet RNA sequencing data set.
FIGURE 2Venn diagram of distribution relationship between the candidate genes and the six tumor groups. The 95 genes in the lower right corner are selected from 285 candidate genes.
FIGURE 3The verification of expression stability of seven candidate reference genes of platelet in another data set (GSE89843). (A) Seven candidate reference genes all expressed stably in the platelet sequencing data from 377 healthy individuals. (B) Seven candidate reference genes all expressed stably in the platelet sequencing data from 402 NSCLC patients. Blue: selected (stable expression), red: removed (unstable expression).
Baseline characteristics of all the enrolled subjects.
| Total | Validation group 1 | Validation group 2 | |
|---|---|---|---|
| Gender | |||
|
| 37 (46.25%) | 14 (46.7%) | 23 (46%) |
|
| 43 (53.75%) | 16 (53.3%) | 27 (54%) |
| Age (years) | |||
| Mean (SD) | 57.46 (9.24) | 60.47 (8.50) | 55.6 (9.20) |
| Cancer | |||
| Lung cancer (NSCLC) | 15 (18.75%) | 5 (16.67%) | 10 (20.0%) |
| Colon cancer (CRC) | 16 (20.0%) | 6 (20.0%) | 10 (20.0%) |
| Hepatobiliary cancer (HBC) | 16 (20.0%) | 6 (20.0%) | 10 (20.0%) |
| Breast cancer (BrCa) | 16 (20.0%) | 6 (20.0%) | 10 (20.0%) |
| Healthy control (HC) | 17 (21.25%) | 7 (23.33%) | 10 (20.0%) |
| Stage | |||
| I | 3 (3.75%) | 1 (3.3%) | 2 (4.0%) |
| II | 16 (20.0%) | 7 (23.3%) | 9 (18.0%) |
| III | 13 (16.25%) | 7 (23.3%) | 6 (12.0%) |
| IV | 27 (33.75%) | 8 (26.7%) | 19 (38.0%) |
| NA | 21 (26.25%) | 7 (23.3%) | 14 (28.0%) |
FIGURE 4(A) Validation of the stability of the seven reference genes. (B-H) The validation of each reference gene with the cancerous and health group. CT value reflects the abundance of reference gene expression. The higher the CT value, the lower the expression level, and vice versa. The standard deviation (SD) of CT values is a schematic indicator of the stability of candidate reference gene expression in all the tested samples. The box plots show a box from the first quartile (25th percentiles) to the third quartile (75th percentiles) and the median in the midst (50th percentiles).
FIGURE 5(A) Ranking of the three reference genes based on their expression stability calculated by Delta CT. (B). GeNorm analysis of the three candidate reference genes. (C). Stability value of each of the three candidate reference genes from the NormFinder analysis. (D). BestKeeper algorithm analysis to determine the stability of reference genes, where a low value indicates a more stable expression in the normalization factor. The least stable gene in each step is indicated by arrows.
Ranking of the three reference genes stability.
| Methods | References genes stability value rank | ||
|---|---|---|---|
| First (1st) | Second (2nd) | Third (3rd) | |
| Delta CT |
|
|
|
| BestKeeper |
|
|
|
| NormFinder |
|
|
|
| geNorm |
| — |
|
| Recommended comprehensive ranking |
|
|
|
Bold values indicates that the finally comprehensive ranking of three reference genes after combing other stability methods.
Basic clinical characteristics of the verified subjects.
| Lung cancer | Healthy subjects |
| |
|---|---|---|---|
|
| 21 | 21 | — |
|
| — | — | 0.298 |
|
| 14 (66.7%) | 9 (42.9%) | — |
|
| 7 (33.3%) | 12 (57.1%) | — |
|
| 61.90 (7.01) | 49.48 (8.41) |
|
| Mean (SD) |
Italic values represents the statistically significant different of age between the healthy subjects and lung cancer patients.
FIGURE 6The quantitative analysis results of FLNA gene expression in platelets of lung cancer patients and healthy subjects. ***p < 0.01.