| Literature DB >> 32309342 |
Jie Zheng1, Ming-Yu Zhu2, Fei Wu3, Bin Kang3, Ji Liang3, Fabienne Heskia4, Yun-Feng Shan5, Xin-Xin Zhang6.
Abstract
BACKGROUND: Hepatocellular carcinoma (HCC) is one of the most common and lethal malignancies. Early detection of HCC could largely reduce mortalities. Ultrasonography (US) and serum Alpha Fetoprotein (AFP) test are the screening methods that are most frequently applied to high-risk populations. Due to the poor performance of AFP testing, and the highly operator-dependent nature of US, a biomarker for HCC early diagnosis is highly sought after. We developed a method for HCC screening using a 22-gene expression signature.Entities:
Keywords: Diagnosis; blood; carcinoma; gene expression; hepatocellular
Year: 2020 PMID: 32309342 PMCID: PMC7154425 DOI: 10.21037/atm.2020.01.93
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Clinical Characters
| Characters | Microarray | qPCR | |||||
|---|---|---|---|---|---|---|---|
| Chronic hepatitis | Liver cirrhosis | Hepatocellular carcinoma | Chronic hepatitis | Liver cirrhosis | Hepatocellular carcinoma | ||
| n | 29 | 31 | 38 | 104 | 112 | 100 | |
| Age, y | |||||||
| Mean | 42.24 | 48.84 | 54.26 | 38.14 | 47.36 | 55.87 | |
| Range | 30–77 | 32–65 | 31–81 | 22–77 | 25–72 | 25–81 | |
| Gender | |||||||
| M | 23 | 27 | 32 | 69 | 80 | 80 | |
| F | 6 | 4 | 6 | 35 | 32 | 20 | |
| TNM stage | |||||||
| T | |||||||
| T1 | – | – | 15 | – | – | 44 | |
| T2 | – | – | 6 | – | – | 20 | |
| T3 | – | – | 2 | – | – | 5 | |
| T4 | – | – | 12 | – | – | 14 | |
| N | |||||||
| N1 | – | – | 33 | – | – | 79 | |
| N2 | – | – | 1 | – | – | 3 | |
| M | |||||||
| M0 | – | – | 33 | – | – | 79 | |
| M1 | – | – | 1 | – | – | 3 | |
| AFP >20 | 3 | 7 | 30 | 13 | 28 | 55 | |
| AFP ≤20 | 26 | 24 | 8 | 91 | 84 | 45 | |
| ALT (IU/L) | |||||||
| Mean | 48.24 (n=29) | 56.23 (n=31) | 92 (n=38) | 53.15 (n=78) | 51.51 (n=94) | 54.25 (n=55) | |
| Range | 11–304 | 12–254 | 9–766.2 | 11–304 | 12–274 | 9–129.2 | |
| AST (IU/L) | |||||||
| Mean | 33.07 (n=29) | 50.87 (n=31) | 101.2 (n=38) | 35.54 (n=78) | 51.08 (n=94) | 62.31 (n=56) | |
| Range | 13–129 | 18–171 | 14.2–596.9 | 13–129 | 17–275 | 16–399 | |
| Bilirubin (μmol/L) | |||||||
| Mean | 16.42 (n=28) | 21.07 (n=31) | 42.49 (n=38) | 16.69 (n=78) | 34.98 (n=93) | 34.92 (n=52) | |
| Range | 4.7–39 | 7.3–51.2 | 4.9–592.3 | 4.7–39.2 | 5–436.4 | 4.9–530.4 | |
| Creatinine (μmol/L) | |||||||
| Mean | 80.27 (n=15) | 76.1 (n=21) | NA | 76.83 (n=40) | 73.91 (n=56) | 66.29 (n=50) | |
| Range | 63–104 | 55–103 | NA | 47–141 | 31–181 | 36–104.7 | |
| Albumin (g/L) | |||||||
| Mean | 43.71 (n=28) | 40.29 (n=31) | 40.17 (n=38) | 44.1 (n=76) | 39.85 (n=92) | 39.17 (n=54) | |
| Range | 37–50 | 27–51 | 23.3–79.8 | 36.9–51 | 26.8–51 | 24.2–64.2 | |
| Etiology | |||||||
| CHB | 26 | 28 | 35 (27 with LC) | 84 | 93 | 82 (66 with LC) | |
| CHC | 2 | 0 | 1 (1 with LC) | 15 | 6 | 0 | |
| AIH | 0 | 0 | 0 | 1 | 3 | 1 (1 with LC) | |
| Alcohol | 0 | 1 | 0 | 0 | 3 | 0 | |
| Cryptogenic | 1 | 2 | 2 | 4 | 7 | 17 (7 with LC) | |
M, male; F, female; CHB, chronic hepatitis B positive; LC, liver cirrhosis; CHC, chronic hepatitis C positive; AIH, autoimmune hepatitis.
Figure 1Two strategies for feature selection and model construction. (A) mRMR-SVM process: In each LOOCV iteration, models were trained via SVM and based on genes selected by mRMR method. Signature with best performance were selected; (B) lasso-SVM process: Lasso feature selection was sequentially combined with SVM method. The optimum number of genes were selected as signature according to the model performance in a 10-fold cross validation; (C) qPCR results: genes selected via the two methods were combined for qPCR validation. Lasso-SVM procedure was applied on qPCR results to generated a 22-gene signature, including 12 genes selected via mRMR-SVM alone, 4 genes selected via lasso-SVM alone, and 6 via both processes.
The 22-gene signature
| Probe Set ID | Gene symbol | Gene title | UniGene ID | Log fold change | Adj. P value |
|---|---|---|---|---|---|
| 11759232_at |
| Megakaryocyte and Platelet Inhibitory Receptor G6b | Hs.247879 | 1.478274023 | 2.86E-18 |
| 11748570_a_at |
| Fatty acid hydroxylase domain Containing 2 | Hs.519694 | 1.372467571 | 4.91E-16 |
| 11738596_x_at |
| Platelet factor 4 variant 1 | Hs.72933 | 1.33841041 | 1.58E-12 |
| 11733817_s_at |
| Four and a half LIM domains 1 | Hs.435369 | 0.899517331 | 3.48E-06 |
| 11754395_a_at |
| Bromodomain containing 4 | Hs.187763 | 0.896721633 | 4.09E-09 |
| 11757485_x_at |
| Family with sequence similarity 129, member B | Hs.522401 | 0.634247118 | 1.98E-10 |
| 11728289_a_at |
| TBC1 domain family, member 2 | Hs.371016 | 0.573178366 | 5.92E-07 |
| 11729797_s_at |
| Sp2 transcription factor | Hs.514276 | 0.485558387 | 2.31E-07 |
| 11756215_x_at |
| Ubiquitin A-52 residue ribosomal protein fusion product 1 | Hs.5308 | 0.455526734 | 0.002286894 |
| 11729784_a_at |
| CKLF-like MARVEL transmembrane | Hs.195685 | 0.452691469 | 0.026888906 |
| Domain containing 2 | |||||
| 11723822_a_at |
| Zinc finger protein 862 | Hs.731923 | 0.359312518 | 9.94E-10 |
| 11757505_a_at |
| High density lipoprotein binding protein | Hs.471851 | 0.246682066 | 0.018919396 |
| 11729610_a_at |
| Engulfment and cell motility 1 | Hs.434989 | 0.178715767 | 0.099388789 |
| 11748635_s_at |
| Stromal antigen 3-like 1/2/3 (pseudogene) | Hs.632310 | –0.162123966 | 0.003566646 |
| Hs.661254 | |||||
| Hs.666638 | |||||
| 11760799_x_at |
| Major histocompatibility complex, class II, DP beta 1 | Hs.485130 | –0.23944138 | 0.129352788 |
| 11731523_s_at |
| Zinc finger protein 592 | Hs.79347 | –0.277762844 | 0.005197294 |
| 11715718_a_at |
| Zinc finger, HIT-type containing 1 | Hs.211079 | –0.362872 | 9.76E-05 |
| 11730824_at |
| COX19 cytochrome c oxidase | Hs.121593 | –0.364304295 | 2.56E-09 |
| 11716794_a_at |
| Myosin light chain 6 | Hs.632717 | –0.534250299 | 6.38E-05 |
| 11721695_s_at |
| Dual specificity phosphatase 2 | Hs.1183 | –0.988291168 | 3.57E-13 |
| 11756740_a_at |
| LRRC75A antisense RNA 1 | Hs.368934 | –1.164800967 | 7.31E-23 |
| 11715357_s_at |
| Ribosomal protein S21 | Hs.190968 | –1.29775912 | 7.24E-19 |
Figure 2Functional annotation of differentially expressed genes. (A) Biological process: platelet degranulation and blood coagulation were significant in up-regulated genes; (B) KEGG pathway: platelet activation was most significant pathway in up-regulated genes. Ribosome was most significant in down regulated genes.
Figure 3Signature performance in diagnosis of HCC. (A) ROC curve for all HCC patients vs. non-cancer patients (CH and LC); (B) ROC curve for HCC patients vs. non-cancer patients in the subgroup with AFP levels ≤20 ng/mL; (C) dot plot for probability values of CH, LC, and HCC group; (D) table for probability values of CH, LC, and HCC with different tumor size. HCC, hepatocellular carcinoma; CH, chronic hepatitis; LC, liver cirrhosis.