| Literature DB >> 34083687 |
Shi-Ang Qi1,2, Qian Wu3,4, Zhenpu Chen2, Wei Zhang1, Yongchun Zhou2, Kaining Mao1, Jia Li2, Yuanyuan Li3, Jie Chen5, Youguang Huang6, Yunchao Huang7.
Abstract
Lung cancer is the leading cause of human cancer mortality due to the lack of early diagnosis technology. The low-dose computed tomography scan (LDCT) is one of the main techniques to screen cancers. However, LDCT still has a risk of radiation exposure and it is not suitable for the general public. In this study, plasma metabolic profiles of lung cancer were performed using a comprehensive metabolomic method with different liquid chromatography methods coupled with a Q-Exactive high-resolution mass spectrometer. Metabolites with different polarities (amino acids, fatty acids, and acylcarnitines) can be detected and identified as differential metabolites of lung cancer in small volumes of plasma. Logistic regression models were further developed to identify cancer stages and types using those significant biomarkers. Using the Variable Importance in Projection (VIP) and the area under the curve (AUC) scores, we have successfully identified the top 5, 10, and 20 metabolites that can be used to differentiate lung cancer stages and types. The discrimination accuracy and AUC score can be as high as 0.829 and 0.869 using the five most significant metabolites. This study demonstrated that using 5 + metabolites (Palmitic acid, Heptadecanoic acid, 4-Oxoproline, Tridecanoic acid, Ornithine, and etc.) has the potential for early lung cancer screening. This finding is useful for transferring the diagnostic technology onto a point-of-care device for lung cancer diagnosis and prognosis.Entities:
Year: 2021 PMID: 34083687 PMCID: PMC8175557 DOI: 10.1038/s41598-021-91276-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of grouping of samples.
| Group | Number of Samples | Age | Histology | Gender | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Range | Median | ADC | SCC | SCLC | Others | Unknown | Male | Female | ||
| Stage I | 30 | 37–65 | 51.5 | 27 | 2 | 0 | 1 | 0 | 11 | 19 |
| Stage II | 5 | 41–55 | 45 | 4 | 0 | 1 | 0 | 0 | 3 | 2 |
| Stage III | 17 | 38–70 | 52 | 8 | 5 | 1 | 2 | 1 | 14 | 3 |
| Stage IV | 12 | 32–65 | 55 | 10 | 1 | 0 | 0 | 1 | 6 | 6 |
| Healthy Control | 50 | 33–69 | 51 | N/A | N/A | N/A | N/A | N/A | 25 | 25 |
| Total | 114 | 32–70 | 50.5 | 49 | 8 | 2 | 3 | 2 | 59 | 55 |
| Stage I | 15 | 40–60 | 50 | 14 | 0 | 0 | 0 | 1 | 6 | 9 |
| Stage II | 2 | 40–50 | 45 | 1 | 1 | 0 | 0 | 0 | 1 | 1 |
| Stage III | 9 | 37–67 | 56 | 2 | 4 | 3 | 0 | 0 | 7 | 2 |
| Stage IV | 6 | 46–68 | 47 | 4 | 0 | 1 | 0 | 1 | 2 | 4 |
| Unknown | 2 | 57–59 | 58 | 0 | 1 | 1 | 0 | 0 | 1 | 1 |
| Healthy Control | 25 | 31–68 | 50 | N/A | N/A | N/A | N/A | N/A | 11 | 14 |
| Total | 59 | 32–70 | 50 | 21 | 6 | 5 | 0 | 2 | 28 | 31 |
Five most significant metabolomic biomarkers for lung cancer screening.
| HMDB or MetPA* Number | AUC | |||
|---|---|---|---|---|
| Palmitic acid | HMDB0000220 | 0.86 | 5.94E−14 | 7.81E−16 |
| Heptadecanoic acid | HMDB0002259 | 0.84 | 5.28E−12 | 1.90E−14 |
| 4-Oxoproline | METPA0228 | 0.83 | 2.65E−10 | 2.69E−11 |
| Tridecanoic acid | HMDB0000910 | 0.81 | 6.95E−13 | 1.48E−12 |
| Ornithine | HMDB0000214 | 0.81 | 4.60E−10 | 9.74E−12 |
*MetPA (Metabolomic Pathway Analysis).
Figure 1(a) Orthogonal projection on latent structure discriminant analysis (OPLS-DA) score plot shows the ability to separate lung cancer patients from healthy controls. (b) Box and whisker plots for the top 5 most important metabolites between healthy control and lung cancer groups. (c)-(e) Receiver-operating characteristic (ROC) curves for discriminating healthy controls and lung cancer patients [(c) ROC curves of the logistic model using top 5 metabolomic markers; (d) ROC curves of the logistic model using top 10 metabolomic markers; (e) ROC curves of the logistic model using top 20 metabolomic markers].
Performance of logistic regression models with various biomarkers for discriminating healthy controls and lung cancer patients.
| Top 5 significant metabolites | Top 10 significant metabolites | Top 20 significant metabolites | ||||
|---|---|---|---|---|---|---|
| Discovery | Validation | Discovery | Validation | Discovery | Validation | |
| AUC | 0.918 (± 0.103) | 0.869 | 0.973 (± 0.065) | 0.947 | 0.947 (± 0.125) | 0.964 |
| Accuracy | 0.836 (± 0.155) | 0.829 | 0.902 (± 0.162) | 0.857 | 0.893 (± 0.159) | 0.900 |
| Precision | 0.850 (± 0.174) | 0.829 | 0.933 (± 0.214) | 0.866 | 0.903 (± 0.204) | 0.905 |
| Recall | 0.855 (± 0.208) | 0.829 | 0.890 (± 0.161) | 0.857 | 0.908 (± 0.117) | 0.900 |
Figure 2(a) Orthogonal projection on latent structure discriminant analysis (OPLS-DA) scores plot shows the ability to discriminate early-stage lung cancer patients, advanced-stage lung cancer patients and healthy controls. (b) Box and whisker plots for the top 5 most important metabolites between healthy control and different lung cancer stage groups. (c)-(e) Receiver-operating characteristic (ROC) curves for discriminating healthy controls, early-stage patients, and advanced-stage lung cancer patients, where class 0 represents healthy controls, class 1 represents early-stage patients, and class 2 represents advanced-stage patients [(c) ROC curves of the logistic model using top 5 metabolomic markers; (d) ROC curves of the logistic model using top 10 metabolomic markers; (e) ROC curves of the logistic model using top 20 metabolomic markers].
Most significant biomarkers for discriminating healthy controls, early-stage patients, and advanced-stage lung cancer patients.
| HMDB Number | AUC | P-value (ANOVA test) | P-value (Kruskal–Wallis test) | |
|---|---|---|---|---|
| Palmitic acid | HMDB0000220 | 0.77 | 8.08E−16 | 1.99E−14 |
| Heptadecanoic acid | HMDB0002259 | 0.75 | 5.02E−14 | 6.56E−13 |
| Ornithine | HMDB0000214 | 0.73 | 2.80E−09 | 1.71E−10 |
| Tridecanoic acid | HMDB0000910 | 0.73 | 4.08E−13 | 7.32E−12 |
| Stearic acid | HMDB0000827 | 0.72 | 5.01E−11 | 2.51E−10 |
Most significant biomarkers for discriminating different lung cancer types.
| HMDB IDS | AUC | P value (ANOVA test) | P value (Kruskal–Wallis test) | |
|---|---|---|---|---|
| Palmitic acid | HMDB0000220 | 0.78 | 3.99E−15 | 5.62e−14 |
| Heptadecanoic acid | HMDB0002259 | 0.75 | 8.86E−13 | 3.75e−12 |
| Ornithine | HMDB0000214 | 0.73 | 1.22E−07 | 8.76e−10 |
| Pentadecanoic acid | HMDB0000826 | 0.69 | 1.68E−05 | 1.04e−05 |
| Acylcarnitine C8:1 | NA | 0.69 | 1.67E−04 | 8.81e−06 |
Figure 3(a) Orthogonal projection on latent structure discriminant analysis (OPLS-DA) scores plot shows the ability to discriminate three lung cancer types and healthy controls. (b) Box and whisker plots for the top 5 most important metabolites between healthy control and different lung cancer types. (c)-(e) Receiver-operating characteristic (ROC) curves for discriminating lung cancer types, where class 0 represents healthy controls, class 1 represents adenocarcinomas, class 2 represents SCC lung cancer, and class 3 represents SCLC [(c) ROC curves of the logistic model using top 5 metabolomic markers; (d) ROC curves of the logistic model using top 10 metabolomic markers; (e) ROC curves of the logistic model using top 20 metabolomic markers].