| Literature DB >> 34685549 |
Leander Van Neste1, Kirk J Wojno1, Ricardo Henao2,3, Shrikant Mane4, Howard Korman5,6, Jason Hafron7, Kenneth Kernen7, Rima Tinawi-Aljundi7, Mathew Putzi8, Amin I Kassis1,9, Philip W Kantoff10,11.
Abstract
The primary objective of this study is to detect biomarkers and develop models that enable the identification of clinically significant prostate cancer and to understand the biologic implications of the genes involved. Peripheral blood samples (1018 patients) were split chronologically into independent training (n = 713) and validation (n = 305) sets. Whole transcriptome RNA sequencing was performed on isolated phagocytic CD14+ and non-phagocytic CD2+ cells and their gene expression levels were used to develop predictive models that correlate to adverse pathologic features. The immune-transcriptomic model with the highest performance for predicting adverse pathology, based on a subtraction of the log-transformed expression signals of the two cell types, displayed an area under the curve (AUC) of the receiver operating characteristic of 0.70. The addition of biomarkers in combination with traditional clinical risk factors (age, serum prostate-specific antigen (PSA), PSA density, race, digital rectal examination (DRE), and family history) enhanced the AUC to 0.91 and 0.83 for the training and validation sets, respectively. The markers identified by this approach uncovered specific pathway associations relevant to (prostate) cancer biology. Increased phagocytic activity in conjunction with cancer-associated (mis-)regulation is also represented by these markers. Differential gene expression of circulating immune cells gives insight into the cellular immune response to early tumor development and immune surveillance.Entities:
Keywords: CD14+; CD2+; boosting; cancer; cells; gradient; immune; phagocytosis; transcriptomics
Mesh:
Year: 2021 PMID: 34685549 PMCID: PMC8533765 DOI: 10.3390/cells10102567
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Cohort demographics for discovery and validation data sets. p-values are for significance testing for differences between the discovery and validation sets. Missing represents the percentage of cases where data was missing or unavailable.
| Level | Validation | Discovery |
| Missing | |
|---|---|---|---|---|---|
|
| 305 | 713 | |||
| age (mean (SD)) | 62.69 (7.60) | 64.09 (7.89) | 0.009 | 0 | |
| race (%) | AA | 38 (12.5) | 69 (9.7) | 0.006 | 0 |
| Caucasian | 227 (74.4) | 593 (83.2) | |||
| Other | 25 (8.2) | 29 (4.1) | |||
| Unknown | 15 (4.9) | 22 (3.1) | |||
| family history (%) | First | 59 (19.3) | 132 (18.5) | 0.764 | 0 |
| None | 204 (66.9) | 494 (69.3) | |||
| Positive | 7 (2.3) | 12 (1.7) | |||
| Second | 16 (5.2) | 27 (3.8) | |||
| Unknown | 19 (6.2) | 48 (6.7) | |||
| DRE (%) | Abnormal/Positive | 74 (24.3) | 114 (16.0) | 0.002 | 0 |
| Normal/Negative (T1c) | 203 (66.6) | 550 (77.1) | |||
| Unknown | 28 (9.2) | 49 (6.9) | |||
| volume (median (IQR)) | 40.20 (29.60, 54.08) | 40.85 (30.78, 57.85) | 0.264 | 9.6 | |
| psa_total (median (IQR)) | 4.97 (3.80, 7.10) | 5.12 (3.80, 7.30) | 0.3 | 1.2 | |
| psa_density (median (IQR)) | 0.12 (0.08, 0.19) | 0.12 (0.07, 0.19) | 0.929 | 10.8 | |
| cores_positive (median (IQR)) | 1.00 (0.00, 4.00) | 1.00 (0.00, 4.00) | 0.973 | 0 | |
| cores_percent (median (IQR)) | 4.00 (0.00, 30.00) | 5.00 (0.00, 35.00) | 0.862 | 0 | |
| site (%) | Unknown | 0 (0.0) | 55 (7.7) | <0.001 | 0 |
| CU | 147 (48.2) | 577 (80.9) | |||
| MIU | 62 (20.3) | 46 (6.5) | |||
| Urology Austin | 96 (31.5) | 35 (4.9) | |||
| Gleason Group (%) | 0 | 145 (47.5) | 337 (47.3) | 0.963 | 0 |
| 1 | 32 (10.5) | 78 (10.9) | |||
| 2 | 66 (21.6) | 166 (23.3) | |||
| 3 | 38 (12.5) | 77 (10.8) | |||
| 4 | 13 ( 4.3) | 27 ( 3.8) | |||
| 5 | 11 (3.6) | 28 (3.9) | |||
| Adverse Pathology (%) | 0 | 245 (80.9) | 577 (80.9) | 1 | 0.2 |
| 1 | 58 (19.1) | 136 (19.1) |
ROC analysis results for various models showing discovery and validation AUC values with confidence intervals. Results are shown for discovery set (disc), and independent validation set (val). Colors indicate clinical variables used in models. Only validation AUC results by age tertial are shown—significance testing p-values are in Supplementary Figure S3. PSAT = total PSA and PSAD = PSA density. Clinical models are color coded with and without genomic component for ease of identifying the boost in performance achieved by adding genomics (CD14/CD2). Yellow (PSAT, Age), Blue (PSAT, Age, Race, DRE, FamH), and Orange (PSAD, Age).
| Model | AUC (disc) | AUC (val) | AUC (val) | ||
|---|---|---|---|---|---|
| Age (42,60) | Age (60,66) | Age (66,87) | |||
| Age | 0.60 (0.54, 0.65) | 0.56 (0.47, 0.65) | |||
| PSAT | 0.72 (0.67, 0.77) | 0.67 (0.59, 0.75) | 0.63 (0.48, 0.77) | 0.74 (0.59, 0.89) | 0.66 (0.52, 0.82) |
| Vol | 0.60 (0.55, 0.66) | 0.72 (0.65, 0.80) | 0.75 (0.65, 0.85) | 0.82 (0.67, 0.96) | 0.62 (0.48, 0.76) |
| PSAD | 0.77 (0.72, 0.81) | 0.78 (0.71, 0.85) | 0.76 (0.64, 0.89) | 0.88 (0.80, 0.97) | 0.74 (0.61, 0.87) |
| Clinical (PSAT, Age) | 0.72 (0.67, 0.77) | 0.67 (0.59, 0.75) | 0.60 (0.46, 0.75) | 0.73 (0.58, 0.88) | 0.67 (0.54, 0.80) |
| Clinical (PSAT, Age, Race, DRE, FamH) | 0.73 (0.69, 0.78) | 0.73 (0.65, 0.80) | 0.70 (0.56, 0.83) | 0.72 (0.56, 0.88) | 0.72 (0.61, 0.83) |
| Clinical (PSAD, Age) | 0.78 (0.74, 0.82) | 0.78 (0.71, 0.85) | 0.73 (0.61, 0.86) | 0.89 (0.80, 0.97) | 0.76 (0.64, 0.88) |
| CD2 | 0.80 (0.76, 0.84) | 0.63 (0.55, 0.71) | 0.67 (0.53, 0.80) | 0.53 (0.35, 0.71) | 0.61 (0.49, 0.74) |
| CD14 | 0.82 (0.77, 0.86) | 0.59 (0.51, 0.67) | 0.60 (0.46, 0.74) | 0.54 (0.36, 0.73) | 0.63 (0.51, 0.75) |
| CD14/CD2 | 0.72 (0.67, 0.77) | 0.70 (0.62, 0.77) | 0.76 (0.65, 0.88) | 0.63 (0.47, 0.79) | 0.68 (0.55, 0.81) |
| CD14/CD2 + Clinical (PSAT, Age) | 0.99 (0.99, 1.00) | 0.75 (0.67, 0.82) | 0.73 (0.60, 0.86) | 0.72 (0.57, 0.88) | 0.76 (0.64, 0.88) |
| CD14/CD2 + Clinical (PSAT, Age, Race, DRE, FamH) | 0.97 (0.95, 0.99) | 0.76 (0.69, 0.83) | 0.78 (0.66, 0.90) | 0.72 (0.55, 0.89) | 0.75 (0.63, 0.86) |
| CD14/CD2 + Clinical (PSAD, Age) | 0.91 (0.88, 0.93) | 0.83 (0.77, 0.89) | 0.79 (0.67, 0.92) | 0.91 (0.84, 0.97) | 0.80 (0.70, 0.90) |
Figure 1ROC curves for genomics only CD2, CD14, and CD14/CD2 ratio models. AUC values and confidence intervals are shown in the white area of Table 2.
Figure 2ROC curves for CD14/CD2 ratio model compared to those models including age, PSA, and PSAD. AUC values and confidence intervals are shown in Table 2 above.
Figure 3Top-ranked, enriched pathways and ontologies represented by the 120 genes in the best performing model according to MSigDB hallmark (A), KEGG (B), and gene ontology biological processes (C). Only terms that had a false discovery rate < 0.1 (or <0.01 for gene ontology (C)) are shown.