| Literature DB >> 32267939 |
Jie Ping1, Xingyi Guo1, Fei Ye1, Jirong Long1, Loren Lipworth1, Qiuyin Cai1, William Blot1, Xiao-Ou Shu1, Wei Zheng1.
Abstract
African American (AA) women have an excess breast cancer mortality than European American (EA) women. To investigate the contribution of tumor biology to this survival health disparity, we compared gene expression profiles in breast tumors using RNA sequencing data derived from 260 AA and 155 EA women who were prospectively enrolled in the Southern Community Cohort Study (SCCS) and developed breast cancer during follow-up. We identified 59 genes (54 protein-coding genes and 5 long intergenic non-coding RNAs) that were expressed differently between EA and AA at a stringent false discovery rate (FDR) < 0.01. A gene signature was derived with these 59 genes and externally validated using the publicly available Cancer Genome Atlas (TCGA) data from180 AA and 838 EA breast cancer patients. Applying C-statistics, we found that this 59-gene signature has a high discriminative ability in distinguishing AA and EA breast cancer patients in the TCGA dataset (C-index = 0.81). These findings may provide new insight into tumor biological differences and the causes of the survival disparity between AA and EA breast cancer patients.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32267939 PMCID: PMC7359770 DOI: 10.1093/carcin/bgaa035
Source DB: PubMed Journal: Carcinogenesis ISSN: 0143-3334 Impact factor: 4.944
Clinicopathological characteristics of breast cancer patients according to race groups in the SCCS and TCGA
| Variable | SCCS ( | TCGA ( | ||||
|---|---|---|---|---|---|---|
| Race | AA | EA |
| AA | EA |
|
| No. of subjects (%) | 260 (62.7) | 155 (37.3) | 180 (17.7) | 838 (82.3) | ||
| Age at diagnosis, mean (SD) | 58.5 (9.0) | 60.6 (9.4) | 0.029 | 56.3 (13.5) | 59.2 (13.1) | 0.007 |
| PAM50 subtype, no. (%) | ||||||
| Luminal A | 112 (43.1) | 78 (50.3) | 0.474 | 48 (26.7) | 410 (48.9) | <0.001 |
| Luminal B | 36 (13.8) | 24 (15.5) | 34 (18.9) | 180 (21.5) | ||
| Basal-like | 67 (25.8) | 32 (20.6) | 67 (37.2) | 120 (14.3) | ||
| HER2-enriched | 29 (11.2) | 12 (7.7) | 22 (12.2) | 75 (8.9) | ||
| Normal-like | 16 (6.2) | 9 (5.8) | 9 (5.0) | 53 (6.3) | ||
| TNBC, no. (%) | ||||||
| Yes | 47 (18.1) | 21 (13.5) | 0.285 | 33 (18.3) | 73 (8.7) | <0.001 |
| Noa | 213 (81.9) | 134 (86.5) | 147 (81.7) | 765 (91.3) | ||
| ER, no. (%) | ||||||
| Negative | 85 (33.3) | 36 (24.8) | 0.095 | 69 (39.2) | 148 (18.6) | <0.001 |
| Positive | 170 (66.7) | 109 (75.2) | 107 (60.8) | 648 (81.4) | ||
| Undeterminedb | 5 | 10 | 4 | 42 | ||
| PR, no. (%) | ||||||
| Negative | 110 (43.7) | 52 (36.1) | 0.173 | 88 (50) | 229 (28.9) | <0.001 |
| Positive | 142 (56.3) | 92 (63.9) | 88 (50) | 564 (71.1) | ||
| Undeterminedb | 8 | 11 | 4 | 45 | ||
| HER2, no. (%) | ||||||
| Negative | 165 (85.5) | 97 (84.3) | 0.914 | 74 (83.1) | 450 (77.4) | 0.378 |
| Positive | 28 (14.5) | 18 (15.7) | 16 (16.9) | 131 (22.6) | ||
| Undeterminedb | 67 | 40 | 91 | 257 | ||
| AJCC stage, no. (%) | ||||||
| Stage I | 74 (36.1) | 65 (51.2) | 0.02 | 32 (18.2) | 145 (17.7) | 0.614 |
| Stage II | 83 (40.5) | 46 (36.2) | 106 (60.2) | 465 (56.6) | ||
| Stage III | 38 (18.5) | 14 (11.0) | 34 (19.3) | 196 (23.9) | ||
| Stage IV | 10 (4.9) | 2 (1.6) | 4 (2.3) | 15 (1.8) | ||
| Stage X or unknown | 55 | 28 | 4 | 19 | ||
aAll participants with undetermined for any or all the three components were included in the ‘No’ group.
bUndetermined or not evaluated—not included in statistical tests to evaluate racial differences.
The set of 59 genes found to be significantly differential expressed in the SCCS at FDR-adjusted P value < 0.01
| ENSEMBL ID | Gene symbol |
| FDR (SCCS) |
|
|---|---|---|---|---|
| ENSG00000255374 | TAS2R43 | <0.001 | <0.001 | <0.001 |
| ENSG00000130950 | NUTM2F | <0.001 | <0.001 | <0.001 |
| ENSG00000241945 | PWP2 | <0.001 | <0.001 | <0.001 |
| ENSG00000259604 | Lnc-ALDH1A3-1 | <0.001 | 0.005 | <0.001 |
| ENSG00000160221 | GATD3A | <0.001 | <0.001 | <0.001 |
| ENSG00000176681 | LRRC37A | <0.001 | <0.001 | <0.001 |
| ENSG00000142178 | SIK1 | <0.001 | 0.001 | <0.001 |
| ENSG00000134184 | GSTM1 | <0.001 | <0.001 | <0.001 |
| ENSG00000261701 | HPR | <0.001 | 0.008 | <0.001 |
| ENSG00000158488 | CD1E | <0.001 | <0.001 | <0.001 |
| ENSG00000196436 | NPIPB15 | <0.001 | <0.001 | <0.001 |
| ENSG00000173678 | SPDYE2B | <0.001 | <0.001 | <0.001 |
| ENSG00000224635 | AL391095.1 | <0.001 | <0.001 | <0.001 |
| ENSG00000106113 | CRHR2 | <0.001 | 0.004 | <0.001 |
| ENSG00000105499 | PLA2G4C | <0.001 | <0.001 | <0.001 |
| ENSG00000204147 | ASAH2B | <0.001 | 0.002 | <0.001 |
| ENSG00000197822 | OCLN | <0.001 | 0.008 | <0.001 |
| ENSG00000197728 | RPS26 | <0.001 | <0.001 | <0.001 |
| ENSG00000168255 | POLR2J3 | <0.001 | <0.001 | <0.001 |
| ENSG00000203666 | EFCAB2 | <0.001 | 0.003 | <0.001 |
| ENSG00000106686 | SPATA6L | <0.001 | <0.001 | <0.001 |
| ENSG00000152689 | RASGRP3 | <0.001 | 0.008 | <0.001 |
| ENSG00000178295 | GEN1 | <0.001 | <0.001 | <0.001 |
| ENSG00000215908 | CROCCP2 | <0.001 | <0.001 | <0.001 |
| ENSG00000154237 | LRRK1 | <0.001 | 0.001 | <0.001 |
| ENSG00000026950 | BTN3A1 | <0.001 | 0.007 | <0.001 |
| ENSG00000149292 | TTC12 | <0.001 | 0.004 | <0.001 |
| ENSG00000198060 | MARCH5 | <0.001 | 0.004 | <0.001 |
| ENSG00000154096 | THY1 | <0.001 | <0.001 | <0.001 |
| ENSG00000111671 | SPSB2 | <0.001 | 0.008 | <0.001 |
| ENSG00000135723 | FHOD1 | <0.001 | 0.004 | <0.001 |
| ENSG00000150477 | KIAA1328 | <0.001 | 0.009 | <0.001 |
| ENSG00000231389 | HLA-DPA1 | <0.001 | 0.008 | <0.001 |
| ENSG00000131669 | NINJ1 | <0.001 | 0.009 | <0.001 |
| ENSG00000181481 | RNF135 | <0.001 | 0.009 | <0.001 |
| ENSG00000203747 | FCGR3A | <0.001 | 0.002 | <0.001 |
| ENSG00000144026 | ZNF514 | <0.001 | 0.001 | <0.001 |
| ENSG00000186468 | RPS23 | <0.001 | <0.001 | <0.001 |
| ENSG00000175745 | NR2F1 | <0.001 | 0.009 | <0.001 |
| ENSG00000171503 | ETFDH | <0.001 | 0.002 | <0.001 |
| ENSG00000068654 | POLR1A | <0.001 | <0.001 | <0.001 |
| ENSG00000185684 | EP400P1 | <0.001 | <0.001 | <0.001 |
| ENSG00000279457 | WASH9P | <0.001 | 0.004 | <0.001 |
| ENSG00000280670 | CCDC163 | <0.001 | 0.008 | <0.001 |
| ENSG00000149089 | APIP | <0.001 | <0.001 | <0.001 |
| ENSG00000273136 | NBPF26 | <0.001 | <0.001 | <0.001 |
| ENSG00000140263 | SORD | <0.001 | <0.001 | <0.001 |
| ENSG00000277203 | F8A1 | <0.001 | <0.001 | <0.001 |
| ENSG00000223865 | HLA-DPB1 | <0.001 | <0.001 | <0.001 |
| ENSG00000280071 | GATD3B | <0.001 | <0.001 | <0.001 |
| ENSG00000019169 | MARCO | <0.001 | <0.001 | <0.001 |
| ENSG00000262202 | Lnc-GRAP-3 | <0.001 | <0.001 | <0.001 |
| ENSG00000132207 | SLX1A | <0.001 | <0.001 | <0.001 |
| ENSG00000215374 | FAM66B | <0.001 | 0.001 | <0.001 |
| ENSG00000016402 | IL20RA | <0.001 | <0.001 | <0.001 |
| ENSG00000170160 | CCDC144A | <0.001 | 0.005 | <0.001 |
| ENSG00000275464 | LOC102724159 | <0.001 | <0.001 | <0.001 |
| ENSG00000101197 | BIRC7 | <0.001 | 0.008 | <0.001 |
| ENSG00000162877 | PM20D1 | <0.001 | <0.001 | <0.001 |
Figure 1.Hierarchical clustering of 59 genes significantly differentially expressed between AA and EA patients after adjusting for age at diagnosis, PAM50 subtypes, tumor, nodes, and metastases stages, probabilistic estimation of expression residuals factors and batch effects for both the SCCS and TCGA.