| Literature DB >> 25787708 |
Xiaoyan Huang1, Matteo Dugo2, Maurizio Callari3, Marco Sandri4, Loris De Cecco2, Barbara Valeri5, Maria Luisa Carcangiu5, Jingyan Xue1, Rui Bi6, Silvia Veneroni3, Maria Grazia Daidone3, Sylvie Ménard4, Elda Tagliabue4, Zhimin Shao1, Jiong Wu1, Rosaria Orlandi4.
Abstract
The recent dramatic increase in breast cancer incidence across China with progressive urbanization and economic development has signaled the urgent need for molecular and clinical detailing of breast cancer in the Chinese population. Our analyses of a unique transethnic collection of breast cancer frozen specimens from Shanghai Fudan Cancer Center (Chinese Han) profiled simultaneously with an analogous Caucasian Italian series revealed consistent transcriptomic data lacking in batch effects. The prevalence of Luminal A subtype was significantly lower in Chinese series, impacting the overall prevalence of estrogen receptor (ER)-positive disease in a large cohort of Chinese/Caucasian patients. Unsupervised and supervised comparison of gene and microRNA (miRNA) profiles of Chinese and Caucasian samples revealed extensive similarity in the comprehensive taxonomy of transcriptional elements regulating breast cancer biology. Partition of gene expression data using gene lists relevant to breast cancer as "intrinsic" and "extracellular matrix" genes identified Chinese and Caucasian subgroups with equivalent global gene and miRNA profiles. These findings indicate that in the Chinese and Caucasian groups, breast neoplasia and the surrounding stromal characteristics undergo the same differentiation and molecular processes. Transcriptional similarity across transethnic cohorts may simplify translational medicine approaches and clinical management of breast cancer patients worldwide.Entities:
Keywords: Breast cancer; Chinese/Caucasian; gene expression profiling; miRNA profiling; race/ethnicity
Mesh:
Substances:
Year: 2015 PMID: 25787708 PMCID: PMC4529340 DOI: 10.1002/cam4.442
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Clinical, pathological, and molecular characteristics of 78 Chinese and 97 Caucasian primary consecutive breast cancer patients surgically treated at Cancer Hospital-Fudan University of Shanghai and INT of Milan, respectively, and analyzed for gene and miRNA expression
| Chinese series (%) | Caucasian series (%) | ||
|---|---|---|---|
| Number of patients | 78 (100) | 97 (100) | |
| Age (years) | |||
| Median | 50 | 60 | 0.009 |
| Range | 32–78 | 35–85 | |
| ≤50 | 41 (53) | 31 (32) | |
| >50 | 37 (47) | 66 (68) | |
| Size (cm) | |||
| Median | 2.5 | 2.0 | 0.124 |
| Range | 1.0–7.0 | 0.4–11.0 | |
| ≤2.0 | 47 (60) | 45 (46) | |
| >2.0 | 31 (40) | 50 (52) | |
| NA | 0 (0) | 2 (2) | |
| Histological grade | |||
| I + II | 40 (51) | 46 (47) | 0.195 |
| III | 26 (33) | 48 (49) | |
| NA | 12 (16) | 3 (4) | |
| Lymph node | |||
| Positive | 35 (45) | 46 (47) | 0.498 |
| Negative | 36 (46) | 36 (37) | |
| NA | 7 (9) | 15 (16) | |
| ER status (IHC) | |||
| Positive | 52 (67) | 75 (77) | 0.162 |
| Negative | 26 (33) | 22 (23) | |
| ER status (gene) | |||
| Positive | 45 (58) | 69 (71) | 0.090 |
| Negative | 33 (42) | 28 (29) | |
| HER2 status (IHC) | |||
| Positive | 10 (13) | 10 (10) | 0.780 |
| Negative | 68 (87) | 87 (90) | |
| HER2 status (gene) | |||
| Positive | 12 (15) | 11 (11) | 0.574 |
| Negative | 66 (85) | 86 (89) | |
| IHC subtypes | |||
| ER−/PG−/HER2− | 17 (22) | 13 (13) | 0.218 |
| HER2+/ER− | 6 (8) | 4 (4) | |
| Luminal HER2− | 51 (65) | 70 (72) | |
| Luminal HER2+ | 4 (5) | 10 (10) | |
ER, estrogen receptor; NA, not available; IHC, immunohistochemistry; HER2, human epidermal growth factor receptor 2.
Age and ER status of 1057 and 1047 primary consecutive breast cancer patients surgically treated in the years 2005–2006 at Cancer Hospital-Fudan University of Shanghai and INT of Milan, respectively
| Chinese series (%) | Caucasian series (%) | ||
|---|---|---|---|
| Number of patients | 1057 (100) | 1047 (100) | |
| Age (years) | |||
| Median | 51 | 59 | <0.001 |
| Range | 23–85 | 25–92 | |
| ≤50 | 487 (46) | 324 (31) | |
| >50 | 570 (54) | 723 (69) | |
| ER status | |||
| Positive | 663 (63) | 885 (85) | <0.001 |
| Negative | 394 (37) | 162 (15) | |
ER (estrogen receptor) status was assessed by immunohistochemistry.
Figure 1Global transcriptional similarity of Chinese and Caucasian samples. Dendrogram, obtained by hierarchical clustering of gene expression (A) and microRNA (miRNA) (B) data of the Chinese–Caucasian cohort, separated samples independently of race/ethnicity. The color bar at the bottom of the dendrogram shows hormone receptor status (assessed by immunohistochemistry) and race/ethnicity. Global genes and miRNA from the Chinese datasets were partioned by hierarchical clustering and the identified groups of correlated genes and miRNAs were validated on Caucasian dataset. The number of genes (C) and miRNAs (D) included in each cluster pair was conserved.
Figure 2Intrinsic classification of Chinese and Caucasian breast tumors. Unsupervised hierarchical clustering of Chinese (A) and Caucasian (B) samples on the PAM50 intrinsic gene list assigned samples to four breast intrinsic subtypes. Unclassified samples are indicated in gray. Color bars show lymph nodal status (N), histological grade, ERBB2, and estrogen receptor (ER) positivity evaluated according to gene expression level, intrinsic subtype assignment by PAM50, and extracellular matrix (ECM) subtype assignment by the Large Average Submatricies (LAS) biclustering algorithm. (C) Heat map of the subclass association matrix obtained by Subclass Mapping. Gene profiles of intrinsic subtypes identified in the Chinese dataset were nearly identical to those of corresponding subtypes defined in the Caucasian dataset. False discovery rate (FDR) (white) represents the significance of the similarity. ITA and CHINA indicate Caucasian Italian and Chinese samples, respectively. (D) Principal component analysis (PCA) of miRNA expression data in Chinese and Caucasian samples. Grouping of Chinese and Caucasian samples was more associated with intrinsic subtypes than to race/ethnicity.
Frequency of PAM50 intrinsic subtypes in independent series of primary breast cancer from Caucasian and Chinese patients
| Caucasian series | Chinese series | ||||
|---|---|---|---|---|---|
| Number of samples | 97 | 207 | 78 | 81 | 100 |
| Sample origin | Milan | Oxford | Shanghai | Taiwan | Singapore |
| GEO identifier | GSE59590 | GSE22219 | GSE59590 | GSE48390 | GSE36722 |
| PAM50 intrinsic subtype | |||||
| Basal-like | 15 (15) | 34 (16) | 10 (13) | 8 (10) | 8 (8) |
| ERBB2 | 12 (12) | 32 (15) | 17 (22) | 15 (19) | 17 (17) |
| Luminal A | 49 (51) | 88 (41) | 21 (27) | 15 (19) | 25 (25) |
| Luminal B | 15 (16) | 42 (19) | 23 (29) | 35 (43) | 38 (38) |
| Undetermined | 6 (6) | 20 (9) | 7 (9) | 8 (10) | 12 (12) |
| Luminal A/Luminal B ratio | 3.19 | 2.16 | 0.93 | 0.44 | 0.66 |
GEO, Gene Expression Omnibus.
Figure 3ECM3 and ECM1 subtyping in Chinese and Caucasian breast tumors. Heat map shows the expression profile of ECM3 and ECM1 genes in Chinese (A) and Caucasian (B) tumors identified by LAS biclustering. (C) Heat map of the subclass association matrix obtained by SubClass Mapping revealed nearly identical gene expression of ECM subtypes identified in the Chinese and the Caucasian datasets. FDR (white) represents the significance of the similarity. ITA and CHINA indicate Caucasian Italian and Chinese samples, respectively. (D) PCA analysis of microRNA (miRNA) expression data in Chinese and Caucasian ECM3 and ECM1 breast tumors indicates the greater role of ECM classification than of race/ethnicity in grouping of Chinese and Caucasian samples. ECM, extracellular matrix; PCA, principal component analysis; FDR, false discovery rate; LAS, Large Average Submatricies.
Figure 4Effect of race/ethnicity and subtypes on gene and microRNA (miRNA) expression. The bar plot reports the number of differentially expressed genes (A) and miRNAs (B) identified by comparison of Chinese and Caucasian samples or by pairwise comparison between subtypes. The number of significant features was dramatically higher when subtypes were compared.