| Literature DB >> 26484152 |
Miljana Tanić1, Kira Yanowski1, Eduardo Andrés2, Gonzalo Gómez-López2, María Rodríguez-Pinilla Socorro3, David G Pisano2, Beatriz Martinez-Delgado1, Javier Benítez4.
Abstract
Hereditary breast cancer constitutes only 5-10% of all breast cancer cases and is characterized by strong family history of breast and/or other associated cancer types. Only ~ 25% of hereditary breast cancer cases carry a mutation in BRCA1 or BRCA2 gene, while mutations in other rare high and moderate-risk genes and common low penetrance variants may account for additional 20% of the cases. Thus the majority of cases are still unaccounted for and designated as BRCAX tumors. MicroRNAs are small non-coding RNAs that play important roles as regulators of gene expression and are deregulated in cancer. To characterize hereditary breast tumors based on their miRNA expression profiles we performed global microarray miRNA expression profiling on a retrospective cohort of 80 FFPE breast tissues, including 66 hereditary breast tumors (13 BRCA1, 10 BRCA2 and 43 BRCAX), 10 sporadic breast carcinomas and 4 normal breast tissues, using Exiqon miRCURY LNA™ microRNA Array v.11.0. Here we describe in detail the miRNA microarray expression data and tumor samples used for the study of BRCAX tumor heterogeneity (Tanic et al., 2013) and biomarkers associated with positive BRCA1/2 mutation status (Tanic et al., 2014). Additionally, we provide the R code for data preprocessing and quality control.Entities:
Keywords: Hereditary breast cancer; Microarray; miRNA
Year: 2014 PMID: 26484152 PMCID: PMC4535901 DOI: 10.1016/j.gdata.2014.11.008
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Clinico-pathological data for hereditary breast tumors.
| BRCA1 | BRCA2 | BRCAX | |||||
|---|---|---|---|---|---|---|---|
| Total no. | n = 13 | n = 10 | n = 43 | ||||
| n | (%) | n | (%) | n | (%) | ||
| Age at diagnosis | |||||||
| Mean | 40.33 | 42.5 | 47.75 | ||||
| Range | 28–55 | 35–56 | 25–95 | ||||
| Grade | |||||||
| 1 | 0 | (0%) | 2 | (20%) | 4 | (9.3%) | |
| 2 | 1 | (8.3%) | 3 | (30%) | 20 | (48.8%) | |
| 3 | 11 | (91.7%) | 5 | (50%) | 17 | (41.5%) | |
| Estrogen receptor | |||||||
| Positive | 2 | (15.4%) | 7 | (70%) | 18 | (48.6%) | |
| Negative | 11 | (84.6%) | 3 | (30%) | 19 | (51.4%) | |
| Progesteron receptor | |||||||
| Positive | 2 | (15.4%) | 7 | (70%) | 14 | (36.8%) | |
| Negative | 11 | (84.6%) | 3 | (30%) | 24 | (63.2%) | |
| HER2 | |||||||
| Positive | 0 | (0%) | 2 | (20%) | 9 | (23.7%) | |
| Negative | 13 | (100%) | 8 | (80%) | 29 | (76.3%) | |
| Ki-67 | |||||||
| 1 (0–5%) | 4 | (30.8%) | 3 | (42.9%) | 15 | (48.3%) | |
| 2 (6–25%) | 5 | (23.1%) | 2 | (28.6%) | 11 | (35.5%) | |
| 3 (> 25%) | 4 | (30.8%) | 2 | (28.6%) | 5 | (21.6%) | |
| Subtype | |||||||
| Luminal A | 2 | (15.4%) | 6 | (60%) | 13 | (37.1%) | |
| Luminal B | 0 | (0%) | 2 | (20%) | 6 | (17.1%) | |
| HER2 | 0 | (0%) | 0 | (0%) | 3 | (8.6%) | |
| Triple negative | 11 | (84.6%) | 2 | (20%) | 13 | (37.1%) | |
| Lymph node | |||||||
| Positive | 5 | (50%) | 4 | (57.1%) | 18 | (48.6%) | |
| Negative | 5 | (50%) | 3 | (42.9%) | 17 | (51.4%) | |
Breast cancer cases were classified into four subtypes based on IHC-model [15]. In bold is the number of samples per category (BRCA1, BRCA2 and BRCAX) for which there was available information on clinico-pathological feature in question.
Fig. 1Shows boxplots representing summaries of the signal intensity distributions of the arrays A) before normalization, B) after quantile normalization. Each box corresponds to one array.
Fig. 2Shows a pseudocolor heatmap of the distances between arrays. The color scale is chosen to cover the range of distances encountered in the dataset. The distance d between two arrays a and b is computed as the mean absolute difference (L1-distance) between the data of the arrays (using the data from all probes without filtering). Identified outliers are marked with an asterisk (*).
Fig. 3A) shows representative MA plots for arrays with lowest (top 4) and highest (bottom 4) Hoeffing's D-statistic. M and A are defined as: M = log2(I1) − log2(I2), A = 1/2 (log2(I1) + log2(I2)), where I1 is the intensity of the array studied, and I2 is the intensity of a “pseudo”-array that consists of the median across arrays. The value of Da is shown in the panel headings. B) Shows a bar chart of the Da, the outlier detection criterion from the previous figure. The bars are shown in the original order of the arrays. A threshold of 0.15 was used, which is indicated by the vertical line. None of the arrays exceeded the threshold and was considered an outlier.
| Organism/tissue | |
| Array type | Exiqon miRCURY LNA™ microRNA Array v.11.0 |
| Data format | Raw data: .txt files; normalized data: SOFT, MINIML, TXT; Code: .RData file |
| Experimental factors | Hereditary breast tumors, sporadic breast tumors, normal breast tissue |
| Experimental features | Global miRNA expression profiling of formalin-fixed paraffin — embedded tissue (FFPE) breast tissues |
| Consent | All patients gave their written informed consent for use of exceeding pathological material in research |
| Sample source location | Samples were collected from Spanish hospitals |