| Literature DB >> 32365829 |
Carolina Cortes-Urrea1,2, Fernando Bueno-Gutiérrez1, Melissa Solarte2, Miguel Guevara-Burbano3, Fabian Tobar-Tosse4, Patricia E Vélez-Varela5, Juan Carlos Bonilla6, Guillermo Barreto2, Jaime Velasco-Medina7, Pedro A Moreno3, Javier De Las Rivas1.
Abstract
Cancer is one of the leading causes of mortality worldwide. Breast cancer is the most frequent cancer in women, and in recent years it has become a serious public health problem in Colombia. The development of large-scale omic techniques allows simultaneous analysis of all active genes in tumor cells versus normal cells, providing new ways to discover the drivers of malignant transformations. Whole exome sequencing (WES) was obtained to provide a deep view of the mutational genomic profile in a set of cancer samples from Southwest Colombian women. WES was performed on 52 tumor samples from patients diagnosed with invasive breast cancer, which in most cases (33/52) were ductal luminal breast carcinomas (IDC-LM-BRCA). Global variant call was calculated, and six different algorithms were applied to filter out false positives and identify pathogenic variants. To compare and expand the somatic tumor variants found in the Colombian cohort, exome mutations and genome-wide expression alterations were detected in a larger set of tumor samples of the same breast cancer subtype from TCGA (that included DNA-seq and RNA-seq data). Genes with significant changes in both the mutational and expression profiles were identified, providing a set of genes and mutations associated with the etiology of ductal luminal breast cancer. This set included 19 single mutations identified as tumor driver mutations in 17 genes. Some of the genes (ATM, ERBB3, ESR1, TP53) are well-known cancer genes, while others (CBLB, PRPF8) presented driver mutations that had not been reported before. In the case of the CBLB gene, several mutations were identified in TCGA IDC-LM-BRCA samples associated with overexpression of this gene and repression of tumor suppressive activity of TGF-β pathway.Entities:
Keywords: DESeq2.; Limma-Voom; RNA-seq; SNPs; bioinformatics; breast cancer; cancer genomics; differential expression; genetic variant; whole exome sequencing
Mesh:
Year: 2020 PMID: 32365829 PMCID: PMC7277822 DOI: 10.3390/biom10050698
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Workflow of the parallel analysis of whole exome sequencing (WES) DNA-seq data and expression RNA-seq data from ductal luminal breast cancer samples, done to select pathogenic variants and corresponding altered genes in a set of Southwest Colombian patients.
Genes and associated exonic variants, presented as driver mutations (known or predicted) of breast cancer in women from Southwest Colombia.
| Gene HGNC Symbol | Nucleotide Change | Protein AA Change | dbSNP_ID(rs) | Frequency in IDC-LM-BRCA | Cancer-Genome Interpreter Prediction | SNPs (Known, Reported, New) | Human Population with More Frequency |
|---|---|---|---|---|---|---|---|
| ABCB4 | c.G2363A | p.R788Q | rs8187801 | 3/33 | Driver_mutation | reported | ExAC_AFR |
| ATM | c.C7375G | p.R2459G | rs730881383 | 1/33 | Driver_mutation | reported | ExAC_OTH |
| ATM | c.C7468T | p.L2490F | rs753262623 | 1/33 | Driver_mutation | reported | ExAC_SAS |
| CD36 | c.G1016T | p.G339V | rs146027667 | 1/33 | Driver_mutation | known | ExAC_OTH |
| CHD8 | c.C871T | p.L291F | rs192989929 | 1/33 | Driver_mutation | reported | ExAC_OTH/ExAC_AMR |
| DPYD | c.A2846T | p.D949V | rs67376798 | 1/33 | known in cancer | reported | ExAC_NFE |
| EPHA1 | c.C2371T | p.R791C | rs766301333 | 1/33 | Driver_mutation | reported | ExAC_NFE |
| ERBB3 | c.G2167C | p.V723L | rs189789018 | 1/33 | Driver_mutation | known | ExAC_AMR |
| ESR1 | c.G1138C | p.E380Q # | rs1057519827 | 1/33 | Driver_mutation | known | all populations similar |
| MLH1 | c.A1129G | p.K377E | rs35001569 | 1/33 | Driver_mutation | reported | ExAC_NFE |
| MSH3 | c.T2732G | p.L911W | rs41545019 | 2/33 | Driver_mutation | reported | ExAC_NFE |
| NOTCH1 | c.G2983A | p.G995S ## | rs868369610 | 1/33 | Driver_mutation | reported | all populations similar |
| NOTCH4 | c.G2504T | p.G835V | rs9267835 | 2/33 | Driver_mutation | known | ExAC_AFR/ExAC_AMR |
| STAT6 | c.C1069T | p.R357W | rs776930978 | 1/33 | Driver_mutation | reported | all populations similar |
| TP53 | c.G338T | p.G113V | rs121912656 | 1/33 | Driver_mutation | reported | ExAC_EAS |
| TP53 | c.T215A | p.L72Q | rs1057519997 | 1/33 | Driver_mutation | reported | all populations similar |
| UPF3B | c.G1082A | p.R361H | rs143538947 | 1/33 | Driver_mutation | reported | ExAC_AFR |
| CBLB | c.G1972A | p.G658S | locus (chr:3q13.11;exon:13) | 1/33 | Driver_mutation | new |
|
| PRPF8 | c.G4153T | p.V1385F | locus (chr:17p13.3;exon:25) | 1/33 | Driver_mutation | new |
|
Populations are represented in the EXAC data. AFR: African/American, AMR: Latino, EAS: East Asian, FIN: Finish, NFE: Non-Finnish European, SAS: South Asian, OTH: Other. # ESR1 protein E380Q: This mutation is currently used as biomarker in BRCA. NOTCH1 protein G995S: This mutation is currently used as biomarker in BRCA.
Figure 2Scatter plots, volcano plots and proportional Venn diagrams showing the results of the differential expression analysis of RNA-seq data from 510 ductal luminal breast cancer samples versus 89 healthy control samples. As indicated in the article, the analyses were done using two algorithms: Limma-Voom (upper scatter plot and volcano plot) and DESeq2 (lower scatter plot and volcano plot). Upregulated genes are marked in red and downregulated genes are marked in blue.
Figure 3Plot of the chromosomal distribution (location) and differential expression (significance) of 81 genes that presented highly pathogenic mutations and expression alterations in ductal luminal breast cancer (IDC-LM-BRCA) patients.