| Literature DB >> 29760470 |
Mahalakshmi Kumaran1, Preethi Krishnan1, Carol E Cass2, Roland Hubaux3, Wan Lam3, Yutaka Yasui4, Sambasivarao Damaraju5,6.
Abstract
Copy Number Variants (CNVs) are a class of structural variations of DNA. Germline CNVs are known to confer disease susceptibility, but their role in breast cancer warrants further investigations. We hypothesized that breast cancer associated germline CNVs contribute to disease risk through gene dosage or other post-transcriptional regulatory mechanisms, possibly through tissue specific expression of CNV-embedded small-noncoding RNAs (CNV-sncRNAs). Our objectives are to identify breast cancer associated CNVs using a genome wide association study (GWAS), identify sncRNA genes embedded within CNVs, confirm breast tissue (tumor and normal) expression of the sncRNAs, correlate their expression with germline copy status and identify pathways influenced by the genes regulated by sncRNAs. We used an association study design and accessed germline CNV data generated on Affymetrix Human SNP 6.0 array in 686 (in-house data) and 495 (TCGA data) subjects served as discovery and validation cohorts. We identified 1812 breast cancer associated CNVs harboring miRNAs (n = 38), piRNAs (n = 9865), snoRNAs (n = 71) and tRNAs (n = 12) genes. A subset of CNV-sncRNAs expressed in breast tissue, also showed correlation with germline copy status. We identified targets potentially regulated by miRNAs and snoRNAs. In summary, we demonstrate the potential impact of embedded CNV-sncRNAs on expression and regulation of down-stream targets.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29760470 PMCID: PMC5951800 DOI: 10.1038/s41598-018-25801-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic of the study design adopted. The flowchart depicts the overall study design, summary of the datasets, and experimental platforms used at each stage of the analysis. Detailed protocols and data analysis methods are discussed in the methods section.
Figure 2Genome wide distribution of germline CNVs. (A) Distribution of genomic features overlapping germline CNVs. Figure shows a Venn diagram of the genome wide distribution of germline CNVs associated (q < 0.05) with breast cancer. Represented genic regions were: protein coding (25%) and non-protein coding genes including pseudogenes and small and long non-coding RNAs (23.9%). An overlap of these regions (10.4%) capture non-coding RNAs originating from the intronic regions of the coding genes. 40.7% of CNVs do not show embedded genes (genome build hg19), hence labelled as non-genic regions. (B) Distribution of associated CNV-sncRNAs across the chromosomes. Figure illustrates the distribution of breast cancer associated CNVs (q < 0.05) harboring small non-coding RNA genes (miRNA, piRNA, tRNA and snoRNAs) for all chromosomes.
Figure 3Expression profiles of small non-coding RNAs in breast tumor and adjacent normal tissues (HiSeq). Figure illustrates the expression profiles from the four classes of sncRNAs between tumor and adjacent normal tissues. Individual bar graphs capture the expressed total sncRNAs and CNV-sncRNAs. Data presented is from TCGA Illumina Hiseq (n = 254 cases and 18 adjacent normal).
Germline CNVs in discovery cohort showing association with breast cancer risk and expression of embedded small RNAs in breast tumor tissues from TCGA.
| Discovery Dataset | TCGA Dataset | |||||||
|---|---|---|---|---|---|---|---|---|
| CNV region | Cytoband | length (bps) | p-value | q-value | CNV frequency gain/loss (%) | CNV region | Small RNAs expressed in breast tissues | |
| Cases | Controls | |||||||
| *chr14:101513466–101514318 | 14q32.31 | 853 | 7.71E-05 | 9.21E-04 | 5/1 | 0/0 | chr14:101513466–101517099 | hsa-miR-539-5p (+), hsa-miR-889-3p (+) |
| *chr14:101515194–101519779 | 14q32.31 | 4586 | 4.84E-05 | 6.52E-04 | 5/1 | 0/0 | chr14:101513466–101517099; chr14:101517099–101527707 | hsa-miR-655-3p (+), hsa-miR-487a-5p |
| *chr14:101519779–101525402 | 14q32.31 | 5624 | 5.53E-05 | 7.27E-04 | 5/1 | 0/0 | chr14:101517099–101527707 | hsa-miR-134-3p (+), hsa-miR-134-5p (+), hsa-miR-323b-3p (+), hsa-miR-382-5p (+), hsa-miR-485-3p (+), hsa-miR-382-3p |
| *chr14:101525779-101527707 | 14q32.31 | 1929 | 8.94E-04 | 5.41E-03 | 4/1 | 0/0 | chr14:101517099-101527707 | hsa-miR-154-3p (+), hsa-miR-154-5p (+), |
| chr19:4437681–4494605 | 19p13.3 | 56925 | 3.09E-04 | 2.53E-03 | 3/2 | 0/0 | chr19:4424993–4664433 | hsa-miR-4746-5p (+) |
| chr1:149676729–149684202 | 1q21.2 | 7474 | 9.33E-06 | 1.77E-04 | 2/5 | 0/16 | chr1:149676729–149684202 | hsa-piR-20636 |
| chr15:25296245–25297449 | 15q11.2 | 1205 | 4.32E-04 | 3.26E-03 | 5/1 | 0/0 | chr15:25296245–25297449 | snoRNA_SNORD116-1-201 (+) |
| chr15:25297449–25300158 | 15q11.2 | 2710 | 5.92E-07 | 1.92E-05 | 8/1 | 0/0 | chr15:25298903–25300158 | snoRNA_SNORD116-2-201 (+) |
| *chr15:25300158–25306451 | 15q11.2 | 6294 | 2.26E-07 | 8.49E-06 | 9/1 | 0/0 | chr15:25300158–25304384; chr15:25305396–25308383 | snoRNA_SNORD116-3-201 (+) |
| chr15:25307985–25310508 | 15q11.2 | 2524 | 6.12E-08 | 2.82E-06 | 9/1 | 0/0 | chr15:25305396–25308383; chr15:25308383–25310928 | snoRNA_SNORD116-6-201 (+) |
| chr15:25310508–25316405 | 15q11.2 | 5898 | 9.95E-08 | 4.25E-06 | 9/1 | 0/0 | chr15:25310928–25318258 | snoRNA_SNORD116-8-201 (+) |
| chr15:25316405–25318258 | 15q11.2 | 1854 | 2.62E-07 | 9.64E-06 | 8/1 | 0/0 | chr15:25310928–25318258 | snoRNA_SNORD116-9-201 (+) |
| chr15:25318258–25324279 | 15q11.2 | 6022 | 9.95E-08 | 4.25E-06 | 8/2 | 0/0 | chr15:25318258–25325686 | snoRNA_SNORD116-9-201 (+), |
| chr15:25324512–25325686 | 15q11.2 | 1175 | 2.87E-06 | 6.76E-05 | 6/2 | 0/0 | chr15:25318258–25325686 | snoRNA_SNORD116-14-201 (+) |
| chr15:25325686–25326762 | 15q11.2 | 1077 | 4.61E-06 | 9.87E-05 | 6/1 | 0/0 | chr15:25325686–25326762 | snoRNA_SNORD116-15-201 (+) |
| chr16:2011427–2016398 | 16p13.3 | 4972 | 6.98E-04 | 4.58E-03 | 3/2 | 0/1 | chr16:2011427–2016398 | snoRNA_SNORA10-201 (−), snoRNA_SNORA64-201 (−) |
| chr19:3975155–3984201 | 19p13.3 | 9047 | 3.09E-04 | 2.53E-03 | 3/2 | 0/0 | chr19:3768181–4110048 | snoRNA_SNORD37-201 (−) |
| chr1:148580449–148606453 | 1q21.2 | 26005 | 7.50E-09 | 4.65E-07 | 7/14 | 10/32 | chr1:148580449–148632305 | chr1.trna108-AsnGTT (−) |
| chr1:148705208-148768557 | 1q21.2 | 63350 | 7.26E-04 | 4.72E-03 | 4/11 | 4/22 | chr1:148662374–148789654 | chr1.trna107-AsnGTT (−) |
| chr1:149598086–149617469 | 1q21.2 | 19384 | 4.48E-10 | 4.08E-08 | 9/12 | 2/29 | chr1:149598086–149631220 | chr1.trna30-AsnGTT (+), |
| chr1:149661965-149670179 | 1q21.2 | 8215 | 3.70E-06 | 8.35E-05 | 4/8 | 1/19 | chr1:149652461–149676729 | chr1.trna94-GluTTC (−) |
| chr1:149670179–149676729 | 1q21.2 | 6551 | 3.60E-06 | 8.17E-05 | 2/6 | 0/17 | chr1:149652461–149676729 | chr1.trna92-PheGAA (−) |
| chr1:149676729–149684202 | 1q21.2 | 7474 | 9.33E-06 | 1.77E-04 | 2/5 | 0/16 | chr1:149676729–149684202 | chr1.trna90-ValCAC (−), chr1.trna91-GlyCCC (−) |
| chr6:26286287–26287456 | 6p22.2 | 1170 | 2.38E-04 | 2.13E-03 | 3/4 | 0/1 | chr6:26274458–26287456 | chr6.trna2-MetCAT (+) |
| *chr19:1381502–1407359 | 19p13.3 | 25858 | 1.23E-04 | 1.29E-03 | 4/2 | 0/0 | chr19:1342160–1547869 | chr19.trna1-AsnGTT (+), chr19.trna14-PheGAA (−) |
| *chr19:4658652–4771070 | 19p13.3 | 112419 | 3.09E-04 | 2.53E-03 | 3/2 | 0/0 | chr19:4714925–4751218 | chr19.trna13-ValCAC (−), chr19.trna2-GlyTCC (+) |
This table represents the selected CNV regions associated with breast cancer that also included one of the four classes of sncRNAs. The statistics represented in this table is based on the discovery dataset (cases/control = 686) and includes the CNV region mapped in validation dataset (TCGA). These sncRNAs were expressed in the breast tissue (either breast tumor or adjacent normal tissues or both) in the TCGA dataset. The rows marked with *symbol indicates the CNVs that are also seen as copy number variable regions in 1000 genomes Phase 3 project.