| Literature DB >> 28117658 |
Sandeep Subramanian1, Srilakshmi Chaparala2, Viji Avali2, Madhavi K Ganapathiraju3,4.
Abstract
BACKGROUND: DNA palindromes are a unique pattern of repeat sequences that are present in the human genome. It consists of a sequence of nucleotides in which the second half is the complement of the first half but appearing in reverse order. These palindromic sequences may have a significant role in DNA replication, transcription and gene regulation processes. They occur frequently in human cancers by clustering at specific locations of the genome that undergo gene amplification and tumorigenesis. Moreover, some studies showed that palindromes are clustered in amplified regions of breast cancer genomes especially in chromosomes (chr) 8 and 11. With the large number of personal genomes and cancer genomes becoming available, it is now possible to study their association to diseases using computational methods. Here, we conducted a pilot study on chromosomes 8 and 11 of cancer genomes to identify computationally the differentially occurring palindromes.Entities:
Mesh:
Substances:
Year: 2016 PMID: 28117658 PMCID: PMC5260791 DOI: 10.1186/s12920-016-0232-3
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Regions that have highest density of palindromes. The number of the palindromes per 3000 bases is computed. The top 2% of the windows that are most dense with palindromes are shown for chromsomes 8 and 11 in (a) and (c). Corresponding numbers were computed for one of the TCGA samples, and the difference with respect to reference genome (tumor – reference) are shown in (b) and (d)
Fig. 2List of Palindromes. List of palindromes in (a) chr8 and (b) chr11 with significant changes in tumor and normal samples in TCGA brca dataset and minimum changes in 1000 Genomes with the following color coding: absence of the palindrome (orange), bigger than in the reference genome (pale blue), smaller than in the reference genome (pale green), perfect palindrome in the reference genome which is now a near palindrome with a single mismatch the central eight bases (dark grey) and palindrome that had the same length as in the reference genome but had a variant (burgundy). The left side of the figure represents normal samples and the right side tumor samples. Oncogenes and/or brca genes are highlighted in green
Fig. 3Changes in specific palindromes in eight genes. Three palindromes that have changed most significantly are shown for eight genes (KMT2A, TNKS, CADM1, RAD21, NUP98, TRPS1, NRG1 and NBN). For each palindrome of any gene, its difference in the length in comparison to reference genome are shown separately for 1000 g, normal and tumor genomes. The size of the circle is proportional to the number of samples out of the 69 considered in each set. C represents Control, N represents Normal and T represents Tumor in the figure
Table of statistical significance values for Chr8 (a) and Chr11 (b). Genes with their regions, whether the gene is oncogene or brca gene with the statistical significance are shown in separate columns
| Gene region | Gene symbol | Oncogene | Breast cancer gene | Odds ratio |
|---|---|---|---|---|
| Chromosome 8 | ||||
| intronic | SPIDR | FALSE | FALSE | 12.5 |
| intronic | EMC2 | FALSE | FALSE | 4.6 |
| intergenic | CASC9 | FALSE | FALSE | 3.9 |
| intronic | RAD54B | FALSE | FALSE | 3.6 |
| intronic | SH2D4A | FALSE | FALSE | 3.6 |
| intronic | MAK16 | FALSE | FALSE | 3.5 |
| intronic | UBXN2B | FALSE | FALSE | 3.4 |
| intronic | KIAA1429 | FALSE | FALSE | 3.3 |
| intronic | UBXN2B | FALSE | FALSE | 3.0 |
| exonic | SLC26A7 | FALSE | FALSE | 3.0 |
| intronic | KIAA0196 | FALSE | FALSE | 2.8 |
| intronic | POTEA | FALSE | FALSE | 2.8 |
| intergenic | UNC5D | FALSE | FALSE | 2.7 |
| intronic | CA1 | FALSE | FALSE | 2.7 |
| intronic | EIF3E | TRUE | TRUE | 2.5 |
| intronic | CPQ | FALSE | FALSE | 2.5 |
| exonic | USP17L7 | FALSE | FALSE | 2.5 |
| intronic | SH2D4A | FALSE | FALSE | 2.5 |
| exonic | C8orf37 | FALSE | FALSE | 2.5 |
| intronic | SDC2 | FALSE | FALSE | 2.4 |
| exonic | DCAF13 | FALSE | FALSE | 2.4 |
| intronic | CPQ | FALSE | FALSE | 2.4 |
| intronic | CCAR2 | TRUE | FALSE | 2.4 |
| ncRNA_intronic | LOC392232 | FALSE | FALSE | 2.4 |
| exonic | GRHL2 | FALSE | FALSE | 2.4 |
| intronic | TRPS1 | TRUE | TRUE | 2.3 |
| Chromosome 11 | ||||
| intergenic | LOC283177 | FALSE | FALSE | 17.0 |
| intronic | NFRKB | FALSE | FALSE | 14.9 |
| intronic | DCDC5 | FALSE | FALSE | 6.0 |
| intronic | C2CD3 | FALSE | FALSE | 5.0 |
| intergenic | LOC283177 | FALSE | FALSE | 4.2 |
| intergenic | LOC102724301 | FALSE | FALSE | 4.1 |
| intergenic | LOC283177 | FALSE | FALSE | 3.9 |
| intergenic | FAM86C2P | FALSE | FALSE | 3.9 |
| intronic | KIRREL3 | TRUE | FALSE | 3.8 |
| intronic | NUCB2 | FALSE | FALSE | 3.5 |
| intronic | MYO7A | FALSE | FALSE | 3.4 |
| intergenic | MIR8068 | FALSE | FALSE | 3.2 |
| intergenic | OR4A5 | FALSE | FALSE | 3.1 |
| intronic | CADM1 | TRUE | FALSE | 3.0 |
| intergenic | LOC283177 | FALSE | FALSE | 3.0 |
| exonic | NXPE1 | FALSE | FALSE | 2.9 |
| intronic | CWF19L2 | FALSE | FALSE | 2.8 |
| intronic | PRCP | FALSE | FALSE | 2.8 |
| intergenic | LOC283177 | FALSE | FALSE | 2.7 |
| intergenic | LOC283177 | FALSE | FALSE | 2.7 |
| intronic | MYO7A | FALSE | FALSE | 2.6 |
| downstream | OR51B4 | FALSE | FALSE | 2.6 |
| intergenic | LOC283177 | FALSE | FALSE | 2.5 |