| Literature DB >> 26777304 |
Jagpreet Singh Nanda1, Rahul Kumar1, Gajendra P S Raghava1.
Abstract
We have developed a database called dbEM (database of Epigenetic Modifiers) to maintain the genomic information of about 167 epigenetic modifiers/proteins, which are considered as potential cancer targets. In dbEM, modifiers are classified on functional basis and comprise of 48 histone methyl transferases, 33 chromatin remodelers and 31 histone demethylases. dbEM maintains the genomic information like mutations, copy number variation and gene expression in thousands of tumor samples, cancer cell lines and healthy samples. This information is obtained from public resources viz. COSMIC, CCLE and 1000-genome project. Gene essentiality data retrieved from COLT database further highlights the importance of various epigenetic proteins for cancer survival. We have also reported the sequence profiles, tertiary structures and post-translational modifications of these epigenetic proteins in cancer. It also contains information of 54 drug molecules against different epigenetic proteins. A wide range of tools have been integrated in dbEM e.g. Search, BLAST, Alignment and Profile based prediction. In our analysis, we found that epigenetic proteins DNMT3A, HDAC2, KDM6A, and TET2 are highly mutated in variety of cancers. We are confident that dbEM will be very useful in cancer research particularly in the field of epigenetic proteins based cancer therapeutics. This database is available for public at URL: http://crdd.osdd.net/raghava/dbem.Entities:
Mesh:
Year: 2016 PMID: 26777304 PMCID: PMC4726101 DOI: 10.1038/srep19340
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Architecture of dbEM database.
Figure 2Bar graph showing the counts of various types of mutations in epigenetic proteins.
Figure 3Circos plot depicting the gene expression and copy number variation.
Moving from outer to inner circles, first circle represents the chromosomes on which epigenetic genes have been mapped. Gene expression data has been represented in second and third circles as scatter diagram and heat map, respectively (red and green dots denote gene expression values greater and less than 10, respectively). Fourth and fifth circles represent copy number variation (CNV) as scatter diagram and heat map, respectively (red and green dots have values greater and less than 2, respectively). Each dot represents a cancer cell line.
Top 10 mutated epigenetic proteins in cancer.
| S. No. | Protein | CCLE Mutation count | COSMIC Mutation count | Sum of CCLE & COSMIC count (S) | Variants Count in 1000 Genome (V) | S/V |
|---|---|---|---|---|---|---|
| 1 | DNMT3A | 8 | 1663 | 1671 | 82 | 20.38 |
| 2 | HDAC2 | 60 | 49 | 109 | 6 | 18.17 |
| 3 | KDM6A | 60 | 246 | 306 | 17 | 18.00 |
| 4 | TET2 | 34 | 1608 | 1642 | 115 | 14.28 |
| 5 | ARID1A | 172 | 821 | 993 | 87 | 11.41 |
| 6 | PRDM9 | 0 | 333 | 333 | 33 | 10.09 |
| 7 | HDAC3 | 29 | 39 | 68 | 7 | 9.71 |
| 8 | PRMT6 | 0 | 32 | 32 | 4 | 8.00 |
| 9 | HDAC6 | 35 | 111 | 146 | 22 | 6.64 |
| 10 | RBBP7 | 0 | 58 | 58 | 9 | 6.44 |