| Literature DB >> 26055100 |
Jaeyoung Choi1, Ki-Tae Kim2, Aram Huh1, Seomun Kwon1, Changyoung Hong1, Fred O Asiegbu1, Junhyun Jeon3, Yong-Hwan Lee4.
Abstract
Over the past two decades, epigenetics has evolved into a key concept for understanding regulation of gene expression. Among many epigenetic mechanisms, covalent modifications such as acetylation and methylation of lysine residues on core histones emerged as a major mechanism in epigenetic regulation. Here, we present the database for histone-modifying enzymes (dbHiMo; http://hme.riceblast.snu.ac.kr/) aimed at facilitating functional and comparative analysis of histone-modifying enzymes (HMEs). HMEs were identified by applying a search pipeline built upon profile hidden Markov model (HMM) to proteomes. The database incorporates 11,576 HMEs identified from 603 proteomes including 483 fungal, 32 plants and 51 metazoan species. The dbHiMo provides users with web-based personalized data browsing and analysis tools, supporting comparative and evolutionary genomics. With comprehensive data entries and associated web-based tools, our database will be a valuable resource for future epigenetics/epigenomics studies.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26055100 PMCID: PMC4460409 DOI: 10.1093/database/bav052
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
List of the 35 genomes used in analysis of gene duplication and loss
| Species name | Kingdom | Phylum | Subphylum |
|---|---|---|---|
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Pezizomycotina | |
| Fungi | Ascomycota | Saccharomycotina | |
| Fungi | Ascomycota | Saccharomycotina | |
| Fungi | Ascomycota | Taphrinomycotina | |
| Fungi | Basidiomycota | Agaricomycotina | |
| Fungi | Basidiomycota | Agaricomycotina | |
| Fungi | Basidiomycota | Agaricomycotina | |
| Fungi | Basidiomycota | Agaricomycotina | |
| Fungi | Basidiomycota | Agricomycotina | |
| Fungi | Basidiomycota | Pucciniomycotina | |
| Fungi | Basidiomycota | Pucciniomycotina | |
| Fungi | Basidiomycota | Ustilaginomycotina | |
| Fungi | Blastocladiomycota | N/D | |
| Fungi | Chytridiomycota | N/D | |
| Fungi | Microsporodia | N/D | |
| Fungi | Zygomycota | Mucoromycotina | |
| Fungi | Zygomycota | Mucoromycotina | |
| Chromista | Oomycota | Oomycotina | |
| Viridiplantae | Streptophyta | N/D | |
| Viridiplantae | Streptophyta | N/D | |
| Drosophila | Metazoa | Arthropoda | N/D |
| Metazoa | Nematoda | N/D | |
| Metazoa | Chordata | Craniata |
Figure 1.An identification pipeline and prediction summary of dbHiMo. A schematic flowchart of the dbHiMo pipeline and distribution of the predicted genes across the taxonomy. (A) In silico prediction pipeline consists of three steps: (i) collection of the reference sequences for each gene family, (ii) multiple sequence alignment and retrieval of well-conserved regions and (iii) construction of sequence profiles and searching on the proteomes. (B) The average numbers of genes belonging to the five main categories for a given taxonomy were summarized to show overview of the prediction results.
List of gene families available in dbHiMo
| Category | Gene family | Number of genes | Number of genomes |
|---|---|---|---|
| Histone acetyltransferase | ELP3 | 634 | 589 |
| (HAT; GNAT | GCN5 | 641 | 536 |
| Hpa2 | 70 | 70 | |
| Hpa3 | 68 | 67 | |
| PCAF | 74 | 51 | |
| Histone acetyltransferase | Esa1 | 691 | 526 |
| (HAT; MYST | HBO1 | 75 | 53 |
| MOF | 127 | 93 | |
| MOZ_MORF | 39 | 14 | |
| Sas2 | 519 | 475 | |
| Sas3 | 145 | 144 | |
| Tip60 | 212 | 182 | |
| Histone deacetylase | ClassI | 1832 | 602 |
| ClassIIA | 115 | 44 | |
| ClassIIB | 628 | 524 | |
| ClassIII | 1658 | 589 | |
| ClassIV | 120 | 83 | |
| Histone methyltransferase | DOT1 | 248 | 247 |
| (HMT) | DOT1L | 19 | 14 |
| PRMT_1 | 1375 | 590 | |
| PRMT_2 | 36 | 17 | |
| SET1 | 442 | 420 | |
| SET2 | 533 | 508 | |
| SET5 | 201 | 200 | |
| Histone demethylase | JARID1 | 209 | 140 |
| (HDM) | JHDM1 | 63 | 39 |
| JHDM2 | 73 | 34 | |
| JHDM3_JMJD2 | 551 | 472 | |
| PHF2_PHF8 | 64 | 22 | |
| UTX_JMJD3 | 92 | 50 |
aGNAT is the abbreviation for Gcn5-related N-acetyltransferases (35).
bMYST is named after its members, including MOZ, Ybf2 (Sas3), Sas2 and Tip60 (35).
Figure 2.Web functionality available on dbHiMo website. (A) Web interface of dbHiMo supports browsing methods (i) by species (or genome), (ii) gene family or (iii) target site in histone. (B) Bioinformatics tools are available on the web; (i) sequence similarity searches (BLAST and BLASTMatrix), (ii) multiple sequence alignment (ClustalW) and (iii) prediction of sequence(s) provided by users. (C) Analytic functions are provided including (i) distribution chart/table of genes in a genome, (ii) distribution across the taxonomy for a given gene family, (iii) domain architecture analysis and (iv) distribution of genes from a sequence collection in Favorite Browser. (D) Sequence collections in Favorite Browser can be further analysed by the tools available at the CFGP 2.0 and other sister databases.
Figure 3.Duplications and losses calculated for a HAT enzyme, GCN5/PCAF. The reconciled tree of GCN5/PCAF sequences from 35 species covering fungi, Oomycetes, animals and plants was constructed. The numbers of gene duplication (D) and loss (L) events are condensed to the species tree and shown in the corresponding internal node. The number of genes and the species name are presented next to the leaf nodes. Species names are abbreviated as the following (ordered by appearance in the tree): Nc (Neurospora crassa), Pa (Podospora anserina), Mo (Magnaporthe oryzae 70–15), Cg (Colletotrichum graminicola M1.001), Fo (Fusarium oxysporum f. sp. lycopersici), Fg (F. graminearum), Bc (Botrytis cinerea), Bg (Blumeria graminis f. sp. hordei DH14), Mg (Mycosphaerella graminicola), Af (Aspergillus fumigatus Af293), An (A. nidulans FGSC A4), Hc (Histoplasma capsulatum H88), Ci (Coccidioides immitis RS), Sp (Schizosaccharomyces pombe 132), Sc (Saccharomyces cerevisiae S288C), Ca (Candida albicans SC5314), Ml (Melampsora laricis-populina 98AG31), Pg (Puccinia graminis f. sp. tritici), Um (Ustilago maydis 521), Cn (Cryptococcus neoformans var. grubii H99), Lb (Laccaria bicolor), Pc (Phanerochaete chrysosporium RP-78), Sl (Serpula lacrymans S7.9), Hi (Heterobasidion irregulare TC 32–1), Bd (Batrachochytrium dendrobatidis JAM81), Pb (Phycomyces blakesleeanus NRRL1555), Ro (Rhizopus oryzae), Am (Allomyces macrogynus), Pi (Phytophthora infestans), Ec (Encephalitozoon cuniculi), Os (Oryza sativa), At (Arabidopsis thaliana), Ce (Caenorhabditis elegans), Dm (Drosophila melanogaster) and Hs (Homo sapiens).