| Literature DB >> 26558257 |
Hailong Zhao1, Chunqin Zheng1, Hongjuan Cui1.
Abstract
As an important economic insect, Bombyx mori is also a useful model organism for lepidopteran insect. SET-domain-containing proteins belong to a group of enzymes named after a common domain that utilizes the cofactor S-adenosyl-L-methionine (SAM) to achieve methylation of its substrates. Many SET-domain-containing proteins have been shown to display catalytic activity towards particular lysine residues on histones, but emerging evidence also indicates that various nonhistone proteins are specifically targeted by this clade of enzymes. To explore their diverse functions of SET-domain superfamily in insect, we identified, cloned, and analyzed the SET-domains proteins in silkworm, Bombyx mori. Firstly, 24 genes containing SET domain from silkworm genome were characterized and 17 of them belonged to six subfamilies of SUV39, SET1, SET2, SUV4-20, EZ, and SMYD. Secondly, SET domains of silkworm SET-domain family were intraspecifically and interspecifically conserved, especially for the catalytic core "NHSC" motif, substrate binding site, and catalytic site in the SET domain. Lastly, further analyses indicated that silkworm SET-domain gene BmSu(var)3-9 owned different characterization and expression profiles compared to other invertebrates. Overall, our results provide a new insight into the functional and evolutionary features of SET-domain family.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26558257 PMCID: PMC4628957 DOI: 10.1155/2015/161287
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
SET-domain genes identified from the SilkDB.
| Subfamily | Gene name | Gene holoname | Name in SilkDB | Conserved domain | ORF length (bp) | Chromosome number | Scaffold | ESTs | Probe |
| Methylase activity in other species |
|---|---|---|---|---|---|---|---|---|---|---|---|
| SUV39 |
| Euchromatic histone-lysine N-methyltransferase 2 | BGIBMGA007949 | Ankyrin repeats, Pre-SET, and ALAD | 4161 | 15 | nscaf2888 | 7 | sw00703, sw17591, | 3 | H3K9, H3K27 |
|
| SET-domain, bifurcated 1 | BGIBMGA011407 | Tudor, MBD, Pre-SET, and Post-SET | 2895 | 23 | Nscaf3026 | No | sw21024, sw18443 | 0.0 | H3K9 | |
|
| Suppressor of variegation 3-9 | BGIBMGA011500, BGIBMGA011499 | CHROMO, Pre-SET, and Post-SET | 1782 | 23 | Nscaf3027 | 12 | sw11427, sw04471 | 0.0 | H3K9 | |
|
| |||||||||||
| SET1 |
| Myeloid/lymphoid or mixed-lineage leukemia | BGIBMGA010221 | PHD, FYRN, FYRC, and Post-SET | 12657 | 7 | nscaf2986 | 2 | sw12266, sw14974 | 0.0 | H3K4 |
|
| SET-domain-containing 1 | BGIBMGA012978 | Post-SET | 4080 | 16 | Nscaf3058 | No | sw14237, sw14238, | 0.0 | H3K4 | |
|
| Trithorax-related | BGIBMGA002503, BGIBMGA002502 | PHD, FYRN, and Post-SET | — | 9 | nscaf2511 | 4 | sw05846, sw04238, | 0.0 | H3K4 | |
|
| Myeloid/lymphoid or mixed-lineage leukemia 5 | BGIBMGA004310 | PHD | 20 | nscaf2789 | No | sw00989, sw16148 | 2 | H3K4 | ||
|
| SET-domain-containing 2 | BGIBMGA003106 | AWS, Post-SET | 3633 | 4 | nscaf2589 | No | sw11671, sw11672 | 2 | H3K36 | |
|
| |||||||||||
| SET2 |
| Nuclear receptor binding SET-domain protein 1 | BGIBMGA002246 | PWWP, PHD, AWS, and Post-SET | 6600 | 26 | nscaf2330 | 3 | sw11742 | 4 | H3K36, H4K20 |
|
| Absent, small, or homeotic discs 1 | BGIBMGA001497 | AWS, Post-SET, BROMO, PHD, and BAH | 8754 | 21 | nscaf2136 | 2 | sw11562 | 0.0 | H3K4, H3K9, and H4K20 | |
|
| |||||||||||
| SUV4-20 |
| Suppressor of variegation 4-20 | BGIBMGA000919 | — | 2094 | 13 | Nscaf1898 | 4 | sw13536 | 2 | H4K20 |
|
| |||||||||||
| EZ |
| Enhancer of zeste | BGIBMGA014476 | SANT, CxC | — | Scaffold707 | 6 | sw15702, sw16217 | 0.0 | H3K27 | |
|
| |||||||||||
| SMYD | BGIBMGA014048 | zf-MYND | 1782 | — | Nscaf3115 | 1 | sw10812 | 3 | — | ||
| BGIBMGA007907 | — | 1881 | 15 | nscaf2888 | No | sw11109 | 2 | — | |||
| BGIBMGA002076 | — | 2232 | 1 | nscaf2210 | 1 | sw19609 | 1 | — | |||
| BGIBMGA011073 | — | 1155 | 23 | Nscaf3015 | No | sw20741 | 1 | — | |||
| BGIBMGA002109 | — | 1071 | 1 | nscaf2210 | No | sw18112, sw02496 | 2 | — | |||
| BGIBMGA008839 | zf-MYND | 3 | nscaf2927 | No | sw17374, sw17507 | 4 | — | ||||
| BGIBMGA008939 | zf-MYND | 3 | nscaf2930 | No | sw08614 | 2 | — | ||||
| BGIBMGA008838 | — | 3 | nscaf2927 | No | sw21409 | 2 | — | ||||
| BGIBMGA008923 | — | 3 | nscaf2930 | No | sw21256 | 2 | — | ||||
| BGIBMGA003139 | — | 4 | nscaf2589 | No | sw05603 | 1 | — | ||||
|
| |||||||||||
| Others |
| SET-domain-containing (lysine methyltransferase) 8 | BGIBMGA012853 | — | 912 | 16 | Nscaf3058 | 1 | — | 3 | H4K20 |
|
| SET-domain and mariner transposase fusion gene | BGIBMGA007557 | — | 810 | 15 | nscaf2887 | 2 | — | 2 | — | |
Figure 1SET-domain sequence alignment analysis. The figure shows invariant residues in binding to the catalytic substrate AdoMet (S-adenosyl-L-methionine) the target lysine, catalytic site, the most conserved NHSC motif, F/Y switch controlling whether the product is a mono-, di-, or trimethylated histone, and the pseudoknot structure formed by two conserved SET motifs to form an active site in a location immediately next to the peptide-binding cleft. Sc: Saccharomyces cerevisiae; Hs: Homo sapiens; Dm: Drosophila melanogaster.
Figure 2Phylogenetic analysis of SUV4-20 subfamily. Pt: Pan troglodytes; Mm: Mus musculus; Bt: Bos taurus; Rn: Rattus norvegicus; Xl: Xenopus laevis; Dr: Danio rerio; Ag: Anopheles gambiae str. PEST; Har: Harpegnathos saltator; Ae: Acromyrmex echinatior; Cf: Camponotus floridanus (the abbreviations of species mentioned above are omitted here).
Figure 3Phylogenetic analysis of SUV39 subfamily. Nc: Neurospora crassa; Sp: Schizosaccharomyces pombe; Am: Apis mellifera; Ld: Leptinotarsa decemlineata. Ldheterochromatin represents the heterochromatin protein of Leptinotarsa decemlineata SUV39 subfamily. DIM5 and CLR4 are representative SUV39 subfamily members of Neurospora crassa and Schizosaccharomyces pombe. The bright green on the phylogenetic tree represents silkworm members (the abbreviations of species mentioned above are omitted here). Domains: CHROMO (CHRromatin Organization MOdifier), Pre-SET (Cys-rich putative Zn2+-binding domain that occurs at N-terminal to some SET domains), and Post-SET (Cysteine-rich motif following a subset of SET domains).
Figure 4Alignment of SETDB1 SET domains. Hs: Homo sapiens; Bt: Bos taurus; Mm: Mus musculus; Xl: Xenopus laevis; Dr: Danio rerio; Bm: Bombyx mori; Dm: Drosophila melanogaster.
Figure 5Phylogenetic analysis of SET2 subfamily. Cf: Camelus ferus; Ip: Ictalurus punctatus; Cm: Chelonia mydas; Tc: Tribolium castaneum; Cs: Clonorchis sinensis; Cb: Cerapachys biroi; Ae: Acromyrmex echinatior; Aa: Aedes aegypti; Cq: Culex quinquefasciatus; Cc: Ceratitis capitata. The bright green on the phylogenetic tree represents silkworm members (the abbreviations of species mentioned above are omitted here).
Figure 6Developmental expression profile of silkworm SET-domain genes. 1: whole silkworm; 2: ovary; 3: testis; 4: blood; 5: midgut; 6: fat body; 7: body wall; 8: Malpighian tube; 9: silk gland; 10: head.
Figure 7Expression patterns of BmSu(var)3-9 during embryogenesis day 1 to day 9 in silkworm. RT-PCR was performed to detect the expression patterns of BmSu(var)3-9 using specific primers and Actin3 gene was used as the internal control at each time point.
Figure 8Larval stage 5 day 3 expression profile of BmSu(var)3-9. 1: whole silkworm; 2: ovary; 3: testis; 4: blood; 5: midgut; 6: fat body; 7: body wall; 8: Malpighian tube; 9: silk gland; 10: head.