| Literature DB >> 31719763 |
Ridip Kumar Gogoi1, Ringhoilal Chorei1, Himanshu Kishore Prasad1.
Abstract
The methylotrophic yeast Komagataella phaffii is an industrial workhorse yeast species that has been widely used in biotechnology industries for recombinant protein production. Genome sequencing of this yeast in 2009 have enabled scientists to assign and characterize functions to most of its proteins while few hypothetical proteins remain uncharacterized. Therefore, it is of interest to characterize the hypothetical protein coding gene PAS_chr2-2_0152 as SET containing the ZNF-MYND (SMYD) domain. They share a homology with other methylotrophic and non-methylotrophic yeast species together with known SMYD proteins of Homo sapiens, with conserved distinctive SMYD domain patterns. A homology model is developed using the crystal structure of human histone-lysine methyl transferase smyd3 as template. These data points to that the hypothetical protein is a potential histones and non-histone lysine methyl transferase regulating cell cycle, chromatin remodeling, DNA damage response, homologous recombination and transcription in Komagataella phaffii. Data also suggests the evolutionary syntenic conservation of DNA damage regulator (RFX) and lysine methyl transferase (SMYD) genes in some yeast lineages, pointing to a conserved role requiring further confirmation.Entities:
Keywords: Hypothetical protein; Komagataella phaffii; SMYD; methyltransferase; phylogenetic analysis
Year: 2019 PMID: 31719763 PMCID: PMC6822524 DOI: 10.6026/97320630015542
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Blast global alignment of PAS_chr2-2_0152 (KpSMYD protein) (Accession No. XP_002492054.1) with Hits from BLASTP using NR and UniProtKb/Swiss-Prot database
| S. No | Name Assigned | Accession No. | Protein Description | Organism | Length (AA) | Percent Identity (%) |
| 1 | MgSMYD | XP_001482836.1 | Hypothetical protein | Meyerozyma guilliermondii ATCC 6260 | 637 | 20 |
| 2 | OpSMYD | XP_013935809.1 | Hypothetical protein | Ogataea parapolymorpha DL-1 | 597 | 22 |
| 3 | KcSMYD | XP_022459850.1 | Uncharacterized protein | Kuraishia capsulata CBS 1993 | 732 | 22 |
| 4 | DhSMYD | XP_462014.2 | DEHA2G10846p | Debaryomyces hansenii CBS767 | 725 | 21 |
| 5 | CaSMYD | XP_713490.2 | Hypothetical protein | Candida albicans SC5314 | 630 | 17 |
| 6 | SpSMYD | NP_596514.1 | putative histone lysine methyltransferase Set6 | Schizosaccharomyces pombe | 483 | 16 |
| 7 | SMYD1 | NP_001317293.1 | SMYD1 isoform 1 | Homo sapiens | 490 | 17 |
| 8 | SMYD2 | NP_064582.2 | SMYD2 | Homo sapiens | 433 | 16 |
| 9 | SMYD3 | NP_001161212.1 | SMYD3 isoform1 | Homo sapiens | 428 | 17 |
Figure 1Phylogenetic analysis and sequence conservation of SMYD proteins (A). Neighbor Joining (NJ) phylogenetic tree representing KpSMYD protein with other SMYD proteins is shown. The accession numbers of the protein sequences were mentioned in Table 1. The tree was constructed from the multiple sequence alignment of whole SMYD protein sequences from BLASTP hits using NR and UniProtKB/Swiss-Prot database with 1000 bootstrap replicas following JTT substitution model with MegaX. (B) Study of the Synteny analysis of SMYD proteins using NCBI's Sequence Viewer. (C) SMYD domain organization in the SMYD proteins.
Figure 2Structure analysis of the KpSMYD protein and cartoon representation of the 3D model (A) Secondary structure prediction using PSIPRED web server. (B) Hydrophobicity surface view of the 3D model predicted by Phyre2 using the template from PDB crystal structure molecule c3mekA chain A of Homo sapiens entitled human histone-lysine n-methyltransferase smyd3 in complex with sadenosyl- l-methionine. (C) Cartoon representation of the 3D model with the gray-colored coil, gold-colored strand, and dark cyan represents helix with white-colored inside. (D) Ramachandran Plot showing the validation of the modeled protein
Data for a homology model of KpSMYD (PAS_chr2-2_0152) using Phyre2 is given. Secondary structure information of the modeled protein is presented in the table with the template information.
| Protein Model | Residues | Resolution (Å) | Template Information | Sequence identity to the template (%) | Organism | Secondary structure Information | ||||||
| modeled at 100% confidence | ||||||||||||
| Residues | Coverage (%) | PDB | Chain | PDB Header | Description | Disordered | Alpha helix (%) | Beta strand (%) | ||||
| (%) | ||||||||||||
| PAS_chr2-2_0152 | 408 | 55 | 1.75 | 3MEK | A | Trans | SET and ZNF-MYND domain containing protein 3 | 19 | Homo sapiens | 14 | 53 | 8 |
| ferase |