| Literature DB >> 17490476 |
Harpreet Kaur Saini1, Daniel Fischer.
Abstract
BACKGROUND: Mimivirus isolated from A. polyphaga is the largest virus discovered so far. It is unique among all the viruses in having genes related to translation, DNA repair and replication which bear close homology to eukaryotic genes. Nevertheless, only a small fraction of the proteins (33%) encoded in this genome has been assigned a function. Furthermore, a large fraction of the unassigned protein sequences bear no sequence similarity to proteins from other genomes. These sequences are referred to as ORFans. Because of their lack of sequence similarity to other proteins, they can not be assigned putative functions using standard sequence comparison methods. As part of our genome-wide computational efforts aimed at characterizing Mimivirus ORFans, we have applied fold-recognition methods to predict the structure of these ORFans and further functions were derived based on conservation of functionally important residues in sequence-template alignments.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17490476 PMCID: PMC1876218 DOI: 10.1186/1471-2164-8-115
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Highly confident 3D structural and functional predictions for 6 Mimivirus ORFans.
| 119.8 | 1fj2A | Human acyl protein Thioesterase 1 | α/β-Hydrolase | Carboxylesterase/Thioesterase | Catalytic triad, 'HG' oxyanion motif and PROSITE signature pattern of carboxylesterase conserved. | |
| 108.4 | 1q7tA | Metal-Dependent Deacetylase | LmbE-like | Metal-Dependent Deacetylase | Deacetyl 'AHPDDE' sequence motif and metal binding residues conserved. | |
| 82.1 | 1r28A | B-cell Lymphoma 6 (bcl6) BTB domain | POZ domain | BTB domain | Residues Asp and Arg/Lys in the charged pocket of BTB domain, important for transcriptional repression conserved. | |
| 63.9 | 1yj5A | Mammalian Polynucleotide Kinase | P-loop containing nucleoside triphosphate hydrolases | P-loop kinases | P-loop (walker-A and walker-B motifs) and arginine motif conserved. | |
| 70.7 | 1ewnA | Human AAG DNA repair Glycosylase | FMT C-terminal domain-like | 3-methyladenine DNA glycosylase | Active site residues Glu, Tyr, His, Tyr and Arg conserved. | |
| 62.1 | 1ap8 | Translation Initiation factor eIF4e | Translation Initiation factor eIF4e | Translation Initiation factor eIF4e | Trp residues important for CAP binding conserved. | |
Fold recognition results for ORFan R843.
| 1 | 119.75 | INUB_01 | 1fj2A | Human Acyl Protein Thioesterase 1 | Carboxylesterase/thioesterase 1 |
| 2 | 113.75 | FFA3_01 | 1fj2A | Human Acyl Protein Thioesterase 1 | Carboxylesterase/thioesterase 1 |
| 3 | 110.75 | INUB_03 | 1auoA | Carboxylesterase | Carboxylesterase/thioesterase 1 |
| 4 | 108.25 | 3DPS_01 | 1fj2A | Human Acyl Protein Thioesterase 1 | Carboxylesterase/thioesterase 1 |
| 5 | 107.88 | BasD_01 | 1fj2A | Human Acyl Protein Thioesterase 1 | Carboxylesterase/thioesterase 1 |
| 6 | 105.25 | 3DPS_02 | 1auoA | Carboxylesterase | Carboxylesterase/thioesterase 1 |
| 7 | 100.38 | ORF2_03 | 1fj2A | Human Acyl Protein Thioesterase 1 | Carboxylesterase/thioesterase 1 |
| 8 | 99.12 | FUG2_02 | 1auoA | Carboxylesterase | Carboxylesterase/thioesterase 1 |
| 9 | 86.88 | FFA3_02 | 1gkkA | Feruloyl Esterase Domain | |
| 10 | 86.75 | FFA3_05 | 1gklA | s954a Mutant of the Feruloyl Esterase |
Figure 1a) Sequence-structure alignment of the ORFan R843 and the template human acyl protein thioesterase 1 (PDB code: 1fj2, Chain: A). The predicted secondary structure (H: helix and E: β-strand) of R843 is shown above its sequence. The observed secondary structure of 1fj2A is shown below its sequence. Identical residues are marked in black with blue background. The secondary structural elements of 1fj2 are represented as α and β and are numbered from A-F and 3–8 respectively. The conserved catalytic residues, S, D and H are shown as stars and highlighted with bold black letters in magenta background. The conserved pentapeptide sequence G-X-S-X-G with active site serine is shown in green color. The conserved PROSITE signature pattern of carboxylesterases is shown in grey boxes. The residues probably forming the oxyanion hole are marked in bold black letters in yellow background. b) Ribbon diagram of predicted model of ORFan R843. The model was generated from INUB and refined using Nest [53]. α-helices and β-sheets are colored in blue and yellow. The remaining secondary structure is in white color. α-helices and β-strands of the canonical fold are numbered alphabetically from A to F and numerically from 3 to 8 respectively (the canonical strands 1 and 2 are missing). The residues forming the catalytic triad (Ser200, Asp261 and His292) are represented in ball-and-sticks. The figure is drawn using MOLSCRIPT program [54].
Fold recognition results for ORFan R277.
| 1 | 63.88 | FFA3_01 | 1yj5A | Polynucleotide kinase | 2.7.1.78 | - |
| 2 | 59.00 | BasD_01 | 1yj5A | Polynucleotide kinase | 2.7.1.78 | - |
| 3 | 58.25 | FUG2_01 | 1yj5A | Polynucleotide kinase | 2.7.1.78 | - |
| 4 | 54.75 | FFA3_03 | 1ly1A | T4 polynucleotide kinase | 2.7.1.78 | Nucleotide and Nucleoside Kinases |
| 5 | 54.38 | INUB_01 | 1ly1A | T4 polynucleotide kinase | 2.7.1.78 | Nucleotide and Nucleoside Kinases |
| 6 | 53.50 | FUG2_09 | 1shkA | Shikimate kinase | 2.7.1.71 | Shikimate kinase |
| 7 | 52.62 | FFA3_02 | 1qhsA | Chloramphenicol phosphotransferase | 2.7.1.- | Chloramphenicol phosphotransferase |
| 8 | 52.62 | 3DPS_01 | 1ly1A | T4 polynucleotide kinase | 2.7.1.78 | Nucleotide and Nucleoside Kinases |
| 9 | 52.38 | FUG2_06 | 1shkA | Shikimate kinase | 2.7.1.71 | Shikimate kinase |
| 10 | 51.75 | FFA3_06 | 1qhnA | Chloramphenicol phosphotransferase | 2.7.1.- | Chloramphenicol phosphotransferase |
Figure 2a) Sequence-structure alignment of ORFan R277 and the template mammalian polynucleotide kinase (PDB code: 1yj5, Chain: A). The ATP binding conserved motifs, Walker-A (P-loop) and Walker-B are highlighted in black with green and yellow backgrounds respectively. The conserved arginine motif in 1yj5 and R277 is shown in dark and light pink background respectively. b) Ribbon diagram of predicted model of ORFan R277. The model was generated from FFAS03 [55] and refined using Nest [53]. α-helices and β-strands are numbered numerically. The ATP-binding P-loop is highlighted in red color. The possible ATP interacting arginine residues (R103 and R107) present in the helical lid (helix α5) covering the P-loop are shown as sticks.
Fold recognition results for ORFan L529.
| 1 | 62.14 | FFA3_03 | 1ap8 | Translation Initiation Factor eIF4E | Translation Initiation Factor eIF4E |
| 2 | 61.14 | FFA3_02 | 1r58A | Translation Initiation Factor eIF4E in complex with m7gdp and eif4gi | Translation Initiation Factor eIF4E |
| 3 | 52.14 | 3DPS_02 | 1ap8 | Translation Initiation Factor eIF4E | Translation Initiation Factor eIF4E |
| 4 | 51.57 | FFA3_01 | 1ipbA | eIF4E complexed with 7-methyl gpppa | Translation Initiation Factor eIF4E |
| 5 | 49.71 | INUB_02 | 1ej1A | eIF4E bound to 7-methyl-gdp | Translation Initiation Factor eIF4E |
| 6 | 48.57 | BasD_01 | 1ap8 | Translation Initiation Factor eIF4E | Translation Initiation Factor eIF4E |
| 7 | 47.14 | INUB_01 | 1ap8 | Translation Initiation Factor eIF4E | Translation Initiation Factor eIF4E |
| 8 | 44.29 | ORF2_01 | 1ap8 | Translation Initiation Factor eIF4E | Translation Initiation Factor eIF4E |
| 9 | 41.86 | 3DPS_01 | 1ej1A | eIF4E bound to 7-methyl-gdp | Translation Initiation Factor eIF4E |
Figure 3a) Sequence-structure alignment ORFan L529 and yeast translation initiation factor (PDB code: 1ap8). The eight conserved tryptophans of 1ap8 are numbered as 1–8. The conserved terminal tryptophans (Trp-1 & Trp-8) are shown in yellow boxes. The consensus motif (S/T)VxxxFW is highlighted in green color. b) Ribbon diagram of predicted model of ORFan L529. The model was built from FFAS03 [55] and refined using Nest [53]. The model shows curved antiparallel β-sheet with two long helices present on one side. The conserved tryptophan residues (Trp25 and Trp124) required for cap recognition are shown in ball-and-sticks.
Highly confident 3D structural predictions for 10 Mimivirus ORFans.
| 184.8 | 1gxr | WD40 region of human groucho/tle1 | 7-bladed beta propeller | |
| 96.1 | 1s78 | Receptor protein-tyrosine kinase erbb-2 | Leucine rich repeat | |
| 84.3 | 1c3j | Glucosyltransferase | UDP-Glycosyltransferase/glycogen phosphorylase | |
| 67.5 | 1qpe | lymphocyte-specific kinase | Protein kinase like | |
| 67.3 | 1gh2 | Catalytic Domain of A New Human Thioredoxin-Like Protein | Thioredoxin fold | |
| 60.6 | 1m38 | Inorganic Pyrophosphatase | OB-fold | |
| 58.8 | 1gr3 | Crystal structure of a collagen VIII | TNF-like | |
| 55.6 | 1r0w | Cystic fibrosis transmembrane conductance regulator nucleotide-binding domain | P-loop containing nucleoside triphosphate hydrolases | |
| 53.9 | 1ji6 | Insecticidal Bacterial Del Endotoxin Cry3Bb1 | Galactose-binding domain-like | |
| 52.7 | 1ciy | Insecticidal Toxin | Galactose-binding domain-like | |