| Literature DB >> 18346272 |
Michael J Allen1, Julie A Howard, Kathryn S Lilley, William H Wilson.
Abstract
BACKGROUND: Emiliania huxleyi virus 86 (EhV-86) is the type species of the genus Coccolithovirus within the family Phycodnaviridae. The fully sequenced 407,339 bp genome is predicted to encode 473 protein coding sequences (CDSs) and is the largest Phycodnaviridae sequenced to date. The majority of EhV-86 CDSs exhibit no similarity to proteins in the public databases.Entities:
Year: 2008 PMID: 18346272 PMCID: PMC2322966 DOI: 10.1186/1477-5956-6-11
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Figure 1SDS PAGE of EhV-86 virion proteins.
Proteins identified by LC-MS in purified EhV-86 virions.
| TREMBL | Gene | Expression Profilea | Number of peptides | MWb (kDa) | Mascotc |
| Q4A3B2 | ehv015 | 2–4 h p.i. | 1 | 14.6 | 79 |
| Q4A399 | ehv034 | > 4 h p.i. | 2 | 18.7 | 117 |
| Q4A398 | ehv035 | 1–2 h p.i. | 16 | 141.4 | 600 |
| Q4A397 | ehv036 | 2–4 h p.i. | 2 | 18.6 | 97 |
| Q4A395 | ehv038 | 2–4 h p.i. | 1 | 12.5 | 48 |
| Q4A378 | ehv055 | unknown | 3 | 34.1 | 59 |
| Q4A373 | ehv060 | > 4 h p.i. | 1 | 212.2 | 207 |
| Q4A366 | ehv067 | > 4 h p.i. | 4 | 41.9 | 145 |
| Q4A348 | ehv085 | 2–4 h p.i. | 22 | 59.9 | 1224 |
| Q4A333 | ehv100 | 2–4 h p.i. | 2 | 40.0 | 208 |
| Q4A2Y5 | ehv149 | 1–2 h p.i. | 8 | 40.0 | 285 |
| Q4A2W6 | ehv168 | > 4 h p.i. | 4 | 18.6 | 239 |
| Q4A2V9 | ehv175 | > 4 h p.i. | 2 | 40.6 | 59 |
| Q4A2V2 | ehv182 | > 4 h p.i. | 4 | 22.7 | 248 |
| Q4A2U4 | ehv189 | unknown | 2 | 45.4 | 109 |
| Q4A2U2 | ehv191 | 1–2 h p.i. | 1 | 93.8 | 46 |
| Q4A2T8 | ehv195 | > 4 h p.i. | 1 | 22.1 | 57 |
| Q4A2T3 | ehv200 | > 4 h p.i. | 3 | 34.1 | 157 |
| Q4A2N2 | ehv250 | > 4 h p.i. | 1 | 11.9 | 100 |
| Q4A2H9 | ehv301 | > 4 h p.i. | 2 | 32.0 | 139 |
| Q4A2F4 | ehv325 | > 4 h p.i. | 1 | 15.8 | 43 |
| Q4A2E6 | ehv333 | Unknown | 1 | 13.5 | 35 |
| Q4A2D9 | ehv340 | > 4 h p.i. | 1 | 14.4 | 48 |
| Q4A2C3 | ehv356 | Unknown | 2 | 81.0 | 39 |
| Q4A2C1 | ehv358 | > 4 h p.i. | 1 | 17.2 | 45 |
| Q4A237 | ehv442 | > 4 h p.i. | 1 | 19.4 | 58 |
| Q4A225 | ehv454 | 2–4 h p.i. | 3 | 74.4 | 119 |
| Q4A218 | ehv461 | 1–2 h p.i. | 4 | 32.9 | 36 |
a Data from Allen et al., 2006; p.i. indicates the time post infection when the transcript is first seen [25]. bPredicted MW. cThe highest score is shown in the case of protein identification in multiple bands.
Unique peptides used in the identification of proteins from the EhV-86 virion.
| ehv015 | RDIILDPNASPSDKR |
| ehv034 | KCIAPDYNKN, KVLNETVSGYFRR |
| ehv035 | KDRPLISENGRY, KDSEIEDLEEQNNSLDRD, KEGYDQNFIGVPSYAVRD, KGIIGVALLEGKG, KIPYVYLNPYLKR, KITAPTAALAAEAAKL, KLAGVYGCGSKT, KLATTVASDIETRK, KNILSGDLEKE, KNYDDSVFFKD, KQIETITAELEPLAEKD, KQMEQLQFEKD, KTSTDLANCTTKV, KVGGPYTVISRN, RATAQSEHVAQLLSIETNKN, RLSNLGVLSTNNQILNKN, |
| ehv036 | KESEADLAEAKR, RELGEATDDLGDAKK |
| ehv038 | KTTLSDITAEIADKR |
| ehv055 | KDDVDAWKE, KDDVDAWKEESFVMRA, KTDFNSAVVKS |
| ehv060 | KIDSWEPGELAELYVDSTRV |
| ehv067 | KELNLVLPPGTKG, KLAVIEEIDNKL, KLIIPAETARH, RYMTPLDVARE |
| ehv085 | KANKDAGDHFNFSGIGGRD, KDAGDHFNFSGIGGRD KDAGDHFNFSGIGGRDPVVSAELLFNNTARV, KEQLIAEAKN, KFTNGLAGLLYSN-, KIVLPGLKV, KVGGATIDTIWSELLFAMEELMGRA, KYNAAPLPVAAQMQSTEMPDFDYAYWTEAIGFHLIKR, RASLECTYVHLEAAERD, RDALTANAGTQLIVQHQAHLQQVSSNNVTARL, RDPVVSAELLFNNTARV, RLDSVELALTLQDDFGAAHDANSELFVFARS, RLTETIGRT, RNVPISDDHLRA, RQEQILYVPLPWYFTKH, RQGDLLSWMYLKI, RQGDLLSWMYLKI, RRLTETIGRT, RRPTELMKA, RRPTELMKA, RSNLVVLHAERN, RVTQKPAVWWRA |
| ehv100 | KTTPAIGLGPPDKY, KGTCIGNLTQCTTEKG |
| ehv149 | KCIPDLATICTGKL, KKYDCAPGTKV, KLEPGADNNCVIKA, KLNNVSTGAKK, KVGPLGEKC, KYDCAPGTKV, RAAAAWAATRG, RGMAGSAAGATSSAAKS |
| ehv168 | KNSALMEMVKS, KSTMGAGELEVARQ, KWTGAAAAGAAAPSAADVIYKR, RGVYGPQPAGSDSSTGKT |
| ehv175 | RRPPNILVKM, RYFEDIFNNPRN |
| ehv182 | KEISDPEIVDLKY, KSTCMFEADRS, KYDEESSSPARK, RFVVGDFIINNQGKL |
| ehv189 | KVVDSLYDFRI, RYNAQQSIRD |
| ehv191 | KQNLGQSDGNLLRA |
| ehv195 | RSNNQYNVQRR |
| ehv200 | KSNGYDDNFVGVNKS, KVMAVSATGTTARV, RVNVSPYWPRN |
| ehv250 | KSFEDAANTPGYLSARS |
| ehv301 | RSMNPNDIRT, RSNEVNDTMIARS |
| ehv325 | KEQPNTVSGERV |
| ehv333 | KGYDVAAVQRI |
| ehv340 | KAIGEGMEPGMIRA |
| ehv356 | RGQTDPSQNPVVDTRF, KNPSIIGAAEKY |
| ehv358 | KSADELNTLVKE |
| ehv442 | KYANGSNVTLYYDPKN |
| ehv454 | KIPTATVTTRQ, RWSGDYLEIKK, KSAVTSITLLTDLEQVRV |
| ehv461 | KTNAIELRR, KVDVYSLSPKN, RLTEELRF, RVGAHGPVEIRV |
Analysis of proteins identified in the EhV-86 virion.
| Protein Analysis | ||||
| Gene Number | Top Blast Hita | Blast Scorea | TMsb | InterProScan Resultsc |
| ehv015 | hypothetical protein, | 0.009 | 1 | No hits reported. |
| ehv034 | predicted protein, | 0.016 | 1 | No hits reported. |
| ehv035 | similar to SMC2 protein, | 0.058 | 2 | No hits reported. |
| ehv036 | HlyD family secretion protein, | 0.004 | 2 | No hits reported. |
| ehv038 | hypothetical protein, | 0.32 | 1 | No hits reported. |
| ehv055 | hypothetical protein, | 5e-06 | 6 | No hits reported. |
| ehv060 | No significant match | n/a | 1 | C type lectin 2 domain |
| ehv067 | Hypothetical protein, | 1.4 | 0 | No hits reported. |
| ehv085 | major capsid protein, Heterosigma akashiwo virus 01 | 7e-39 | 0 | Capsid domain (iridovirus like) |
| ehv100 | predicted protein, | 5e-10 | 2 | No hits reported. |
| ehv149 | hypothetical protein, | 0.43 | 2 | C type lectin 1 domain |
| ehv168 | hypothetical protein, | 2.4 | 1 | No hits reported. |
| ehv175 | Putative serine/threonine protein kinase, | 0.66 | 0 | Protein Kinase |
| ehv182 | diaminopimelate decarboxylase, | 0.48 | 1 | No hits reported. |
| ehv189 | pol-like protein, | 2.0 | 0 | No hits reported. |
| ehv191 | No significant match | n/a | 1 | Proline rich extensin signature |
| ehv195 | hypothetical protein, | 0.27 | 2 | No hits reported. |
| ehv200 | hypothetical protein, | 0.23 | 1 | No hits reported. |
| ehv250 | GCN5-related N-acetyltransferase, | 9.6 | 1 | No hits reported. |
| ehv301 | NB-ARC domain containing protein, | 0.31 | 0 | No hits reported. |
| ehv325 | envelope glycoprotein, Simian immunodeficiency virus | 1.1 | 1 | No hits reported. |
| ehv333 | CRISPR-associated protein, Cse1 family, | 0.35 | 2 | No hits reported |
| ehv340 | Putative fimbrial associated sortase-like protein, | 0.42 | 1 | No hits reported. |
| ehv356 | No match | n/a | 1 | No hits reported |
| ehv358 | hypothetical protein, | 2e-08 | 1 | Thioredoxin domain |
| ehv442 | conserved hypothetical protein, | 0.008 | 2 | No hits reported. |
| ehv454 | hemocyanin isoform 1, | 2.2 | 2 | No hits reported. |
| ehv461 | Fatty acid/phospholipid synthesis protein, | 2.6 | 1 | No hits reported |
aBLASTP analysis [12] against nonredundant protein sequences performed on 12th December 2007. bTransmembrane (TM) domains predicted by HMMTOP v2.0. cFunctional and structure predicted by InterProScan [10].