| Literature DB >> 32354127 |
Jeroen Wagemans1, Jessica Tsonos1,2,3, Dominique Holtappels1, Kiandro Fortuna1, Jean-Pierre Hernalsteens2, Henri De Greve3,4, Leandro F Estrozi5, Maria Bacia-Verloop5, Christine Moriscot6, Jean-Paul Noben7, Guy Schoehn5, Rob Lavigne1.
Abstract
The phAPEC6 genome encodes 551 predicted gene products, with the vast majority (83%) of unknown function. Of these, 62 have been identified as virion-associated proteins by mass spectrometry (ESI-MS/MS), including the major capsid protein (Gp225; present in 1620 copies), which shows a HK97 capsid protein-based fold. Cryo-electron microscopy experiments showed that the 350-kbp DNA molecule of Escherichia coli virus phAPEC6 is packaged in at least 15 concentric layers in the phage capsid. A capsid inner body rod is also present, measuring about 91 nm by 18 nm and oriented along the portal axis. In the phAPEC6 contractile tail, 25 hexameric stacked rings can be distinguished, built of the identified tail sheath protein (Gp277). Cryo-EM reconstruction reveals the base of the unique hairy fibers observed during an initial transmission electron microscopy (TEM) analysis. These very unusual filaments are ordered at three annular positions along the contractile sheath, as well as around the capsid, and may be involved in host interaction.Entities:
Keywords: HK97-fold; cryo-EM; jumbo phage
Mesh:
Substances:
Year: 2020 PMID: 32354127 PMCID: PMC7247149 DOI: 10.3390/ijms21093119
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Electron microscopy of phAPEC6. (A) Negative staining image of phAPEC6. Six short tail fibers (arrows) on the base plate are clearly visible. (B) Different phAPEC6 particles display hairy appendages, sticking out from the tail but also from the capsid (arrows). Those attached to the capsid seem to be trimers (inset). The scale bar represents 50 nm.
Figure 2Circular representation of the genome of the giant Escherichia phage vB_EcoM_PhAPEC6. The inner circle represents a GC plot. The 551 identified coding sequences (CDS) are depicted by arrows or boxes if a function could be predicted or not, respectively. The 62 CDS in green, scattered along the genome, are structural proteins detected in the mature virion by mass spectrometry. The CDS indicated in light blue are exonucleases; dark blue corresponds to DNA-associated proteins such as DNA polymerases, topoisomerases, helicases and ssDNA-binding proteins; orange to proteases and dark grey to chromosome condensation domains. CDS in purple and red are involved in general metabolism and host lysis, respectively.
Structural proteome of PhAPEC6.
| Protein | Function | MW (kDa) | Unique Peptide Count | Sequence Coverage (%) |
|---|---|---|---|---|
| gp003 | DNA condensation protein | 41.39 | 4 | 11.6 |
| gp005 | DNA condensation protein | 40.25 | 2 | 5.7 |
| gp006 | hypothetical protein | 41.36 | 1 | 3.3 |
| gp007 | DNA condensation protein | 30.06 | 3 | 17.5 |
| gp009 | DNA condensation protein | 39.30 | 3 | 10.8 |
| gp010 | hypothetical protein | 41.39 | 1 | 3.1 |
| gp035c | chromosome segregation protein | 46.39 | 8 | 28.8 |
| gp037c | hypothetical protein | 14.83 | 5 | 48.0 |
| gp077 | hypothetical protein | 42.88 | 10 | 37.8 |
| gp078c | hypothetical protein | 29.73 | 4 | 21.4 |
| gp149c | hypothetical protein | 25.61 | 4 | 29.2 |
| gp150c | hypothetical protein | 21.33 | 5 | 34.8 |
| gp158c | hypothetical protein | 90.30 | 9 | 17.1 |
| gp159c | topoisomerase II small subunit | 54.54 | 2 | 8.1 |
| gp161c | hypothetical protein | 28.98 | 6 | 28.9 |
| gp162c | hypothetical protein | 60.05 | 10 | 25.4 |
| gp167c | hypothetical protein | 20.66 | 4 | 23.4 |
| gp179c | hypothetical protein | 21.48 | 2 | 15.0 |
| gp182c | hypothetical protein | 12.33 | 1 | 11.7 |
| gp192c | hypothetical protein | 36.90 | 7 | 32.0 |
| gp203c | RecA-like recombination protein | 41.58 | 2 | 6.7 |
| gp204c | single-stranded DNA binding protein | 38.89 | 4 | 17.1 |
| gp205c | hypothetical protein | 48.48 | 2 | 8.4 |
| gp210 | hypothetical protein | 20.92 | 6 | 69.8 |
| gp211 | hemagglutinin repeat-containing protein | 24.30 | 13 | 69.9 |
| gp215c | hypothetical protein | 26.56 | 5 | 26.3 |
| gp225c | major head protein | 42.18 | 14 | 74.9 |
| gp226c | scaffolding protein | 42.21 | 2 | 4.9 |
| gp227c | prohead core scaffold and protease | 23.01 | 2 | 15.0 |
| gp228c | hypothetical protein | 31.12 | 5 | 16.7 |
| gp230c | portal vertex protein of head | 64.56 | 9 | 21.1 |
| gp231c | hypothetical protein | 79.72 | 1 | 1.4 |
| gp238 | hypothetical protein | 27.50 | 2 | 12.8 |
| gp239 | hypothetical protein | 12.37 | 1 | 8.7 |
| gp240 | hypothetical protein | 85.40 | 5 | 10.0 |
| gp241 | lysozyme family protein | 95.90 | 7 | 9.9 |
| gp244 | baseplate wedge | 131.18 | 21 | 24.2 |
| gp245 | hypothetical protein | 381.88 | 17 | 7.6 |
| gp248 | proximal tail protein | 51.67 | 4 | 13.9 |
| gp250c | hypothetical protein | 40.33 | 9 | 40.2 |
| gp251c | hypothetical protein | 21.95 | 2 | 13.4 |
| gp253 | hypothetical protein | 40.99 | 14 | 65.6 |
| gp256 | tail fiber protein | 180.38 | 15 | 15.9 |
| gp259c | thymidylate synthase | 35.51 | 3 | 10.7 |
| gp262c | hypothetical protein | 25.02 | 1 | 6.4 |
| gp265c | ATPase | 49.52 | 6 | 19.0 |
| gp267c | hypothetical protein | 35.96 | 1 | 4.9 |
| gp268c | hypothetical protein | 51.12 | 6 | 17.8 |
| gp270c | hypothetical protein | 16.20 | 4 | 41.1 |
| gp273c | hypothetical protein | 40.42 | 7 | 29.9 |
| gp274c | hypothetical protein | 28.76 | 3 | 16.3 |
| gp275c | hypothetical protein | 25.07 | 2 | 9.2 |
| gp277c | tail sheath monomer | 97.05 | 36 | 68.0 |
| gp280c | neck protein | 29.93 | 4 | 20.5 |
| gp306c | hypothetical protein | 16.90 | 6 | 65.8 |
| gp314c | tail fiber protein | 105.34 | 8 | 13.0 |
| gp315c | hypothetical protein | 4.66 | 2 | 69.0 |
| gp323c | hypothetical protein | 98.62 | 4 | 5.6 |
| gp341c | PhoH-like protein | 53.21 | 8 | 30.9 |
| gp347 | hypothetical protein | 14.10 | 8 | 75.2 |
| gp549 | DNA condensation protein | 41.77 | 4 | 13.2 |
| gp551 | DNA condensation protein | 41.23 | 5 | 14.2 |
Figure 3Detailed 3D reconstruction of the phAPEC6 capsid: (A) Isosurface representation of the phAPEC6 head 3D reconstruction at a 10-Å resolution. The particle is colored according to its diameter (from red to blue). (B) Central slice of the phAPEC6 head visualizing 15 concentric layers of DNA (black arrows). (C) Close-up view of one 6-fold axis. The major capsid protein (MCP) is colored in orange and the decoration protein in yellow. The black oval highlights a local 2-fold axis. (D) Fitting of the HK97 MCP X-ray structure into the phAPEC6 hexamer density seen from the outside of the particle (an arrow highlights the long alpha helix present in both HK97 and phAPEC6 MCP) and (E) seen from the inside of the particle. (F) Detail of the hexamer central protein along a 2-fold axis, displaying the fiber-anchoring protein. (G) Side view of the fiber-anchoring protein. (H) Close-up view of the 5-fold axis. The pentamer is colored in blue and highlighted by a black circle. The same protein as the one colored in yellow in part C is visible around the 5-fold axis.
Figure 4Cryo-electron microscopy of phAPEC6. (A) Low-dose and high-dose cryo-electron microscopy images of phAPEC6 demonstrate the presence of an inner body. (B) Cryo-electron microscopy images of the contracted form of phAPEC6. The capsid has released all the DNA, the tail is contracted and the presence of a little piece of host cell membrane is visible (arrow).
Figure 53D reconstruction of the entire tail. (A) A negative staining image of phAPEC6. The different fibers are labeled 1*, 2*, 3* and 4* (for the baseplate one). (B) 3D reconstruction of the entire tail at a normal threshold (resolution 25 Å). The baseplate plus fiber are colored in yellow and pink; the different fibers anchored in the tail (1*, 2* and 3*) are in light pink; a decoration protein present at 2 levels of the tail is colored in brown and the collar and the portal are colored respectively in blue and red. (C) Distal part of the tail at a high contour level: the fibers located at the 2*-level are visible. The same color code as in B has been used. (D) Detail of the baseplate at a higher contour level. The host membrane patch (green), as well as some fibers connected to the membrane, are visible (red). (E) Bottom view of the baseplate showing the complex organization of this part.