| Literature DB >> 20353581 |
Andrew R Timms1, Joanna Cambray-Young, Andrew E Scott, Nicola K Petty, Phillippa L Connerton, Louise Clarke, Kathy Seeger, Mike Quail, Nicola Cummings, Duncan J Maskell, Nicholas R Thomson, Ian F Connerton.
Abstract
BACKGROUND: Our understanding of the dynamics of genome stability versus gene flux within bacteriophage lineages is limited. Recently, there has been a renewed interest in the use of bacteriophages as 'therapeutic' agents; a prerequisite for their use in such therapies is a thorough understanding of their genetic complement, genome stability and their ecology to avoid the dissemination or mobilisation of phage or bacterial virulence and toxin genes. Campylobacter, a food-borne pathogen, is one of the organisms for which the use of bacteriophage is being considered to reduce human exposure to this organism.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20353581 PMCID: PMC2853527 DOI: 10.1186/1471-2164-11-214
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparison of basic parameters for CP220 and CPt10 phage genomes
| Genome size bpa (+/- variable repeats) | Genome | CDS | Number of | Coding | Strand Bias | |
|---|---|---|---|---|---|---|
| CP220 | 177 493/171 841 | 27.4 | 28.2 | 194 | 88.4 | 173/194 |
| CPt10 | 175 720/173 299 | 27.3 | 28.0 | 201 | 89.7 | 180/201 |
a Genome sizes are given both for the sequences as currently assembled and excluding repeat regions which may be variable in size.
b The number of CDS regions and the coding density is based on the proportion of the genome covered by discrete coding regions. It does not account for CDS regions produced from internal initiation sites or alternative gene products produced by splicing, post-translational processing or other mechanisms that may increase the number of protein products produced from the genome.
Figure 1Genome alignments of CP220 and CPt10. Nucleotide alignment of the virulent bacteriophages CP220 (top) and CPt10 (bottom) generated using the Artemis Comparison Tool (ACT). The CDS regions are indicated by bars top and bottom showing the 'forward' and 'reverse' strands. CDS regions are coloured according to the following key: grey - areas of nucleotide identity, white - significantly different CDS regions present in only one of the genomes, solid black - repeat regions (numbered according to the current genome annotation), red - DNA modification proteins, blue - insertion elements, pink - radical SAM proteins. Red bars indicate regions of sequence homology, the diagonal lines link multiple repeat regions in both genomes indicating the conserved nature of these sequences throughout both genomes.
Comparison of T4 and CP220 Structural Proteins
| CP220 CDS | T4 Lengtha | CP220 Lengtha | T4 Gene Functionb | CP220/T4 Identity (%) |
|---|---|---|---|---|
| CPT_0001 | 610 | 744 | Gp17 Terminase subunit with nuclease and ATPase activity; binds single-stranded DNA, Gp16 and Gp20 | 143/436 (32%) |
| CPT_0005 | 516 | 503 | Gp39 DNA topoisomerase large subunit | 186/516 (35%) |
| CPT_0009 | 342 | 347 | Gp61 DNA primase subunit | 73/330 (22%) |
| CPT_0010 | 319 | 304 | Gp44 Clamp loader subunit, DNA polymerase accessory protein | 95/319 (29%) |
| CPT_0011 | 305 | 352 | RNaseH ribonuclease | 56/151 (37%) |
| CPT_0029 | 157 | 178 | Gp49 EndoVII packaging and recombination endonuclease VII | 28/108 (25%) |
| CPT_0030 | 524 | 569 | Gp20 Portal vertex protein of head | 130/456 (28%) |
| CPT_0033 | 487 | 429 | Gp30 DNA ligase | 128/484 (26%) |
| CPT_0034 | 659 | 516 | Gp18 Tail sheath monomer | 58/235 (24%) |
| CPT_0037 | 134 | 120 | Gp25 Baseplate wedge subunit | 26/66 (39%) |
| CPT_0041 | 660 | 1214 | Gp6 Baseplate wedge subunit | 86/389 (22%) |
| CPT_0045 | 163 | 252 | Gp19 Tail tube protein | 49/176 (27%) |
| CPT_0046 | 150 | 158 | Gp4 Head completion protein | 44/147 (29%) |
| CPT_0048 | 185 | 255 | Gp55 Sigma factor for T4 late transcription | 33/123 (26%) |
| CPT_0051 | 521 | 444 | Gp23 Major head protein | 131/412 (31%) |
| CPT_0053 | 659 | 578 | Gp18 Tail sheath protein | 149/436 (34%) |
| CPT_0058 | 163 | 196 | Gp19 Tail tube protein | 41/170 (24%) |
| CPT_0115 | 898 | 882 | Gp43 DNA polymerase | 244/901 (27%) |
| CPT_0125 | 475 | 445 | Gp41 DNA primase-helicase subunit | 120/453 (26%) |
| CPT_0148 | 587 | 472 | UvsW RNA-DNA and DNA-DNA helicase, ATPase | 124/407 (30%) |
| CPT_0174 | 272 | 456 | Gp15 Tail sheath stabilizer and completion protein | 54/214 (25%) |
| CPT_0177 | 301 | 306 | Gp32 Single-stranded DNA binding protein | 65/212 (30%) |
| CPT_0181 | 374 | 361 | RnlA RNA ligase 1 and tail fiber attachment catalyst | 80/297 (26%) |
| CPT_0193 | 212 | 236 | Gp21 Prohead core protein protease | 52/153 (33%) |
a Length refers to the number of amino-acid residues in the CDS and are calculated from the genomic sequence
b T4 gene product and functions are according to Miller [26]
c Identity and E values are from NCBI BLAST results, performed on 5th January 2009
Figure 2Repeat regions present in CP220. PCR amplification of repeat regions from CP220 showing the range of products produced from a single population. Lanes; M - size in kbp, 1 - size marker (1 kb DNA ladder, Invitrogen, Paisley, UK), 2 - repetitive region (RR) as currently annotated in the CP220 genome RR1, 3 - RR3/4, 4 - RR6, 5 - RR8, 6 - RR9, 7 - RRx, 8 - RR7.
Figure 3Virion proteins from CP220. SDS-PAGE analysis of CP220 phage virion proteins. M - molecular size markers. Numbers refer to protein sequences excised from the gel along with the equivalent genome CDS designation and description in T4 if known (the protein molecular masses correspond with the translation products of the nucleotide sequence, kDa); 1 - CPT_0030 - Gp20 portal vertex protein of head (66.3), 2 - CPT_0053 - Gp18 tail sheath (64.2), 3 - CPT_0034 - Gp18 tail sheath (59.2), 4 - CPT_0052 - Gp18 tail sheath (58.4), 5 - CPT_0103 - unknown (57.2), 6 - CPT_0051 - Gp23 major capsid (48.6), 7 - CPT_0186 - Gp48 base plate tail tube cap (34), 8 - CPT_0175 - Gp19 tail tube (27.4), 9 - CPT_0118 - unknown, neck protein (26.9), 10 - CPT_0045 - Gp19 tail tube (28.7), 11 - CPT_0046 - Gp4 head completion (18.6).
Putative CP220 CDSs showing homology to proteins in Campylobacter and closely related species Helicobacter, Arcobacter and Wolinella
| CP220 CDS | CP220 | Source Organism, Functionb |
|---|---|---|
| CPT_0017 | 264 | |
| CPT_0040 | 70 | |
| CPT_0066 | 328 | |
| CPT_0084 | 342 | |
| CPT_0095 | 144 | |
| CPT_0105 | 202 | |
| CPT_0123 | 143 | |
| CPT_0127 | 205 | |
| CPT_0137 | 306 | |
| CPT_0145 | 80 | |
| CPT_0158 | 182 | |
| CPT_0164 | 181 | |
| CPT_0165 | 212 | |
| CPT_0172 | 955 | |
| CPT_0191 | 171 | |
a Length refers to the number of amino-acid residues in the CDS and are calculated from the genomic sequence
b Gene product and putative function
c Identity and E values are from NCBI BLAST results, performed on 5th January 2009