| Literature DB >> 20865039 |
Carsten Friis1, Trudy M Wassenaar, Muhammad A Javed, Lars Snipen, Karin Lagesen, Peter F Hallin, Diane G Newell, Monique Toszeghy, Anne Ridley, Georgina Manning, David W Ussery.
Abstract
Campylobacter jejuni strain M1 (laboratory designation 99/308) is a rarely documented case of direct transmission of C. jejuni from chicken to a person, resulting in enteritis. We have sequenced the genome of C. jejuni strain M1, and compared this to 12 other C. jejuni sequenced genomes currently publicly available. Compared to these, M1 is closest to strain 81116. Based on the 13 genome sequences, we have identified the C. jejuni pan-genome, as well as the core genome, the auxiliary genes, and genes unique between strains M1 and 81116. The pan-genome contains 2,427 gene families, whilst the core genome comprised 1,295 gene families, or about two-thirds of the gene content of the average of the sequenced C. jejuni genomes. Various comparison and visualization tools were applied to the 13 C. jejuni genome sequences, including a species pan- and core genome plot, a BLAST Matrix and a BLAST Atlas. Trees based on 16S rRNA sequences and on the total gene families in each genome are presented. The findings are discussed in the background of the proven virulence potential of M1.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20865039 PMCID: PMC2928727 DOI: 10.1371/journal.pone.0012253
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1AFLP on strains isolated from abattoir.
Isolates were from flock 1 (designated F1) and flock 2 (F2) and a selection of caecal isolates and crate swabs is shown. Two AFLP banding patterns were recognized of which the lower one is identical to that of the human isolate M1.
Figure 2Genome Atlas of C. jejuni M1.
The Atlas shows various nucleotide and structural properties as well as the genes of the M1 genome in relation to chromosomal position. The intrinsic curvature, stacking energy and position preference (a measure of helix flexibility, where green color denotes flexible sequences) are calculated from simple weight matrix models previously published [35], while global direct and inverted repeats are derived from BLASTN alignments [65]. The second inner-most circle displays the GC-skew, visible as the bias of G's towards the replication leading strand. Genes discussed in the text are marked on the atlas. A zoomable version is available online in the supplementary section.
Figure 3Pan-genome plots of Campylobacter.
The columns give the number of new gene families introduced with the addition of each new genome to the consideration. The curves shows the evolution of the pan genome as an accumulated sum of gene families (blue line) and the core genome (red line), as described in the Methods. The lower-bound estimates of the sizes of the C. jejuni core and pan-genomes are indicated by the dotted lines.
Figure 4Gene Ontology terms associated with core and auxiliary genes.
The figure shows the percentage of the total core and auxiliary genomes for strain M1 which could be associated with a given role based on Gene Ontology terms. The Gene Ontology terms used to connect each protein to a functional category is given in parenthesis below each category. The total fraction of each set which could not be assigned is also presented.
Figure 5Blast Matrix of C. jejuni.
The figure presents the absolute number of gene families preserved between any two species along with the total number of families between them. The relative percentage between these numbers is also given and is used as a basis for the color intensity. The orange squares deserve special mention since these are based on alignments of the organism against itself, and thus show the internal homology within each organism's proteome. The highest homology was found between genomes M1 and 81116, highlighted by a box.
Figure 616S rRNA and Pan-genomic tree of the Campylobacter genus.
The left dendrogram displays the distance between the genomes based on the sequence of the 16S rRNA gene. The right dendrogram is based on presence or absence of the gene families in the pan-genome. The relative manhattan distance indicates the proportion of the pan-genome where genomes differ in present/absent status. Bootstrap values are given as percentages to indicate the stability of the branching.
Genes identified in a comparison between strains 81116 and M1 as unique relative to the comparator.
| Number of Proteins | ||
| BLASTP %-identity | M1 proteome (1624 proteins) vs. 81116 proteome | 81116 proteome (1623 proteins) vs. M1 proteome |
| 100% | 1166 | 1194 |
| ≥90% | 392 | 322 |
| ≥80% | 23 | 19 |
| ≥70% | 7 | 13 |
| ≥60% | 6 | 8 |
| ≥50% | 2 | 11 |
| ≥40% | 0 | 4 |
| ≥30% | 4 | 5 |
| ≥20% | 5 | 11 |
| <20% | 19 | 36 |
| Proteins with <20% identity in the other genome | ||
|
|
| |
*Gene names given in bold refer to the sequences having matches to verified proteins in UniProtKB/Swiss-Prot, while italics indicate a match to the NCBI nr database.
Figure 7Blast Atlas of C. jejuni M1.
The proteome of the M1 strain was aligned against the proteomes of 20 other Campylobacter genomes using BLASTP and the results are displayed as colored circles with increasing color intensity signifying increased similarity. Only BLAST results of proteins are shown. The three rRNA islands are marked as well as the lipooligosaccharide (LOS) and extracellular polysaccharide biosynthesis (PBS) clusters.
Campylobacter Genomes included in this study.
| Species Name | Project ID | Contigs | Accession | Reference |
|
| 38041 | 1 | CP001900 | This Study |
|
| 16229 | 10 | AANK01000000 | - |
|
| 17953 | 1 | CP000814.1 |
|
|
| 16135 | 3 | CP000538.1, CP000549.1, CP000550.1 |
|
|
| 17341 | 1 | NZ_AASL00000000 |
|
|
| 16367 | 5 | AANT02000000 | - |
|
| 16265 | 14 | AANJ01000001 |
|
|
| 21037 | 20 | ABGQ01000000 |
|
|
| 17055 | 19 | AASY01000000 |
|
|
| 17163 | 1 | CP000768.1 | - |
|
| 16267 | 35 | AANQ0100000 | - |
|
| 8 | 1 | AL111168.1 |
|
|
| 303 | 1 | CP000025.1 |
|
|
| 12516 | 38 | AAFL01000000 |
|
|
| 17159 | 3 | CP000792.1, CP000793.1, CP000794.1 | - |
|
| 17161 | 1 | CP000767.1 | - |
|
| 16293 | 1 | CP000487.1 | - |
|
| 20083 | 2 | CP000775.1, CP000776.1 | - |
|
| 12517 | 2 | CP000932.1, CP000933.1 |
|
|
| 31017 | 89 | ACFU01000000 | - |
|
| 12518 | 20 | AAFJ01000000 |
|
If the genome project listed any plasmids, these were included in the number of contigs and their accession numbers are included.
*For genomes represented by multiple contigs the published sequence may still be incomplete.