| Literature DB >> 29293692 |
Clifford G Clark1, Chih-Yu Chen2,3, Chrystal Berry1, Matthew Walker1, Stuart J McCorrister2, Patrick M Chong2, Garrett R Westmacott2.
Abstract
Whole genome sequencing (WGS) has been used to assess the phylogenetic relationships, virulence and metabolic differences, and the relationship between gene carriage and host or niche differentiation among populations of C. jejuni isolates. We previously characterized the presence and expression of CJIE4 prophage proteins in four C. jejuni isolates using WGS and comparative proteomics analysis, but the isolates were not assessed further. In this study we compare the closed, finished genome sequences of these isolates to the total proteome. Genomes of the four isolates differ in phage content and location, plasmid content, capsular polysaccharide biosynthesis loci, a type VI secretion system, orientation of the ~92 kb invertible element, and allelic differences. Proteins with 99% sequence identity can be differentiated using isobaric tags for relative and absolute quantification (iTRAQ) comparative proteomic methods. GO enrichment analysis and the type of artefacts produced in comparative proteomic analysis depend on whether proteins are encoded in only one isolate or common to all isolates, whether different isolates have different alleles of the proteins analyzed, whether conserved and variable regions are both present in the protein group analyzed, and on how the analysis is done. Several proteins encoded by genes with very high levels of sequence identity in all four isolates exhibited preferentially higher protein expression in only one of the four isolates, suggesting differential regulation among the isolates. It is possible to analyze comparative protein expression in more distantly related isolates in the context of WGS data, though the results are more complex to interpret than when isolates are clonal or very closely related. Comparative proteomic analysis produced log2 fold expression data suggestive of regulatory differences among isolates, indicating that it may be useful as a hypothesis generation exercise to identify regulated proteins and regulatory pathways for more detailed analysis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29293692 PMCID: PMC5749857 DOI: 10.1371/journal.pone.0190836
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary data for the 4 complete sequenced isolates and genomes.
| Characteristic | Isolate | |||
|---|---|---|---|---|
| 00–0949 | 01–1512 | 00–6200 | 00–1597 | |
| HS (Penner) serotype | HS:2 | HS:2 | HS:4,13 | HS:9,37 |
| MLST sequence type (ST) | 8 | 8 | 806 | 930 |
| MLST clonal complex (CC) | ST-21 | ST-21 | ST-21 | NA |
| Genes | 1,974 | 1,971 | 1,755 | 1,817 |
| CDS | 1,884 | 1,885 | 1,673 | 1,721 |
| Pseudo Genes | 36 | 32 | 28 | 42 |
| CRISPR arrays | 1 | 1 | 1 | 1 |
| rRNAs (5S, 16S, 23S) | 3, 3, 3 | 3, 3, 3 | 3, 3, 3 | 3, 3, 3 |
| Complete rRNAs (5S, 16S, 23S) | 3, 3, 3 | 3, 3, 3 | 3, 3, 3 | 3, 3, 3 |
| tRNAs | 44 | 44 | 44 | 44 |
| ncRNA | 1 | 1 | 1 | 1 |
| Frameshifted genes | 25 | 22 | 26 | 33 |
| Frameshifted genes on monomer runs | 3 | 3 | 4 | 11 |
| Frameshifted genes not on monomer runs | 3 | 4 | 3 | 2 |
| Plasmids | 2 | 2 | 0 | 0 |
| Size of invertible region (bp) | 88,465 | 88,465 | 89,946 | 96,109 |
GenBank accession numbers for the isolate data in this table: 00–0949, NZ_CP010301; 00–1512, NZ_CP010072; 00–6200, NZ_CP010307, 00–1597, NZ_CP010306.
NA, not applicable, meaning the ST was not assigned to a clonal complex.
Fig 1Alignment showing the genome organization of the four C. jejuni isolates used in this study.
The figure was prepared using Progressive Mauve (Mauve 20150226 build 10 [44]).
Fig 2Comparison of C. jejuni isolate pangenome comparison.
The BLAST atlas was obtained using GView Server [43] and further annotated using Adobe Illustrator.
Distribution in genomes and expression of unique chromosomal genes of the 4 isolates.
| Hypervariable region, genomic element, or prophage | 00–0949 + 00–1512 (HS:2) | 00–6200 (HS:4) | 00–1597 (HS:9) | |||
|---|---|---|---|---|---|---|
| # unique genes | # proteins detected | # unique genes | # proteins detected | # unique genes | # proteins detected | |
| CJIE1 + CJIE1 variant prophages | 76 | 13 | 0 | NA | 0 | NA |
| CJIE4 prophage | 0 | NA | 2 | 1 | 0 | NA |
| LOS biosynthesis gene cluster | 5 | 3 | 0 | NA | 1 | 1 |
| flagellar glycosylation gene cluster | 1 | 1 | 1 | 1 | 3 | 1 |
| capsule biosynthesis gene cluster | 15 | 15 | 11 | 8 | 20 | 18 |
| T6SS island, not T6SS gene cluster | 0 | 0 | 0 | 0 | 32 | 6 |
| T6SS gene cluster | 0 | 0 | 0 | 0 | 13 | 9 |
| putative invertible element | 0 | 0 | 0 | 0 | 6 | 1 |
| not in a defined genomic element | 19 | 3 | 16 | 4 | 52 | 17 |
| Total | 115 | 34 | 30 | 14 | 127 | 53 |
NA, not applicable
Fig 3Detection of proteins from each of the four isolates.
A. Heat map comparing relative log2 fold change values of proteins for each isolate using 00–0949 as the reference. Only exclusive peptides were used for analysis in Scaffold. The scale bar shows the relationship of color to log2 fold change values. Labels to the right of each lane identify the isolate and experimental replicate associated with that lane. B. PCA comparing the first two components when only exclusive peptides were used for analysis in Scaffold. C. Heat map comparing relative log2 fold change values of proteins for each isolate using 00–0949 as the reference when non-exclusive peptides were included in the Scaffold analysis. D. PCA comparing the first two components when non-exclusive peptides were included in the Scaffold analysis.
Fig 4UpSet plots summarizing differential protein expression analysis for the four isolates.
The horizontal bar graph at the bottom left of each figure shows the total number of proteins with differences in log2 fold change expression for each pair of isolates. Joined black circles to the right of these bar graphs indicate that the same differentially expressed proteins were common to the pairwise isolate comparisons shown at left. The vertical bar graph at the top quantitates the number of proteins with similar log2 fold change expression differences in the isolate comparisons. A. Results obtained using only exclusive peptides in the Scaffold pair-wise differential expression analyses. B. Results obtained when non-exclusive peptides were included in the Scaffold pair-wise differential expression analyses. Note the difference in scale between the two bar graphs.
The most highly enriched and depleted GO categories in comparisons between isolates when analysis includes only exclusive peptides.
| Enrichment factor | |||||
|---|---|---|---|---|---|
| GO category enriched | 00–0949 | 01–1512 | 00–0949 | 01–1512 | 00–6200 |
| monosaccharide biosynthetic process | 4.398 | 4.584 | |||
| alcohol biosynthetic process | 3.998 | 4.167 | |||
| carbohydrate biosynthetic process | 3.776 | 3.704 | |||
| cellular carbohydrate biosynthetic process | 3.665 | 3.820 | |||
| alcohol metabolic process | 3.529 | ||||
| nucleotide-sugar metabolic process | 4.775 | ||||
| purine ribonucleotide binding | 0.067 | 0.059 | |||
| purine nucleotide binding | 0.066 | 0.059 | |||
| hydrolase activity | 0.065 | 0.058 | |||
| nucleotide binding | 0.057 | 0.050 | |||
| small molecule metabolic process | 0.050 | 0.045 | |||
| oxidoreductase activity, acting on the CH-NH2 group of donors | 5.252 | 5.264 | |||
| FMN binding | 5.252 | 5.264 | |||
| acid-amino acid ligase activity | 4.596 | 4.606 | |||
| transaminase activity | 4.377 | 4.387 | |||
| sequence-specific DNA binding | 4.377 | 4.387 | |||
| purine nucleotide binding | 0.045 | ||||
| hydrolase activity | 0.044 | ||||
| nucleotide binding | 0.038 | ||||
| binding | 0.016 | ||||
| catalytic activity | 0.013 | ||||
| biosynthetic process | 0.023 | ||||
| primary metabolic process | 0.018 | ||||
| cellular metabolic process | 0.016 | ||||
| cellular process | 0.015 | ||||
| metabolic process | 0.015 | ||||
| coenzyme metabolic process | 2.309 | ||||
| cofactor binding | 2.145 | ||||
| cofactor metabolic process | 2.124 | ||||
| cellular nitrogen compound biosynthetic process | 1.739 | ||||
| small molecule metabolic process | 1.556 | ||||
| RNA binding | 0.099 | ||||
| nucleoside-triphosphatase activity | 0.099 | ||||
| pyrophosphatase activity | 0.095 | ||||
| hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides | 0.094 | ||||
| hydrolase activity, acting on acid anhydrides | 0.094 | ||||
Results were ordered by enrichment factor and filtered by statistical significance (Benjamini-Hochberg FDR ≤0.05).
Values > 1 indicate enrichment in the first isolate in each comparison; values < 1 indicate enrichment in the second isolate in each comparison (depletion in the first isolate in each comparison).
Total enriched and depleted GO categories in comparisons between isolates when analysis includes non-exclusive peptides.
| Enrichment factor | |||||
|---|---|---|---|---|---|
| GO category enriched | 00–0949 | 01–1512 | 00–0949 | 01–1512 | 00–6200 |
| cellular process | 0.190 | 0.181 | 0.035 | 0.123 | |
| primary metabolic process | 0.157 | 0.149 | 0.043 | 0.153 | |
| cellular metabolic process | 0.140 | 0.134 | 0.038 | 0.137 | |
| metabolic process | 0.126 | 0.120 | 0.035 | 0.123 | |
| nitrogen compound metabolic process | 0.103 | 0.056 | 0.134 | ||
| catalytic activity | 0.125 | 0.150 | 0.214 | ||
| binding | 0.117 | 0.187 | 0.267 | ||
| nucleobase-containing compound metabolic process | 0.091 | ||||
| cellular macromolecule metabolic process | 0.079 | 0.104 | |||
| macromolecule metabolic process | 0.075 | 0.099 | |||
| cellular nitrogen compound metabolic process | 0.060 | 0.143 | |||
| cellular biosynthetic process | 0.059 | 0.141 | |||
| biosynthetic process | 0.054 | 0.129 | |||
| metabolic process | 0.035 | 0.123 | |||
Results were ordered by enrichment factor and filtered by statistical significance (Benjamini-Hochberg FDR ≤0.05).
Values > 1 indicate enrichment in the first isolate in each comparison; values < 1 indicate enrichment in the second isolate in each comparison (depletion in the first isolate in each comparison).