| Literature DB >> 35963929 |
Matthew Russell1, Andrew Currin1, William Rowe1, Guo-Qiang Chen1,2, Perdita Barran3, Nigel S Scrutton4.
Abstract
Despite its greener credentials, biomanufacturing remains financially uncompetitive compared with the higher carbon emitting, hydrocarbon-based chemical industry. Replacing traditional chassis such as E. coli with novel robust organisms, are a route to cost reduction for biomanufacturing. Extremophile bacteria such as the halophilic Halomonas bluephagenesis TD01 exemplify this potential by thriving in environments inherently inimical to other organisms, so reducing sterilisation costs. Novel chassis are inevitably less well annotated than established organisms. Rapid characterisation along with community data sharing will facilitate adoption of such organisms for biomanufacturing. The data record comprises a newly sequenced genome for the organism and evidence via LC-MS based proteomics for expression of 1160 proteins (30% of the proteome) including baseline quantification of 1063 proteins (27% of the proteome), and a spectral library enabling re-use for targeted LC-MS proteomics assays. Protein data are annotated with KEGG Orthology, enabling rapid matching of quantitative data to pathways of interest to biomanufacturing.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35963929 PMCID: PMC9376085 DOI: 10.1038/s41597-022-01610-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Schematic of data acquisition and processing pipeline. Left: Acquisition of genome; Middle: Acquisition of mass spectrometry data; Right: Data processing pipeline.
Table of iRT standard peptides.
| Sequence | Stock concentration/pmol µL−1 | Amount on column/pmol | Used for Quantification |
|---|---|---|---|
| LFLQFGAQGSPFLK | 1.50 | 0.15 | |
| YILAGVENSK | 1.62 | 0.16 | X |
| TPVISGGPYEYR | 1.63 | 0.16 | X |
| LGGNEQVTR | 1.67 | 0.17 | X |
| VEATFGVDESNAK | 1.67 | 0.17 | X |
| GAGSSEPVTGLDAK | 1.68 | 0.17 | X |
| TPVITGAPYEYR | 2.90 | 0.29 | |
| GTFIIDPGGVIR | 3.20 | 0.32 | |
| DGLDAASYYAPVR | 3.89 | 0.39 | |
| ADVTPADFSEWSK | 5.00 | 0.5 | |
| GTFIIDPAAVIR | 15.00 | 1.5 |
Fig. 2Quality metrics for protein identification and spectral library. (a) Plot of target and total (target and decoy) protein identifications at given false discovery rate (FDR), 2% cut-off is indicated. (b) Depth of coverage of E. coli quantified in the paxDB database. (c) Distribution of disparity in iRT index for to E. coli peptides in this analysis with that published by Moritz et al. (d) Distribution of spectrum similarity scores for E. coli between this analysis and Moritz et al. (e and f) Number of distinct peptides identified per protein identified by only, or at least one, prototypic peptide for (e) H. bluephagenesis TD01 and (f) E. coli.
Fig. 3Comparison of protein quantification estimates against published data with generalised linear model overlayed (a) Control E. coli data plotted against PAXdb data. (b) H. bluephagenesis protein quantification plotted against quantification of most homologous E. coli protein as determined by Blast algorithm from paxDB data.
| Measurement(s) | Genome • Proteomic Profile |
| Technology Type(s) | PacBio Sequel System • nanoflow liquid chromatography-electrospray ionisation mass spectrometry |
| Sample Characteristic - Organism | Halomonas sp. TD01 |