| Literature DB >> 33903600 |
Uxue Ulanga1, Matthew Russell2, Stefano Patassini3, Julie Brazzatti2, Ciaren Graham4,5, Anthony D Whetton2,3, Robert L J Graham6,7,8.
Abstract
Murine models are amongst the most widely used systems to study biology and pathology. Targeted quantitative proteomic analysis is a relatively new tool to interrogate such systems. Recently the need for relative quantification on hundreds to thousands of samples has driven the development of Data Independent Acquisition methods. One such technique is SWATH-MS, which in the main requires prior acquisition of mass spectra to generate an assay reference library. In stem cell research, it has been shown pluripotency can be induced starting with a fibroblast population. In so doing major changes in expressed proteins is inevitable. Here we have created a reference library to underpin such studies. This is inclusive of an extensively documented script to enable replication of library generation from the raw data. The documented script facilitates reuse of data and adaptation of the library to novel applications. The resulting library provides deep coverage of the mouse proteome. The library covers 29519 proteins (53% of the proteome) of which 7435 (13%) are supported by a proteotypic peptide.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33903600 PMCID: PMC8076245 DOI: 10.1038/s41597-021-00896-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Plots of the target and total protein count for each sub-library. (a) MEF, (b) Day six of MEF-iPSC transition, (c) IPSC and feeder cells (MEFs) mixed, (d) iPSC and (e) tissue data sets. In each case the vertical line shows the chosen FDR threshold.
Fig. 2Plots showing quality of libraries after filtering to Swissprot protein entries. (a) Histogram of mouse protein abundance from paxDB database with proteins identified by proteotypic peptides in the final library overlayed. The library covers almost all of the most abundant proteins in the mouse proteome. (b) The numbers of proteins identified by 1–4 and 5 or more proteotypic peptides. (c) The difference in iRT score between peptides identified in both cell culture samples and re-analysed tissue samples. (d) Distribution of similarity scores (cosine of matched signal intensity vectors) between cell culture and tissue derived data. (e) Venn diagram for numbers of proteins uniquely identified across all combination of sample sets.
Fig. 3Plots showing applicability of library to SWATH analysis. (a) A histogram of spectral similarity scores between library spectra and SWATH analysis transition relative intensity. (b) percentage deviation between libraries predicted and measured retention time. (c) Plot of log2 transformed sum of “top 3” protein quantification from a pair of injections plotted against each other. A linear model through the data is plotted over the data.
| Measurement(s) | database type spectral library • protein expression data |
| Technology Type(s) | liquid chromatography-electrospray ionisation time-of-flight mass spectrometry • SWATH MS protein profiling assay |
| Factor Type(s) | treatment • cell development stage |
| Sample Characteristic - Organism | Mus musculus |