| Literature DB >> 33230158 |
Harshi Weerakoon1,2,3, Jeremy Potriquet1,4, Alok K Shah1,5, Sarah Reed6, Buddhika Jayakody6, Charu Kapil7, Mukul K Midha7, Robert L Moritz7, Ailin Lepletier1,8, Jason Mulvenna1, John J Miles9,10,11, Michelle M Hill12,13.
Abstract
Data independent analysis (DIA) exemplified by sequential window acquisition of all theoretical mass spectra (SWATH-MS) provides robust quantitative proteomics data, but the lack of a public primary human T-cell spectral library is a current resource gap. Here, we report the generation of a high-quality spectral library containing data for 4,833 distinct proteins from human T-cells across genetically unrelated donors, covering ~24% proteins of the UniProt/SwissProt reviewed human proteome. SWATH-MS analysis of 18 primary T-cell samples using the new human T-cell spectral library reliably identified and quantified 2,850 proteins at 1% false discovery rate (FDR). In comparison, the larger Pan-human spectral library identified and quantified 2,794 T-cell proteins in the same dataset. As the libraries identified an overlapping set of proteins, combining the two libraries resulted in quantification of 4,078 human T-cell proteins. Collectively, this large data archive will be a useful public resource for human T-cell proteomic studies. The human T-cell library is available at SWATHAtlas and the data are available via ProteomeXchange (PXD019446 and PXD019542) and PeptideAtlas (PASS01587).Entities:
Mesh:
Substances:
Year: 2020 PMID: 33230158 PMCID: PMC7683684 DOI: 10.1038/s41597-020-00744-3
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Primary human T-cell spectral library generation workflow. Main steps used in (a) isolation of human peripheral blood mononuclear cells (PBMCs), purification of ex vivo CD3+ T-cells and in vitro activation (three ex vivo CD3+ T-cell samples and one in vitro activated CD3+ T-cell sample were obtained from PBMCs extracted from four different buffy coats). (b) Proteomic sample preparation, peptide fractionation and LC-MS/MS data acquisition. (c) Generation of individual spectral libraries for separate DDA-MS dataset. (d) Generation of a combined assay library compatible for multiple SWATH-MS data analysis platforms and quality control of assay libraries.
Fig. 2Statistics of the primary human T-cell spectral library. (a) Statistics of individual spectral libraries used in the generation of human T-cell spectral library. (b) Protein, peptide and peptide spectral matches (PSMs) of individual libraries as function of protein FDR. (c) Venn diagram depicting the protein overlaps between different spectral libraries. (d) Frequency distribution of peptides of different lengths in human T-cell spectral library. (e) Frequency distribution of proteins with different peptide numbers in human T-cell spectral library. (f) First 30 significant immune related canonical pathways represented by the T-cell spectral library (Core analysis, Ingenuity pathway analysis, Qiagen Bioinformatics, USA) (g) Protein coverage of the human T-cell spectral library for T-cell receptor (TCR) signaling pathways. Proteins highlighted in pink are included in human T-cell spectral library. (SpLib_1 - in vitro activated human T-cell spectral library, SpLib_2, 3 and 4 - ex vivo human T-cell spectral libraries).
Fig. 3SWATH-MS data analysis against human T-cell and Pan human libraries and comparison with DDA-MS data. (a) Experimental protocol used to obtain human CD4+ T-cells at different in vitro activated status. Samples were obtained from three different donors. (b) Number of peptides and proteins detected by the analysis of CD4+ T-cells by SWATH-MS against human T-cell and pan-human spectral libraries and DDA-MS. (c) Proportional Venn diagram depicting the overlap between quantified SWATH-MS proteins against the two spectral libraries. (d) Proportional Venn diagram depicting the overlap between all the quantified SWATH-MS proteins and DDA-MS proteins. (e) Box plots showing the missing peptide intensity values in SWATH-MS and DDA-MS data (central lines and boxes represent means and 95% confidence intervals respectively while whiskers are 2.5 to 97.5 percentiles). (f) Distribution of Pearson correlation coefficient between protein expression data obtained by SWATH-MS and DDA-MS for human CD4+ T-cell time series experiment.
| Measurement(s) | Proteome • protein expression data |
| Technology Type(s) | Proteomic Profiling • nanoflow liquid chromatography-tandem mass spectrometry • SWATH MS protein profiling assay • Data-Dependent Acquisition • mass spectrometry |
| Factor Type(s) | time following T-cell activation |
| Sample Characteristic - Organism | Homo sapiens |