| Literature DB >> 32795611 |
Tiansheng Zhu1, Yi Zhu2, Yue Xuan3, Huanhuan Gao4, Xue Cai4, Sander R Piersma5, Thang V Pham5, Tim Schelfhorst5, Richard R G D Haas5, Irene V Bijnsdorp6, Rui Sun4, Liang Yue4, Guan Ruan4, Qiushi Zhang4, Mo Hu7, Yue Zhou7, Winan J Van Houdt8, Tessa Y S Le Large9, Jacqueline Cloos10, Anna Wojtuszkiewicz10, Danijela Koppers-Lalic11, Franziska Böttger12, Chantal Scheepbouwer13, Ruud H Brakenhoff14, Geert J L H van Leenders15, Jan N M Ijzermans16, John W M Martens17, Renske D M Steenbergen18, Nicole C Grieken18, Sathiyamoorthy Selvarajan19, Sangeeta Mantoo19, Sze S Lee20, Serene J Y Yeow20, Syed M F Alkaff19, Nan Xiang4, Yaoting Sun4, Xiao Yi4, Shaozheng Dai21, Wei Liu4, Tian Lu4, Zhicheng Wu1, Xiao Liang4, Man Wang22, Yingkuan Shao23, Xi Zheng23, Kailun Xu23, Qin Yang24, Yifan Meng24, Cong Lu25, Jiang Zhu25, Jin'e Zheng25, Bo Wang26, Sai Lou27, Yibei Dai28, Chao Xu29, Chenhuan Yu30, Huazhong Ying30, Tony K Lim19, Jianmin Wu22, Xiaofei Gao31, Zhongzhi Luan21, Xiaodong Teng26, Peng Wu24, Shi'ang Huang25, Zhihua Tao28, Narayanan G Iyer20, Shuigeng Zhou32, Wenguang Shao33, Henry Lam34, Ding Ma24, Jiafu Ji22, Oi L Kon20, Shu Zheng23, Ruedi Aebersold35, Connie R Jimenez5, Tiannan Guo36.
Abstract
To address the increasing need for detecting and validating protein biomarkers in clinical specimens, mass spectrometry (MS)-based targeted proteomic techniques, including the selected reaction monitoring (SRM), parallel reaction monitoring (PRM), and massively parallel data-independent acquisition (DIA), have been developed. For optimal performance, they require the fragment ion spectra of targeted peptides as prior knowledge. In this report, we describe a MS pipeline and spectral resource to support targeted proteomics studies for human tissue samples. To build the spectral resource, we integrated common open-source MS computational tools to assemble a freely accessible computational workflow based on Docker. We then applied the workflow to generate DPHL, a comprehensive DIA pan-human library, from 1096 data-dependent acquisition (DDA) MS raw files for 16 types of cancer samples. This extensive spectral resource was then applied to a proteomic study of 17 prostate cancer (PCa) patients. Thereafter, PRM validation was applied to a larger study of 57 PCa patients and the differential expression of three proteins in prostate tumor was validated. As a second application, the DPHL spectral resource was applied to a study consisting of plasma samples from 19 diffuse large B cell lymphoma (DLBCL) patients and 18 healthy control subjects. Differentially expressed proteins between DLBCL patients and healthy control subjects were detected by DIA-MS and confirmed by PRM. These data demonstrate that the DPHL supports DIA and PRM MS pipelines for robust protein biomarker discovery. DPHL is freely accessible at https://www.iprox.org/page/project.html?id=IPX0001400000.Entities:
Keywords: Data-independent acquisition; Diffuse large B cell lymphoma; Parallel reaction monitoring; Prostate cancer; Spectral library
Mesh:
Substances:
Year: 2020 PMID: 32795611 PMCID: PMC7646093 DOI: 10.1016/j.gpb.2019.11.008
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1Workflow for building DPHL
A. Schematic representation of DDA shotgun proteomics data acquisition. Numbers in parentheses indicate the number of DDA files per tissue type. B. Protein identification and iRT detection from DDA raw files using pFind. C. SiRT detection and calibration. D. CiRT detection and calibration. E. Generation of DPHL. Details of the commands are presented in File S18. DDA, data-dependent acquisition; DIA, data-independent acquisition; iRT, indexed retention time; PCT, pressure cycling technology; SCX, strong cation-exchange; SiRT, synthetic iRT; CiRT, common internal iRT; DPHL, DIA pan-human library.
Figure 2Comparison of DPHL and PHL
A. Venn diagram showing the comparison of transition groups (i.e., peptide precursors), peptides, protein groups, and proteins in DPHL and PHL. B. Visualization of tissue intersections using R package UpSet. C. Bar plots displaying the number of transition groups, peptides, protein groups, proteins in DPHL library for each sample type. PHL, pan-human spectral library.
Figure 3PCa proteome using 60-min gradient DIA
A. Number of protein groups and peptide precursors identified using SiRT and CiRT. B. Technical reproducibility of proteome matrix using CiRT and SiRT. C. Comparison of protein quantification based on MS intensity using the SiRT and CiRT methods. D. 2D plane t-SNE plot of disease classes, color coded by sample type using CiRT and SiRT. E. Boxplots showing the expression (MS intensity) of the significantly dysregulated proteins; P values adjusted with Benjamini & Hochberg are shown under each protein name. ROC curves of the proteins were also shown. R1, technical replicate 1; R2, technical replicate 2; PCa, prostate cancer; BPH, benign prostate hyperplasia; t-SNE, t-distributed stochastic neighbor embedding; FASN, fatty acid synthetase, UniProtKB: P49327; TPP1, tripeptidyl-peptidase 1, UniProtKB: O14773; SPON2, spondin-2, UniProtKB: Q9BUD6.
Figure 4DIA analysis of plasma samples from DLBCL patients and HC subjects
A. Technical reproducibility for protein quantification of four plasma samples from two DLBCL patients and two healthy control subjects. B. 2D plane t-SNE plot showing that proteomes are separated. C. Volcano plot showing significantly down-regulated (blue) and up-regulated (red) proteins in 37 plasma samples (19 samples from DLBCL patients and 18 samples from HC subjects). D. The relative protein expression as calculated by MS intensity for CRP and SAA1. P values adjusted with Benjamini & Hochberg are shown under each protein name Left: Boxplot and ROC curve of CRP. Right: Boxplot and ROC curve of SAA1. DLBCL, diffuse large B cell lymphoma; HC, healthy control; CRP, C-reactive protein, UniProtKB: P02741; SAA1, serum amyloid A1, UniProtKB: P0DJI8.
Figure 5PRM validation of TPP1, FASN, and SPON2 across 73 peptide samples from 53 PCa patients
Two best flying peptides were selected for each protein. For each peptide, boxplot shows the relative abundance of the peptide across 73 PRM runs as calculated from MS intensity (on the left), and XIC demonstrates a representative peak group of the peptide (on the right). P values are computed using Student’s t test. PRM, parallel reaction monitoring; XIC, extracted ion chromatogram.