| Literature DB >> 35194950 |
Yaoting Sun1,2,3, Lu Li1,2,3, Yan Zhou1,2,3, Weigang Ge4, He Wang4, Runxin Wu2,3,5, Wei Liu4, Hao Chen4, Qi Xiao1,2,3, Xue Cai1,2,3, Zhen Dong2,3, Fangfei Zhang2,3, Junhong Xiao6, Guangzhi Wang6, Yi He7, Jinlong Gao2,3, Oi Lian Kon8, Narayanan Gopalakrishna Iyer8,9, Haixia Guan10, Xiaodong Teng11, Yi Zhu2,3, Yongfu Zhao6, Tiannan Guo1,2,3.
Abstract
Thyroid nodules occur in about 60% of the population. A major challenge in thyroid nodule diagnosis is to distinguish between follicular adenoma (FA) and carcinoma (FTC). Here, we present a comprehensive thyroid spectral library covering five types of thyroid tissues. This library includes 121 960 peptides and 9941 protein groups. This spectral library can be used to quantify up to 7863 proteins from thyroid tissues, and can also be used to develop parallel reaction monitoring (PRM) assays for targeted protein quantification. Next, to stratify follicular thyroid tumours, we compared the proteomes of 24 FA and 22 FTC samples, and identified 204 differentially expressed proteins (DEPs). Our data suggest altered ferroptosis pathways in malignant follicular carcinoma. In all, 31 selected proteins effectively distinguished follicular tumours. Of those DEPs, nine proteins were further verified by PRM in an independent cohort of 18 FA and 19 FTC. Together, we present a comprehensive spectral library for DIA and targeted proteomics analysis of thyroid tissue specimens, and identified nine proteins that could potentially distinguish FA and FTC.Entities:
Keywords: data-independent acquisition; mass spectrometry; proteomics; spectral library; thyroid nodules
Mesh:
Year: 2022 PMID: 35194950 PMCID: PMC9019893 DOI: 10.1002/1878-0261.13198
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 7.449
Fig. 1Generation, validation and application of a comprehensive thyroid‐specific spectral library. Left panel, generation of the spectral library. Five types of thyroid tissues were collected and prepared for proteomic analysis using pressure cycling technology (PCT). Three pooled thyroid samples were fractionated using strong cation exchange (SCX) or high‐pH reversed‐phase chromatography. Each peptide fraction was analysed using data‐dependent acquisition (DDA) MS for spectral library generation using Spectronaut v14.6. Middle panel, validation of the spectral library. The established library was validated by four DIA data acquisition strategies. Right panel, application of spectral library for proteomic analysis of follicular thyroid adenoma and carcinoma in a multicentre study. N*, the pericancer tissues.
Statistics of the thyroid‐specific spectral library
| Library | |
|---|---|
| Transition groups | 925 330 |
| Peptide precursors | 157 548 |
| Peptides | 121 960 |
| Protein groups | 9941 |
| Proteotypic proteins | 9826 |
Fig. 2Characterization and statistics of the thyroid‐specific spectral library. (A) Distribution of peptide precursor m/z. (B) Counts of different precursor charge states. (C) Distribution of identified peptides lengths. (D) Modified peptides numbers and distribution of three modifications. (E) Numbers of proteotypic peptides for each protein and their corresponding ratios and counts. (F) Ion counts of each fragment type. (G) Venn diagrams of proteins and peptides in our thyroid‐specific spectral library, the PHL and DPHL libraries. (H) Proteins are annotated according to two classification systems, subcellular location (words in red) and function type (words in black). Each curve represents one protein, linking the protein function type with the corresponding subcellular location. (I) A total of 340 kinases (orange dots) belonging to seven families (highlighted by the different tree colours) are identified in our library.
Fig. 3Results from a technical validation of our thyroid‐specific spectral library searched by Spectronaut. Four datasets were acquired with single‐shot DIA (dataset 1), PulseDIA (dataset 2), pre‐fractionation DIA (dataset 3) and a combination of pre‐fractionation and PulseDIA (dataset 4). Identified (A) peptides and (B) proteins were obtained by searching against our thyroid‐specific spectral library. Coefficient of variation of (C) peptides and (D) proteins abundance in tumours (T_CV) and their pericancer tissues (N_CV).
Fig. 4Stratification of follicular thyroid carcinoma and adenoma based on prototype analysed by DIA and the established spectral library. (A) Venn diagram shows 98.3% overlap of protein identification between FA and FTC. (B) The t‐SNE plot shows FA and FTC cannot be separated based on the expression of 7777 proteins. (C) Volcano plot highlights differentially expressed proteins with the threshold of |log2(fold change)| ≥ 1 and P value < 0.05. Red and orange points indicate upregulated proteins, while the light and dark blue points denote the downregulated proteins. Those six annotated points in light blue or orange are proteins with adjusted P values < 0.05. The red and dark blue points are overlapped with favourable/unfavourable prognosis genes of thyroid cancer acquired from TCGA transcriptomics data. (D) Protein expression plots for top six DEPs substantially dysregulated in FA and FTC. (E) Pathway enrichment analysis for 204 DEPs. Z‐score represents the degree of activation of the enriched pathway. (F) Unsupervised hierarchical clustering heatmap of Pearson correlation coefficients between every two samples on 31 proteins selected by geNetClassifier.
Fig. 5PRM‐MS analysis for nine selected proteins. (A) A representative peak group chromatography of a peptide precursor (left) and a box plot (right) showing the protein abundance in FA and FTC. Statistical significance was calculated by two‐tailed Student’s t‐test. (B) Heatmap showing the z‐score scaled expression of nine proteins in each sample, and the average expression of nine proteins in 18 FA and 19 FTC.
Nine differentially expressed proteins confirmed by PRM
| Uniprot ID | Symbol | Entrez Gene Name | Subcellular location | log2(Fold change, FTC/FA) | Adjusted |
|---|---|---|---|---|---|
|
| PTPRE | Protein tyrosine phosphatase receptor type E | Plasma Membrane | 1.2910 | 1.18E‐05 |
|
| LRP4 | LDL‐receptor‐related protein 4 | Plasma Membrane | 1.2679 | 2.44E‐04 |
|
| KPNA2 | Karyopherin subunit alpha 2 | Nucleus | 1.0668 | 2.27E‐04 |
|
| FRMD3 | FERM domain containing 3 | Plasma Membrane | 1.2227 | 3.65E‐04 |
|
| SMOC2 | SPARC‐related modular calcium binding 2 | Extracellular Space | 0.8629 | 1.01E‐02 |
|
| GPD1 | Glycerol‐3‐phosphate dehydrogenase 1 | Cytoplasm | 1.1141 | 1.85E‐02 |
|
| GMIP | GEM interacting protein | Cytoplasm | 1.0664 | 2.63E‐02 |
|
| CA4 | Carbonic anhydrase 4 | Plasma Membrane | 0.9398 | 2.69E‐02 |
|
| ITIH5 | Inter‐alpha‐trypsin inhibitor heavy chain 5 | Extracellular Space | 0.9516 | 2.69E‐02 |