| Literature DB >> 34950757 |
Harshi Weerakoon1,2,3, John J Miles4,5, Ailin Lepletier4,6, Michelle M Hill1,7.
Abstract
Regulatory T cells (Tregs) play a core role in maintaining immune tolerance, homeostasis, and host health. High-resolution analysis of the Treg proteome is required to identify enriched biological processes and pathways distinct to this important immune cell lineage. We present a comprehensive proteomic dataset of Tregs paired with conventional CD4+ (Conv CD4+) T cells in healthy individuals. Tregs and Conv CD4+ T cells were sorted to high purity using dual magnetic bead-based and flow cytometry-based methodologies. Proteins were trypsin-digested and analysed using label-free data-dependent acquisition mass spectrometry (DDA-MS) followed by label free quantitation (LFQ) proteomics analysis using MaxQuant software. Approximately 4,000 T cell proteins were identified with a 1% false discovery rate, of which approximately 2,800 proteins were consistently identified and quantified in all the samples. Finally, flow cytometry with a monoclonal antibody was used to validate the elevated abundance of the protein phosphatase CD148 in Tregs. This proteomic dataset serves as a reference point for future mechanistic and clinical T cell immunology and identifies receptors, processes, and pathways distinct to Tregs. Collectively, these data will lead to a better understanding of Treg immunophysiology and potentially reveal novel leads for therapeutics seeking Treg regulation.Entities:
Keywords: Conventional T cell; LC-MS/MS; Proteomics; Regulatory T cell; Tandem mass spectrometry
Year: 2021 PMID: 34950757 PMCID: PMC8671522 DOI: 10.1016/j.dib.2021.107687
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Study workflow. A. Workflow for sorting Tregs and Conventional (Conv) CD4+. PBMC obtained from three volunteers underwent two rounds of magnetic-activated cell sorting (MACS) followed by flow cytometry based cell sorting (FACS), yielding Tregs and Conv CD4+ at high purity. B. Proteomic sample preparation and mass spectrometry (LC-MS/MS) analysis. Peptide samples were prepared from whole cell lysate using protein co-precipitation with trypsin in methanol. The resulting tryptic peptides were desalted before LC-MS/MS analysis on Obitrap Fusion Tribrid inline coupled to nano ACQUITY UPLC. C. DDA-MS data were deconvoluted using the MaxQuant search engine against the UniProt/SwissProt human proteome. Differential expression analysis was performed on only high-quality label-free protein intensity data D. Orthogonal validation of CD148 (PTPRJ) enrichment in Treg cells.
Description of the data files deposited in ProteomeXchange data repository under the data identification number of PXD022095.
| Title of the file/folder | Description | |
|---|---|---|
| 1 | Rep1_Treg.raw | .raw file of Treg cells - Replicate 1 |
| 2 | Rep2_Treg.raw | .raw file of Treg cells - Replicate 2 |
| 3 | Rep3_Treg.raw | .raw file of Treg cells - Replicate 3 |
| 4 | Rep1_nonTreg.raw | .raw file of Conv CD4+ T cells - Replicate 1 |
| 5 | Rep2_nonTreg.raw | .raw file of Conv CD4+ T cells - Replicate 2 |
| 6 | Rep3_nonTreg.raw | .raw file of Conv CD4+ T cells - Replicate 3 |
| 7 | search.zip | MaxQuant ouput files resulted from the analysis of the above .raw files against UniProt/SwissProt human reviewed proteome |
| 8 | parameters.txt | Parameters used in the data analysis through MaxQuant search engine |
| 9 | human_proteome_reviewed_25102017.fasta | UniProt/SwissProt proteome database used in the analysis |
| 10 | Treg_nonTreg_protein_quantification.txt | MaxQuant ouput files giving the protein quantification data and LFQ normalised protein intensities |
Fig. 2Summary of DDA-MS proteomic dataset in human Tregs. Label-free DDA-MS data were analyzed using MaxQuant search engine against UniProt/SwissProt human reviewed proteome database. MS/MS spectral data determined at 1% FDR were selected for further analysis across Tregs (blue) and Conv CD4+ (orange). A. The number of MS/MS spectra detected in each sample. B. Mean MS/MS spectra per protein per sample. C. Total number of peptides and unique+razor peptides detected in each sample. D. Mean unique+razor peptides per protein per sample. E. Percentage of amino acid sequence coverage from the peptide dataset (unique+razor). F. Mean percentage of amino acid sequence coverage per peptide per sample (unique+razor). G. Total number of proteins quantified with single or multiple UniProt entries. Here, 92% of proteins had single entries (light grey), and 8% of proteins had multiple peptide entries (dark grey). H. The number of proteins quantified per sample. Error bars with standard deviation are shown.
Fig. 3Tregs and Conv CD4+ T cell exhibit divergent proteomes. A. Word clouds of biological processes and reactome pathways enriched in Treg versus Conv CD4+. B. Volcano plot showing differential protein abundance of proteins between Treg and Conv CD4+, using q < 0.05 and log2FC >1 or -1 as cut-off (dotted lines). When comparing Conv CD4+, proteins significantly upregulated in Treg (red) and downregulated (blue) are shown, along with CD49f (ITGA6) and CD148 (PTPRJ) which are shown in green. Gene lists of the top 10 most upregulated and top 10 most downregulated proteins are shown. C. Proteomic data quantifying CD148 levels in Treg and Conv CD4+. Pairwise comparison showed higher abundance of CD148 in Treg cells in all donors (q < 0.0001, multiple t-test with false discovery rate correction). D. Flow cytometry data for cell surface CD148 in Treg and Conv CD4+ T cells in six healthy donors denoted by different colors. Note some of the data points overlap (*p < 0.05, Mann Whitney U test). E. Representative histogram from one donor displaying the fluorescence intensity difference of CD148 between two T cell populations.
Chromatographic and mass spectrometry parameters used in sample analysis.
| Parameter | Description/Settings | |
|---|---|---|
| Mass Spectrometer | Orbitrap Fusion™ Tribrid™ (Thermo Fisher Scientific, USA) | |
| Total LC gradient | 175 minutes | |
| Buffers | A | 0.1% FA |
| B | 100% ACN + 0.1% FA | |
| LC gradient | 5% at 3 minutes, 9% at 10 minutes, 26% at 120 minutes, 40% at 145 minutes, 80% at 152 minutes, 80% at 157 minutes and 1% at 160 minutes | |
| Trap column | Symmetry C18 trap, 2G VM trap (Waters, USA), | |
| Column | BEH C18 (Waters, USA), 130Å, 1.7 µm particle size, 75 µm × 200 mm | |
| Flow rate | 0.3 µl/ minutes | |
| Ion source | EASY-Max NG™ ion source (Thermo Fisher Scientific, USA | |
| Ion spray voltage | 1900 V | |
| Heating temperature | 285°C | |
| Data acquisition method | DDA-MS | |
| MS1 mass range | 380 – 1500 m/z | |
| Injection time | 50 milliseconds | |
| Resolution | 120,000 FWHM | |
| Charge state | Rejecting +1, Selecting +2 to +7 | |
| No. of ions selected to trigger MS2 | 15 | |
| Injection time | 70 milliseconds | |
| Dynamic exclusion | 90 seconds | |
| Cycle time | 2 seconds | |
Parameters used in DDA-MS data analysis and label-free quantification using MaxQuant software and MaxLFQ.
| Parameter | Settings |
|---|---|
Digestive enzyme | Trypsin |
Maximum number of miscleavages | 2 |
Fixed modification | Carbamidomethylation (Cystiene) |
Variable modifications | Oxidation (Methionine), Acetylation (Protein N-terminal) |
Precursor mass tolerance | ± 20 ppm |
Product mass tolerance | ± 40 ppm |
Maximum peptide charge | +7 |
Match between runs | Yes (0.7 minutes match time window and 20 minutes alignment time window) |
Protein FDR | 0.01 |
Peptide spectral match (PSM) FDR | 0.01 |
Minimum peptides | 1 |
Peptide for protein quantification | Unique + Razor |
Peptide selection for quantification | Unmodified peptides and only the peptides modified with oxidation and N terminal acetylation |
No. of peptides for quantification | ≥ 2 |
| Subject | Immunology |
| Specific subject area | Regulatory T cells (Tregs) play a key role in directing adaptive immunity, particularly enforcing immune tolerance. However, few studies have examined the |
| Type of data | Tables and Figures. |
| How data were acquired | Orbitrap Fusion™ Tribrid™ (Thermo Fisher Scientific, USA) inline coupled to nano ACQUITY UPLC (Waters, USA) was used to acquire label-free proteomic data using data-dependent acquisition (DDA-MS). |
| Data format | Raw and analysed data. |
| Parameters for data collection | CD3+, CD4+, CD25high, CD127low, FOXP3+ and CD3+, CD4+, CD25−, FOXP3− cells were identified as Tregs and Conv CD4+, respectively. Peptides were separated using a 160 mins chromatographic gradient at 0.3 µl/min flow rate while ionised at 1900 V and 285 °C. MS1 ranged, and resolutions were 380 – 1500 m/z and 120,000 FWHM, respectively. Injection time for MS1 and MS2 were 50 ms and 70 ms, respectively. Fifteen ions were selected to trigger MS2 at 90 s dynamic exclusion. The total cycle time was two seconds. |
| Description of data collection | Label-free proteomics data were acquired on peptide samples prepared from Treg and Conv CD4+ from peripheral blood mononuclear cells (PBMC) from three healthy volunteers (age 30-35 years) at the QIMR Berghofer Medical Research Institute (QIMRB), Brisbane, Australia. |
| Data source location | Raw proteomic data are available via ProteomeXchange |
| Data accessibility | Repository name: ProteomeXchange |
| Related research article | H. Weerakoon, J. Straube, K. Lineburg, L. Cooper, S. Lane, C. Smith, S. Alabbas, J. Begun, J.J. Miles, M.M. Hill, A. Lepletier, Expression of CD49f defines subsets of human regulatory T cells with divergent transcriptional landscape and function that correlate with ulcerative colitis disease activity., Clin. Transl. Immunol. 10 (2021) e1334. |