| Literature DB >> 33786345 |
Mostafa J Khan1, Heather Desaire2, Oscar L Lopez3,4, M Ilyas Kamboh4,5,6, Renã A S Robinson1,7,8,9,10.
Abstract
Here we present a plasma proteomics dataset that was generated to understand the importance of self-reported race for biomarker discovery in Alzheimer's disease. This dataset is related to the article "Why inclusion matters for Alzheimer's disease biomarker discovery in plasma" [1]. Plasma samples were obtained from clinically diagnosed Alzheimer's disease and cognitively normal adults of African American/Black and non-Hispanic White racial and ethnic backgrounds. Plasma was immunodepleted, digested, and isobarically tagged with commercial reagents. Tagged peptides were fractionated using high pH fractionation and resulting fractions analysed by liquid chromatography - mass spectrometry (LC-MS/MS & MS3) analysis on an Orbitrap Fusion Lumos mass spectrometer. The resulting data was processed using Proteome Discoverer to produce a list of identified proteins with corresponding tandem mass tag (TMT) intensity information.Entities:
Keywords: African American; Alzheimer's disease; Biomarker; Black; Plasma; Proteomics; disparities
Year: 2021 PMID: 33786345 PMCID: PMC7988280 DOI: 10.1016/j.dib.2021.106923
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Plasma proteomics workflow for Set 1 (N=73) and Set 2 (N=40) samples. Samples were randomized into eight batches for Set 1 and four batches for Set 2 with one QC pool sample and representative samples from each group contained in each batch. The samples were randomly assigned TMT channels for both experiments. The experimental workflow was maintained except for the digestion step, where ammonium bicarbonate based in solution digestion was used for Set 1, while urea-based filter-assisted sample preparation digestion was employed in Set 2.
Fig. 2Correlation plot of average normalized TMT reporter ion intensities for all proteins between different batches for both Set 1 and Set 2. Both displayed positive correlation among all the batches. In case of Set 1, batch 3 and batch 6 showed the best correlation with an R2 value of 0.9972, while batch 2 and batch 7 showed the lowest co-relation with an R2 value of 0.9765. On average Set 1 had an R2 value of 0.99. On the other hand, for Set 2, batch 1 and batch 2 had the best correlation with an R2 value of 0.9973, batch 2 and batch 4 had the lowest correlation, with an R2 value of 0.9902. On average Set 1 had an R2 value of 0.9939.
Fig. 3a) Venn diagram of identified proteins common between Set 1 and Set 2. The pyramid plots highlight the distribution of proteins identified with a given range of peptide spectral matches (PSMs). b) Line plot of theoretical protein concentration's for plasma [2] against the experimentally obtained relative abundances for these proteins using normalized TMT reporter ion intensities of the top 100 most abundant proteins in plasma. The insert is a zoomed-in version of proteins in the lower concentration range of the line plot. The black line represents the theoretical protein concentrations, while the orange and blue line represent corresponding TMT reporter ion abundances for Set 1 and Set 2 proteins, respectively. Abbreviation: CP- Ceruloplasmin, C3- Complement C3, APOA1-Apolipoprotein A1, APOA2- Apolipoprotein A2, FGA- Fibrinogen alpha chain, AHSG- Alpha-2-HS-glycoprotein, FGB- Fibrinogen beta chain, C4A- Complement C4-A, APOB- Apolipoprotein B, SAA4- Serum amyloid A4, FGG- Fibrinogen gamma chain, APOE- Apolipoprotein E, SERPINA7- Thyroxine-binding globulin, F2- Prothrombin, CLU- Clusterin, PON1- Serum paraoxonase/arylesterase 1, SAA1- Serum amyloid A1, APOL1- Apolipoprotein L1, HBB- Hemoglobin subunit beta, C1R- Complement C1r subcomponent, C1S- Complement C1s subcomponent, CFI- Complement factor I, HBD- Hemoglobin subunit delta.
| Subject | Chemistry, Biology, Neuroscience |
| Specific subject area | Alzheimer's disease, quantitative proteomics, health disparities |
| Type of data | Table |
| How data were acquired | Liquid chromatography separation coupled to high-resolution mass spectrometer (LC-MS/MS), MS3 quantification using Tandem Mass Tagging (TMT) strategies. LC parameters: Nano UHPLC (Thermo Scientific) coupled with autosampler, 105 min gradient, self-packed trap column (100 µM ID × 2.5 cm) and analytical column (100 µM ID × 25 cm). MS parameters: Orbitrap Fusion Lumos mass spectrometer (Thermo Scientific), data dependent acquisition (top speed precursor selection), synchronous precursor selection (top 10) for MS3 quantification. |
| Data format | Raw mass spectrometry files, PD read out files |
| Parameters for data collection | Plasma samples obtained from African American probable Alzheimer's disease (AD) (N=30) and cognitively normal (CN) (N=26) individuals, non-Hispanic White probable Alzheimer's disease (N=29) and cognitively normal (N=28) individuals. The patients had an average age of 71.8 (CN) and 75.4 (AD), average MMSE score of 27 (CN) and 15 (AD). No significant differences among the groups in terms of age, sex and presence of comorbidities. |
| Description of data collection | Samples were randomly divided into two experimental sets (Set 1 N=73 sample and Set 2 N=40 samples). Further, for Set 1, samples were divided into 8 batches, whereas there were 4 batches for Set 2 samples, with representation of all four groups in all batches. Plasma samples were immunodepleted of the top 6 most abundant proteins and digested using trypsin-LysC mix. Resulting peptides were labelled using TMT 10/11-plex tags. Labelled peptides were further fractionated using high pH fractionation. Finally, each fraction was randomly injected in duplicates into the mass spectrometer and subject to LC-MS/MS, MS3 analysis. |
| Data source location | Vanderbilt University |
| Data accessibility | Proteomics Identification Database (PRIDE) |
| Related research article | M.J. Khan, H. Desaire, O.L. Lopez, M.I. Kamboh, R.A.S. Robinson, Why inclusion matters for Alzheimer's disease biomarker discovery in plasma Journal of Alzheimer's Disease (2021) Jan 5. doi: |