Literature DB >> 29085871

Proteomics dataset: The colon mucosa from inflammatory bowel disease patients, gastrointestinal asymptomic rheumatoid arthritis patients, and controls.

Tue Bjerg Bennike1, Thomas Gelsing Carlsen2, Torkell Ellingsen3, Ole Kristian Bonderup4,5, Henning Glerup4, Martin Bøgsted6,7, Gunna Christiansen8, Svend Birkelund1, Vibeke Andersen9,10, Allan Stensballe1.   

Abstract

The datasets presented in this article are related to the research articles entitled "Neutrophil Extracellular Traps in Ulcerative Colitis: A Proteome Analysis of Intestinal Biopsies" (Bennike et al., 2015 [1]), and "Proteome Analysis of Rheumatoid Arthritis Gut Mucosa" (Bennike et al., 2017 [2]). The colon mucosa represents the main interacting surface of the gut microbiota and the immune system. Studies have found an altered composition of the gut microbiota in rheumatoid arthritis patients (Zhang et al., 2015; Vaahtovuo et al., 2008; Hazenberg et al., 1992) [5], [6], [7] and inflammatory bowel disease patients (Morgan et al., 2012; Abraham and Medzhitov, 2011; Bennike, 2014) [8], [9], [10]. Therefore, we characterized the proteome of colon mucosa biopsies from 10 inflammatory bowel disease ulcerative colitis (UC) patients, 11 gastrointestinal healthy rheumatoid arthritis (RA) patients, and 10 controls. We conducted the sample preparation and liquid chromatography mass spectrometry (LC-MS/MS) analysis of all samples in one batch, enabling label-free comparison between all biopsies. The datasets are made publicly available to enable critical or extended analyses. The proteomics data and search results, have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD001608 for ulcerative colitis and control samples, and PXD003082 for rheumatoid arthritis samples.

Entities:  

Keywords:  Colon mucosa; Colonoscopy; Dataset; Inflammatory bowel diseases; Neutrophil extracellular traps; Proteomics; Rheumatoid arthritis; Sigmoidoscopy; Ulcerative colitis

Year:  2017        PMID: 29085871      PMCID: PMC5650644          DOI: 10.1016/j.dib.2017.09.059

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The dataset contains the largest number of identified human proteins from colon mucosa biopsies as of 2017. The dataset was obtained in one batch, allowing for label-free comparison of the colon mucosa of ulcerative colitis patients, rheumatoid arthritis patients, and controls. The first dataset of the colon mucosa of gastrointestinal healthy RA patients. The datasets can be analyzed for novel proteome effects of disease and treatments. The datasets allow for extended statistical analysis, and we encourage such collaborations.

Data

The datasets in this article provides information on the proteome of the colon mucosa of inflammatory bowel disease patients with ulcerative colitis [1], gastrointestinal healthy rheumatoid arthritis patients [2], and controls. The study was motivated by the finding of an altered composition of the gut microbiota in rheumatoid arthritis patients [5], [6], [7] and inflammatory bowel disease patients [8], [9], [10]. All biopsies were handled on-site by the project group to limit technical variance. The biopsies were randomized, digested using a modified filter-aided sample preparation protein digestion protocol, and analyzed in technical triplicates by high-throughput proteomics on a Q Exactive mass spectrometer. All experimental factors were kept constant, allowing for a label-free quantitative analysis between all samples. The unprocessed proteomics data files (Table 1) and processed search result files (Table 2), have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository, with the dataset identifier PXD001608 for ulcerative colitis and control samples, and PXD003082 for rheumatoid arthritis samples [13], [14].
Table 1

Raw-datafiles in PXD001608 and PXD003082. All samples were analyzed in technical triplicates, and all raw-files are named accordingly (e.g. Ctrl_10_3 is the third repeat of control 10). “Poor R” signifies a Pearson's correlation coefficient R<0.95 between the technical repeats, and additional data validation is recommended for studies including these datafiles. The number of identified proteins in each replicate is given, wo/w the MaxQuant match between runs feature which transfer MS/MS information between different LC-MS/MS analysis. RA: Rheumatoid arthritis, UC: ulcerative colitis, NA: Not available.

FilenameSample#Proteins matching OFF#Proteins matching ONDataset IDComment
Ctrl_1Control4362, 4279, 42415967, 5903, 5897PXD001608
Ctrl_2Control3613, 3607, 35955657, 5664, 5723PXD001608
Ctrl_3Control4203, 4203, 41885936, 5952, 5939PXD001608
Ctrl_4Control4245, 4268, 41915863, 5865, 5849PXD001608
Ctrl_5Control3961, 3881, 39035694, 5683, 5632PXD001608
Ctrl_6Control4290, 4281, 42425966, 5959, 5932PXD001608
Ctrl_7Control4080, 4099, 40975856, 5822, 5817PXD001608
Ctrl_8Control4269, 4325, 43365968, 5986, 5966PXD001608
Ctrl_9Control4560, 4549, 39746103, 6126, 6103PXD001608
Ctrl_10Control3974, 3993, 39725781, 5817, 5779PXD001608
UC_1UC4290, 4229, 43635870, 5852, 5907PXD001608
UC_2UC3328, 3285, 33285148, 5101, 5139PXD001608
UC_3UC4455, 4472, 44826068, 6082, 6041PXD001608
UC_4UC4236, 4005, NA5685, 5773, NAPXD001608UC_4_3 poor R
UC_5UC3097, 3174, NA5051, 5017, NAPXD001608UC_5_3 poor R
UC_6UC4458, 4424, 44826118, 6108, 6115PXD001608
UC_7UC3657, 3686, 36935647, 5626, 5575PXD001608
UC_8UC3356, 3288, 33035237, 5207, 5164PXD001608
UC_9UC4681, 4700, 47036220, 6233, 6223PXD001608
UC_10UC3762, 3688, 36745587, 5557, 5562PXD001608
RA_1RA3900, 3789, 37555658, 5644, 5632PXD003082
RA_2RA4347, 4141, 42856023, 5997, 5973PXD003082
RA_3RA4654, 4678, 46296204, 6179, 6180PXD003082
RA_4RA4298, 4287, 42675933, 5925, 5899PXD003082
RA_5RA3472, 3521, 34915402, 5390, 5344PXD003082
RA_6RA3545, 3485, 35265735, 5742, 5728PXD003082
RA_7RA4538, 4476, 44596057, 6062, 6071PXD003082
RA_8RA3619, 3439 NA5317, 5310, NAPXD003082RA_8_3 poor R
RA_9RA3361, NA, 3427,5256, NA, 5187PXD003082RA_9_2 poor R
RA_10RA4417, 4354, 43615991, 6010, 6023PXD003082
RA_11RA3094, 3055, 30515005, 5050, 5006PXD003082
Table 2

Additional submitted Search and FASTA files in PXD001608 and PXD003082. RA: Rheumatoid arthritis, UC: ulcerative colitis.

FilenameDataset IDContentDescription
CombinedTxtFiles.zipPXD001608Zipped MaxQuant combined txt folder.Result of the label-free quantitative analysis of UC and controls in MaxQuant. The content of each file is described in “tables.pdf”.
131008_Swissprot_Human_Ref_ proteome.fastaPXD001608Protein FASTA database file.Protein database used for the UC and controls analysis.
MaxQuantOutput.zipPXD003082Zipped MaxQuant combined txt folder.Result of the label-free quantitative analysis of UC, RA, and controls in MaxQuant. The content of each file is described in “tables.pdf”.
UniprotHumanProteome P000005640Isoforms.fastaPXD003082Protein FASTA database file.Protein database used for the UC, RA, and controls analysis.
FASTA file parameters.txtPXD003082FASTA info.Information regarding the database.
Raw-datafiles in PXD001608 and PXD003082. All samples were analyzed in technical triplicates, and all raw-files are named accordingly (e.g. Ctrl_10_3 is the third repeat of control 10). “Poor R” signifies a Pearson's correlation coefficient R<0.95 between the technical repeats, and additional data validation is recommended for studies including these datafiles. The number of identified proteins in each replicate is given, wo/w the MaxQuant match between runs feature which transfer MS/MS information between different LC-MS/MS analysis. RA: Rheumatoid arthritis, UC: ulcerative colitis, NA: Not available. Additional submitted Search and FASTA files in PXD001608 and PXD003082. RA: Rheumatoid arthritis, UC: ulcerative colitis. A cumulated 6768 proteins (FDR<1%) were identified, representing the largest proteome dataset of the colon mucosa so far. Additionally, the dataset represents the first analysis of the colon mucosa of gastrointestinal healthy rheumatoid arthritis patients. The data-analysis result from the analysis with MaxQuant can be downloaded as zipped txt-files, the context of which are described in the tables.pdf also in the zipped file. The result-file proteinGroups.txt, contains all identified proteins at <1% FDR, and information regarding each protein, e.g. the corresponding label-free relative quantitation value (LFQ). Additional information regarding the participants can be found in the publications.

Experimental design, materials and methods

Study cohort and sample collection

The sample material was extracted and processed as described in [1] and 2]. Colon mucosal biopsies (roughly 1 mm3) were sampled 40 cm from the anus by sigmoidoscopy, at the Regional Hospital Silkeborg Denmark, from 10 ulcerative colitis patients, 11 rheumatoid arthritis patients and 10 controls in the period from 2012 to 2013. The biopsies were immediately transferred to cryotubes and snap-frozen in liquid nitrogen followed by storage at minus 80 °C until proteomics sample preparation. All participants had given a written informed consent prior to participation in the study, and the project was approved by The Regional Scientific Ethical Committee (S-20120204) and the Danish Data Protection Agency (2008-58-035).

Proteomic sample preparation

The biopsies were randomized, and enzymatic digested using a modified filter-aided sample preparation protein [17], [18], [19], [20], [21], [22]. Briefly explained, the biopsies were homogenized in 0.5 mL cold sample buffer (5% sodium deoxycholate, 50 mM triethylammonium bicarbonate, pH 8.5). The lysate protein concentration was estimated by absorbance at 280 nm measured using a NanoDrop 1000 UV–vis spectrophotometer (Thermo Scientific, Waltham, MA, USA). Additionally, the concentration of four biopsy lysates was determined using a bicinchoninic acid assay (BCA) with bovine serum albumin as standard, measured using an Infinite microplate reader (Tecan, Männedorf, Switzerland). The nanodrop measurements were calibrated using the BCA results. 100 µg protein was transferred to 30 kDa molecular weight cutoff spin-filters (Millipore, Billerica, MA, USA) to facilitate buffer exchanges by centrifugation at 15,000g for 15 min between all steps. Protein disulfide bonds were reduced by addition of 100 µL 10 mM tris(2-carboxyethyl)phosphine (Thermo Scientific, Waltham, MA, USA) and alkylated by addition of 100 µL 50 mM 2-iodoacetamide (Sigma-Aldrich, St. Louis, MO, USA) in sample buffer. Two µg sequencing grade modified trypsin (Promega, Madison, WI, USA) diluted in lysis buffer with 0.5% sodium deoxycholate was added to the spin-filter, and the proteins were digested to peptides overnight at 37 °C. The peptide material was eluted from the spin-filter and purified by phase inversions with 1:1 (v/v) ethyl acetate with 1% formic acid, and dried down in a vacuum centrifuge overnight, and stored at −80 °C for a maximum of one week prior to analysis.

Proteomic analysis

The peptides were analyzed by LC-MS/MS using an UltiMate 3000 UPLC system (Thermo Scientific, Waltham, MA, USA) coupled online to a Q Exactive plus mass spectrometer (Thermo Scientific). Five µg peptide material was loaded onto a 2 cm reverse phase C18-material trapping column and separated on a 50 cm analytical column, both from Acclaim PepMap100 (Thermo Scientific). The liquid phase consisted of 96% solvent A (0.1% formic acid) and 4% solvent B (0.1% formic acid in acetonitrile), at a flow rate of 300 nL/min. The peptides were eluted from the column by increasing to 8% solvent B and subsequently to 30% solvent B on a 225 min ramp gradient, and introduced into the mass spectrometer by a picotip emitter for electrospray ionization (New objective, Woburn, MA, USA). The mass spectrometer was operated in positive mode with data-dependent acquisition, alternating between survey spectra and isolation/fragmentation spectra using a top12 method. Selected eluting peptides were excluded from re-analysis for 30 s. All biopsies were analyzed in triplicates in a random order.

Data processing

The generated RAW-files were searched with MaxQuant 1.5.2.8 software against the Uniprot Homo sapiens reference proteome database with isoforms (UP000005640, last modified 2015-01-16, entry count 90,434) [23], [24]. Standard settings were employed, with the following abundant peptide modifications included in the search: Carbamidomethylated(C) (fixed), N-terminal protein acetylation (variable), oxidation(M) (variable), and deamidation (N or Q) (variable) [11], [12]. The match between runs feature in MaxQuant was enabled to allow the transfer of confident peptides identifications across LC-MS/MS runs, based on accurate mass-to-charge and retention time. Identified proteins and peptides were filtered to <1% false discovery rate [25]. Label-free quantitation was enabled in MaxQuant to report protein and peptide relative quantities using standard parameters.

Funding sources

The Lundbeck Foundation Denmark (R181-2014-3372), and the Carlsberg Foundation (CF14-0561) are acknowledged for grants enabling the project (TBB grants). Knud and Edith Eriksens Memorial Foundation (“Knudog Edith EriksensMindefond”) and Ferring are acknowledge for grants, enabling the collection of the biological sample material (VA grant). The Obelske Family Foundation and the Svend Andersen Foundation are acknowledged for grants supporting the analytical platform being part of the Danish National Platform for Proteomics (PRO-MS) (AS grants).
Subject areaBiology
More specific subject areaCharacterization of the proteome of the colon mucosa of ulcerative colitis patients, gastrointestinal healthy rheumatoid arthritis patients, and controls.
Type of dataRaw- mass spectrometry files and text/excel files
How data was acquiredMass Spectrometry Liquid Chromatography
Data was acquired using a high-resolution/high-accuracy Q Exactive plus (Thermo Scientific) mass spectrometer.
Data formatRaw- and analyzed data.
Experimental factorsHuman colon mucosal biopsies from ulcerative colitis patients, gastrointestinal healthy rheumatoid arthritis patients, and controls.
Experimental featuresBiopsies were extracted by colonoscopy and immediately snap-frozen with liquid nitrogen. The biopsies were tryptic digested and analyzed by electrospray ionization liquid chromatography mass spectrometry.
Data source locationThe Laboratory for Medical Mass Spectrometry, Department of Health Science and Technology, Aalborg University, Fredrik Bajers Vej 7E, 9220 Aalborg East, Denmark
Data accessibilityThe proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository, [13], [14], [15], [16] with dataset identifiers:
PXD001608 – Ulcerative colitis patients, and controls.
PXD003082 – Gastrointestinal healthy rheumatoid arthritis patients.
Direct download links: http://www.ebi.ac.uk/pride/archive/projects/PXD001608
http://www.ebi.ac.uk/pride/archive/projects/PXD003082
  21 in total

1.  Sample preparation and digestion for proteomic analyses using spin filters.

Authors:  Linda L Manza; Sheryl L Stamer; Amy-Joan L Ham; Simona G Codreanu; Daniel C Liebler
Journal:  Proteomics       Date:  2005-05       Impact factor: 3.984

Review 2.  Interactions between the host innate immune system and microbes in inflammatory bowel disease.

Authors:  Clara Abraham; Ruslan Medzhitov
Journal:  Gastroenterology       Date:  2011-05       Impact factor: 22.682

Review 3.  Are intestinal bacteria involved in the etiology of rheumatoid arthritis? Review article.

Authors:  M P Hazenberg; I S Klasen; J Kool; J G Ruseler-van Embden; A J Severijnen
Journal:  APMIS       Date:  1992-01       Impact factor: 3.205

4.  Fecal microbiota in early rheumatoid arthritis.

Authors:  Jussi Vaahtovuo; Eveliina Munukka; Mika Korkeamäki; Reijo Luukkainen; Paavo Toivanen
Journal:  J Rheumatol       Date:  2008-06-01       Impact factor: 4.666

Review 5.  Biomarkers in inflammatory bowel diseases: current status and proteomics identification strategies.

Authors:  Tue Bennike; Svend Birkelund; Allan Stensballe; Vibeke Andersen
Journal:  World J Gastroenterol       Date:  2014-03-28       Impact factor: 5.742

6.  Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment.

Authors:  Xochitl C Morgan; Timothy L Tickle; Harry Sokol; Dirk Gevers; Kathryn L Devaney; Doyle V Ward; Joshua A Reyes; Samir A Shah; Neal LeLeiko; Scott B Snapper; Athos Bousvaros; Joshua Korzenik; Bruce E Sands; Ramnik J Xavier; Curtis Huttenhower
Journal:  Genome Biol       Date:  2012-04-16       Impact factor: 13.583

7.  The PRoteomics IDEntification (PRIDE) Converter 2 framework: an improved suite of tools to facilitate data submission to the PRIDE database and the ProteomeXchange consortium.

Authors:  Richard G Côté; Johannes Griss; José A Dianes; Rui Wang; James C Wright; Henk W P van den Toorn; Bas van Breukelen; Albert J R Heck; Niels Hulstaert; Lennart Martens; Florian Reisinger; Attila Csordas; David Ovelleiro; Yasset Perez-Rivevol; Harald Barsnes; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Mol Cell Proteomics       Date:  2012-09-04       Impact factor: 5.911

8.  The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013.

Authors:  Juan Antonio Vizcaíno; Richard G Côté; Attila Csordas; José A Dianes; Antonio Fabregat; Joseph M Foster; Johannes Griss; Emanuele Alpi; Melih Birim; Javier Contell; Gavin O'Kelly; Andreas Schoenegger; David Ovelleiro; Yasset Pérez-Riverol; Florian Reisinger; Daniel Ríos; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

9.  Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ.

Authors:  Jürgen Cox; Marco Y Hein; Christian A Luber; Igor Paron; Nagarjuna Nagaraj; Matthias Mann
Journal:  Mol Cell Proteomics       Date:  2014-06-17       Impact factor: 5.911

10.  Proteome stability analysis of snap frozen, RNAlater preserved, and formalin-fixed paraffin-embedded human colon mucosal biopsies.

Authors:  Tue Bjerg Bennike; Kenneth Kastaniegaard; Simona Padurariu; Michael Gaihede; Svend Birkelund; Vibeke Andersen; Allan Stensballe
Journal:  Data Brief       Date:  2016-02-06
View more
  2 in total

Review 1.  The emerging role of neutrophilic extracellular traps in intestinal disease.

Authors:  Feng Chen; Yongqiang Liu; Yajing Shi; Jianmin Zhang; Xin Liu; Zhenzhen Liu; Jipeng Lv; Yufang Leng
Journal:  Gut Pathog       Date:  2022-06-22       Impact factor: 5.324

2.  Abincol® (Lactobacillus plantarum LP01, Lactobacillus lactis subspecies cremoris LLC02, Lactobacillus delbrueckii LDD01), an oral nutraceutical, pragmatic use in patients with chronic intestinal disorders.

Authors:  Luigi Bonavina; Andrea Arini; Leonardo Ficano; Donato Iannuzziello; Luigi Pasquale; Salvo Emanuele Aragona; Giorgio Ciprandi; Italian Study Group On Digestive Disorders
Journal:  Acta Biomed       Date:  2019-07-10
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.