| Literature DB >> 33543004 |
Juliane Merl-Pham1, Trayambak Basak2, Larissa Knüppel3, Deepak Ramanujam4, Mark Athanason2, Jürgen Behr5,6, Stefan Engelhardt4, Oliver Eickelberg3, Stefanie M Hauck1, Roberto Vanacore2, Claudia A Staab-Weijnitz3.
Abstract
Lung fibrosis is characterized by excessive deposition of extracellular matrix (ECM), in particular collagens, by fibroblasts in the interstitium. Transforming growth factor-β1 (TGF-β1) alters the expression of many extracellular matrix (ECM) components produced by fibroblasts, but such changes in ECM composition as well as modulation of collagen post-translational modification (PTM) levels have not been comprehensively investigated. Here, we performed mass spectrometry (MS)-based proteomics analyses to assess changes in the ECM deposited by cultured lung fibroblasts from idiopathic pulmonary fibrosis (IPF) patients upon stimulation with transforming growth factor β1 (TGF-β1). In addition to the ECM changes commonly associated with lung fibrosis, MS-based label-free quantification revealed profound effects on enzymes involved in ECM crosslinking and turnover as well as multiple positive and negative feedback mechanisms of TGF-β1 signaling. Notably, the ECM changes observed in this in vitro model correlated significantly with ECM changes observed in patient samples. Because collagens are subject to multiple PTMs with major implications in disease, we implemented a new bioinformatic platform to analyze MS data that allows for the comprehensive mapping and site-specific quantitation of collagen PTMs in crude ECM preparations. These analyses yielded a comprehensive map of prolyl and lysyl hydroxylations as well as lysyl glycosylations for 15 collagen chains. In addition, site-specific PTM analysis revealed novel sites of prolyl-3-hydroxylation and lysyl glycosylation in type I collagen. Interestingly, the results show, for the first time, that TGF-β1 can modulate prolyl-3-hydroxylation and glycosylation in a site-specific manner. Taken together, this proof of concept study not only reveals unanticipated TGF-β1 mediated regulation of collagen PTMs and other ECM components but also lays the foundation for dissecting their key roles in health and disease. The proteomic data has been deposited to the ProteomeXchange Consortium via the MassIVE partner repository with the data set identifier MSV000082958.Entities:
Keywords: 3-HyP, 3-hydroxyproline; 4-HyP, 4-hydroxyproline; AGC, automatic gain control; ANXA11, annexin A11; BGN, biglycan; COL1A1, collagen-I alpha 1 chain; Collagen; Collagen post-translational modifications; DCN, decorin; ECM, extracellular matrix; Extracellular matrix; FN1, fibronectin 1; G-HyK, galactosylhydroxylysine; GG-HyK, glucosylgalactosylhydroxylysine; HyK, hydroxylysine; HyP, hydroxyproline; ILD, interstitial lung disease; IPF, idiopathic pulmonary fibrosis; LH, lysyl hydroxylase; LOX(L), lysyl oxidase(-like); LTBP2, latent-transforming growth factor β -binding protein 2; Lysyl glycosylation; Lysyl hydroxylation; P3H, prolyl-3-hydroxylase; P4H, prolyl-4-hydroxylase; PAI1, plasminogen activator inhibitor 1; PCA, principal component analysis; PLOD (LH), procollagen-lysine,2-oxoglutarate 5-dioxygenases (lysyl hydroxylases); PTM, post-translational modification; Prolyl hydroxylation; Pulmonary fibrosis; SEMA7A, semaphorin 7a; TGF-β, transforming growth factor β; TGM2, transglutaminase 1; Transforming growth factor-β; VCAN, versican; Xaa, Xaa position in the Gly-Xaa-Yaa repeat in triple-helical collagen; Yaa, Yaa position in the Gly-Xaa-Yaa repeat in triple-helical collagen; α-SMA, α-smooth muscle actin
Year: 2019 PMID: 33543004 PMCID: PMC7852317 DOI: 10.1016/j.mbplus.2019.04.002
Source DB: PubMed Journal: Matrix Biol Plus ISSN: 2590-0285
Fig. 1Overview of experimental approach.
Fig. 2(A) Relative contribution of ECM proteins to the total number of protein IDs classified by cellular localization (upper panel, left) and distribution of ECM protein IDs according to matrisome classification [42] (upper panel, right); relative contribution of ECM proteins in terms of total protein abundance (lower panel, left) and distribution of ECM protein abundance according to matrisome classification [42] (lower panel, right). (B) Principal component analysis of control (yellow) and TGF-β1-treated (blue) distinct biological replicates of all protein IDs (top) and ECM proteins subset (bottom)
Fig. 3Relative protein abundance in IPF fibroblast ECM and their regulation by TGF-β1. (A) Summed normalized protein abundances for collagens and proteoglycans. (B) Summed normalized protein abundances for ECM glycoproteins. (C) Heat map showing fold-change of ECM proteins in response to TGF-β1 treatment. Levels of proteins depicted in red font were significantly changed (p < 0.05).
Fig. 4Transcriptional and post-translational regulation of ECM and ECM-associated proteins in TGF-β1 treated IPF fibroblasts. Fold-change derived from qRT-PCR analyses of transcripts of proteins quantified as decreased (A) and increased (B) by label-free proteomics profiling of ECM derived from TGF-β1-treated IPF fibroblast (Fig. 3B). Each bar represents the fold-change in TGF-β1-treated samples. Data is plotted as mean ± SD (n = 4 for each transcript) using GraphPad Prism 5. Significant changes are denoted by single asterisk (*p < 0.05) and double asterisks (**p < 0.01) as determined by the two-tailed paired t-test. (C) Western blot analyses of representative ECM and ECM-associated proteins in cell lysates (≈intracellular) from control and TGF-β1-treated IPF fibroblast. Two independently derived primary fibroblast lines are shown.
Collagen chain stoichiometries for type I, IV, V, and VI collagens.
| Collagen type | Detected chains | Control | TGF-β1 |
|---|---|---|---|
| I | COL1A1 - α1(I) | 1 | 1 |
| COL1A2 - α2(I) | 0.62 ± 0.22 | 0.56 ± 0.15 | |
| IV | COL4A1 - α1(IV) | 1 | 1 |
| COL4A2 - α2(IV) | 0.56 ± 0.22 | 0.48 ± 0.20 | |
| V | COL5A1 - α1(V) | 1 | 1 |
| COL5A2 - α2(V) | 1.77 ± 0.57 | 1.74 ± 0.25 | |
| COL5A3 - α3(V) | 0.04 ± 0.02 | 0.02 ± 0.01 | |
| VI | COL6A1 - α1(VI) | 1 | 1 |
| COL6A2 - α2(VI) | 0.64 ± 0.29 | 0.67 ± 0.19 | |
| COL6A3 - α3(VI) | 0.84 ± 0.37 | 0.89 ± 0.25 |
Fig. 5Correlation analysis of the here presented data set with a publicly available ECM data set from human lung fibrosis samples. Names are given for proteins that deviate by a log2 ratio > 1 from the perfect correlation. Collagens are given in red, ECM glycoproteins in green, proteoglycans in blue, ECM regulators in purple, ECM-affiliated proteins in brown, and secreted factors in black.
Fig. 6Bioinformatics pipeline for the global identification and quantitation of site-specific PTM in collagens. Thermo .raw MS/MS files were initially searched with Myrimatch to identify the proteins present in the fibroblast ECM. The subset of identified proteins were used as database to perform a second search in which sequence motifs were defined for the identification of site-specific PTM in collagens present in ECM. The resulting *.pep.XML output files from Myrimatch were parsed by PeptideProphet to compute the probability score (0–1) for each peptide spectrum match (PSM). The PeptideProphet parsed .pep.XML output file was imported into Skyline along with all the raw MS/MS files in *.mz.XML format (converted from *.RAW by MSConvert) to generate the spectral library (.blib). This spectral library was used for the targeted extraction of all the PTM modified and unmodified peptide species for each specific site. MS1 area was computed for each peptide for different samples from Skyline.
Mass spectrometry-based identification of site-specific hydroxylation and glycosylation of different collagens identified in the ECM of human lung fibroblast.
| Collagen chain | 3-HyP Sites (Gly-HyP-HyP) | Glucosylgalactosyl-hydroxylysine | Galactosyl-hydroxylysine | Hydroxylysine |
|---|---|---|---|---|
| COL1A1 | P426, P555, P567, P690, P771,P885, P894, P897, P1119, P1122, P1164, P1179 | K265, K277, K448, K520, K586, K862 | K277, K448, K520, K586 | K265, K277, K286, K352, K397, K442, K505, K538, K586, K742, K781, K826, K862, K934 |
| COL1A2 | P101, P419, P593, P782, P797, P806, P809, P902 | K189 | – | K189, K198, K354, K498, K621, K846 |
| COL2A1 | P1096 | – | – | K608, K848 |
| COL3A1 | P940, P1147, P1162 | K584, K662, K740, K743, K779, K932 | – | K108, K251, K350, K461, K503, K584, K629, K779, K860, K923 |
| COL4A1 | P372, P478, P647, P786, P803, P1424, P1436 | K295, K298, K322, K343, K361, K497, K527 | – | – |
| COL4A2 | P197, P424, P733 | K225, K409, K453, K456, K883, K1106, K1175, K1247, K1373 | – | K597, K907 |
| COL5A1 | P500, P516, P575, P599, P761, P770, P779, P833, P992, P995, P1322 | K582, K774, K1038, K1074, K1311 | – | K963, K1125, K1326, K1473 |
| COL5A2 | P136, P409, P442, P448, P658, P919, P1177, P1198 | K320, K386, K554, K785 | – | K431, K860 |
| COL5A3 | P576, P1113 | K502, K1132 | – | – |
| COL6A1 | P582, P588 | K546 | – | – |
| COL6A2 | – | K545 | – | – |
| COL6A3 | – | K2052, K2103, K2170, K2209, K2212 | – | K2121 |
| COL12A1 | P2775, P2831, P2837, P2855, P2858, P2976, P3022, P3025 | – | – | – |
| COL16A1 | P1402 | K658, K1116 | – | K1245 |
| COL18A1 | P862, P868, P871, P907, P910 | – | – | – |
Fig. 7Comprehensive map of hydroxylation and O-linked glycosylation sites in human COL1A1. Lung fibroblast ECM was used to generate Collagen I PTM maps. MS identified collagen peptide sequences are shown in black which together represent an overall sequence coverage of 85% for the processed form of COL1A1. Signal peptide encompassing amino acid residues 1–22 are indicated. The green and purple arrows delimit the N-terminal (23–161) and C-terminal (1219–1464) pro-peptide sequences, respectively. Sequences not identified in this study are colored gray. “P” indicates 4-hydroxyproline occurring in the Yaa position of Gly-Xaa-Yaa motif, and a bold “P*” indicates 3-hydroxyproline in the Xaa position of Gly-Xaa-HyP motif. In addition, hydroxylation in red “P” residues occurring in the Xaa position followed by either Ala, Val, Met, Arg, Asp, or Glu in the Yaa position of Gly-Xaa-Yaa motif is reported but cannot be defined as either 3- or 4-HyP solely on the current MS/MS strategy [21]. Green and orange diamonds denote glucosyl and galactosyl sugar moieties attached to hydroxylysine (bold “K”). A summary of the PTMs is presented in Table 2, and PSMs for O-glycosylated lysine and 3-hydroxyproline sites are provided in Supplementary Figs. S3–S19. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 8Quantitation of site-specific hydroxylation occupancy of P771 in COL1A1 from control and TGF-β1 treated human lung fibroblasts. (A, B) MS/MS spectra of peptide 764GLTGPIGPP+16GPAGAP+16GDK+16781 in which Pro771 is either unmodified (m/z 803.9086+2) or 3-hydroxylated (m/z 811.9047+2). (C) Skyline MS1 quantitation showing the extracted ion chromatograms for the 3-HyP771-containing peptide 764GLTGPIGPP+16GPAGAP+16GDK+16781 from the ECM digest of control and TGF-β1 treated human fibroblasts. (D) Determination of 3-hydroxylation occupancy (%) for P771 calculated from the relative abundances of unmodified and 3-hydroxylated peptides obtained by Skyline MS1 quantitation of control and TGF-β1 samples. MS1 area for 3-HyP771 modified peptide was normalized by dividing the area of modified peptide to the summed area of all the unmodified and modified peptide “764GLTGPIGPPGPAGAPGDK781” species. The occupancy (%) of 3-HyP771 in COL1A1 was increased about 3.7 fold in TGF-β1 treated fibroblast ECM (Table 3). Each dot and square represents the measurement of 3-HyP771 occupancy in each control and TGF-β treated samples. Data is plotted as mean ± SEM (n = 3 in each group) using GraphPad Prism 5. The increased hydroxylation occupancy for Pro771 in TGF-β1 treated samples was significant (*p < 0.05, two-tailed unpaired ‘t’-test).
Quantitative occupancy (%) of 3-hydroxyproline (3-HyP) sites identified in COL1A1 from lung fibroblast ECM. Results are expressed as mean ± SEM (n = 3 in each group). Significant statistical differences were estimated by two-tailed unpaired ‘t’-test (*p < 0.05, ns > 0.05).
| COL1A1 | Occupancy (%) of 3-HyP sites | |
|---|---|---|
| Control | TGF-β1 | |
| P426 | 1.5 ± 0.5 | 2.8 ± 0.8ns |
| P555,567 | 3.0 ± 1.5 | 1.6 ± 0.19ns |
| P690 | 1.6 ± 0.6 | 1.2 ± 0.08ns |
| P771 | 1.9 ± 1.1 | 7.2 ± 0.5* |
| P885,894,897 | 0.6 ± 0.4 | 0.8 ± 0.4ns |
| P1119,1122 | 9.4 ± 4.9 | 11.9 ± 1.2ns |
| P1164 | 42.7 ± 11 | 41.4 ± 2.2ns |
Numbering of collagen alpha 1 chain (I) corresponds to that of the translated protein product starting from the signal peptide.
Fig. 9Skyline quantitation of micro-heterogeneity of the K277 site in COL1A1 from human lung fibroblasts. (A) Chromatographic elution profiles of unmodified (K), hydroxylysine (HyK), galactosyl-hydroxylysine (G-HyK), and glucosylgalactosyl-hydroxylysine (GG-HyK) modified (K277) peptide in human COL1A1 from lung fibroblast ECM digests. (B) Graphical representation of Skyline quantitation of micro-heterogenic distribution of unmodified K277 (black), HyK277 (maroon), G-HyK277 (blue) and GG-HyK277 (green) species in COL1A1 from the ECM digest of control and TGF-β1 treated human fibroblasts. The different colors in bar represent the occupancy of different forms at K277 site in COL1A1 with mean ± SEM (n = 3 in each group) (see Table 4).
Quantitative micro-heterogenic occupancy (%) of O-glycosylated lysine sites identified in COL1A1 from lung fibroblast ECM. Results are expressed as mean ± SEM (n = 3 in each group). Statistically significant differences were estimated by two-tailed unpaired ‘t’-test (*p < 0.05, ns > 0.05). N/D, not detected; HyK, hydroxylysine; G-HyK, galactosylhydroxy-lysine; GG-HyK, glucosylgalactosylhydroxylysine.
| COL1A1 | Modification | Micro-heterogeneity (%) | |
|---|---|---|---|
| Control | TGF-β1 | ||
| K277 | K | 39.7 ± 8.2 | 29.8 ± 3.4ns |
| HyK | 17.9 ± 1.1 | 12.1 ± 2.2ns | |
| G-HyK | 16.7 ± 0.8 | 14.2 ± 3.9ns | |
| GG-HyK | 25.5 ± 7.2 | 43.7 ± 6.6ns | |
| K448 | K | N/D | N/D |
| HyK | N/D | N/D | |
| G-HyK | 30.5 ± 11.6 | 31 ± 0.8ns | |
| GG-HyK | 69.5 ± 11.5 | 69 ± 0.8ns | |
| K520 | K | N/D | N/D |
| HyK | N/D | N/D | |
| G-HyK | 56.5 ± 21 | 34 ± 9.5ns | |
| GG-HyK | 43.5 ± 21 | 66 ± 9.5ns | |
| K586 | K | 28 ± 9.3 | 13.6 ± 7.1ns |
| HyK | 3.6 ± 1.1 | 1.1 ± 0.17 ns | |
| G-HyK | 29.4 ± 7.4 | 29.5 ± 6.7ns | |
| GG-HyK | 38.9 ± 12.5 | 55.7 ± 13.3ns | |
Numbering of collagen alpha 1 chain (I) corresponds to that of the translated protein product starting from the signal peptide.