| Literature DB >> 24695516 |
David M Hudson1, Rachel Werther1, MaryAnn Weis1, Jiann-Jiu Wu1, David R Eyre1.
Abstract
Approximately half the proline residues in fibrillar collagen are hydroxylated. The predominant form is 4-hydroxyproline, which helps fold and stabilize the triple helix. A minor form, 3-hydroxyproline, still has no clear function. Using peptide mass spectrometry, we recently revealed several previously unknown molecular sites of 3-hydroxyproline in fibrillar collagen chains. In fibril-forming A-clade collagen chains, four new partially occupied 3-hydroxyproline sites were found (A2, A3, A4 and (GPP)n) in addition to the fully occupied A1 site at Pro986. The C-terminal (GPP)n motif has five consecutive GPP triplets in α1(I), four in α2(I) and three in α1(II), all subject to 3-hydroxylation. The evolutionary origins of this substrate sequence were investigated by surveying the pattern of its 3-hydroxyproline occupancy from early chordates through amphibians, birds and mammals. Different tissue sources of type I collagen (tendon, bone and skin) and type II collagen (cartilage and notochord) were examined by mass spectrometry. The (GPP)n domain was found to be a major substrate for 3-hydroxylation only in vertebrate fibrillar collagens. In higher vertebrates (mouse, bovine and human), up to five 3-hydroxyproline residues per (GPP)n motif were found in α1(I) and four in α2(I), with an average of two residues per chain. In vertebrate type I collagen the modification exhibited clear tissue specificity, with 3-hydroxyproline prominent only in tendon. The occupancy also showed developmental changes in Achilles tendon, with increasing 3-hydroxyproline levels with age. The biological significance is unclear but the level of 3-hydroxylation at the (GPP)n site appears to have increased as tendons evolved and shows both tendon type and developmental variations within a species.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24695516 PMCID: PMC3973637 DOI: 10.1371/journal.pone.0093467
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Protein sequence alignment of the collagen (GPP)n motif from phylogenetically diverse animals.
Conservation of the (GPP)n motif is shown in red for fibrillar collagens from early chordates through amphibians, birds and mammals. Genomic sequences are from Ensembl. Lamprey and ciona collagen sequences are from Petromyzon marinus transcript: COL2A1 ENSPMAT00000009617 and Ciona intestinalis transcript: FCOL1 ENSCINT00000014311, respectively.
Summary of 3Hyp occupancy in the (GPP)n of type I and II collagen α-chains.
| Species | Tendon | Bone | Skin | Cartilage | |||
| α1(I) | α2(I) | α1(I) | α2(I) | α1(I) | α2(I) | α1(II) | |
| Human | 1.6 (65%) | 2.0 (80%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0.3 (30%) |
| Mouse | 1–2 (80%) | 1–2 (80%) | 0 (0%) | 0.3 (30%) | 0 (0%) | 0.3 (30%) | 0.5 (50%) |
| Bovine | 0.2 (20%) | 1.2 (50%) | 0 (0%) | 0.3 (30%) | 0 (0%) | 0.3 (30%) | 0.4 (40%) |
| Chicken | 0.1 (10%) | 1.1 (70%) | 0 (0%) | 0.8 (50%) | 0 (0%) | 0.7 (50%) | 0.1 (10%) |
| Xenopus | 0.1 (10%) | 0.1 (10%) | 0.1 (10%) | 0.1 (10%) | 0.1 (10%) | 0.1 (10%) | 0 (0%) |
| Species | Tendon | Notochord | |||||
| Unclassified α1 | Unclassified α1 | ||||||
| Lamprey | 0.15 (15%) | 0.05 (5%) | |||||
The table shows the average number of 3Hyp residues per (GPP)n motif with the percentage of α-chains containing at least one 3Hyp residue per (GPP)n given in parentheses. The percentage of each posttranslational variant was determined based on the ratio of the heights of the m/z peaks. For example, the human tendon α1(I) (GPP)n tryptic peptide, TGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQPPQEK, was found to be a mix of eight distinct molecular species giving a hydroxylation (±16 Da) ladder, each representing a posttranslational variant (Figure 2). The molecular location of the each hydroxylated residue (3Hyp, 4Hyp and Hyl) was determined using MS/MS (Figure S1). The C-terminal lysine was predominantly hydroxylated in all Achilles tendons. In this scroll, the 1270.73+ m/z (peptide species containing four 3Hyp residues and five 4Hyp) represents 9% of the total population and the other variations are as follows: 1265.93+ (three 3Hyp residues and five 4Hyp, 10%); 1260.53+ (three 3Hyp residues and four 4Hyp, 13%); 1254.63+ (two 3Hyp residue and four 4Hyp, 19%); 1249.63+ (one 3Hyp residue and four 4Hyp, 16%); 1244.13+ (no 3Hyp residues and four 4Hyp, 18%); 1238.63+ (no 3Hyp residues and three 4Hyp residue, 10%); 1233.13+ (no 3Hyp residues and two 4Hyp residue, 5%). From these percentages, the average number of 3Hyp residues was estimated per α-chain. In this example the calculation is (4×9%)+(3×10%)+(3×13%)+(2×19%)+(1×16%) = mean content of 1.6 3Hyp per α1(I) from human tendon. The 3Hyp content in mouse tendon type I collagen was observed to vary markedly with animal age, in the range between one and two 3Hyp residues per (GPP)n as indicated in the table.
Figure 2Mass spectra of (GPP)n containing tryptic peptides from adult animal tendons.
Full scan spectra from LC-MS profiles of in-gel trypsin digests of α1(I) from human, chicken and xenopus tendon with 6% SDS-PAGE lanes at left (A). MS/MS fragmentation spectrum of the parent ion (1265.33+) from human tendon (B). The sequence is shown with b and y ion breakages. P*, 4Hyp; P#, 3Hyp; K*, Hyl. The cross-linking telopeptide Lys of the human peptide was fully hydroxylated in all posttranslational variants.
Figure 3Edman N-terminal sequence analysis confirming 3-hydroxyproline in the (GPP)n motif of tendon type I collagen.
The (GPP)n-containing tryptic peptides from the α1(I) and α2(I) α-chains of rat tail tendon were recovered as a chromatographic pool and sequenced simultaneously. Sequential phenylthiohydantoin-derivative HPLC chromatograms are shown for sequencer cycles 8-15 (sequencer cycles 1-7 were as predicted from the known sequences of both α-chains). The new residues at each cycle are highlighted in blue for the α1-chain and green for the α2-chain. The 3Hyp residue gave a distinctive double peak as previously reported [18].
Figure 4Mass spectra showing prolyl 3-hydroxylation distributed throughout the whole fibril.
Collagen was solubilized from adult human tendon using SDS extraction (A) and CNBr digestion (B). Lanes of 6% (A) and 12% (B) SDS-PAGE gels are shown to the left. Similar levels of 3Hyp (∼two 3Hyp per α2(I) chain) were observed using each approach. The 12102+ and 12182+ ions in both ion ladders represent unrelated peptides with a 2+ charge (these ions are indicated with φ).
Figure 5Developmental control of prolyl 3-hydroxylation in tendon.
Reduced levels of 3Hyp were observed in fetal tendon relative to adult tissue. MS scan of fetal human Achilles tendon α1(I) with 6% SDS-PAGE lane at left (A). The 1248.43+ ion contains a mix of two peptide posttranslational variants (one 3Hyp and four 4Hyp; and no 3Hyp and five 4Hyp). MS scan of fetal human Achilles tendon α2(I) (B).