| Literature DB >> 28704418 |
J Des Parkin1, James D San Antonio2, Anton V Persikov3, Hayat Dagher1, Raymond Dalgleish4, Shane T Jensen5, Xavier Jeunemaitre6,7, Judy Savige1.
Abstract
Collagen III is critical to the integrity of blood vessels and distensible organs, and in hemostasis. Examination of the human collagen III interactome reveals a nearly identical structural arrangement and charge distribution pattern as for collagen I, with cell interaction domains, fibrillogenesis and enzyme cleavage domains, several major ligand-binding regions, and intermolecular crosslink sites at the same sites. These similarities allow heterotypic fibril formation with, and substitution by, collagen I in embryonic development and wound healing. The collagen III fibril assumes a "flexi-rod" structure with flexible zones interspersed with rod-like domains, which is consistent with the molecule's prominence in young, pliable tissues and distensible organs. Collagen III has two major hemostasis domains, with binding motifs for von Willebrand factor, α2β1 integrin, platelet binding octapeptide and glycoprotein VI, consistent with the bleeding tendency observed with COL3A1 disease-causing sequence variants.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28704418 PMCID: PMC5509119 DOI: 10.1371/journal.pone.0175582
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Arrangement of five overlapping collagen III α1 chains in the fibril D-period with overlap and gap zone indicated.
Fig 2Arrangement of collagen I α1 (top), α2 (middle) and collagen III α1 (bottom) chains demonstrates approximate alignment of charge across all three molecules in the D-period.
Fig 3a. Collagen III D-period demonstrating location of GPP residues (in red) proposed to confer stability and atypical triplets (GAA, GGP, etc, in other colors) proposed to contribute flexibility.; b. Each monomer of the collagen III D-period was divided into ten bins of equal size (about 23 residues) starting at the N-terminus. The number of GPP triplets and atypical triplets (GAA, GGP, etc) were added for all five monomers and plotted against bin number; c. Relative triple helix stability for amino acid sequences of each monomer within the collagen III D-period was predicted using the collagen stability calculator (http://semimajor.net/collagen_calculator/). Monomers 1–5 represent five segments of the collagen III sequence, beginning at the N-terminus. Three comprise approximately 234 amino acid residues but the two that included the gap zone were shorter. Examining these stability curves together suggests that some regions of the D-period are more stable, and others less so, suggesting a “flexi-rod” model for collagen III; d.Top: Schematic of type III collagen monomer with sites for cell interactions and remodeling flanked by crosslinks (X) and hemostasis domain (H). Bottom: Stability modeling indicates clusters of atypical collagen sequences of lower stability (springs) are interspersed with rigid zones (rods) hosting sequences that carry out crucial biologic functions.
Fig 4Human collagen III interactome demonstrating the positions of putative Cell interaction domain, Fibrillogenesis and enzyme cleavage domain, Major ligand-binding regions (MLBR1, 2 and 3), Hemostasis domains, and miscellaneous ligand binding sites and structural features on the collagen triple helix in the fibril’s D-period.
Sequences or residues comprising binding sites or structural features are indicated by labels that appear beneath the relevant sequence or residue(s) and approximate their spans. Some enzyme cleavage sites are indicated by arrows pointed to the approximate center of the enzyme’s recognition sequence. Abbreviations used in the interactome for the first time include: ang, angiogenesis; AR, A-rich; coll, collagen; hep, heparin/heparan sulfates; FBP, fibronectin-binding protein; and PFR, platelet fibrinogen receptor.
Major structural features and ligand binding sites for collagen III.
| Name | Motif | Sequence | Residues | Comments | Detection method | Reference |
|---|---|---|---|---|---|---|
| Structural features | ||||||
| C cross-links | 1043–1044 | Forms bonds between collagen III α1 chains | X Ray crystallography | [ | ||
| K cross-links | 108–113951–956 | Cross links collagen III and collagen I or II | Enzyme digestion and sequencing | [ | ||
| Modification sites and sites related to assembly, trafficking and turnover | ||||||
| 4 P-hydroxylation | N/A | Multiple sites | Stabilizes collagen triple helix | Protein sequence | [ | |
| O-linked glycosylation | 107–114 | Regulates cross-link maturation | Cyanogen bromide fragments, protein sequence | [ | ||
| N-linked glycosylation | 236–238 | Unlikely in native collagen | Protein sequence | NetNGly 1.0 | ||
| NC1 recognition sequence | C-terminal propeptide | C-terminal to onset of triple helix | Highly conserved sequence, critical to triple helix nucleation | Protein sequence | [ | |
| HSP47 | Multiple sites, in the same locations as collagen I α1 and α2 | Procollagen-specific | Protein sequence | [ | ||
| Secreted Protein Acidic and Rich in C (SPARC) | 423–428 | Matricellular protein with roles in cell differentiation and wound healing | Rotary shadowing, cyanogen bromide, synthetic peptides | [ | ||
| Membrin | 574–577 | Transports proteins within the Golgi | Protein sequence | Minimotif | ||
| Glycation sites | Select K residues | Undefined | Advanced glycation end-products may form at select residues | Sites in collagen I α1 (434) and α2 chains (453, 479, 924); cross-fibril glycation zone for type I/III fibrils, blue stripe, | [ | |
| Cleavage sites | ||||||
| Procollagen III N-proteinase | .…/1-4 | Cleaves native collagen III | Protein sequence | [ | ||
| ADAMTS-2 | …./1-4 | N-propeptidase that cleaves collagens I, II III, probably distinct from procollagen III N-proteinase | Protein sequence | [ | ||
| Meprin a/b | …./1-4 | Cleaves both N- and C- propeptides | SDS-PAGE | [ | ||
| Bone morphogenetic protein 1 (BMP1; Procollagen C-proteinase, PCP) | C-propeptide | 1065-1068/… | Cleaves C-propeptide | SDS-PAGE | [ | |
| Metalloproteinase-1 (MMP1, 3 and 9) | 792–803 | Cleaves collagen III ¾ along the molecule; activates other MMPs | Agarose gels and sequencing | [ | ||
| MMP3 (stromelysin 1) | 457–466 | Cleaves collagen III ¾ along the molecule; activates other MMPs | Agarose gels and sequencing | [ | ||
| High temperature requirement A 1 (HtrA 1) serine protease | C-propeptide | C-terminal to triple helix | Serine protease, binds to C-terminus | Solid phase binding assay | [ | |
| Clostridium collagenase | 160–163 | Cleavage | Protein sequence | [ | ||
| Caspase 1 | C-terminal to triple helix | Cysteine-aspartic protease that activates proteins by cleavage, digests ECM | Protein sequence | [ | ||
| Caspase 3–7 | 311–315 | Cysteine-aspartic protease, involved in apoptosis and digests extracellular matrix | Protein sequence | [ | ||
| Cathepsin D | 8–16 | Non-specific | Protein sequence | [ | ||
| Caspase 3–7 | 311–315 | Cysteine-aspartic protease, involved in apoptosis an ddiegests ECM | Protein sequence | [ | ||
| Neutrophil elastase | 799/800 and 800/801 | Cleaves in MMP enzyme cleavage domain | Protein sequence | [ | ||
| Trypsin | Multiple sites | Cleaves in MMP enzyme cleavage domain | Protein sequence | [ | ||
| Thermolysin | 804/805 | Cleaves in MMP enzyme cleavage domain | Protein sequence | [ | ||
| Thrombin | 365/366 | Cleavage | Protein sequence | [ | ||
| Factor Xa | 118/119 | Cleavage | Protein sequence | [ | ||
| Binding sites for extracellular matrix molecules, cells and other ligands | ||||||
| Extracellular molecules | ||||||
| Collagen I, II | N-terminal site unresolved | 99–119, 1052–1068 | Form heterotypic fibrils in tissue-specific manner | CNBr fragments and synthetic peptides with Western blots | [ | |
| Keratan sulfate proteoglycan (KSPG) | 474–484 | Proposed to regulate spacing between collagen fibrils | Protein sequence | [ | ||
| 588–598 | Proposed to regulate spacing between collagen fibrils | Protein sequence | [ | |||
| Dermatan sulfate proteoglycan (DSPG) | 624–634 | Proposed to regulate spacing between collagen fibrils | Protein sequence | [ | ||
| 645–655 | Proposed to regulate spacing between collagen fibrils | Protein sequence | [ | |||
| 879–889 | Proposed to regulate spacing between collagen fibrils | Protein sequence | [ | |||
| 909–919 | Proposed to regulate spacing between collagen fibrils | Protein sequence | [ | |||
| Heparin | 110–113 | Inhibits angiogenesis | Solid phase binding assay, endothelial tube formation assays | [ | ||
| Pigment Epithelium-derived Factor (PEDF, SERPIN F1) | 109–120 | Complexes with collagen in cornea and vitreous | Solid phase binding assay | [ | ||
| Decoron (decorin core protein) | 877–883 | Inferred from sequence, position and site on A clade map | Rotary shadowing, X Ray crystallography | [ | ||
| Fibronectin-binding protein | Binds about 122 nm from C-terminus of collagen I | Undefined binding sequence | about 800 | Links fibronectin and collagen III. Binds near MMP cleavage site and near decorin with which it interacts | Rotary shadowing, X Ray crystallography | [ |
| Secreted protein Acidic and Rich in Cysteine (SPARC, BM40, | 423–428 | Secreted glycoprotein important in mineralization | CNBr fragments, synthetic peptides and rotary shadowing | [ | ||
| Thrombospondin | Binds close to both N- and C-termini | Undefined binding sequence(s) | Unknown | Platelet aggregation and antiangiogenic, mediates cell-cell and cell-ECM binding | Rotary shadowing | [ |
| Tenascin X | Unknown | Unknown | Unknown | Binds to collagen III and decorin | Solid phase binding assay | [ |
| Cells | ||||||
| Integrin α1β1, α2β1 | 84–89 | High affinity, expressed on platelets | Rotary shadowing and synthetic peptides, collagen III toolkit | [ | ||
| Integrin α2β1 | 135–140, | High affinity | Rotary shadowing and synthetic peptides, collagen III toolkit | [ | ||
| Integrin α2β1 | 525–530 | Low affinity | Rotary shadowing and synthetic peptides, collagen III toolkit | [ | ||
| Integrin α2β1 | 573–578 | High affinity | Rotary shadowing and synthetic peptides, collagen III toolkit | [ | ||
| Integrin α2β1 | 834–839 | High affinity | Rotary shadowing and synthetic peptides, collagen III toolkit | [ | ||
| RGD | 938–940 | Integrin- binding site, not usually relevant in native collagen | Protein sequence | [ | ||
| DDR2 (Discoidin domain receptor -2) | 423–428 | Cell surface collagen receptor, tyrosine kinase receptor that activates fibroblasts | Synthetic peptides | [ | ||
| LAIR-1 (Leukocyte associated Ig-like | 537–563 | Inhibits immune | Solid phase binding to collagen III and | [ | ||
| OSCAR (Osteoclast Associated Receptor) | 651–662 | Tyrosine kinase-coupled receptor that costimulates osteoclastogenesis | Solid phase binding to synthetic peptides and type II toolkit | [ | ||
| nCAM1 | Motif unknown | Unknown | One binding site | Membrane glycoprotein but also part of the ECM | Solid phase binding assays of ligand to native proteins | [ |
| MAG (Myelin- associated glycoprotein) | Motif unknown | Unknown | Two binding sites | Membrane glycoprotein, but also part of the ECM | Solid phase radio-ligand binding assay | [ |
| Hemostasis | ||||||
| Von Willebrand Factor (VWF) | Multimeric glycoprotein, adheres to collagen and platelet surface receptors at high shear rate flow | Synthetic triple helical peptides and solid phase binding assay | [ | |||
| Platelet fibrinogen receptor (IIb/IIIa) | 72–76 | Controversial, biological significance unknown | Protein sequence | [ | ||
| Glycoprotein VI | 537–563 | Platelet membrane protein mediating collagen-induced aggregation | Platelet and GPVI receptor binding to collagens and triple helical peptides in solid phase assays | [ | ||
| Platelet-binding octapeptide, kindlin-3 | 502–509 | Mediates platelet adhesion independent of GpVI and GpIa/IIa | Platelet binding to collagen peptides in solid phase binding assay | [ | ||
| Platelet-derived growth factor (PDGF) | Motif not known | Unknown | C terminal part of collagen III | Binds to a tyrosine kinase receptor to stimulate cell growth | Solid phase binding assay | [ |
| Proteins of infectious agents | ||||||
| Langerin | N-terminal to triple helix | C-type lectin | Mass spectrometry | [ | ||
| YadA (Yersinia adhesion A) | 24–32 | Outer membrane | Solid phase binding to synthetic triple-helical | [ | ||
| AAEL010235 | 419–427 | Anticoagulant from saliva of blood sucking insect, Aedes aegypti; inhibits binding of vWF, glycoprotein VI and α2β1integrin | Solid phase binding assay | [ | ||
| Calin | Motif unknown | Unknown | About | Anticoagulant in leech saliva (Hirudo medicinalis) | Inhibits VWF binding to collagen III | [ |
| Leech antiplatelet agent (LAPP) | Bovine | 537–554 | Anticoagulant in | Inhibits VWF binding to collagen III | [ | |
| MIP (Macrophage | Motif unknown | Unknown | Unknown | Virulence factor found in Legionella | Solid phase binding assay | [ |
| Miscellaneous ligands | ||||||
| G protein-coupled receptor 56 (GPR56) | Motif unknown | Unknown | Unknown | Orphan G-protein coupled receptor, important in cortical development | Solution phase “pull down” binding assay | [ |
| PR47 (47 Kd platelet receptor for collagen III) | Motif unknown | Unknown | Unknown | Membrane receptor for collagen III similar to collagen I receptor | Solid phase binding assay | [ |
Fig 5Missense mutation distribution in collagen III.
Missense mutation frequency along the residues of the collagen III chain. This demonstrates the non-random distribution of variation.