Literature DB >> 34381990

Collagen fibril assembly: New approaches to unanswered questions.

Christopher K Revell1,2, Oliver E Jensen1,2, Tom Shearer1,3, Yinhui Lu2, David F Holmes2, Karl E Kadler2.   

Abstract

Collagen fibrils are essential for metazoan life. They are the largest, most abundant, and most versatile protein polymers in animals, where they occur in the extracellular matrix to form the structural basis of tissues and organs. Collagen fibrils were first observed at the turn of the 20th century. During the last 40 years, the genes that encode the family of collagens have been identified, the structure of the collagen triple helix has been solved, the many enzymes involved in the post-translational modifications of collagens have been identified, mutations in the genes encoding collagen and collagen-associated proteins have been linked to heritable disorders, and changes in collagen levels have been associated with a wide range of diseases, including cancer. Yet despite extensive research, a full understanding of how cells assemble collagen fibrils remains elusive. Here, we review current models of collagen fibril self-assembly, and how cells might exert control over the self-assembly process to define the number, length and organisation of fibrils in tissues.
© 2021 The Author(s).

Entities:  

Keywords:  Collagen; Diameter; Mechanical properties; Plasma membrane; Polymer length; Self-assembly; Tendon

Year:  2021        PMID: 34381990      PMCID: PMC8334717          DOI: 10.1016/j.mbplus.2021.100079

Source DB:  PubMed          Journal:  Matrix Biol Plus        ISSN: 2590-0285


Collagen assembly is a complex multiscale problem

Collagen fibrils appeared at the dawn of multicellular evolution [1], [2], [3], when cells acquired the ability to construct tissues using semi-rigid extracellular polymers to provide long-range mechanical connectivity and sites for cell attachment. These fibrils have evolved to be the primary tensile element in a wide range of tissues, where they are organized into architectural structures ranging from gels, to parallel bundles, and orthogonal lattices. Collagen fibrils are roughly cylindrical with two tapered tips [4], can be up to 500 nm in diameter, and centimetres in length [5]. The remarkable versatility of collagen to assemble into fibrils of different sizes and pack into a variety of tissue architectures explains how cells were able to construct the skeletons of dinosaurs exceeding 40 m in length, yet use the same fibrils to build a transparent cornea or the stapedius tendon, which is a mere 125 μm in length [6]. The fibrils range in abundance from a few percent in endocrine organs such as pancreas [7] to 15% in lung [8] and 90% of the mass of tissues such as tendon [9], [10]. Overall, collagen fibrils account for 25% of total protein in vertebrates [11]. Collagen self-assembly is an inherently multiscale process, in both space and time, that involves the aggregation of monomers to form larger-scale organised structures [Fig. 1]. A multiscale system integrates processes occurring at different length and time scales, making any attempt at understanding the system as a whole challenging. For example, collagen fibrils may be centimetres in length [5], and yet they are formed from monomers with a diameter of only ~1.5 nm; monomers can be produced by cells at a rate of 2x105 per hour [12], and yet crosslinked collagen structures survive for decades. Understanding the assembly and organisation of fibrils thus crosses 8 orders of magnitude in space and 4–5 orders of magnitude in time. Significant challenges exist in bridging this gap in scales. Progress has been made at the macro (large) scale in terms of understanding fibril and tissue mechanics, and at the micro (small) scale in the chemistry of collagen monomers, but the intermediate (mesoscale) regime remains poorly understood. However, the mesoscale regime contains some of the most interesting and fundamental problems in collagen fibril assembly. In this review, we will outline current understanding of collagen fibril assembly. We will first summarise the basic properties of collagen, before describing how in vitro studies, and corresponding mathematical models, have given hints at understanding its self-assembly. Finally, we will outline the limitations of this work, how in vivo collagen assembly differs from self-assembly observed in vitro, and how new approaches could further our understanding of this problem.
Fig. 1

Diagram to demonstrate how processes involved in the assembly of collagenin vivo cross multiple scales in both space and time. Lines between processes indicate interactions; those lines that are dotted show hypothesised or poorly understood links. The precise length and time scales of the growth and organisation of fibrils in vivo are unknown, as demonstrated by the faded edges of this process bubble, but it is clear that this particular step crosses several scales and is critical for in vivo development.

Diagram to demonstrate how processes involved in the assembly of collagenin vivo cross multiple scales in both space and time. Lines between processes indicate interactions; those lines that are dotted show hypothesised or poorly understood links. The precise length and time scales of the growth and organisation of fibrils in vivo are unknown, as demonstrated by the faded edges of this process bubble, but it is clear that this particular step crosses several scales and is critical for in vivo development.

The multiscale nature of collagen

Collagen is multiscale in space

Collagen monomers are ~300 nm length and ~1.36 nm in diameter in solution [13], with an effective packing diameter of 1.52 nm [14], and assemble in their millions to generate fibrils with a wide variety of diameters. The narrowest collagen fibrils occur in vitreous humor (~10.5–12.0 nm) [15] and cartilage, where “thin” (~16 nm) and thicker (~25–50 nm) fibrils occur [16] alongside hyaluronan and large proteoglycans in a fibre composite gel that resists compression. Relatively thin fibrils (~25 nm) can be found in cornea [17], where they are organized in an exquisite orthogonal lattice [18], [19] that is mechanically strong and yet completely transparent. In contrast, “thick” collagen fibrils occur in tendon, where the ability of the fibrils to transmit high forces is important. “Narrow” (~50 nm) fibrils are deposited by embryonic tenocytes into bundles that are parallel to the tendon long axis. However, soon after birth (in mouse) this unimodal distribution of diameters changes to a multimodal distribution with three peaks at 50 nm, 150 nm, and 250 nm [20]. Much is known about the diameter of fibrils in tissues, but relatively little is known about fibril length. The magnification required to resolve a single fibril in a tissue is such that tracking the fibril from one end to another would require a prohibitively large number of separate images, so direct measurement of fibril lengths in vivo is extremely challenging. Theoretical work based on the frequency of fibril tips in sample images has estimated mean fibril lengths on the order of 10 mm in rat tail [5]. However, the paucity of fibril ends (or tips) introduces large errors in this estimate. Other studies have observed whole fibrils in echinoderms (starfish) and found a maximum length of 600 μm [21]. To further complicate matters, fibrils can fuse by tip-to-tip interactions [22] and by tip-to-shaft interactions to generate Y-shaped branches [23]. Consequently, the question of “How long is a fibril?” is not simple to answer. It most probably varies with developmental stage, anatomical site, and the stage of growth of the fibril. However, if Y-shaped structures are common, then the fibril length is poorly defined. What is clear, however, is that collagen monomers can be assembled into structures several orders of magnitude longer than the monomers themselves.

Collagen is multiscale in time

In vertebrates, collagen fibril assembly begins when the vascular and musculoskeletal systems need to withstand hydrodynamic, compressive, and tensile loads. In mouse, fibril assembly begins at around 14.5 days after fertilisation (embryonic day 14.5 or E14.5) [24], and around embryonic day 10 in chick [25]. By the end of embryonic development, just 9 days later in mice, the fibrils have grown in number and length sufficiently to transmit forces through tendons and ligaments [Fig. 4] for locomotion, and establish the fibrous framework of organs including liver, lung, blood vessels, and skin. This underscores that assembly and organisation of collagen can occur relatively quickly. However, studies in mice have shown that the three-dimensional organization of fibrils that was established during embryonic and early postnatal growth remains unchanged after birth [26]. Thus, the collagen fibril bundles in tendon and ligament, basketweaves in skin and blood vessels, and orthogonal lattices in cornea, form stable, lifelong frameworks, maintained over orders of magnitude longer than the timescales of their original assembly.
Fig. 4

Electron micrograph showing a cross section of mouse tail tendon at embryonic 18.5 days. Collagen fibrils appear as bundles of regularly-spaced dark spots, demonstrating the perfect alignment and high degree of spatial order in the long-range organisation of tendon fibrils.

Not only is the collagen matrix effectively permanent, but the constituent fibrils are too. It has been shown that the collagen in fibrils in tendon and cartilage remain throughout life without renewal or turnover [27], [28], [29]. This means that the collagen molecules that are assembled into fibrils during embryonic growth are the same molecules that are in the tissue when the organism dies, perhaps decades later. The high stability of collagen is also apparent in collagens that have been extracted from the remains of extinct animals preserved in permafrost [30], [31], where other proteins and nucleic acids have long since degraded. The permanency of collagen has advantages: for example, the fibrils provide a stable framework that shields cells against damaging forces. It is estimated that the tendons in a galloping horse experience ~16 kN per kilogram of body mass [32], forces that would readily destroy the resident tenocyte cells if it were not for the stressshielding properties of the collagen fibrils that make up the majority of the mass of the tissue. However, the permanence of fibrils means that if they are damaged or removed by injury or disease, they cannot be replaced in a like-for-like manner. For example, during wound-healing, the new collagen fibrils that are deposited have a different organization and packing compared to the surrounding tissue, leading to the appearance of a scar. These observations demonstrate that collagen crosses orders of magnitude in time, much as it does in space.

Molecular structure and spatial organisation of collagen

Collagen nomenclature

The criteria for classifying a protein as a collagen have not been established precisely. All collagens comprise three polypeptide chains (called α-chains) wound into a supercoiled triple helix in which each α-chain is formed from repeating Gly-X-Y triplets [33], a residue of glycine, frequently combined with imino acids proline and hydroxyproline [34], [35]. As with all proteins, these polypeptide chains have an N- and C- terminal end; in collagens, the triple helix is formed by association of the C-termini and zippering of the helix in a C to N direction [36]. However, not all proteins that contain a triple helix are considered collagens [37]. It is generally agreed that there are 28 distinct “types” of collagen in vertebrates [37], [38], but almost 200 in C. elegans [39]. Collagens can be homotrimers comprising α-chains encoded by a single gene or heterotrimers comprising two or three different types of α-chain encoded by different genes. Each α-chain is identified by an Arabic number and a Roman numeral in parentheses. For example, type II collagen has three identical α1(II) chains and type III collagen has three identical α1(III) chains, encoded by genes Col2a1 and Col3a1, respectively. In the gene nomenclature, the first Arabic number refers to the collagen “type” and the second Arabic number refers to the α-chain. Some collagens are heterotrimers of two or three different α-chains. For example, type I collagen is a heterotrimer of α1(I)2α2(I) encoded by two genes, Col1a1 and Col1a2. Type V collagen is an example of a heterotrimeric collagen that can exist in two forms: α1(V)2α2(V) and α1(V)α2(V)α3(V).

Collagen molecules are synthesized as procollagens

The classical fibril-forming collagens are types I, II, III, V, and XI, with collagens XXIV and XXVII included on the basis of structural similarities. Analysis of protein-coding variations in the human genome showed that genes encoding collagens I, II, III, V, XI and XXVII are intolerant to protein-truncating variants [40], which definitively illustrates the requirement of collagen fibrils for life. An important feature of fibrillar collagens is that they are synthesized as procollagens, containing an uninterrupted triple helix with globular domains at each end. These globular domains are the N- and C-propeptides, which are removed by procollagen N-proteinases and C-proteinases, respectively [Fig. 2a]. There are three N-proteinases: ADAMTS-2, −3, and −14 [41]. It has been shown that the N-proteinases exhibit maximal activity when procollagen is properly folded into a triple helix [42]. In fact, mutations in collagen genes that change the structure of the triple helix (even hundreds of residues C-terminal to the N-proteinase cleavage sites) can stop cleavage of the N-propeptides [43], [44] and result in abnormal fibril assembly [45]. It was postulated in these studies that as the procollagen triple helix folds in a C to N direction, changes in chain alignment caused by amino acid substitutions or deletions would be propagated to the N-terminus and change the conformation of the procollagen molecule where N-proteinase cleaves. The reason why N-proteinases require a triple helical conformation of procollagen molecule is unclear. Electron microscope evidence suggests that the N-propeptides of type I procollagen are in a “bent back” conformation [46], which might present the scissile bonds to each of the three α-chains to N-proteinase. Whether or not the N-propeptides of type II and III procollagen are also ’bent back’ needs to be confirmed. These observations point to the N-propeptides having more than simply a propeptide function to help keep procollagen soluble, but might instead suggest an important role in controlling fibril assembly. In contrast, the C-proteinases (which are members of the BMP1/mTld family) have a broad substrate specificity not restricted to procollagens [47].
Fig. 2

a, Diagram of procollagen cleavage to collagen by removal of the N- and C-propeptides from a monomer. b, Diagram showing two possible distribution outcomes for 24 collagen monomers.

a, Diagram of procollagen cleavage to collagen by removal of the N- and C-propeptides from a monomer. b, Diagram showing two possible distribution outcomes for 24 collagen monomers. Following the removal of propeptides, each end of the collagen triple helix terminates at a set of short extra-helical peptides, known as telopeptides [Fig. 2a]. These non–helical strands are critical for normal fibril assembly, and are involved in collagen cross-linking, mediated by the lysyl-oxidase enzyme (LOX), which permanently binds adjacent monomers together. Proteolytic removal of telopeptides has been shown to inhibit collagen I fibril self-assembly [48], and adding synthetic telopeptide-like peptides to a self-assembly system inhibits fibril assembly by blocking binding between telopeptides and neighbouring monomers [49]. The presence of a propeptide (or proprotein) domain that renders a protein inactive is typical of secreted proteinases, growth factors and hormones. The presence of two propeptides, as in procollagen, is unusual. Given the impact of propeptide cleavage on assembly, it is possible that having two propeptides, and different families of proteinases to remove them, provides spatial and temporal control of fibril assembly. Removal of the C-propeptides, to generate pNcollagen, dramatically decreases the solubility of procollagen and is essential for fibril formation [50]. Removal of the N-propeptides is not essential for fibril formation. In the case of type I procollagen, retention of the N-propeptides can modulate the shape of fibrils in vitro [51] and in vivo [46]. Retained N-propeptides of type I and III collagen have been detected in human skin [52].

Collagen fibrils are “molecular alloys”

Collagen fibrils are heterotypic polymers comprising 3 main components: a “major” fibrillar collagen, specifically type I or II collagen; a “minor” fibrillar collagen such as type V in cornea [53] or type XI in cartilage; and a FACIT (Fibril Associated Collagen with Interrupted Triple helices) collagen such as type IX, XII, or XIV, which can also bind a wide range of soluble and membrane-associated molecules [54], [55], [56]. Absence of type I [57] or II [58] collagen leads to major skeletal deficiencies and is incompatible with life; mutations in genes encoding these collagens result in severe skeletal disorders, notably (but not exclusively) osteogenesis imperfecta and achondrogenesis, respectively. Mutations in genes encoding type V and XI collagens cause the Ehlers-Danlos and Stickler/Marshall syndromes, in which affected individuals can have a range of skeletal and occular problems [59]. Studies in developing mouse have shown that the core fibrillar network in cartilage is a cross-linked copolymer of collagens II, IX, and XI [60]. Of special interest, autosomal recessive chondrodysplasia (cho) in mice lacking collagen XI affects cartilage in limbs, ribs, mandibles, and trachea, and is accompanied by the absence of thin fibrils and the appearance of thick fibrils [61]. The thin fibrils have an exquisite 10 + 4 microfibril structure with the inner core and outer shell of microfibrils tilted by ~3° to the fibril axis [62]. The absence of thin fibrils in the cho/cho mouse suggests that collagen XI is either required to initiate the assembly of the thin fibrils, or has a major role in limiting the lateral growth of fibrils. In fibrous, mineralized and vascular tissues, the fibrils are predominately type I collagen with smaller amounts of type III and V collagen, and can have either type XII or XIV at their surfaces [63]. Studies have shown that collagen-I containing fibrils do not form in the absence of collagen V in vivo [64], which illustrates the importance of type V collagen in the assembly of type I collagen-containing fibrils. Collagen XII- and XIV-null mice demonstrate delayed endothelial maturation [65], and mutations in Col12a1 cause myopathic Ehlers-Danlos Syndrome with a clinical phenotype involving both joints and tendons [66]. The fibrils can also bind small proteoglycans including decorin [67], [68], fibromodulin [69], and lumican [70], with important roles in controlling fibrillar alignment and size [71]. These studies have shown that fibrils formed in vivo are “molecular alloys” (the term first coined by [72]) comprising a major, minor, and FACIT collagen. The ability of type I collagen to co-polymerise with types III and V, and for type II collagen to co-polymerise with collagens types XI and III [73], and for these fibrils to bind a variety of molecules at their surfaces, helps to explain the versatility of fibrils as the primary scaffolding component of tissues. However, precisely how the multitude of combinations are controlled in vivo remains unclear.

Long-range organization of fibrils

The three-dimensional organization of the fibrils within a tissue plays a crucial role in determining its mechanical properties, and this organisation varies radically from tissue to tissue. The orientation, degree of alignment, cross-link density, volume fraction, and relative length distribution of the fibrils all contribute to the differing mechanical needs of each tissue. The orientation of the fibrils determines what kinds of loads the tissue can resist effectively. In tendons, the fibrils are strongly aligned with the tendon’s longitudinal axis to withstand uniaxial tensile loading [74]; in arteries, fibrils are arranged helically in lamellar units with alternating chirality [75] to resist radial deformations; in skin, fibrils exhibit a high degree of dispersion, which provides resistance to deformations in multiple directions [76]. In some ways, these tissues behave, mechanically, like modern, man-made, fibre-reinforced composite materials, which are commonly produced using glass, carbon, or aramid fibres for engineering applications. In both cases, the fibres are the main load-bearing components as they have a much higher tangent modulus than the matrix in which they are embedded. Uniaxial composites (like tendons) display transversely isotropic material behaviour, whereas laminated materials (like arteries) are orthotropic. As a result, many of the mathematical methods that are used to model these materials [77] can be borrowed and adapted to model soft tissues. In contrast to these engineering composites, in which the fibres are often embedded in a relatively stiff, solid matrix such as cured epoxy, however, collagen fibres are embedded in a proteoglycan-rich, viscous matrix, which enables the fibres to slide relative to each other [78]. To mimic this structure, biomaterials scientists have created soft composites, consisting of electrospun [79], solution blow spun [80], or three-dimensionally printed [81] polymer microfibres embedded in a hydrogel matrix. The fibres provide mechanical reinforcement, whilst the hydrogels can be loaded with biological agents to improve biocompatibility and allow fibre sliding. The relative length distribution of the fibrils directly determines the shape of a tissue’s stress–strain curve [76], [82], [83]. Within a fixed length of tissue at rest, the fibrils all have an arc length that is longer than the section of tissue, with this extra length being accommodated by fibril tortuosity, or crimp. Each fibril has a slightly different arc length to the others, producing a distribution. As a tissue is stretched, its fibrils straighten one-by-one and contribute to the stiffness of the tissue only once taut. This process is often called collagen recruitment [84], [85], and gives rise to a J-shaped stress–strain curve that is characteristic of all soft tissues. By manufacturing solution blow spun biomaterials with a distribution of wavy fibres, it has recently become possible to reproduce this J-shaped stress–strain behaviour [80]. The role of cross-link density in collagen fibrils is clear: the higher the cross-link density, the stiffer the tissue [86]. Fibril volume fraction is also thought to affect tissue mechanics, but the relationship is less clear. Theoretically, since collagen is generally much stiffer than other extracellular matrix components, the larger the ratio of collagen to non-collagen, the stiffer the tissue should be [87]. Mathematical models often assume that the amount of strain energy stored by the collagenous and non-collagenous parts of the extracellular matrix is linearly proportional to their respective volume fractions [88], [89]. It is extremely difficult to test experimentally whether this theory holds in soft tissues, however, as this would require the production of tissue samples that differ only in their collagen fibril volume fraction, without also affecting other factors such as the relative length distribution. A large and continually developing body of work has begun to disentangle the relationships between collagen architecture and tissue mechanical function, but the mechanisms by which the cells produce such complex three-dimensional structures remain largely unknown. However, it is reasonable to suppose that the lengths, diameters, and organisation of fibrils play a crucial role in determining tissue mechanical properties, and therefore cells must exert careful control of these factors during development.

Collagen self-assembly

Self-assembly in vitro

The nucleation and growth of collagen fibrils can occur in a minimal system of purified collagen molecules in a warm, neutral buffer [90], [91], and indicates an intrinsic tendency to self-assemble. Warming a cold acetic acid solution of collagen, followed by neutralisation, results in an accumulation of early collagen fibrils with smoothly tapered tips when less than 1% of the collagen has assembled [90]. These fibrils are unipolar, meaning the N-to-C orientations of all monomers are aligned [Fig. 3b]. They are typically ~20 nm in diameter and ~5 μm in length. A diameter limitation is evident at this early stage of growth, as the tips show a gradual flattening of the mass profile [Fig. 3a], but lack the abrupt diameter limitation observed in the tips of collagen fibrils formed in embryonic tissues [92]. These early fibrils can readily fuse as their concentration increases and this process contributes to an increase in fibril size leading to the broad range of diameters in the final gel, typically 20–80 nm [90]. Growing short reconstituted fibril seeds in a dilute collagen solution avoids inter-fibrillar fusion, allowing individual fibrils to grow in length at uniform diameter [93]. The fibril assembly pathway is critically dependent both on the intactness of the telopeptides [94] and also on the order of warming and neutralisation of the initial cold, acid solution of collagen: neutralisation followed by warming leads to an accumulation of thin filaments in the early stages of fibril assembly, rather than short fibrils with tapered ends [90].
Fig. 3

a, Comparison of fibril mass profiles in vivo and in vitro. b, Image of a single collagen fibril formed in vitro. The N and C ends of the fibril are deduced by analysis of the D-periodic staining pattern. This fibril is unipolar with a length of 6 μm.

a, Comparison of fibril mass profiles in vivo and in vitro. b, Image of a single collagen fibril formed in vitro. The N and C ends of the fibril are deduced by analysis of the D-periodic staining pattern. This fibril is unipolar with a length of 6 μm. An alternative fibril assembly system, to match the steps occurring in tissue, is based on a starting solution of pCcollagen and C-proteinase [95]. Fibrils assemble along with the enzymatic cleavage of the pCcollagen to form the collagen monomer. These fibrils are found to have two polarised tips with C-termini of the molecules in both tips pointing to a point of maximum diameter in the fibril where molecules are in anti-parallel register [96]. Fibrils of this polarity type, with two N-terminal tips and a central region of polarity reversal, have subsequently been identified in tissue and are described as N,N bipolar fibrils [97], [98]. Despite the symmetry of molecular polarity in the two tips, the fibrils generated in this cell-free system have a shape polarity with a relatively coarse α-tip and fine β-tip. The tips show a near uniform increase in mass per unit length with distance from the end [4]. The gradients of mass per unit length are found to have a dependence on the C-proteinase to pCcollagen ratio used in the assembly system, with higher enzyme:substrate ratios giving rise to greater mass slopes at the tips [99]. Evidence of surface nucleation in collagen fibril growth was obtained from a simple seeding experiment where fibril length fragments from chick embryonic tendon were used as seeds in a solution of type I collagen [100]. Almost all fibril fragments initially had square broken ends after the mechanical extraction procedure, but ~95% of these formed spurs of growth in the solution of collagen monomer, which slowly elongated over several hours to form smoothly tapered tips.

Self-assembly in silico

Theoretical and computational models provide important tools in understanding the mechanisms of complex systems, with the ultimate aim being to formulate a complete multiscale model of a given system. For collagen assembly, the inherent diversity of length and time scales in the hierarchical self-assembly process makes a full multiscale theoretical understanding particularly challenging [Fig. 1]. This problem motivates a variety of theoretical approaches; each approach typically addresses a specific scale in length and time. An organising principle acting across all scales is the idea of free energy minimisation. An energy that reflects the intermolecular interactions, which could be electrostatic, covalent or otherwise, plus any external constraints, such as spatial confinement, can be associated with any configuration of monomers. Self-assembly involves changes in molecular configurations to lower free energy states, against a viscous drag from the environment, generally in the presence of noise associated with Brownian motion. There are likely to be multiple minima in the free energy landscape, the majority representing disorganised configurations, and a fundamental question is how the self-assembly process can be driven towards those few states with high spatial organisation. Existing work on macromolecular assembly has typically involved explicit, discrete Brownian dynamics, Monte-Carlo, or molecular dynamics models [101], [102]. Collagen can be modelled as a semi-flexible diffusive rod using a discrete worm-like-chain model [103] with Brownian motion. However, the narrow diameter of a collagen triple helix relative to the scale of a fibril ensures that any explicit simulation of assembly will impose a large computational cost; very small spatial perturbations could result in very large changes in force for neighbouring monomers, requiring very small time steps in system integration. Such small time steps make simulating the diffusion of a monomer over the lengthscale of a fibril challenging. This computational cost limits the scale of such simulations to small systems in space, or short simulations in time. For example, explicit modelling of collagen triple helix folding has been performed using the OpenMM molecular dynamics library [104], but such systems require only 3 individual worm-like-chains [105]. Modelling larger systems comes with a penalty in simulation length. This issue is exemplified by the work of McCluskey et al. [106], who use the LAMMPS molecular dynamics library [107] to model the in vitro self-assembly of collagen. Each collagen monomer is treated as a flexible chain of 200 1.5 nm beads. This chain is divided into nine regions, with even-numbered segments being slightly longer (37.5 nm) than the odd-numbered ones (30 nm), and attractive interactions between even segments in separate monomers, attractive interactions between odd segments, and repulsive interactions between even and odd segments. Monomers are predicted to aggregate first into dimers and trimers, then into long, narrow filaments a few monomers in cross-section. Thus they predict that axial growth precedes lateral growth, and that monomers aggregate into initially disordered fibrils, which slowly cystrallize into an ordered state with a clear D-band pattern. Accompanying experiments reveal the timescale of the process, showing a rise in turbidity over 20–40 min as monomers in solution assemble and crystallize into fibrils, and the model shows some success in matching this behaviour. However, this model is subject to the computational limits discussed previously, and is only able to model relatively small assembly systems for a short period of time. Furthermore, it gives no consideration to cross-linking, despite experiments in the paper demonstrating that telopeptides are required for assembly. These computations [106] represent the current state of the art in simulating collagen self-assembly. Similar work, also using LAMMPS, has studied assembly of “collagen-mimetic” rods [108], but neglects specific details of collagen itself to achieve simpler dynamics. Other recent studies using coarse-grained molecular dynamics have explored the influence of axial stretch on the organisation of the assembled fibril [109], but without simulating the origin of this organisation. Additional work has studied the self-assembly of rod-like particles [110], and of collagen specifically [111], [112], by diffusion limited aggregation, but it is notable that such principles lead to branching structures that lack the hallmarks of collagen fibril self-assembly. It is worth comparing modelling work on collagen assembly to the broad literature of theoretical work on actin assembly [113], microtubule assembly [114], [115], and amyloid plaque formation [116]. These systems appear similar: all involve the formation of filamentous structures from smaller subunits. However, collagen is again distinguished by its multiple lengthscales. Actin, microtubules, and amyloid plaques form much shorter fibres from globular subunits that lack the enormous aspect ratio of a collagen monomer. These differences make such systems more tractable in traditional particle-based modelling.

Collagen assembly in vivo

Assembly in vitro does not explain in vivo observations

We have seen how past work has begun to elucidate the mechanisms of in vitro collagen assembly. However, complicating matters further, fibrils formed by self-assembly in vitro differ from those observed in vivo. For example, in vivo fibrils show a distinct radius limitation that is not observed in vitro, as was discussed previously and shown in Fig. 3a. Additionally, collagen self-assembled in vitro forms a randomly oriented gel, in which there is no control of fibril length, number, or orientation. A given number of collagen monomers could form any distribution of fibril number, lengths, and diameter [Fig. 2b]; we have previously discussed how these properties are critical in determining tissue mechanics [76], and that in vivo fibrils are very precisely distributed, for example into parallel bundles in tendon [Fig. 4]. We are thus left with a number of unanswered questions: How is fibril diameter controlled in vivo? What determines the number of fibrils in vivo? What determines the lengths of these fibrils? How are fibrils oriented correctly for a given purpose? Given the additional differences between in vivo and in vitro assembly, novel approaches will be required to answer these questions. Electron micrograph showing a cross section of mouse tail tendon at embryonic 18.5 days. Collagen fibrils appear as bundles of regularly-spaced dark spots, demonstrating the perfect alignment and high degree of spatial order in the long-range organisation of tendon fibrils. Additional insights into the molecular regulation of collagen fibril assembly in vivo have come from studies of the Ehlers-Danlos syndrome (EDS), which is a heterogeneous group of connective tissue disorders featuring hyperextensible, fragile, easy bruising skin, joint hypermobility, and abnormal wound-healing. The majority of the genes linked to EDS encode the fibrillar collagens types I, III and V, and enzymes that modify procollagen including ADAMTS2 [117]. These studies showed a direct cause and effect of structural changes in type I-containing collagen fibrils and the mechanical properties of tissues. Tenascin-X is an extracellular matrix glycoprotein found in skin, muscle, tendons and blood vessels, and was the first non-collagen gene shown to cause classical-like EDS [118]. Mao and colleagues showed that the skin of tenascin-X deficient mice had 40% reduction in fibril numbers. Solving the conundrum of how tenascin-X helps to regulate collagen fibril number will be an important step forward in understanding the complexity of cellular control of collagen fibril formation in vivo.

The plasma membrane is the limit of direct cellular control

Uncontrolled self-assembly of collagen fibrils is a combination of chemistry and physics. The results of uncontrolled self-assembly are observed in vitro; the particular issue of in vivo assembly now becomes a question of how cells can influence the environment around them to change the physical and chemical landscape such that ordered fibril development will occur. Whilst the procollagen and collagen molecules are transiting through the secretory pathway the cell has full control of pH and other solution conditions that can influence protein assembly and aggregation. Therefore, even though the procollagen can be processed to pNcollagen, pCcollagen and collagen during transport [24], [119], [120], the cell can exert active negative control of collagen fibrillogenesis [121]. How the cell achieves this is unknown. What is clear, however, is that the plasma membrane is the limit of the cell’s ability to directly control the collagen it produces, and is the final point from which it can influence its environment and must therefore exert control over collagen fibril assembly, fibril number, and matrix organization. The suggestion that the plasma membrane is the site of collagen fibril formation is not new. In the 1940s, fibres were observed at the cell surface [122], and in the 1970s, collagen fibrils were seen in plasma membrane invaginations of embryonic fibroblasts [123]. With improvements in image handling software [124], the commercialization of serial block face-scanning electron microscopy (SBF-SEM) [125], and the development of suitable preparation protocols [126], it has been possible to obtain volumetric three-dimensional reconstructions of embryonic tendon [127] and cornea [18] showing collagen fibrils in plasma membrane invaginations, known as fibripositors. Fibripositors exhibit a range of morphologies, from simple invaginations of the plasma membrane (recessed fibripositors) to finger-like projections (protruding fibripositors) [127]. They are actin-dependent structures and usually contain just one fibril tip. Although transmission EM and SBF-SEM provide high resolution images of fibripositors, the samples are chemically fixed, dried and embedded in resin prior to imaging. Consequently, it is challenging to perform time-resolved studies to obtain a detailed insight into the role, or roles, of fibripositors. For example, we do not know their rate of formation, if they are static or dynamic structures, or if they are exclusively involved in fibril assembly at the exclusion of fibril turnover. More recently, using CRISPR-Cas9 to tag the proα2(I) chain of type I collagen with photoswitchable Dendra2, it has been possible to image, live, collagen fibril synthesis by fibroblasts in culture [128]. All the fibrils synthesized by the cells were attached to the plasma membrane. Also, cells migrating over collagen fibrils pull and align the fibrils to facilitate end-to-end fusion, which had been suggested to be a mode of rapid growth in length of fibrils [129]. In addition to directly interacting with fibrils at the plasma membrane via fibripositors, cells are able to exert indirect influence over longer distances. For example, it has been shown that cells can remodel existing collagen fibril networks around them to transmit forces and communicate with other cells simply by exerting stress on nearby fibrils [130], [131], [132]. Such stresses are propagated through a fibril network by steric interactions with other fibrils. These results give hints as to how cells control in vivo fibril assembly, but we still lack a comprehensive understanding that integrates all factors and scales.

New approaches to unanswered questions

We have previously discussed how collagen assembly is an inherently multiscale process for which a single theoretical framework may never be sufficient to capture all aspects of the system. This is especially true in vivo. We saw how, in past work, explicitly modelling collagen monomers in Brownian dynamics and Monte-Carlo simulations helped to elucidate the behaviour of collagen self-assembly in vitro. We also recognized the limitations of such systems to capture the length scales of both a single monomer and a large fibril due to their computational cost. This cost poses a barrier to using such techniques to study assembly for an in vivo system, in which effects on the scale of cells and tissue cannot be neglected. Just as one need not track every molecule in the air to understand the weather, considering larger lengthscales allows us to avoid explicitly modelling the behaviour of individual collagen monomers. Instead, we can describe properties of the system as functions that vary continuously in space and time. For example, the work of Rutenberg and colleagues [133], [134], [135], [136], [137] has made extensive use of mesoscopic continuum models, inspired by liquid crystals, to study the organisation of monomers within a fibril. Instead of resolving individual monomers, their properties, such as orientation, are described with functions that vary smoothly with position in a fibril. For example, using a so-called “director field” [133], a function that captures molecular orientations from the core of a fibril towards its periphery. Such continuum models require assumptions such as cylindrical symmetry. Balancing energetic penalties for distortions of the molecular array with a surface free energy allows fibril radius to be related to the twist angle of monomers arranged helically relative to the fibril axis [134]. The same group [133] also used a phase field crystal model to represent D band density modulation, the axially periodic striations of fibrillar collagen, demonstrating the coexistence of fibrils of different sizes and distinct structures, indicating an underlying phase transition. A further refinement of this approach incorporates enzymatic cross-linking that is confined to the fibril surface, assuming the enzyme LOX is too large to penetrate beneath, resulting in a core where the director field is insensitive to the D-band surrounded by a shell that is strongly coupled to the D band, with fibril growth being controlled by the availability of collagen outside the fibril [137]. While the energies underpinning these models remain an empirical representation of the underlying intermolecular interactions, these models provide promising mechanistic explanations for internal fibril structure and radius control. However, these studies remain agnostic about the precise mechanisms of assembly, and do not answer the question of how multiple fibrils assemble and organise in vivo to form tendons and other tissues. By moving up in lengthscale again, we can consider macroscopic models that address the organisation and properties of bundles of fibrils in the extracellular space. The organisation of fibrils in the extracellular space on lengthscales comparable with the size of individual cells are described by a variety of macroscale models, ranging from organised bundles of fibrils in tendon to more disordered matrix structures seen in in vitro systems. In the former category, the diffusion of pCcollagen from the periphery of a fibre bundle towards its centre was addressed by Rutenberg et al. [138], who showed that diffusion must be sufficiently rapid for fibril growth to be homogeneous across the bundle. They identified a threshold monomer concentration, determined in terms of uptake rate and diffusion coefficient, necessary for homogeneous fibril expansion to take place. The origin of organization within disordered matrices has been described with both discrete and continuum approaches. For example, discrete approaches have been used to predict the reorganisation of a random fibre network in response to cellular stresses using a computational representation of individual filaments and calculating their interactions explicitly [132]. Work such as this has shown how random fibre networks can be remodelled to produce dense, parallel bundles in response to stress. Continuum methods can approach these problems by describing a disordered matrix as a continuous function that varies in space and time. For example, Grekas and colleagues [139] used a continuum approach to demonstrate that the strongly nonlinear relation between stress and strain for individual fibrils translates at the network level to a mechanical phase transition, with tension applied by cells forming bundles of laterally compressed fibres within the matrix. A network of fibrils can thereby reorganise into “tethers” that radiate out from a contracting cell, allowing the cell to remodel its environment. Continuum and discrete network models can also be combined, as in recent work by Ban and colleagues [140], showing how remodelling into dense fibril tethers may be mediated by crosslinks. These are examples of macroscopic self-assembly, orchestrated by fibroblasts but exploiting mechanical and biochemical interactions, which can extend over appreciable distances. This work on tethers raises the questions: where does a fibril end and a tether begin? Could similar processes be at work in fibril assembly in vivo? The answers to such questions remain unclear, but novel approaches hold great promise in providing new answers to old problems.

Competing interests

The authors declare no competing interests.
  122 in total

Review 1.  Collagen binding proteins.

Authors:  L Svensson; A Oldberg; D Heinegård
Journal:  Osteoarthritis Cartilage       Date:  2001       Impact factor: 6.576

Review 2.  Tip-mediated fusion involving unipolar collagen fibrils accounts for rapid fibril elongation, the occurrence of fibrillar branched networks in skin and the paucity of collagen fibril ends in vertebrates.

Authors:  K E Kadler; D F Holmes; H Graham; T Starborg
Journal:  Matrix Biol       Date:  2000-08       Impact factor: 11.583

3.  Relationship between muscle forces, joint loading and utilization of elastic strain energy in equine locomotion.

Authors:  Simon M Harrison; R Chris Whitton; Chris E Kawcak; Susan M Stover; Marcus G Pandy
Journal:  J Exp Biol       Date:  2010-12-01       Impact factor: 3.312

4.  Polymorphism of stable collagen fibrils.

Authors:  Samuel Cameron; Laurent Kreplak; Andrew D Rutenberg
Journal:  Soft Matter       Date:  2018-06-13       Impact factor: 3.679

5.  Decorin interacts with fibrillar collagen of embryonic and adult human skin.

Authors:  R Fleischmajer; L W Fisher; E D MacDonald; L Jacobs; J S Perlish; J D Termine
Journal:  J Struct Biol       Date:  1991-02       Impact factor: 2.867

6.  Targeted disruption of decorin leads to abnormal collagen fibril morphology and skin fragility.

Authors:  K G Danielson; H Baribault; D F Holmes; H Graham; K E Kadler; R V Iozzo
Journal:  J Cell Biol       Date:  1997-02-10       Impact factor: 10.539

7.  Cells exploit a phase transition to mechanically remodel the fibrous extracellular matrix.

Authors:  Georgios Grekas; Maria Proestaki; Phoebus Rosakis; Jacob Notbohm; Charalambos Makridakis; Guruswami Ravichandran
Journal:  J R Soc Interface       Date:  2021-02-17       Impact factor: 4.118

8.  Giantin is required for intracellular N-terminal processing of type I procollagen.

Authors:  Nicola L Stevenson; Dylan J M Bergen; Yinhui Lu; M Esther Prada-Sanchez; Karl E Kadler; Chrissy L Hammond; David J Stephens
Journal:  J Cell Biol       Date:  2021-05-04       Impact factor: 10.539

9.  Two type XII-like collagens localize to the surface of banded collagen fibrils.

Authors:  D R Keene; G P Lunstrum; N P Morris; D W Stoddard; R E Burgeson
Journal:  J Cell Biol       Date:  1991-05       Impact factor: 10.539

10.  OpenMM 7: Rapid development of high performance algorithms for molecular dynamics.

Authors:  Peter Eastman; Jason Swails; John D Chodera; Robert T McGibbon; Yutong Zhao; Kyle A Beauchamp; Lee-Ping Wang; Andrew C Simmonett; Matthew P Harrigan; Chaya D Stern; Rafal P Wiewiora; Bernard R Brooks; Vijay S Pande
Journal:  PLoS Comput Biol       Date:  2017-07-26       Impact factor: 4.475

View more
  5 in total

1.  Bayesian inference on a microstructural, hyperelastic model of tendon deformation.

Authors:  James Haughton; Simon L Cotter; William J Parnell; Tom Shearer
Journal:  J R Soc Interface       Date:  2022-05-18       Impact factor: 4.293

2.  Collagen Scaffolds Treated by Hydrogen Peroxide for Cell Cultivation.

Authors:  Yuliya Nashchekina; Pavel Nikonov; Nataliya Mikhailova; Alexey Nashchekin
Journal:  Polymers (Basel)       Date:  2021-11-26       Impact factor: 4.329

3.  3D Printing of Monolithic Proteinaceous Cantilevers Using Regenerated Silk Fibroin.

Authors:  Xuan Mu; Constancio Gonzalez-Obeso; Zhiyu Xia; Jugal Kishore Sahoo; Gang Li; Peggy Cebe; Yu Shrike Zhang; David L Kaplan
Journal:  Molecules       Date:  2022-03-26       Impact factor: 4.411

4.  Lysyl oxidase-like 1 deficiency alters ultrastructural and biomechanical properties of the peripapillary sclera in mice.

Authors:  Lauren K Wareham; John Kuchtey; Hang-Jing Wu; Evan Krystofiak; Yusheng Wu; Cynthia A Reinhart-King; Rachel W Kuchtey
Journal:  Matrix Biol Plus       Date:  2022-08-21

Review 5.  Molecular Basis of Pathogenic Variants in the Fibrillar Collagens.

Authors:  Allan J Richards; Martin P Snead
Journal:  Genes (Basel)       Date:  2022-07-04       Impact factor: 4.141

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.