| Literature DB >> 21931622 |
David I Spivak1, Tristan Giesa, Elizabeth Wood, Markus J Buehler.
Abstract
Materials in biology span all the scales from Angstroms to meters and typically consist of complex hierarchical assemblies of simple building blocks. Here we describe an application of category theory to describe structural and resulting functional properties of biological protein materials by developing so-called ologs. An olog is like a "concept web" or "semantic network" except that it follows a rigorous mathematical formulation based on category theory. This key difference ensures that an olog is unambiguous, highly adaptable to evolution and change, and suitable for sharing concepts with other olog. We consider simple cases of beta-helical and amyloid-like protein filaments subjected to axial extension and develop an olog representation of their structural and resulting mechanical properties. We also construct a representation of a social network in which people send text-messages to their nearest neighbors and act as a team to perform a task. We show that the olog for the protein and the olog for the social network feature identical category-theoretic representations, and we proceed to precisely explicate the analogy or isomorphism between them. The examples presented here demonstrate that the intrinsic nature of a complex system, which in particular includes a precise relationship between structure and function at different hierarchical levels, can be effectively represented by an olog. This, in turn, allows for comparative studies between disparate materials or fields of application, and results in novel approaches to derive functionality in the design of de novo hierarchical systems. We discuss opportunities and challenges associated with the description of complex biological materials by using ologs as a powerful tool for analysis and design in the context of materiomics, and we present the potential impact of this approach for engineering, life sciences, and medicine.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21931622 PMCID: PMC3169555 DOI: 10.1371/journal.pone.0023911
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Illustration of multiscale hierarchical structure of protein materials, a summary of multiscale modeling and experimental tools, and an analogy to music (figure adapted from [).
In protein materials (left for the example of spider silk), multifunctional materials are created via the formation of hierarchical structures. The synergistic interaction of structures and mechanisms at multiple scales provides the basis for enhanced functionality of biological materials despite the reliance on few distinct building blocks. Similar to the case of protein materials is music (right), where universal elements such as basic wave forms or a set of available instruments are used in hierarchical assemblies to provide macroscale functionality, and eventually a particular orchestral sound (e.g. a symphony). Universality tends to dominate at smaller levels, whereas diversity is found predominantly at larger, functional levels. The integrated use of computational and experimental methods at multiple scales provides a powerful approach to elucidate the scientific concepts underlying the materiomics paradigm (center).
Figure 2Simple examples of transformations preserving structure in category theory.
Categories consist of objects and arrows which are closed under composition and satisfy certain conditions typical of functions. ψ is a structure-preserving transformation (or covariant functor, or morphism of categories) between the two categories. If the categorical objects in this example are considered as sets of instances, then each instance of the set ‘A man’ is mapped to an instance of the set ‘A tennis ball’. This concept applies to all objects and arrows in the categories.
Figure 3Visualization of protein filaments considered here, and abstraction of how key functional properties (here: mechanical properties under axial extension) can be understood based on the interplay of a set of “building blocks”.
A, Overview over fundamental building blocks of our protein materials. The protein materials considered here are composed of a linear arrangement of three elements, “bricks”, “glue”, and in some cases “lifeline”. Thereby as a design rule, brick and glue need to alternate in order to achieve a stable structure. That is, two brick or glue elements immediately next to each other would not stick together – the chemical reason is that bricks represent the protein's polypeptide backbone and glue represents H-bonding which can only occur between residues in the backbone. The “lifeline” is a third element introduced here, reflecting the situation when there is still a physical connection between bricks even after the glue breaks. Chemically, this resembles the existence of “hidden” polypeptide length such that there exists a “covalent” link between two brick elements even after the H-bond glue has broken. This hidden length is not “visible” before the glue is actually broken. B, Mechanical behavior of each of the building blocks characterized by a description of the failure extension. The hidden length of lifelines is reflected in the fact that the resting extension of the lifeline is roughly equal to the failure extension of the glue. Both the brick and the lifeline have large failure extensions relative to the glue. C, Model of a beta-sheet crystal under axial loading. This resembles a system without a lifeline since after breaking of the H-bond cluster ( = glue) between the layers formed by clusters of polypeptide ( = brick) no physical connection exists. D, Model of a beta-helical protein under axial loading. This resembles a system with a lifeline, as after breaking of the cluster of H-bonds ( = glue) that are formed between clusters of amino acids ( = brick) there still exists a physical connection due to the polypeptide backbone as shown in D ( = lifeline). As shown in E, the existence of a lifeline has major implications on the functional properties of the overall system. A system with a lifeline (D) shows a ductile response, where a connection can be sustained at large extension as compared to the glue alone. In contrast a system without a lifeline (C) shows a brittle response, where only a small extension can be sustained until the material breaks (which equals roughly the failure extension of the glue).
Figure 4Pictured here is an olog, which captures the semantic content of our situations, as described in Sections 3.1 – 3.3.
Each box represents an abstract type, and each arrow represents an aspect (or observable) of that type. Each type refers to a set of intended instances, which we think of as being contained in the box. For example, box E contains ductile sequences of bricks and glue (like a beta-helix or an alpha-helix), whereas box V contains real numbers (like 9.228). Each arrow from a source box to a target box refers to an observation one may make on things in the source box, for which the observed result is a thing in the target box. For example, arrow 11:E→O indicates that one can observe of any ductile material S a pair of numbers (R,r) where R is much greater than r. The meaning of these numbers R and r is enforced by a “commutative diagram” declared in Table 1 (line 6): the number R must refer to the failure extension of the system S and the number r must refer to the failure extension of its glue. This says that a ductile system fails at a much greater extension than its glue elements do. Perhaps a simpler but more mundane observation is made by arrow 37:Q→V which indicates that one can take any pair of real numbers (x,y) and observe the x-coordinate. So the pair (8.0, 3.2) is inside box Q, and it is observed by arrow 37 to have x-coordinate 8.0, which is in box V. Thus, each box is meant to contain an intended set of instances and each arrow is meant to functionally relate two such sets. The rest of the olog is recorded in Tables 1 and 2. Some are commutative diagrams which declare two paths through the olog to be equivalent and some are fiber products which define new types in terms of others.
Commutative diagrams in the olog. *
| Starting point | Ending point | Path 1 | Path 2 | Same result |
| A: a one-dimensional system of bricks, glue, and lifeline | F: a one-dimensional system (S) of bricks (b) and glue (g) | A→E→F | A→F | Each of these paths from A to F simply “forgets” the lifeline. |
| A: a one-dimensional system of bricks, glue, and lifeline | D: a “chain graph” | A→F→D | A→D | Each of these paths from A to D yields the structure graph of the system, which is a “chain graph”. |
| A: a one-dimensional system of bricks, glue, and lifeline | H: a graph | A→D→H | A→G→H | Each of these paths from A to H yields the structure graph of the system. |
| B: a one-dimensional system of bricks and glue without lifeline | F: a one-dimensional system (S) of bricks (b) and glue (g) | B→C→F | B→F | Each of these paths from B to F simply forgets that the system has no lifeline. |
| C: a brittle system (S) of bricks (b) and glue (g) | Q: a pair (x,y) of real numbers | C→F→Q | C→M→Q | Each of these paths from C to Q sets x = failure extension of the system (S), y = failure extension of the glue (g). |
| E: a ductile system (S) of bricks (b) and glue (g) | Q: a pair (x,y) of real numbers | E→F→Q | E→O→Q | Each of these paths from E to Q sets x = failure extension of the system (S), y = failure extension of the glue (g). |
| F: a one-dimensional system (S) of bricks (b) and glue (g) | H: a graph | F→D→H | F→J→H | Each of these paths from F to H yields the structure graph of the system. |
| I: a threesome (b,g,L) of building blocks, serving as bricks, glue, and lifeline | Q: a pair (x,y) of real numbers | I→M→Q | I→K→Q | Each of these paths from I to Q sets x = resting extension of lifeline (L), y = failure extension of glue (g). |
| I: a threesome (b,g,L) of building blocks, serving as bricks, glue, and lifeline | U: a building block | I→K→L→P→U | I→T→U | Each of these paths from I to U yields the lifeline element (L). |
| K: a threesome (b,g,S) of building blocks, serving as bricks, glue, and strong-glue | R: a brick | K→N→R | K→L→R | Each of these paths from K to R yields the same brick element (b). |
| L: a pair (b,S) of building blocks, serving as bricks and strong-glue | Q: a pair (x,y) of real numbers | L→P→Q | L→M→Q | Each of these paths from L to Q sets x = failure extension of brick (b), y = failure extension of strong-glue (S) |
| L: a pair (b,S) of building blocks, serving as bricks and strong-glue | U: a building block | L→P→U | L→R→U | Each of these paths from L to U yields the brick element (b). |
| N: a pair (b,g) of building blocks, serving as bricks and glue | Q: a pair (x,y) of real numbers | N→O→Q | N→P→Q | Each of these paths from N to Q sets x = failure extension of brick (b), y = failure extension of glue (g). |
| N: a pair (b,g) of building blocks, serving as bricks and glue | U: a building block | N→P→U | N→R→U | Each of these paths from N to U yields the brick element (b). |
| N: a pair (b,g) of building blocks, serving as bricks and glue | U: a building block | N→P→U | N→S→U | Each of these paths from N to U yields the glue element (g). |
| P: a pair (B1,B2) of building blocks, such that B2 can connect two instances of B1 | V: a real number | P→Q→V | P→U→V | Each of these paths from P to V yields the failure extension of B1. |
| P: a pair (B1,B2) of building blocks, such that B2 can connect two instances of B1 | V: a real number | P→Q→V | P→U→V | Each of these paths from P to V yields the failure extension of B2. |
*Each sequence of consecutive arrows through the olog (Figure 4) is called a path, which represents a functional relationship between its starting point and its ending point. Two such paths A→B may result in the same function, and the 17 lines of this table record 17 cases of this phenomenon in our olog. The idea is that given an instance of A, each of these paths returns the same instance of type B. By having this additional data, we confine the meaning of the label on each box and arrow – they cannot stray far from our intended meaning without “breaking” these path equalities. Thus this table serves as an additional check on our labels. [For a more diagrammatic description of the same information presented in the typical style of category theory, see Figure S1.].
Fiber product diagrams in the olog.**
| Object | Fiber product object name | Defining attributes | Equated terms | “Idea” |
| A | a one-dimensional system of bricks, glue, and lifeline | D←A→G | D→H←G | A system of bricks, glue, and lifeline is defined as “one-dimensional” if its structure graphs (brick/glue) and (brick/lifeline) are both chains. |
| C | a brittle system of bricks (b) and glue (g) | F←C→M | F→Q←M | A system is defined as “brittle” if its failure extension is roughly equal to the failure extension of its glue. |
| E | a ductile system of bricks (b) and glue (g) | F←E→O | F→Q←O | A system is defined as “ductile” if its failure extension is much greater than the failure extension of its glue. |
| F | a one-dimensional sequence (S) of bricks (b) and glue (g) | D←F→J | D→H←J | A system of bricks and glue is defined as “one-dimensional” if its structure graph is a chain. |
| I | a threesome (b,g,L) of building blocks, serving as bricks, glue, and lifeline | M←I→K | M→Q←K | A strong-glue element is defined as “lifeline” if its resting extension is roughly equal to the failure extension of a glue element. |
| K | a threesome (b,g,S) of building blocks, serving as bricks, glue, and strong-glue | N←K→L | N→R←L | A “brick/glue/strong-glue threesome” is defined to be a brick/glue pair and a brick/lifeline pair where the bricks are the same in both instances. |
| L | a pair (b,S) of building blocks, serving as bricks and strong-glue | M←L→P | M→Q←P | Two building blocks, one of which can connect together two instances of the other, are defined as “bricks and strong-glue” if their failure extensions are roughly equal. |
| N | a pair (b,g) of building blocks, serving as bricks and glue | O←N→P | O→Q←P | Two building blocks, one of which can connect together two instances of the other, are defined as “bricks and glue” if the failure extension of the connector is much less than the failure extension of the connectee. |
**Some boxes in the olog (Figure 4) are defined in terms of others by use of so-called fiber products. For example, object A is defined in terms of three others in relationship, D→H←G: given a system of bricks, glue, and lifeline (D), we observe its structure graph (H) and set it equal to a “chain graph” (G) – in so doing we define “one-dimensionality” for a system. A reader of this olog realizes that our notion of one-dimensionality is not up for interpretation, but directly dependent on the other notions in this olog. By having this additional data, we confine the meaning of 24 labels (8 for boxes, 16 for arrows) in the olog. Thus this table serves to anchor the interpretation of our olog more firmly to its original intention. [For a more diagrammatic description of the same information presented in the typical style of category theory, see Figure S2.].
Figure 5Visual representation of the social network.
A, Overview over fundamental building blocks of our social networks. The social networks considered here are composed of a linear arrangement of three elements, “bricks”, “glue” and in some cases “lifeline”. Thereby, as a design rule, brick and glue need to alternate in order to achieve a stable structure. That is, two brick or glue elements immediately next to each other would not stick together; where the reason is that bricks represent participants with transceivers and glue represents wireless communication that, in our case, can only occur between neighboring participants. The “lifeline” is a third element that is introduced here, reflecting the situation when there is still a physical connection of bricks even after the glue breaks. This reflects the existence of a “hidden” connection in that there exists a physical passageway between two brick elements even after the communication over the wireless connection is no longer feasible. The hidden connection is not “visible” before the glue is actually broken because, for reasons of efficiency, participants will choose to communicate the simple messages wirelessly rather than verbally, as the latter requires much more effort. B, Mechanical behavior of each of the building blocks. The hidden length of lifelines is reflected in the fact that the resting extension of the lifeline is roughly equal to the failure extension of the glue. In other words, lifeline passageways are used only when wireless communication is no longer feasible. Both the brick and the lifeline have large failure extensions relative to the glue because participants and their verbal communication function perfectly well in the presence of noise on the wireless channels. C, Representation of a social network not allowing for face-to-face interaction under stress from wireless noise. This resembles a system without a lifeline, as after noise on the wireless line reaches a critical point, messages can no longer be correctly conveyed. D, Representation of a social network allowing for face-to-face interaction under high levels of wireless noise. This resembles a system with a lifeline, as after messages can no longer be conveyed wirelessly, communication can still take place, due to the physical passageways as shown in D. As shown in E, the existence of a lifeline has major implications on the functional properties of the system. A system with a lifeline (D) shows a ductile response, where a connection can be sustained at large displacements as compared to the glue alone. In contrast a system without a lifeline (C) shows a brittle response, where only a small displacement can be sustained until the material breaks (roughly the failure extension of the glue).
Analogy between protein and social network.***
| Type | Type Labels | Protein Specific | Social-network Specific |
| A | a one-dimensional system of bricks, glue, and lifeline | beta-helix | social network with wireless & physical passageways |
| B | a one-dimensional system of bricks and glue without lifeline | beta-sheet nanocrystal | social network with wireless, without physical passageways |
| C | a brittle system (S) of bricks (b) and glue (g) | brittle protein filament | brittle social network |
| D | a “chain” graph *→*→* … →* | chain shape for protein | chain shape for network |
| E | a ductile system (S) of bricks (b) and glue (g) | ductile protein filament | ductile social network |
| F | a one-dimensional system (S) of bricks (b) and glue (g) | beta-helix / beta-sheet nanocrystal | social network |
| G | a system consisting of bricks connected by glue and lifeline, both structured as in graph G | lifeline protein of specified shape | lifeline social network of specified shape |
| H | a graph | shape of protein | shape of network |
| I | a threesome (b,g,L) of building blocks, serving as bricks, glue, and lifeline | amino cluster, H-bond, backbone | transceiver, wifi system, physical passageway |
| J | a system consisting of bricks connected by glue, structured as in graph G | protein of specified shape | social network of specified shape |
| K | a threesome (b,g,S) of building blocks, serving as bricks, glue, and strong-glue | amino acid cluster, H-bond, backbone | transceiver, wifi system, physical passageway |
| L | a pair (b,S) of building blocks, serving as bricks and strong-glue | amino acid cluster, backbone | transceiver, physical passageway |
| M | a pair (R,r) of real numbers such that R is roughly equal to r | e.g. R = 20.5 r = 23.45 | e.g. R = 20.5 r = 23.45 |
| N | a pair (b,g) of building blocks, serving as bricks and glue | amino acid cluster, H-bond | transceiver, wifi system |
| O | a pair (R,r) of real numbers such that R>>r | e.g. R = 100 r = 20.6 | e.g. R = 100 r = 20.6 |
| P | a pair (B1,B2) of building blocks, such that B2 can connect two instances of B1 | e.g. amino acid and backbone | e.g transceiver and wifi |
| Q | a pair (x,y) of real numbers | e.g. x = 20.55, y = 50.6 | e.g. x = 20.55, y = 50.6 |
| R | a brick | amino acid cluster | transceiver |
| S | a glue | H-bond cluster | wifi connection |
| T | a lifeline | backbone | physical passageway |
| U | a building block | basic unit of material | basic unit of social interaction |
| V | a real number | e.g. 181.2 | e.g. 181.2 |
| W | a resting extension | e.g. 61 Angstrom | e.g. 1/100 error/bit |
***Because our olog (Figure 4) was designed to abstract away the particulars of either the protein or the social network (using terms like “brick” instead of “amino-acid cluster” or “transceiver”), this table serves to remind the reader of the particulars in each case. Each type in the olog is described in these two cases. Some types, such as “a real number”, stand on their own and we merely give examples. Others, such as “a one-dimensional system of bricks, glue, and lifeline” require a bit more description in the concrete cases. For more on this, see relevant sections in the text in “Results and Discussion”. This table provides the necessary description to connect the concrete formulations in the case of our protein and social network to the abstract formulation given by Figure 4.
Figure 6Schematic illustration of the approach discussed here, the representation of complex hierarchical systems such as biological materials (e.g. silk) and language in the same category theory space (olog).
The description of how functional properties emerge in different hierarchical systems can be rigorously described using this approach, and fundamental insight can be derived and compared between different systems. This finds immediate applications in the design of synthetic systems (e.g. novel fiber and bulk materials with tunable functional properties). The poem “The Road Not Taken” shown on the right written by Robert Frost (March 26, 1874 - January 29, 1963), published in 1916 in the collection Mountain Interval. Text from: http://en.wikisource.org/wiki/The_Road_Not_Taken.