Literature DB >> 35756505

Functional amyloids from bacterial biofilms - structural properties and interaction partners.

Ümit Akbey1, Maria Andreasen2.   

Abstract

Protein aggregation and amyloid formation have historically been linked with various diseases such as Alzheimer's and Parkinson's disease, but recently functional amyloids have gained a great deal of interest in not causing a disease and having a distinct function in vivo. Functional bacterial amyloids form the structural scaffold in bacterial biofilms and provide a survival strategy for the bacteria along with antibiotic resistance. The formation of functional amyloids happens extracellularly which differs from most disease related amyloids. Studies of functional amyloids have revealed several distinctions compared to disease related amyloids including primary structures designed to optimize amyloid formation while still retaining a controlled assembly of the individual subunits into classical cross-β-sheet structures, along with a unique cross-α-sheet amyloid fold. Studies have revealed that functional amyloids interact with components found in the extracellular matrix space such as lipids from membranes and polymers from the biofilm. Intriguingly, a level of complexity is added as functional amyloids also interact with several disease related amyloids and a causative link has even been established between functional amyloids and neurodegenerative diseases. It is hence becoming increasingly clear that functional amyloids are not inert protein structures found in bacterial biofilms but interact with many different components including human proteins related to pathology. Gaining a clear understanding of the factors governing the interactions will lead to improved strategies to combat biofilm associated infections and the correlated antibiotic resistance. In the current review we summarize the current state of the art knowledge on this exciting and fast growing research field of biofilm forming bacterial functional amyloids, their structural features and interaction partners. This journal is © The Royal Society of Chemistry.

Entities:  

Year:  2022        PMID: 35756505      PMCID: PMC9172111          DOI: 10.1039/d2sc00645f

Source DB:  PubMed          Journal:  Chem Sci        ISSN: 2041-6520            Impact factor:   9.969


Introduction

Historically the formation of protein aggregates or amyloids has been associated with various diseases such as neurodegenerative diseases.[1] More recently it has been found that amyloids can have biological functions in the organism producing them, hence differing from the pathological amyloids and coining the term functional amyloids.[2] Amyloid fibrils and functional amyloids are a topic of great interest and research, and the exponential growth of the field, research output and citations are shown in Fig. 1. Functional amyloids are found in various organisms ranging from bacteria to humans and currently a whole range is known, Fig. 2.[2] They provide different types of functionality ranging from virulence during infection[3] to structural scaffolds in biofilms,[4] to storage of peptide hormones,[5] to formation of melanin granula in melanocytes,[6] to memory storage[7] and during cell death,[8] and many others.
Fig. 1

The number of publications per year (left y-axis) and total citations (right y-axis) in the Web of Science (WoS) in the research fields of (A) “amyloid fibrils” and (B) “functional amyloid”, by the end of 2021. Remarkable progress in these fields has been made in the recent years as evident from the increase in the research output in those fields.

Fig. 2

(A) Schematic representation of different bacterial biofilm forming functional amyloids: PSMs from S. aureus, CsgA from E. coli, FapC from P. aeruginosa and TasA from B. subtilis. The signal sequences are shown in red, the amyloid regions (AR) in blue and loop/linker and C-terminus regions in grey. The number of amino acids forming the amyloid region is given for each protein in blue. The lengths of the full-length proteins are given in parenthesis in black. The PSMs are short proteins and AR themselves. The CsgA is composed of 5 AR repeats, FapC from 5 AR repeats, whereas, TasA is considered to be composed of a large AR without repeats (note that other hypotheses exist on the nature of TasA amyloid region). (B) The amino acid composition of the five functional amyloids. The Uniprot identification numbers for each protein are given in parenthesis. The consecutive AR repeats are indicated as bold versus normal font for CsgA. (C) Illustration of amyloid fibril construction mechanisms by using different building blocks to the similar final amyloid fibril fold.

Many bacteria form biofilms to survive under stress-inducing conditions to achieve communal living. Biofilms are microbial communities where the microorganisms live in a matrix of hydrated extracellular polymeric substances (EPS) that forms the scaffold of the biofilm.[9] The EPS is produced and secreted by the bacterial cells living in the biofilm, and it is composed of biopolymers such as polysaccharides, proteins and nucleic acids in the form of extracellular DNA (eDNA), Fig. 3. The formation of biofilm by bacteria enables adhesion to surfaces and other bacteria cells, retention of nutrients and water, and additionally acts as a protective barrier. Biofilm-associated pathogenic microbes are protected from antimicrobial agents and host immune system attacks, as a result they are more infectious and difficult to treat. Eighty percent of all chronic infections are related to bacterial biofilms.[10,11]
Fig. 3

(A) Representation of bacteria mediated biofilm formation, initiated by the motile species gathering together into the mature solid biofilm matrix. The molecular details of the biofilm composed of bacteria themselves, polysaccharides, extracellular DNA and protein amyloid fibrils are additionally shown. This figure is reproduced with permission.[9] (B) The wild-type (WT) versus the deletion-mutants (Δ) of B. subtilis bacteria and the biofilm formed by them. The WT, Δeps, ΔtasA and Δeps–tasA. The deletion mutants form bacterial communities that are healthy but with a weakened/disrupted biofilm. This figure is reproduced with permission.[63] (C) Reconstruction of a native-like B. subtilis biofilm from a ΔtasA deletion mutant that cannot form biofilm by itself. The supplementation of recombinantly produced TasA protein to the growth medium of the ΔtasA mutant B. subtilis strain reconstitutes the native-like biofilm. This allows incorporation of isotope-labeled TasA into the biofilm for in vivo NMR spectroscopy without any background signal. (D) 2D 1H–15N NMR spectra of different TasA samples prepared in vitro, soluble monomer, oligomer and fibril, as well as in vivo TasA in biofilm. Spectral differences indicate structural differences and shed light on the biofilm TasA form. For more details on the figure and explanation of the additional indications, the reader is referred to the original article. This figure is reproduced with permission.[81]

Aggregated proteins in the form of amyloid fibrils play a key role in maintaining the structural integrity of biofilms and when these proteins are mutated, the biofilms are disrupted and bacteria become accessible to, e.g., antibiotic treatments. Functional amyloids strengthen biofilms and are a major threat to human health, since the (chronic) infections they cause are difficult to treat due to the biofilm structural integrity and insufficient penetration of drugs, thus promoting antibiotic resistance (antimicrobial resistance, AMR).[12] Targeting biofilms and their amyloid components could be a novel approach to fight AMR, which results in ∼million casualties per year, and is estimated to cause more death than cancer by 2050.[13] However, very little structural information exists about biofilms and their fibrillar components and how these components interact with other proteins. Determining these unknown structures and unraveling the structure–activity relationship of such fibrils will help to develop better therapies. In this current review we focus on functional amyloids related to bacterial biofilm formation or functional bacterial amyloids (FuBAs), more specifically curli from E. coli, FapC from Pseudomonas, TasA from B. subtilis and phenol-soluble modulins (PSMs) from S. aureus, Fig. 2. For an extensive list of various functional amyloids along with coverage of the broader theme of amyloids, functional amyloids, prions and functional prions we refer to other recent excellent reviews.[14-22] Other functional amyloids related to bacteria but not directly involved in the biofilm formation have also been observed. These include but are not limited to S. coelicolor chaplins which facilitates the rising of aerial hyphae,[23,24]M. tuberculosis amyloid pili (MTP) which mediates host interactions during pathogenesis,[25] and X. axonopodis harpins which act as virulence factors.[3,26] Recently other functional amyloids from bacterial biofilms such as the Esp from E. faecalis[27] have also been discovered but due to the recent discovery these are still poorly understood and will not be described in this paper. Moreover, due to the homology between the Salmonella Tafi system and E. coli curli,[28] the Tafi system will also not be addressed here.

Curli from E. coli

Curli was the first discovered biofilm forming functional amyloid fibril and hence curli fibrils from E. coli are the most extensively characterized FuBAs.[4] Curli fibrils have been found to be essential for biofilm formation and attachment to surfaces including plant cells, stainless steel, glass and plastics.[29-33] They also play key roles in interaction with host proteins and invasion of host cells.[34-36] The curli biogenesis involves the expression of two divergently evolved operons, csgBA and csgDEF.[35] CsgA is the major biofilm forming amyloid protein component of curli and CsgB is the minor amyloid protein. CsgA and CsgB are secreted as unstructured monomeric proteins to the cell surface to self-assemble into the final amyloid fibrils. CsgB facilitates CsgA fibrillation as a nucleator and is required for the complete assembly of curli fibrils.[35,37] CsgC is a chaperon-like accessory protein that prevents fibrillation in the periplasm to ensure that CsgA and CsgB are secreted as monomeric units to the extracellular compartment.[38] CsgE is another chaperone-like accessory protein that reversibly interacts with CsgA (in a 1 : 1 ratio) while the pore-forming protein CsgG promotes translocation, thereby achieving specificity for secretion.[39] CsgF and also CsgB help localize CsgA to the cell surface[40] while CsgD regulates the expression of the csgABC operon.[19] This amazingly regulated protein system controls and directs curli amyloid formation and maturation, and prevents the formation of toxic oligomers or fibrils inside the cells that may otherwise harm the bacteria itself. CsgA and CsgB are homologs and share ∼50% sequence identity. They both consist of five imperfect repeats of ∼20 amino acids and the imperfect repeats of CsgA contain highly conserved glutamine and asparagine residues, which are important for the amyloid formation,[37,41]Fig. 2. Repeating the amyloidogenic region several times likely comprises an effective way of fibrillation compared to non-repeat containing proteins.[42] This may explain the fast fibril formation without a lag-phase compared to other amyloids. The repeats R1, R3 and R5 of CsgA are amyloidogenic on their own and can form fibrils similar to wild type (WT) CsgA, whereas R2 and R4 do not form fibrils on their own and are dispensable for WT fibril formation.[40] Interestingly, the amyloidogenic R1 and R5 contain sequences that contribute significantly to the ability of CsgA to bind human proteins such as fibronectin, plasminogen, tissue plasminogen activator, and β2-microglobulin.[43]In vitro in the absence of the other curli components CsgA aggregates in a nucleation dependent manner.[44]

FapC from Pseudomonas

Functional amyloids in Pseudomonas (Fap) have more recently been discovered and are encoded by a single operon with six genes, fapA–F.[45] In addition to the original strain of Pseudomonas and P. fluorescence many other pseudomonas strains also express Fap amyloids, including the pathologically relevant P. aeruginosa. This strain is involved in cystic fibrosis pathology, where it is considered the major pathogen in the development of chronic airway infections and linked to declining lung function.[45-47] In Pseudomonas the Fap system is found by phylogenetic analysis in the beta-, delta-, and gammaproteobacteria where a high fraction of the identified strains are pathogens (39%) or rhizobacteria (36%). This has led to the suggestion that, although the Fap system is not required for pathology as not all pathogens in the identified genera had the Fap genes, the Fap system could be a virulence enhancing factor in the pathogen strains and could also aid in the association with plant roots. Analogous to curli, FapC is the major biofilm forming amyloid protein, whereas FapB is the minor component and a nucleator for FapC.[45,48,49] They are secreted as unfolded monomers similar to CsgA–B, and then fibril formation happens at the cell surface through self-assembly.[46] FapA is a transcription regulator for fapAB and controls the amyloid formation, and interestingly the deletion of it leads to fibrils composed solely of FapB.[48] FapD is a protease making essential modifications to the related Fap proteins. FapE is a minor component playing a role in the amyloid formation, it is found to be part of the mature Fap-fibrils.[48] Finally, FapF is a trimeric β-barrel membrane transporter having an extended domain in the periplasm and gated by a helical plug and it secretes FapC through the outer membrane.[50,51] FapC contains three repeat motifs of ∼37 amino acids that stack into the β-sheet structure of the amyloids, Fig. 2. The three repeat motifs are separated by highly flexible linkers and furthermore also contain highly conserved glutamine and asparagine residues.[45] The three imperfect repeats in FapC are crucial for the stability of functional amyloids,[52] and the presence of imperfect repeats in FapC has been shown to promote aggregation efficiency by decreasing the lag-time of aggregation. Also the repeats reduce the tendency of the formed aggregates to fragment. Interestingly, even a construct missing all three imperfect repeats could still aggregate although the aggregation kinetics observed were irregular and the structures of the aggregates formed were altered and also destabilized.[53] Removal of the repeats through deletion has been shown to increase the degree of compaction of the protein during the activation step of fibril growth through elongation.[54]

PSMs from S. aureus

The functional amyloids comprising the structural scaffold of S. aureus biofilms are composed of various phenol-soluble modulins (PSMs).[55] PSMs are amphipathic peptides that have multiple functions in pathogenesis. They are small peptides of 20–44 amino acids in size, Fig. 2. The genes encoding the core family of PSM peptides are highly conserved and located in psmα operon (αPSM1–αPSM4) and psmβ operon (βPSM1 and βPSM2), and the δ-toxin is encoded within the coding sequence of RNAIII.[56] PSMs have been recognized as a crucial factor for S. aureus biofilm formation, as PSMs strengthen the S. aureus biofilms. Experiments performed on strains that cannot produce α/β-PSMs resulted in biofilm disruption.[55] In their soluble monomeric form they hinder host immune response by recruiting, activating and lysing human neutrophils while also promoting biofilm dissociation.[55] However, PSMs also self-associate to form the amyloid fibrils that fortify the biofilm matrix to better resist disassembly by mechanical stress and matrix degrading enzymes.[57] Surprisingly, when a mature S. aureus biofilm was exposed to “soluble” PSMα1, the biofilm amount was significantly reduced, however, the polymerized fibrillar PSMα1 had no such effect.[55] This indicates a biofilm maturity regulation by the very same peptides producing the biofilm. High expression of αPSMs increases the virulence potential of methicillin-resistant S. aureus.[58] Moreover, PSMα3, the most cytotoxic and lytic PSM, enhances its toxicity to human cells upon fibrillation.[59] Despite lower concentrations, the larger βPSMs, ∼44 residues in length, seem to have the most pronounced impact on biofilm structuring.[60] Finally, PSMs can undergo truncations in vivo to form even smaller peptides (∼6 amino acids) and have an altered function as antimicrobial agents and different structural properties compared to the full-length PSMs.[61]

TasA from B. subtilis

The extrapolymeric matrix of B. subtilis biofilms is mainly composed of exopolysaccharides and the protein TasA,[62]Fig. 3. The gene encoding TasA is located in the tapA–sipW–tasA operon and the polymerization of TasA in vivo requires the expression of all three members of the operon,[62]Fig. 2. TasA is the major biofilm forming functional amyloid fibril and TapA is the minor component. They are both secreted as unfolded monomers, in a mechanism not fully understood yet. SipW is the general secretory pathway, responsible for the secretion of these proteins. When TasA or TapA is mutated (as well as the exopolysaccharide), the native biofilm structure is disrupted, and the bacteria colony loses the wrinkled topology,[62-66]Fig. 3B. Processing through a signal peptidase activity and secretion of TasA and TapA are regulated by SipW, which also regulates expression of genes involved in exopolysaccharide production.[67-69] Recombinantly purified TasA forms amyloid fibers in vitro and can also restore wild-type morphology to the biofilm of a TasA deletion mutant and hence the fibers formed by TasA provide structural integrity to B. subtilis biofilm.[63] The fibers formed by TasA were found to connect individual cells forming a network while also possibly organizing other components in the biofilm matrix. In addition to TasA, the protein BslA (formerly named YuaB) is also found in B. subtilis biofilms where it forms a hydrophobic coat of the surface of the biofilm by self-assembly.[70-72]

Functional amyloid structural features

Currently, to the best of our knowledge, there is no high-resolution structure of any full-length cross-β functional amyloid protein in its fibrillar conformation, except PSMα3 that forms a unique cross-α fibrillar structure.[59] Despite this, there are structures available from small peptide fragments from functional amyloids, monomeric protein forms as well as some structural models. This lack of structural information limits our understanding of functional amyloids and the formation of biofilms to a large extent, and hampers the strategic fight against, e.g., biofilm related infections, pathogenesis and antimicrobial resistance (AMR).[73,74]

General structural features of (functional) amyloid fibrils

The function of a protein in living organisms is directed by its three-dimensional (3D) structure, entailing a structure–function relationship. Every protein has to adopt its correct fold to function, which is uniquely determined by the amino acid sequence. However, one protein can adopt multiple structures if the conditions (buffer, pH, salt, concentration etc.) are manipulated towards different local/globular minima in the protein folding energy landscape. The amyloid fold derived by intermolecular contacts, rather than intramolecular contacts, is one of these possible structures and with its high stability and lower total energy, once formed they are usually trapped in this conformation.[75] Amyloid fibrils can be obtained for many protein sequences, e.g. via misfolding along the folding pathway towards a native conformation. Protein misfolding and/or fibrillation is usually associated with biological anomalies and diseases.[1,76] Neurodegenerative diseases such as Alzheimer's disease (AD) and Parkinson's disease (PD) are known examples that are caused by pathological amyloid proteins. The loss of normal protein function and subsequent gain of toxicity are key phenotypes for this type of pathology. Functional amyloids, in contrast, have a distinct biological function in vivo.[1,77,78] By applying harsh conditions such as extreme pH or denaturing conditions, many globular or unfolded proteins can be directed to amyloid fibrils.[79] Starting from a natively unfolded protein state (intrinsically disordered protein, IDP), rather than a well-defined native fold, highly stable amyloid structures can be formed as seen in AD and PD or from functional amyloids like CsgA and FapC.[20,42] Amyloid formation has been observed from proteins with all types of native structures, e.g. in an α-helical structure from serum amyloid, β-sheet rich structure from transthyretin, or α- and β-rich structure from lysozyme,[80] as well as in a more controlled manner from a globular protein fold.[63,81] When amyloid fibril formation occurs from an initial folded protein extensive conformational rearrangements are required, which may require unfolding to an intermediate that then forms the final amyloid fibril structure.[82,83] Despite many initial forms of fibril forming proteins, the final structure consists of cross-β-sheets as explained below.[84] Amyloid fibrils are insoluble protein aggregates and are rich in β-sheet content if not composed predominantly by them. For a globular protein the β-strands form intramolecular β-sheets, however, for amyloid fibrils the β-strands form intermolecular β-sheets which are arranged orthogonal to the fiber axis. The longer β-strands can be bent into a β-hairpin motif connecting two β-strands. This cross-β-sheet structure is stabilized by hydrogen bonds running parallel to the fiber axis, Fig. 4A. Steric-zippers formed from tightly packed protein sidechains between the β-sheets increases the amyloid fibril stability particularly for the small-peptide amyloid examples. One or more cross-β-sheets then form one or more protofilaments. Due to this protein architecture a typical X-ray diffraction pattern is observed, with two typical reflections at ∼4.7 Å (vertical axis) corresponding to the inter-strand spacing and at ∼10 Å (horizontal axis) corresponding to the inter-sheet spacing due to the sidechains,[85-87]Fig. 4. The kinetics of amyloid formation is typically studied using the amyloid binding dye Thioflavin T (ThT). The fluorescence intensity of the dye increases due to binding of ThT in grooves at the surface of the amyloid fibrils perpendicular to the fibril axis. This results in rotational immobilization of the ThT molecules changing the quantum yield of the dye.[88-90] Typical aggregation curves yield a sigmoidal shape comprising an initial lag-phase followed by exponential growth ending with a stationary phase which characterizes a nucleation dependent pathway,[91]Fig. 5. Later studies shed light on the long-range organization of the amyloid fibrils forming the cross-β-sheet structure and their helical symmetry.[92-94] Usually the amyloid fibrils are not straight but have a helical symmetry with a small twist angle to minimize the overall free energy of the structure,[95] which is particularly true for structures bearing several protofilaments.[96]
Fig. 4

(A) Representation of the cross-β-sheet amyloid fibril fold. The interstrand and intersheet distances of ∼4.7 and ∼10 Å are depicted. Here the β-strands form intermolecular β-sheets and are orthogonal to the fiber axis. (B) The negative-stain electron microscopy (EM) micrograph of four different biofilm forming functional amyloids. The white scale bar represents 200 nm at each micrograph. The CsgA EM is reproduced with permission.[148]

Fig. 5

(A) Schematic representation of amyloid fibril formation. The soluble protein fold (in red) can form oligomers (in blue) or amyloid fibrils (in green) depending on the system and condition. The oligomers then form amyloid fibrils. The amyloid fibril maturation can happen with different steps/mechanisms as depicted in the figure; 1° nucleation, elongation, 2° nucleation/self-replication and a final maturation. (B) Amyloid binding Thioflavin-T dye results monitoring the formation of amyloid fibrils from soluble monomers for PSMα1 and PSMα3 proteins from S. aureus (repeated three times for each protein, please note the different time scales).

Despite the unique amino acid sequences seen in functional and pathological amyloid proteins structural similarities are seen from the known atomic resolution structures.[16,97,98] For example, a comparison of the functional amyloid HET-s with α-synuclein (αSyn) or amyloid-β (Aβ)[42,99-101] indicates that these structures are stabilized by hydrophobic contacts, as well as intermolecular hydrogen-bonds along the cross-β-sheets. Hydrophobicity or polarity of the surfaces as a whole or by patches is a difference between these two classes. HET-s displays a polar surface with no hydrophobic areas, whereas the pathological amyloids typically expose hydrophobic areas. This difference may have an effect on the regulation/interactions of these proteins. Similarly the display of hydrophobic patches or lack thereof on the surface of structurally similar protein oligomers along with the degree of flexibility has been linked to the cytotoxicity where increased exposure of hydrophobic patches was linked to cell membrane disruption and toxicity.[102] Moreover, the functional amyloids do not contain frustrated structural segments, in contrast to pathological ones that are not evolutionarily optimized towards fibril formation.[16,103] Structural polymorphism is also another factor, in which functional amyloid HET-s adopts a single conformation,[99,104] whereas polymorphism is commonly observed for pathological fibrils such as αSyn and Aβ.[105-107] A quantitative comparison of the amino acid composition of functional versus pathological amyloids reveals more details on the evolutionary optimization of amino acids and their selection accordingly.[17] A comparison of four pathological and six functional amyloid sequences indicated that despite high-frequency occurrence of nonpolar residues in both, the polar residues were more dominant for functional amyloids. The polar residues can form ladders of hydrogen bonds along the fibril, with higher frequency in functional amyloids. Positively charged residues such as lysine, arginine and histidine are observed more frequently in pathological amyloids, whereas, negatively charged residues are observed for both types to a similar degree (except glutamate showing higher frequency for pathological amyloids). This phenomenon has alternatively been described as gatekeeper residues composed of arginine, aspartate, glycine, lysine or proline.[42] Overall, more frequent occurrence of glycine, serine and glutamine in functional amyloids correlates with them being more reversible with more hydrophilic and flexible interfaces, in contrast to the hydrophobic, dry and tight interfaces observed for pathological amyloids.[16] The aliphatic residues constituting highly stable steric-zipper conformations are observed often in small-peptide amyloid structures and in pathological amyloids. There is strong amino acid specificity for the formation of steric-zippers. Homo-zippers by same protein sequence fragments are observed mostly in small-peptide amyloid structures. This homo-zipper motif is observed at the fibril interfaces of different protofilaments for the full-length amyloid fibril proteins. Otherwise, for full-length protein fibrils hetero-zippers are formed from two different amino acid sequences.[20] The functional amyloids were also found to contain low-complexity aromatic-rich, kinked segments (LARKS) which are more reversible due to a weaker packing and a kinked backbone which reduces the stability of the complex.[108,109] A recent three-fold symmetric structure of the functional neuronal amyloid Orb2 revealed a hydrophilic steric-zipper interface which highlights its functional role in Drosophila in contrast to the hydrophobic core of pathological amyloids.[110] The control of fibrillation by other protein machinery is a remarkable difference between functional and pathological amyloids. For example, FapC fibrillation strongly depends on a nucleator protein FapB and others in the same operon (FapA–F) for tight regulation of the localization, secretion, and fibrillation.[45] The optimized sequence properties and accessory protein machinery of functional amyloids makes the aggregation process both fast and controlled compared to the pathological ones.[111] Along these lines, the aggregation kinetics and mechanism of the pathological and functional amyloids is noticeably different, Fig. 5. Aggregations of CsgA and FapC mostly progress via primary nucleation and elongation without self-replication, making the process more controlled and directed.[111] On the other hand, the fibrillation of pathological amyloids can be mediated by secondary processes such as secondary nucleation and fragmentation which are more stochastic.[112]

High-resolution structure determination methods

Historically, negative staining EM allowed the visualization of amyloid fibrils at low resolution, Fig. 4B. In favorable situations, it's possible to differentiate the straight filaments from twisted ones or the protofilaments, singles from multiple ones. This approach combined with mass-per-length (MPL) measurements allowed the determination of the number of protofilaments per cross-section. The first atomic insights into the amyloids came from the X-ray structures of the small spine regions of amyloids that could be crystallized unlike the full-length proteins.[84,113] High-resolution reconstruction of density maps at the atomic level came into play with the cryo-EM in the last decade or so, with new technology available such as direct-electron detectors and software packages for efficient and fast helical reconstruction.[114] In the last decade fast frozen specimens with class averaging of many images resulted in high resolution that allowed atomic modeling of the full-length protein structures, which is referred to as the “resolution revolution”.[115-117] Solution NMR spectroscopy has been applied to study functional amyloids in their monomeric soluble form.[81,118-120] Since most of the known functional amyloids are unfolded proteins, the use of this technique is limited to understanding the structural changes during amyloid fibril formation. Nevertheless, it provides valuable information, e.g. by hydrogen/deuterium exchange experiments to understand slowly exchanging sites.[121,122] Higher molecular weight oligomers or fibrils cannot be observed directly by solution NMR, due to their slower tumbling compared to monomers which decrease both the intensity of NMR signals and the resolution. For such insoluble oligomeric and fibrillar forms of functional amyloids, ssNMR is the method of choice to determine structural information and atomic resolution structures.[123,124] Several beautiful examples have been presented in the recent years, including non-biofilm forming functional and pathological fibrils. These are described in detail in recent reviews and we refer the reader to these for a full overview.[114,123,124] ssNMR has been used for understanding the molecular details and structures of amyloid fibrils since the 2000s.[96] The breakthrough in the application of this method was initially slowed down by general difficulties in sample preparation of fibrillar samples due to polymorphism. Extensive studies and progress in this field, including seeding approaches, resulted in high-resolution ssNMR spectra of amyloids composed of a single polymorph for various systems, such as αSyn.[125,126] Amyloid fibrils have local order with irregularities in long-range order due to bends or curves.[123,124] This is not a problem for ssNMR studies since the local order is sufficient to result in high-resolution NMR spectra. ssNMR spectroscopy historically relied on 2D/3D carbon-detection experiments for chemical shift dispersion and sequential assignment,[99,127-129] requiring ∼10 mg of fully/selective/sparse isotope labeled protein. More recently, proton-detection experiments relying on protein deuteration and/or ultra-fast sample spinning (MAS) have expanded the scope of ssNMR by improving the sensitivity significantly and reducing the requirement for the protein amount needed to <1 mg.[130-134] Nowadays, these proton-detecting ssNMR experiments at high magnetic fields above 1 GHz combined with MAS frequencies above 100 kHz allow a fast and sensitive structure determination of amyloids or other protein forms.[135-138] It is important to note that combining long-range structural information from cryo-EM and atomic resolution of a single unit from ssNMR can be a very powerful experimental approach. In addition, dynamic nuclear polarization based (DNP) ssNMR increases the sensitivity by several orders of magnitude, and opens up the possibility to study systems otherwise very difficult or impossible, e.g. proteins in complex environments at low concentrations.[139-143]

Structural studies on FuBAs

PSMs

PSMs are characterized as amphipathic α-helices in solution, Fig. 6.[144,145] They can adopt either a β-sheet or α-helical structure after aggregation in vitro and in biofilms, as determined from ThT-binding and other biophysical assays.[55] The atomic resolution structure of full-length PSMα3 amyloid-like fibrils (22 amino acids) was determined by X-ray crystallography in 2017,[59]Fig. 6. This unique structure is composed of a cross-α-architecture, in contrast to the “canonical” cross-β structural features of other known amyloids. This was the first determined prokaryotic cross-α amyloid fibril structure. The X-ray fibril diffraction pattern indicates reflections at ∼4.6, 9.2 and 11.5 Å. This non-canonical structure is made of amphipathic α-helices stacking perpendicular to the fiber axis forming sheets, which are tightly packed with their hydrophobic face towards each other between sheets. PSMα3 was determined to be α-helical both in solution as a monomer and in the fibrillar form, Fig. 6.[59,118] The buried surface area, aromatic residues, and cavities inducing ThT binding are properties similar to the cross-β fibrils. It was shown that the toxicity of this protein is strongly coupled to its ability to form fibrils in the cross-α form.[145]
Fig. 6

Structures of different functional amyloids discussed in this review as their full length or truncated fragments. The structure determination method and corresponding PDB code are given in parenthesis for each system. The structures determined experimentally as soluble monomers are shown in green, for PSMα1, PSMα3 and TasA as one group. Fibrillar structures of the fragment of PSMα1 and full length PSMα3 are additionally shown. There is not yet a CsgA structure from full-length protein determined experimentally, nevertheless the structures of the four different small fragments exist and are shown. Please note the parallel versus antiparallel arrangements of individual β-strands in the amyloid fragments of CsgA. The color codes on the structures indicate red for oxygen, blue for nitrogen and grey for carbon. Moreover, the two structural models exist for CsgA and are shown in the Models section in the figure. One of the models relies on amino acid contacts from a multiple sequence alignment of homologous proteins and the other one is from Alphafold (PDB-like file was kindly provided by Kresten Lindorff-Larsen and presented elsewhere earlier).[152] The structural model of FapC is also based on a multiple sequence alignment of homologous proteins, reproduced from the work by Dueholm and coworkers.[46]

Truncation mutants of PSMα3 were studied for structure determination.[61] In contrast to the cross-α structure of the full length PSMα3, the 7LFKFFK12 fragment forms two different atypical cross-β amyloid fibrils composed of anti-parallel strands, Fig. 6. One polymorph is composed of hexameric architecture with cylindrical cavities along the fiber axis, and the other one is composed of out-of-register β-sheets that are −50° off from the fibril axis in contrast to the 90° found for in-register sheets. These structures were found to be less stable compared to the classic cross-β structures containing steric-zipper elements. In contrast to this picture, the structures of truncation spine elements of PSMα1 7IIKVIK12 and PSMα4 7IIKIIK12 form cross-β fibrils with steric-zippers and have classical X-ray reflections at 4.6 and 10 Å. Full length PSMα1 and α4 undergo a structural transition from helical to β-sheet in solution during aggregation within several days, in contrast to PSMα3.[146,147] These transitions or non-transitions could be related to the functions of these proteins, which are strikingly colorful. More structural studies on full length PSMs are needed to unravel the full picture of these systems.

CsgA

The secondary structure prediction and ssNMR data suggest the formation of strand–loop–strand type of structures for the repeat units of the CsgA amyloid fibril.[35,148] Detailed structural work on CsgA was performed by Tycko and later by Ritter and coworkers, Fig. 7.[148] The earlier work comprises ssNMR characterization that was supported with X-ray diffraction, EM, MPL measurements and biophysical methods. The ssNMR results give hints about the CsgA fibril structure prepared without signal-peptides. The lyophilized and rehydrated samples showed low spectral quality in contrast to the functional amyloid HET-s with superb resolution.[99] Nevertheless, superior NMR resolution was obtained from a recombinant prepared CsgA sample.[149]
Fig. 7

NMR is a powerful technique to study protein structures of soluble and insoluble forms. (A) 2D 1H–15N HSQC NMR spectra of soluble monomeric CsgA (in DMSO), FapC (in DMSO) and TasA (in H2O) functional amyloids. The unfolded nature of the CsgA and FapC can be seen directly from the narrow chemical shift dispersion in the proton dimension in the HSQC spectra. In contrast, the TasA has a well-dispersed HSQC spectrum, which is an indication of its folded globular protein fold. (B) The 2D 13C–13C MAS NMR spectra of CsgA, FapC (R3C construct) and TasA are shown. Solid-state NMR spectroscopy is used to study structures of functional amyloids in their insoluble fibrillar form. The NMR spectra show high quality that is suitable for structure determination. Sparse/segmental labeling combined with proton-detection experiments will additionally increase the full structure determination possibilities for these kinds of difficult proteins. Such segmental labeling of only a single repeat for CsgA results in superior spectral resolution, isotope labeling of full-length is shown in black and single-repeat is shown in blue with several signal assignments. Frans Mulder kindly provided the FapC HSQC spectrum. The 2D 13C–13C ssNMR spectrum of CsgA is reproduced with permission.[149]

Typical X-ray diffraction patterns were observed for both CsgA and CsgB indicating cross-β-sheet spacings of ∼4.7 and ∼9 Å. Secondary structure contributions were determined by circular dichroism (CD) to be ∼16% α-helix, ∼40% β-sheet, ∼13% β-turn and ∼31% others. These in vitro prepared fibrils of CsgA were more resistant to proteinase-K as compared to the soluble form. A comparison of the observed chemical shifts with the random-coil values[150,151] indicates the presence of major and minor contributions for many of the selectively labeled sites. Quite surprisingly, these chemical shifts indicate both β-sheet and non-β-sheet contributions, which is a major difference between CsgA and other pathological amyloids that are composed of in-register β-sheet fibrils. This situation was similar to that of the HET-s structure which was found to adopt a unique β-solenoid structure.[99] The distance measurements based on dipolar-dephasing experiments were not definitive but indicated a non in-register β-sheet structure. This was supported by the MPL measurements and EM images showing the fibrils having a narrow (∼3–4 nm) width. Proposed flexible loops that may be connecting the rigid repeat units were not observed by NMR. A CsgA structural model based on amino acid contacts from a multiple sequence alignment of homologous proteins suggests that the hairpins stack on top of each other to form a β-helical structure, Fig. 6,[48,152] similar to HET-s and the recently determined structure of a HET-s homolog HELLF functional amyloid fibrils.[99,153] The model is consistent with the repeats identified for CsgA, Fig. 2. The Alphafold structural model is overall similar to this model, and both are consistent with the NMR data described above.[154] A more recent ssNMR study revealed more insights on the structural fold of CsgA and supports the proposed models, Fig. 7.[149] This work presents evidence via ssNMR that the biofilm-extracted and recombinant produced proteins adopt a similar homogeneous structure. The spectral resolution (∼0.35 ppm) of these preparations is superior compared to that of the lyophilized samples,[148] and similar to that of HET-s or αSyn.[99,129,155] Sample preparation of amyloid fibrils remains a challenge and needs to be optimized carefully for each system. The segmental labeling strategy of a single repeat unit R1 (21–69 amino acids) greatly simplifies the NMR spectrum and allows sequential assignment and secondary structure propensity determination.[156] These results support the strand–loop–strand structural motif with two β-strands from S43–Y50 and S55–T61 separated by a short rigid glycine loop. Moreover, the R1 repeat unit and the full CsgA adopts a similar β-solenoid structure, and most probably all the repeats are structurally equivalent. Observation of long-range distance restrains defines a tight turn between the individual β-strands of the repeats. Finally, ssNMR spectra monitoring only mobile segments contain signals from the N-terminus and the loop that connects repeats.

FapC

A detailed structural and biophysical study depicts our current understanding of the FapC functional amyloid, Fig. 7.[157] In this work, full-length (FL: NR1L1R2L2R3C) and three truncation mutants (NL1L2R3C, L2R3C and R3C) were studied in detail. The effect of loop regions on the fibrillation and fibril structure was studied using these different constructs. The X-ray diffraction analysis resulted in the typical reflections at ∼4.6 and ∼10 Å, as well as an additional ∼3.9 Å reflection for all preparations. The smaller distance reflection was assumed to arise from the twisting of the fibrils. This indicates that truncation mutants comprise similar structures to the FL FapC. EM and infrared spectroscopy (IR) provided additional proof that the truncation preparations are structurally similar to the FL FapC. Initial sequence analyses predicted three imperfect repeats separated by linker regions that are assumed to be flexible loops.[48,158] The repeats contain mostly hydrophobic amino acids, whereas the linker and N-terminal region contain charged amino acids that were the origin of the initial nomenclature. This picture is modified now by the determination of four amyloidogenic hot spots in FapC (between 37–43, 102–111, 157–163 and 215–219), present in the linkers, termini and R3 regions. The aggregation of the FL has a lag phase, which is further slowed down by the removal of R1 and R2. The shortest truncation unit R3C aggregates via secondary nucleation, whereas, FL aggregates via primary nucleation and elongation. Surprisingly, the predicted loop regions increase the aggregation propensity, highlighting the importance of the non-repeat regions for FapC. The removal of all three repeats still results in amyloid fibril formation,[53] indicating the presence of an amyloidogenic hot spot in the linker regions. A recent study by Pedersen and coworkers expands the macromolecular structural details of FapC amyloid fibrils.[159] In this work, an MPL value of ∼33 kDa nm−1 was determined by EM and small angle X-ray scattering (SAXS), indicating that the mature fibrils consisted of two-protofilaments per fibril. The protofilaments were formed without any intermediate species besides monomers and oligomers, which then mature into more compact fibrils. 2D ssNMR spectra revealed further insights about FapC.[157] First of all, the spectral quality is decent compared to the CsgA, however, it was significantly lower compared to HET-s. Nevertheless, the R3C construct gives an opportunity to extract qualitative information due to its reduced complexity. Experiments based on dipolar-coupling probe only the rigid protein parts, and the Thr, Ser and Ala regions indicate chemical shifts characteristic of β-sheet, α-helix and coil segments. This situation is similar to that of CsgA and TasA.[81,148,149] The NMR cross-peaks from the reduced-size R3C construct containing only one repeat and C-terminal resulted in the observation of more number of peaks than that present in the sequence. In R3C there are 1*I, 2*S and 6*A in the rigid repeat region, and 4*T, 3*S and 3*A in the flexible C-terminal. Ile showed more than one cross-peak in the 2D spectrum, although there is a single Ile in the repeat, indicating multiple conformations. Moreover, there are threonine cross-peaks in the spectrum, which only are in the C-terminus part. Overall, these results indicate a complex interplay of the R3C fibril forming region and its polymorphism expressing as cross-peak multiplication. To further elaborate on this picture, INEPT-based NMR was performed in order to detect only very flexible amino acids. The 2D 1H–13C spectrum of this type results in an almost full amino acid coverage indicating that most of the protein can be perceived as flexible, including the Ile from the repeat region. This is in contrast to CsgA observations, where INEPT signals indicate residues only from the highly flexible N-terminal signals.[149] Overall, these NMR findings indicate extreme polymorphism in the FapC system, resulting in reduced resolution and transient flexibility/rigidity. Lack of spectral resolution may indicate that the homogeneous fibrillation may need accessory proteins, such as FapB and/or others. As a final note, the FapC structural model obtained is similar to the CsgA model, highlighting the stacking of β-hairpins.[48,158] One major difference is that the linker/loop regions forming disordered/flexible parts are extending from the fibril core, since they are much longer compared to the CsgA hairpin turns, Fig. 6.

TasA

The first high-resolution structural investigation on TasA was performed in 2018.[149] This work establishes our understanding of TasA-mediated biofilm formation in B. subtilis by following the structural transition of TasA protein in vitro and in vivo (in live biofilm). TasA is a unique example of functional amyloids having a globular protein fold as a soluble monomer. Three structurally different TasA species were produced and trapped for NMR characterization, one soluble monomer and two insoluble high-molecular weight species. From the soluble monomeric globular fold of TasA protein, oligomeric (gel-like) and fibrillar TasA species are formed depending on the pH, temperature and incubation time, Fig. 3D. Freshly prepared TasA28–261 and TasA28–239 samples at pH 7 are mostly monomeric with a small oligomeric fraction, whereas TasA261 favors oligomer formation to a greater extent. The C-terminus of TasA (∼240–261 aa) is unstructured as determined by NMR spectroscopy. The crystal structure was obtained at 1.56 Å for the monomeric TasA239 at pH 4.6. The jellyroll-fold consists of antiparallel β-sheets separated by short helices and longer loops, with surprisingly differential rigidity observed among the proteins. To understand the high-MW components from a fresh and mostly monomeric sample at t = 0 point, two different preparations were done. First, the fresh sample was kept for a long time to obtain a gel-like sample. Second, the pH was shifted to 3 resulting in the formation of fibrils, giving typical cross-β-sheet reflections at ∼4.8 and ∼10.4 Å. From pH 7, a slow pH shift towards the acidic condition retains the folded structure, whereas a rapid pH shift results in unfolding of the monomer. ssNMR of the fibrillar TasA261 sample resulted in a well-resolved proton-detected HSQC-type spectrum, indicating high β-sheet content and a substantial structural change compared to the solution NMR spectrum. On the other hand, the ssNMR spectrum of the gel-like sample produced from TasA239 at pH 7 is remarkably different from the fibrillar ssNMR spectrum. The gel-like sample resembles the soluble monomeric HSQC spectrum with noticeable alterations as more helical/loop fractions. A later study on Bacillus biofilms shows additional aspects of the TasA fibrillar structure.[160] In this work fibrillar B. subtilis TasA samples were compared to their B. cereus homolog. The B. cereus TasA indicates an unfolded monomer, in contrast to the TasA from B. subtilis. The ssNMR analyses revealed the co-existence of β-sheets, α-helices and loops as fibrillar structural elements, as previously shown by Diehl et al.[81] The proteinase-K digestion assay indicated a shorter ∼110 amino acid long resistance part of the TasA assigned to the amyloid core between residues 35 and 144. Surprisingly, when TasA fibrils are prepared under different conditions (pH, salt and protein concentrations)[161] both fibrillar and ‘oligomeric’ samples with different morphologies are observed as seen in the EM and CD spectra. All these indicate the complex biophysics of TasA and how tightly it has to be controlled for production of different protein forms. It remains to be proven which of these in vitro forms will be in vivo relevant. As a final note here, despite the observation of both TasA oligomers,[81,162] no oligomers were captured for CsgA and FapC unless they are forced by using fibrillation inhibitors.[163,164] This is consistent with the fast-fibrillation hypothesis of functional amyloids made out of repeats for efficient self-templating and simple primary-nucleation and elongation mechanism. This is another interesting research avenue that needs more experimental demonstrations. In contrast to the three different TasA forms obtained for the recombinant protein in vitro, a pure fibrillar form is obtained in vivo.[149] The model B. subtilis system that cannot produce TasA (ΔtasA) was used for understanding insights into the native biofilm. This strain can grow healthy, but cannot produce biofilm. Externally supplied recombinant TasA protein reconstitutes the native-like biofilm, Fig. 3C.[63,81] A DCN-labeled TasA261 was used in this model system and a fingerprint HSQC-type ssNMR spectrum was obtained, Fig. 3D. This experimental setup is the first demonstration of in vivo (in the biofilm) ssNMR spectroscopy, to the best of our knowledge. The resulting NMR spectrum with decent resolution contains all the features from the spectrum of the fibrillar sample. In vivo biofilm and in vitro fibrillar spectra overlap to a large extent, however they still have small differences which indicate that additional factors within the biofilm can modulate the protein structure. This remarkable observation indicates that the biofilm contains homogeneous TasA fibrils that are resistant to proteases. In summary, biofilm forming functional amyloids and functional amyloids in general have many properties similar to pathological amyloids. These include, e.g., a common cross-β structure, X-ray diffraction pattern and limited proteolysis results. The details of how these two classes of amyloids form structures are still surprisingly unique. As unique features, functional amyloids have a control over the mechanism that regulates their in vivo properties and fibrillation, whereas pathological ones has not been evolutionary optimized in the same manner due to their lack of function. Structurally we have limited information and except from a special crystalline case of PSMα3, no high-resolution structure is available. Nevertheless, studies unraveling structural features exist as we have summarized above and comprise a solid basis for the future full structure determination studies. With the advancements in structural biology techniques, we expect a bright future with many high-resolution structures enlightening the molecular mechanisms of functional amyloids and biofilm formation in vitro and in vivo.

Interactions between functional amyloids and components present in the biofilm matrix

As mentioned earlier the functional amyloids in bacterial biofilms are formed extracellularly. The assembly process therefore happens in close proximity to both cell membranes and the biopolymers constituting the EPS of the biofilm matrix. The interactions between functional amyloids and other biofilm components during the formation of functional amyloids and maturation of biofilms are therefore crucial to understand the biogenesis of biofilms and have thus been a topic of interest.

Interactions between functional amyloids and lipids

During a study on how the cell membrane of the bacterial cell affects the formation of curli amyloids in E. coli it was found that in the presence of lipopolysaccharides (LPS), which are found in the outer membrane of Gram-negative bacteria, the ability of CsgB to nucleate and seed the formation of CsgA is enhanced.[165] This was found to be due to electrostatic interactions between the lipid bilayer and the CsgB protein. Hence the bacterial cell membrane adds a layer of cooperativity to the formation of curli amyloids ensuring that the formation of the functional amyloids occurs at the cell membrane to tether the amyloid structures to the cells and hence keeping the amyloid structures in place in the biofilm. LPS have also been found to accelerate the aggregation of FapC by bypassing the nucleation lag-phase.[164] This is achieved through reduction of the population of intermediates during aggregation in addition to rearrangement of early aggregates. A similar effect on aggregation is observed for the biosurfactant rhamnolipid which is produced in Pseudomonas species alongside the FapC protein. At high concentrations of the rhamnolipid micelles, they were found to be decorated with FapC fibrils indicating a more long-term interaction between FapC and rhamnolipids. TasA also interacts differently with bacterial cell membranes compared to eukaryotic cell membranes.[166] In the presence of bacterial cell membranes the fibers formed by TasA are disordered and have a different β-sheet signature than fibers formed in the absence of membranes. In the presence of eukaryotic cell membranes TasA fibers display the same morphology and β-sheet signature as in the absence of membranes indicating that the presence of eukaryotic cell membranes does not affect the TasA fiber formation. Similarly the bacterial membranes were found to deform significantly upon interaction with TasA. Again this effect was not observed for eukaryotic cell membranes suggesting that TasA penetrates the bacterial cell membranes but not eukaryotic cell membranes.[166] Other functional amyloids have also been found to interact with lipid cell membranes. S. aureus PSM peptides are able to cause cell lysis and hence interact with the cell membrane in a variety of eukaryotic cells, including monocytes, leukocytes, erythrocytes, epithelial cells, endothelial cells, and osteoblasts.[58,167,168] The lytic activity of the PSM peptides is closely linked to the degree of α-helicity of the peptides in the monomeric form[169] and can furthermore be counteracted by serum lipoproteins.[170] Besides the α-helicity of the peptides the presence of cholesterol also increases the susceptibility to lysis in vesicles containing between 10 and 30 mol% cholesterol.[169] However, when a shift from liquid-disordered to liquid-ordered state occurs in the lipid bilayer upon addition of higher amounts of cholesterol, a decrease in lytic activity from the PSM peptides occurs. Based on these results it was speculated that differences in cholesterol concentration in phagosomes from different cell types and different cell species could impact PSM-mediated phagosome escape and subsequent survival of S. aureus during an infection. The ability of lipid bilayers to alter the aggregation of proteins is not limited to functional amyloids as lipids have also been found to play a key role in the aggregation αSyn. Recently, Galvagnion and coworkers showed that lipid bilayers enhance the rate of heterogeneous primary nucleation of αSyn by three orders of magnitude.[171]

Interactions between functional amyloids and extracellular polymeric substances

Since the formation of functional amyloids occurs extracellularly, other biomolecules of the biofilm matrix come in close proximity with functional amyloids. Interactions with the other matrix components could therefore play a role during the formation of the functional amyloids. The EPS of the biofilm shows high biodiversity with distinct biomolecules being expressed in different bacterial species however eDNA is emerging as a nearly universal component of the biofilm across species even including fungi.[172] eDNA is found to play a role in facilitating adhesion[173] and promoting intercellular aggregation[174] along with providing structural integrity to the biofilm architecture.[175-177] During in vivo biofilm formation curli fibers have been found to bind eDNA forming curli–eDNA complexes that are highly immunogenic. The complexes caused cytokine production through activation of immune cells. When administered systemically the complexes caused immune activation and formation of autoantibodies thus triggering autoimmunity. Furthermore, the interaction with eDNA and DNA from other sources (genomic and from salmon sperm) accelerated the aggregation of CsgA in vitro.[178] Additionally, it has been shown that the presence of eDNA is required for functional amyloid formation in S. aureus biofilm formation in vivo.[179] Analogous to CsgA, eDNA promotes the aggregation of PSMα1 in vitro.

Interactions between functional amyloids and other extracellular components

The highly sulphonated glycosaminoglycan, heparin, has frequently been used as a soluble mimic of the human extracellular matrix component heparan sulphate. Additionally, heparin is used as an anticoagulant in catheters and hence it is a molecule potentially encountered by functional amyloids in bacterial biofilms. Heparin has been found to promote S. aureus biofilm formation[180,181] and it has been suggested that heparin can mimic and even act as a substitute for eDNA in S. aureus biofilm formation.[182] In an in vitro study using heparin, a complex mechanism of interaction is seen with PSM peptides. Heparin was found to accelerate the aggregation of all αPSM peptides except PSMα2 but simultaneously decelerates the aggregation of βPSM peptides. In the case of PSMβ2 heparin is even capable of altering the dominating molecular mechanism of aggregation from primary nucleation and elongation dominated to being dominated by secondary nucleation.[183] In summary, the formation of functional amyloids in the extracellular environment of the biofilm appears to be optimized ensuring that the biomolecules in the biofilm are generally beneficial for the aggregation process.

Functional amyloids and cross-seeding

Cross-seeding between different functional amyloids

The interactions between functional amyloids and other amyloids have been gaining attention in recent years due to the recent more detailed understanding. For the E. coli curli fibrillogenesis it has been investigated how the major curli subunit CsgA interacts with the minor curli subunit CsgB. In vitro, both CsgA and CsgB are capable of forming amyloid fibers which morphologically resemble the fibers found in vivo,[4,37] however in vivo CsgA alone cannot form amyloid fibers. CsgA is secreted away from the cell as a soluble protein in the absence of CsgB,[184] and CsgA is not assembled into insoluble fibers in the absence of CsgB in vivo.[185] CsgB was found in vivo to aggregate on its own as overexpression of CsgB in E. coli resulted in self-assembly of CsgB into extracellular fibers located at the bacterial surface.[186] The nucleating ability of CsgB towards CsgA was confirmed through several studies. In inter-bacterial complementation surface localized CsgB from one mutant strain can convert soluble secreted CsgA from another mutant strain into amyloid fibers and also nucleate endogenously added recombinant purified CsgA.[4,40,185] Substoichiometric concentrations of CsgB have been found to alter the mechanism of aggregation of CsgA. In the absence of CsgB, CsgA was found to aggregate through the formation of amorphous aggregates but in the presence of CsgB ordered oligomers with a β-sheet structure were seen, thus resembling the mechanism seen for CsgB in the absence of CsgA.[187] Through investigation of the individual domains in the proteins the C-terminal parts of both CsgA and CsgB are found to play a crucial role in the nucleation mechanism. Besides being amyloidogenic, the C-terminal repeat 5 (R5) in CsgA has been found to be critical for in vitro nucleation of CsgA.[40] The R4 and R5 of CsgB are required to tether the CsgA aggregates to the lipid membrane, and mutants lacking these domains are no longer localized to the membrane but are secreted into the extracellular space. The C-terminal repeat domain, R5, is required to anchor the protein to the cell membrane although this domain in CsgB is not amyloidogenic on its own.[188] In addition, CsgA nucleation by a CsgB truncation mutant lacking the C-terminal 19 amino acids is less efficient than full-length CsgB in vivo.[37] While TasA constitutes the major components of the fibers present in B. subtilis biofilm, TapA has been found to co-purify as a minor component of the fibers in a 1 : 100 ratio with TasA.[189]In vitro TapA contributes to the polymerization of TasA into fibers. More specifically, a sequence of 8 amino acids in the N-terminal of TapA has been identified as being crucial in the initiation of the self-assembly of TasA.[190] Additionally, the 5 conserved cysteine residues in TapA play a minor role in the formation of a robust biofilm. The presence of TapA has furthermore been shown to be required in order to anchor TasA fibers to the cell surface.[63,189] Incorporation of d-amino acids into the peptidoglycan results in release of TasA fibers from the cell surface and disassembly of the biofilm.[191] Besides TapA, TasA has also been found to interact with BslA during fiber formation in the biofilm. BslA acts synergistically with other matrix components to facilitate the assembly of the biofilm matrix.[70]In vivo, overexpression of TasA was not sufficient to overcome the effects of deletion of BslA. Besides aiding in the self-assembly of TasA, BslA also self-assembles on its own at interfaces to form a hydrophobic surface layer around the biofilm.[72] Although no specific nucleator protein has been identified for the PSM peptides from S. aureus, interactions between the individual peptides have been investigated though cross-seeding experiments. No specific pattern in cross-seeding was seen between αPSMs and βPSMs but PSMα1 was found to cross-seed all the other PSM peptides.[192] Based on the kinetics of the aggregation of PSM peptides along with the cross-seeding it was speculated that PSMα3 that aggregates extremely fast is able to kick-start the functional amyloid formation in biofilm. The PSMα3 aggregates are then able to cross-seed PSMα1 that normally aggregates much slower in the absence of PSMα3 aggregates. The PSMα1 aggregates can then proceed to cross-seed all the other remaining PSM peptides to ensure that even the more slowly aggregating peptides aggregate on a timescale relevant for the bacteria during biofilm formation. This ensures that βPSM peptides which have been found to have the most pronounced impact on S. aureus biofilm structuring[60] are able to aggregate in a time scale relevant for the bacteria. This mechanism of less amyloidogenic peptides/proteins being seeded by more highly amyloidogenic peptides/proteins has also been observed for other functional amyloids not involved in biofilm formation. Different members of the CRES-family which comprise part of the complex extracellular amyloid matrix in mouse spermatozoa have been found to cross-seed each other in a manner similar to the PSM peptides and these are also seeding of CsgA by CsgB.[193] The cross-seeding between the CRES subgroup members suggests that it may be an evolutionarily conserved mechanism to control the assembly of some functional amyloids.

Cross-seeding between functional amyloids and pathological amyloids

Cross-seeding between functional amyloids and pathological amyloids is an emerging interesting research area due to recent discoveries of a link between functional amyloids in the gut and progression in neurodegenerative diseases. During the progression of PD it has been observed that αSyn aggregates in the form of an αSyn-immunoreactive Lewy body and Lewy neurite pathology spreads through the brain in a progressive manner from the brainstem to the telencephalon.[194] In the earliest stages aggregates are detected in the olfactory bulb, as well as in the dorsal motor nucleus of the vagus nerve in the medulla oblongata. In later stages αSyn aggregate pathology is found more widespread in the brainstem via the pons and midbrain, in the basal forebrain and, ultimately, in the neocortex. The spreading of aggregated species of αSyn in mice has been demonstrated to happen through cell-to-cell transmission of pathologic αSyn aggregates in anatomically interconnected regions upon injections of synthetic αSyn fibers leading to nigrostriatal degeneration.[195] Furthermore, the injections of αSyn aggregates in rats and the subsequent pattern of αSyn accumulation observed suggests that the cell-to-cell spreading occurs only through retrograde transmission.[196] In rodent models the cell-to-cell spreading of αSyn aggregates has also been used to demonstrate a spreading and propagation from neurons in the gut to the brain through the vagus nerve and/or the spinal cord.[197-199] Furthermore, the bacterial composition of the gut microbiome has been observed to lead to increased neuroinflammation and αSyn aggregate deposits in the brain of αSyn expressing mice. As a link emerges between neurons in the gut and the brain and the possible spread of pathological amyloid species, the interactions between functional amyloids formed from bacteria and pathological amyloids prove to be an interesting topic highly related to bacterial biofilm forming functional amyloids.[200] In mice it was found that monomeric CsgA was able to accelerate the aggregation of αSyn even under sub-stoichiometric conditions.[200] The accelerated aggregation occurs through transient interactions between CsgA and αSyn. In the same mouse model Tau aggregation was not found to be accelerated by CsgA monomers. When αSyn is overexpressed in mice it was seen that simultaneous colonization with curli-producing E. coli in the gut promotes αSyn pathology in both the gut and the brain. Furthermore, curli expression is required for E. coli to intensify αSyn-induced behavioral deficits, including intestinal and motor impairments in the mice. Interestingly CsgA is not the only component of the curli apparatus found to interact with αSyn. CsgC and CsgE, both of which act as periplasmic chaperones and prevent aggregation of CsgA intracellularly,[4,38] also interact with αSyn although in different ways. CsgC inhibits the aggregation of αSyn while CsgE surprisingly accelerates the aggregation of αSyn.[201] PD is not the only neurodegenerative disease to be linked to functional bacterial amyloids, as CsgA pre-formed seeds have been found to also accelerate the aggregation of Aβ, and fibrillation inhibitors are shared between CsgA and Aβ.[202] Curli cross-seeding exerted a complicated concentration dependent effect on Aβ1–40 fibrillogenesis kinetics.[203] In human semen fragments of prostatic acid phosphatase (PAP248–286) increase HIV infection efficiency through increased virus adhesion to target cells. This enhancement of infection only occurs when PAP248–286 is in the form of amyloid aggregates termed SEVI (Semen Enhancer of Viral Infection), however monomeric PAP248–286 aggregates very slowly and exogenous factors have been suggested as a promoter of SEVI fiber formation in vivo.[203] Indeed it has been demonstrated that curli amyloids of both CsgA and CsgB can act as a catalyst for SEVI formation from PAP248–286. At low concentrations in vitro, cross-seeding with curli fibers results in fibers of PAP248–286 that retain the ability to enhance HIV infection. Kinetic analysis of the cross-seeding of PAP248–286 with curli moderately affects the nucleation rate while significantly enhancing the growth of fibers from existing nuclei and hence the elongation rate. During the study it was also shown that curli amyloids of both CsgA and CsgB were able to cross-seed IAPP leading to a decreased lag-time but simultaneously a strong inhibition is observed in IAPP elongation.[203] FapC has also been found to interact with proteins associated with neurodegenerative diseases, specifically αSyn and Aβ. In vitro FapC can seed Aβ while templating the structure of the resulting Aβ amyloid fibers to resemble that of FapC fibers.[204]In vivo in zebrafish larvae FapC was able to induce accelerated Aβ aggregation and Aβ pathology. In adult zebrafish FapC was found to accelerate Aβ-induced cognitive pathology. To extend the results to the bacterial biofilm it was found that co-injection of Aβ and the protein part of P. aeruginosa biofilm samples into adult zebrafish elevated Aβ pathology. This elevated pathology was not observed in zebrafish injected with only the biofilm protein sample. Based on these results Aβ can also interact with the fibers when they are present in a complex setting such as a biofilm despite the morphology of the biofilm protein sample being different from that of FapC. The changes in morphology were attributed to the presence of other structural components of the biofilm in the sample.[204] FapC is also capable of interacting with αSyn. FapC binds in vitro to αSyn monomers, oligomers, and aggregates.[205] Full-length FapC protein slightly accelerates αSyn aggregation while the FapC deletion mutant where all three repeat regions have been deleted and hence only the loop regions remain (ΔR1R2R3) increases the lag-time of αSyn aggregation. The fibrillation inhibition was found to be due to the formation of hetero-oligomers possibly enhanced by disulfide bond formation in FapC ΔR1R2R3. Other deletion mutants where only one or two repeat regions had been deleted did not increase the lag-time of αSyn. Interestingly, only the monomeric protein is responsible for the interactions between FapC and αSyn as no seeding effects were seen for FapC aggregates on αSyn aggregation. Although several links have been found between functional bacterial amyloids and neurodegenerative diseases, it cannot be concluded that biofilm formation is involved in the causative effects of such neurodegenerative diseases. For example, it has also been shown that biofilm can actually protect against protein aggregation. In C. elegans the probiotic B. subtilis has been found to inhibit αSyn aggregation by altering the host sphingolipid metabolism and thereby reduce αSyn aggregation through biofilm formation in the gut.[206] In summary, functional amyloids are found to interact with a wide variety of other proteins both from bacteria and from humans. They are even emerging as contributing factors involved in the development of the pathology of various human diseases with pathology linked to protein aggregation.

Concluding remarks

It is becoming increasingly clear that functional bacterial amyloids involved in biofilm formation are not solitary inert structural components in the biofilm. Cross-talk between different aggregated species is emerging as a common theme and not only a phenomenon observed between various functional amyloids and pathological amyloids. Self-assembled structures of various metabolites have also been found to induce aggregation of pathological amyloid species. Fibrillar aggregates of homocysteine have been found to display cross-talk with Aβ,[207] and amyloid-like aggregates of quinolinic acid have been observed to cross-seed αSyn aggregation.[208] Despite the recent focus and many new insights gained in the field, the molecular mechanisms underlining the interactions between different amyloid species are very limited. This lack of molecular insight limits possibilities to develop strategies to better combat biofilm related bacterial infections. We hope that this review will make a contribution towards a more complete understanding. Our fight against AMR and biofilm related infections relies strongly on our knowledge of the molecular details of biofilm forming functional amyloids. With the recent fast pace and exponential growth in biofilm, amyloid fibril and functional amyloid related research we believe that the future is very bright. As of today, structural and mechanistic models are predominantly still missing, along with high-resolution structures of the key elements of this complex network in biofilms. Recent developments in the field of structural biology by cryo-EM and solid-state NMR will pave the way towards our atomic level understanding of proteins in vitro and in vivo. All in all, we will be able to understand, manipulate and fight persistent infections, AMR, biofilms and amyloids forming them.

Author contributions

The manuscript was conceived, designed and written by UA and MA with equal contributions.

Conflicts of interest

There are no conflicts to declare.
  202 in total

Review 1.  Mechanisms of biofilm resistance to antimicrobial agents.

Authors:  T F Mah; G A O'Toole
Journal:  Trends Microbiol       Date:  2001-01       Impact factor: 17.079

2.  BslA is a self-assembling bacterial hydrophobin that coats the Bacillus subtilis biofilm.

Authors:  Laura Hobley; Adam Ostrowski; Francesco V Rao; Keith M Bromley; Michael Porter; Alan R Prescott; Cait E MacPhee; Daan M F van Aalten; Nicola R Stanley-Wall
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-31       Impact factor: 11.205

3.  D-amino acids trigger biofilm disassembly.

Authors:  Ilana Kolodkin-Gal; Diego Romero; Shugeng Cao; Jon Clardy; Roberto Kolter; Richard Losick
Journal:  Science       Date:  2010-04-30       Impact factor: 47.728

4.  Investigating the lytic activity and structural properties of Staphylococcus aureus phenol soluble modulin (PSM) peptide toxins.

Authors:  Maisem Laabei; W David Jamieson; Yi Yang; Jean van den Elsen; A Toby A Jenkins
Journal:  Biochim Biophys Acta       Date:  2014-09-03

5.  Solid-state NMR spectroscopy.

Authors:  Bernd Reif; Sharon E Ashbrook; Lyndon Emsley; Mei Hong
Journal:  Nat Rev Methods Primers       Date:  2021-01-14

6.  The cytotoxic Staphylococcus aureus PSMα3 reveals a cross-α amyloid-like fibril.

Authors:  Einav Tayeb-Fligelman; Orly Tabachnikov; Asher Moshe; Orit Goldshmidt-Tran; Michael R Sawaya; Nicolas Coquelle; Jacques-Philippe Colletier; Meytal Landau
Journal:  Science       Date:  2017-02-24       Impact factor: 47.728

7.  Limits of Resolution and Sensitivity of Proton Detected MAS Solid-State NMR Experiments at 111 kHz in Deuterated and Protonated Proteins.

Authors:  Kai Xue; Riddhiman Sarkar; Carina Motz; Sam Asami; Diana C Rodriguez Camargo; Venita Decker; Sebastian Wegner; Zdenek Tosner; Bernd Reif
Journal:  Sci Rep       Date:  2017-08-07       Impact factor: 4.379

8.  Designed α-sheet peptides suppress amyloid formation in Staphylococcus aureus biofilms.

Authors:  Alissa Bleem; Robyn Francisco; James D Bryers; Valerie Daggett
Journal:  NPJ Biofilms Microbiomes       Date:  2017-07-03       Impact factor: 7.290

9.  Reducing the Amyloidogenicity of Functional Amyloid Protein FapC Increases Its Ability To Inhibit α-Synuclein Fibrillation.

Authors:  Line Friis Bakmann Christensen; Kirstine Friis Jensen; Janni Nielsen; Brian Stougaard Vad; Gunna Christiansen; Daniel Erik Otzen
Journal:  ACS Omega       Date:  2019-02-22

Review 10.  Ecology and Biogenesis of Functional Amyloids in Pseudomonas.

Authors:  Sarah L Rouse; Stephen J Matthews; Morten S Dueholm
Journal:  J Mol Biol       Date:  2018-05-16       Impact factor: 5.469

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.