| Literature DB >> 36227846 |
Aaron Yip1, Julien Smith-Roberge2, Sara Haghayegh Khorasani1, Marc G Aucoin1, Brian P Ingalls1,2.
Abstract
Spatiotemporal models that account for heterogeneity within microbial communities rely on single-cell data for calibration and validation. Such data, commonly collected via microscopy and flow cytometry, have been made more accessible by recent advances in microfluidics platforms and data processing pipelines. However, validating models against such data poses significant challenges. Validation practices vary widely between modelling studies; systematic and rigorous methods have not been widely adopted. Similar challenges are faced by the (macrobial) ecology community, in which systematic calibration approaches are often employed to improve quantitative predictions from computational models. Here, we review single-cell observation techniques that are being applied to study microbial communities and the calibration strategies that are being employed for accompanying spatiotemporal models. To facilitate future calibration efforts, we have compiled a list of summary statistics relevant for quantifying spatiotemporal patterns in microbial communities. Finally, we highlight some recently developed techniques that hold promise for improved model calibration, including algorithmic guidance of summary statistic selection and machine learning approaches for efficient model simulation.Entities:
Mesh:
Year: 2022 PMID: 36227846 PMCID: PMC9560168 DOI: 10.1371/journal.pcbi.1010533
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.779
Fig 1Modelling frameworks commonly used for capturing the behaviour of microbial communities, with associated spatial scales.
ABM, agent-based model; ODE, ordinary differential equation; PDE, partial differential equation.
Fig 2Single-cell measurement techniques for microbial communities.
(a) Flow cytometers can measure cell fluorescence and morphology in large sample sizes. (b) Time-lapse microscopy allows for direct visualization of physical cell–cell interactions and quantitative measurement of single-cell characteristics over time. (c) End-point microscopy can illustrate large-scale spatial patterns and features of cell arrangements in 2 or 3 dimensions.
Fig 3Model calibration techniques for spatiotemporal models of microbial communities.
Manual fitting involves direct adjustment of parameter values to achieve qualitative agreement between model predictions and observations. Nonspatial calibration is often systematic (based on a goodness of fit function) but is based on experiments that do not incorporate the spatial features of the system. Spatial calibration, against spatially distributed data, can be systematic (SSE-based) but must rely on summary statistics collected from the data.
Summary statistics for quantifying features in microbial communities.
| Statistic | Goal of Statistic | Computational Details | Reference |
|---|---|---|---|
|
| |||
|
| |||
| Microcolony aspect ratio | Quantify eccentricity of developing colony | Standard image processing feature, defined in 2D (or 3D, projected to 2D) | [ |
| Dyad structure | Characterize structure of 2-cell “colony” immediately before the second division | Normalized dot product of the 2 cells’ orientation | [ |
| Biofilm base circularity | Characterize shape of the biofilm base | Unity minus aspect ratio of projection onto the horizontal plane | [ |
|
| |||
| Microcolony density | Quantify packedness of cells within developing colony | Standard image processing feature, defined locally or globally | [ |
| Order parameter | Quantify anisotropy within developing colony | Mean of projections of orientation of neighbouring cells; defined per-cell, recorded as a colony average or as a distribution | [ |
| Correlation length of scalar order parameter | Characterize “patchiness”: spatial scale over which orientation of neighbouring cells is aligned | Correlation of orientation as a function of distance; can be compared as a mean or a distribution | [ |
| Micropatch area | Quantify “patchiness”; similar to correlation length of scalar order parameter | Cells are clustered into patches based on contact and relative orientation | [ |
| Topological defect density | Characterize “patchiness”: density of topological defects (i.e., discontinuities in the order-parameter field) | Algorithm provided in [ | [ |
| Defect velocity | Characterizes the evolution of a microcolony’s internal structure | The position of topological defects is tracked over time | [ |
| Age distribution of cell poles within the developing microcolony | Characterize degree of mixing during colony development | Simple measure is distance from centre of colony to oldest cell poles. More complete measures additionally account for younger poles | [ |
|
| |||
| Orientation of cells at the microcolony boundary | Characterize tendency of boundary cells to align with the colony boundary | Colony boundary must be determined by a smoothing operation; cells on the boundary and the corresponding boundary orientation must be identified | [ |
| Gradient of cell velocity normal to microcolony boundary | Characterize growth inhibition due to pressure gradients | Measured by particle-image velocimetry | [ |
| Cell–cell distance | Characterize cell spacing | Centroid-to-centroid distance to nearest neighbour | [ |
| Vertical and radial alignment | Characterize 3D structure; identify transition from monolayer to multilayer growth | Angle formed by the z-axis and cell’s major axis | [ |
|
| |||
|
| |||
| Single-strain population counts; population fractions | Captures population abundances | Cell counts for each population | [ |
| Shannon species diversity index | Species biodiversity metric | Determined from cell counts for each population | [ |
|
| |||
| Shannon entropy | Quantify randomness of pixel identities in an image | Total population of each species, heterospecific neighbouring cell counts | [ |
| Intensity correlation quotient | Characterize colocalization or exclusion of pairs species in space | Determined by sum of pixel intensities for each fluorescence channel | [ |
| Contagion index | Quantify dispersion and intermixing of different populations; deviation from maximum entropy state | Total population of each species, heterospecific neighbouring cell counts | [ |
| Probability matrix for adjacent species identities | Global quantification for likelihood of nearest interspecific adjacencies | Identify neighbour to cell centroid | [ |
| Neighbour index | Characterize interspecific adjacencies relative to the initial adjacencies | Count physical contacts between pairs of cells of different phenotypes | [ |
|
| |||
| Proportion of conspecific neighbours | Quantify interspecies mixing from a probability distribution; | Probability that a cell is located a defined distance away from other members of its own species | [ |
| Structure factor | Quantifies characteristic length scales of spatial patterning; used to characterize transition from well-mixed to structured populations | Normalized spatial Fourier transform of image data | [ |
| Segregation index | Normalized metric to quantify population segregation | Cell neighbourhood interaction distance, heterospecific neighbouring cell counts | [ |
| Colony edge roughness | Increased in communities with antagonistic interactions | Standard deviation of microcolony radius | [ |
| Fractal dimension | “Jaggedness” of species boundaries | Distance of each pixel to nearest border of 2 different populations; algorithm provided in [ | [ |
|
| |||
| Intermixing index | Estimate spatial mixing between multiple species | Average number of species transitions along a straight line or arc | [ |
| Single-strain sector size | Indirect measurement of spatial mixing between multiple species | Length scale of single-strain patches | [ |
|
| |||
| Order parameter; phase transitions | Quantify synchrony in gene expression between populations | Spatiotemporal data are processed into kymographs; algorithm introduced in [ | [ |
Fig 4Calculating spatiotemporal summary statistics for microbial communities.
(a) Population counts over time capture the overall dynamics in a multispecies community. (b) The frequency of adjacent species in physical contact, determined by a contact network, provides a measure of intermixing between different species. (c) Summary statistics can be calculated from data averaged within a particular cell’s neighbourhood. (d) Single-species patch metrics, such as patch width and number of sectors, are useful for quantifying spatial patterns on a larger colony scale.
Fig 5Persistence homology.
Panels (a-c) illustrate the topology of a dataset changing as the length scale, L, is varied. (a) For small values of L, the balls (disks) are mostly disconnected; only 2 of the 9 intersect. (b) At an intermediate scale, all 9 balls intersect, forming a single connected component, giving rise to a loop. (c) At larger scales, there is a single connected component and no loop. (d) The progression illustrated in (a-c) is documented in the persistence barcode; the blue bars correspond to separate connected components, the ends of which corresponds to intersection (merge) events, e.g., at L = L. The red bar corresponds to the loop, which forms at L = L and which becomes filled in at L = L. (e) The same information can be represented in persistence diagram in which the (x,y) coordinates of points correspond to the right and left ends, respectively, of each bar in the barcode.
Requirements for generating training data for microbial community models.
| Model Type | Parameters | Simulations Required | Time per Simulation (h) | Serial Computing Time (h) | Est. Parallel Computing Time on 64 CPU cores (h) | Reference |
|---|---|---|---|---|---|---|
| PDE | 231 | 100,000 | 0.0972 | 9,720 | 152 | [ |
| ABM | 32 | 300 | 5–6 | 1,500–1,800 | 23–28 | [ |
| ABM | 7 | 100 | 6–8 | 600–800 | 9–13 | [ |