Literature DB >> 35041653

DeLTA 2.0: A deep learning pipeline for quantifying single-cell spatial and temporal dynamics.

Owen M O'Connor^1,2, Razan N Alnahhas^1,2, Jean-Baptiste Lugagne^1,2, Mary J Dunlop^1,2.

Abstract

Improvements in microscopy software and hardware have dramatically increased the pace of image acquisition, making analysis a major bottleneck in generating quantitative, single-cell data. Although tools for segmenting and tracking bacteria within time-lapse images exist, most require human input, are specialized to the experimental set up, or lack accuracy. Here, we introduce DeLTA 2.0, a purely Python workflow that can rapidly and accurately analyze images of single cells on two-dimensional surfaces to quantify gene expression and cell growth. The algorithm uses deep convolutional neural networks to extract single-cell information from time-lapse images, requiring no human input after training. DeLTA 2.0 retains all the functionality of the original version, which was optimized for bacteria growing in the mother machine microfluidic device, but extends results to two-dimensional growth environments. Two-dimensional environments represent an important class of data because they are more straightforward to implement experimentally, they offer the potential for studies using co-cultures of cells, and they can be used to quantify spatial effects and multi-generational phenomena. However, segmentation and tracking are significantly more challenging tasks in two-dimensions due to exponential increases in the number of cells. To showcase this new functionality, we analyze mixed populations of antibiotic resistant and susceptible cells, and also track pole age and growth rate across generations. In addition to the two-dimensional capabilities, we also introduce several major improvements to the code that increase accessibility, including the ability to accept many standard microscopy file formats as inputs and the introduction of a Google Colab notebook so users can try the software without installing the code on their local machine. DeLTA 2.0 is rapid, with run times of less than 10 minutes for complete movies with hundreds of cells, and is highly accurate, with error rates around 1%, making it a powerful tool for analyzing time-lapse microscopy data.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35041653 PMCID： PMC8797229 DOI： 10.1371/journal.pcbi.1009797

Source DB: PubMed Journal: PLoS Comput Biol ISSN： 1553-734X Impact factor: 4.475

Introduction

The automation of hardware and software for microscopy has resulted in researchers’ ability to generate massive datasets containing images of cells over time. For example, in a recent high throughput experiment Bakshi et al. imaged 108 Escherichia coli over days by acquiring 705 field of views every few minutes [1]. Additionally, recent studies have used closed-loop microscopy and optogenetic platforms to control gene expression in single cells in real time [2-4]. These improvements in microscopy have motivated the need for automated image analysis, as traditional approaches that require manual error correction cannot keep pace with the size of these new datasets or the rate at which they can be acquired. More generally, segmentation and tracking have historically required intensive user input as well as custom image processing code or experimental modifications such as the use of dedicated fluorophores [1,5-8]. These requirements limit throughput and can introduce burdensome experimental constraints. To address this, researchers need tools that are rapid, accurate, and require minimal input from the user. This combination of needs is well suited for deep learning-based approaches, and deep convolutional neural networks have enabled fast and accurate analysis of images. Specifically, the U-Net architecture has emerged as the state-of-the-art convolutional neural network for biomedical applications [9]. U-Net uses a “U”-shaped network architecture with a contraction path, where successive convolutional layers are applied to feature maps that are progressively down-sampled, followed by a symmetric expansion path where the low-resolution but high-level encoding of the input image is up-sampled back to the original resolution. In addition, skip-connections are used to concatenate finer detail feature maps used in the contraction path with up-sampled feature maps in the expansion path. Skip-connections allow the network to retain high-resolution information needed to construct the mask at the end of the network. This approach has been widely successful for segmentation of cells [10-14] and for tracking cells from frame-to-frame within time-lapse images [11,13]. Here, we focus on analysis of bacterial time-lapse microscopy data in two-dimensional settings such as agarose pads or within microfluidic chips. With rapid cell cycle times, small cell sizes, and high throughput microfluidic devices, it is possible for researchers to generate large datasets containing thousands of single cells over periods of hours or days. As a result, researchers can use statistical analysis to study the subtle and complex effects of cell-to-cell heterogeneity, gene expression dynamics, and cell-to-cell interactions in isogenic populations [1,6]. This has led to fundamental discoveries related to antibiotic resistance [8,15], and has allowed for accurate characterization of genetic parts and circuits [16]. Further, single-cell time-lapse analysis has revealed that cell division in E. coli is asymmetrical, where daughter cells receiving the ‘old’ pole grow more slowly than daughter cells receiving the ‘new’ pole [17]. However, these effects are subtle, necessitating measurements of many division events to determine statistically significant effects [18]. Until recently, such studies necessitated painstaking semi-manual analysis and curation of microscopy data. Software based on traditional image analysis techniques such as Schnitzcells [19], Oufti [20], and SuperSegger [21] all require significant user input and post-processing. A few recent studies have proposed deep learning models for bacterial cell segmentation such as MiSiC [10], DeepCell [22], and Cheetah [14], and yeast cell segmentation in Yeastnet [12] and Cell-DETR [23], however to our knowledge there is no integrated deep learning segmentation and tracking pipeline for two-dimensional time-lapse analysis of bacteria. In previous work, we developed the Deep Learning for Time-lapse Analysis (DeLTA) pipeline to analyze single-cell growth and gene expression in microscopy images [11]. DeLTA uses two instances of the U-Net model to segment and then track cells. This allows for rapid and robust analysis of time-lapse movies. The original version of DeLTA focused on segmentation and tracking of cells in the ‘mother machine’ microfluidic device [24] where bacteria are constrained to narrow chambers where they grow in a single file line. This powerful design simplifies image analysis and enables experiments that run for many hours or days. However, this constrained geometry is not well suited for the study of two-dimensional effects such as diffusion of chemical signals, proximity-based effects, or co-cultures with mixed populations of cells. Two-dimensional configurations, ranging from microcolonies growing on agarose pads to microfluidic growth chambers, can be used to measure spatial dynamics of cell-to-cell interactions. Examples include quorum sensing [25] and the effect of efflux pumps on neighboring cells in the presence of antibiotics [26]. Segmenting and tracking cells in two dimensions are more challenging than for cells constrained within the mother machine. Segmentation becomes more difficult as microcolonies grow because images can contain hundreds of cells, where any given cell may have neighbors on all sides. The complexity associated with tracking also increases dramatically. In contrast to mother machine data, where frame-to-frame assignments can be limited to the small number of cells within the chamber (typically <10 cells), two-dimensional environments need to consider hundreds of possible assignments. Further, cells can move in any direction and may move large distances, for example if there is drift over the course of the movie. In this manuscript we introduce DeLTA 2.0, a new version of DeLTA that segments and tracks cells in two dimensions. DeLTA 2.0 retains all the functionality of the original version and is fully compatible with mother machine data. In order to make our approach adaptable to different use cases, we minimized the number of pre- and post-processing steps so that most of the analysis is performed by the trainable models. The new version is available open source and uses a fully Python implementation. We have also introduced other improvements to the code to increase accessibility, such as the ability to work with images of arbitrary size and to accept many common microscopy file formats as inputs. We show that DeLTA 2.0 can segment and track co-cultures of bacteria growing on agarose pads and within microfluidic chambers. In addition to fluorescence and cell length, DeLTA 2.0 also records pole age. We use this to record replicative aging and compare the growth rate across generations. DeLTA 2.0 performs well on crowded images and requires no human intervention. The code, installation instructions, and datasets are available open source on Gitlab. We also provide a Google Colab notebook for users to rapidly test DeLTA 2.0 on their own data.

Results

DeLTA 2.0 can track cell length, growth rate, fluorescence, and progeny over time for cells growing in a two-dimensional microcolony. The algorithm takes microscopy images as inputs. It then uses two U-Net convolutional neural network models, one for segmentation and one for tracking, performs lineage reconstruction, and outputs single-cell data (Fig 1A). Segmentation generates information about cell morphology and can be used to identify both normal and filamented cells (Fig 1B and 1C). The tracking step reliably tracks cells across frames, recording division events when they occur (Fig 1D). Lineage reconstruction determines how cells are associated across generations and records features such as pole age (Fig 1E). The algorithm outputs single-cell resolution information for all cells within a field of view (Fig 1F). No human intervention is required to specify any input parameters. This is in sharp contrast to other methods, which typically require inputs, such as cell size or cell type [10,19-21,27].

Fig 1

Segmentation and tracking of cells within a microcolony.

Segmentation and tracking of cells within a microcolony.

(A) DeLTA pipeline consists of segmentation, tracking, and lineage reconstruction. (B) Segmentation example with phase contrast image containing an E. coli microcolony, which is input into a U-Net convolutional neural network to obtain segmentation results. (C) Histogram of cell lengths. Inset shows a zoomed-out version with outliers included. (D) Cell tracking between frames. Representative examples of cell tracking with and without division are shown with a phase contrast image of the ‘previous frame’ on the left, a phase contrast image of the ‘current frame’ in the middle, and a greyscale image of the ‘prediction’ on the right. The ‘current frame’ also shows the tracking prediction overlayed. The ‘prediction’ shows the U-Net output with the ground truth overlayed (S1 Fig). (E) Lineage reconstruction keeps track of cell lineages and records pole age. (F) Plot of cell lengths over time. Black line is a representative example of one cell’s length as it grows and divides; all cells in the microcolony are shown in grey. DeLTA 2.0 can process datasets of various dimensions quickly and robustly. To evaluate its speed and accuracy, we used a movie from the literature that had been segmented and tracked with manual correction that DeLTA had not been trained on (Methods). It took 8 mins and 49 secs to conduct complete analysis of this time-lapse movie containing 69 frames, where the final frame contains 232 E. coli cells (S1 Movie). This analysis was conducted on a desktop computer with a Nvidia Quadro P4000 graphics card. In addition to being fast, DeLTA also has a low error rate. For the 3,286 cells segmented in the test set, there was a segmentation error rate of 0.01%. We defined a correct segmentation prediction as any case where the cell annotation in the model prediction had more than three quarters of its pixels overlapping with the ground truth data. This allows subtle differences between the prediction and ground truth to be considered acceptable, such as slight discrepancies in the exact location of the cell perimeter. In addition, if the model made a prediction where it erroneously connected two cells together, we defined this as two errors. Erroneously predicting a split cell was counted as one error. Because it can be difficult to assess the exact frame at which a cell divides, we did not count predictions where the model determined division events to be up to three frames later or earlier than in the ground truth as an error. Other metrics for assessing segmentation errors also showed good agreement with our findings that error rates are low (S2A Fig). The segmentation model tends to slightly under-segment due to emphasis on getting the cell borders classified correctly (S2B Fig). To assess the tracking error rate, we processed 6 movies of E. coli growing on agarose pads that the model had not been trained on. Out of the 17,622 tracking events, we measured an average error rate of 1.02%. We defined a correct prediction as a case where the cell assignment from one timepoint to the next matched the ground truth. Cases where the model assigned two cells as the daughters of a cell from the previous frame when there was in fact no division event, and cases where the model assigned no cell when the cell was in fact still within the field of view, were counted as tracking errors. The DeLTA 2.0 algorithm has several improvements over the original version of DeLTA [11]. The new code is a purely Python workflow; movies do not need to be pre- and post-processed in Matlab. This transition allows the entire pipeline to exist in an open-source framework. We do provide code that can be used to convert the output to a Matlab file for users that are more comfortable working in this environment for post-processing data. In addition, in DeLTA 2.0 we take advantage of the Bio-Formats toolbox for Python [28,29]. This allows users to work directly with images in many common formats that are output via microscopy software, including nd2, czi, ome-tiff files, and many more, without the need for any preformatting. We also made updates to the code that increase its flexibility, while optimizing for performance. For example, DeLTA 2.0 can accept input images of various sizes. For large images (>512x512 pixels), DeLTA 2.0 will automatically crop the image into smaller windows for segmentation and then stitch the outputs back together. We note that large movies can cause memory issues, however the size limit at which this occurs will depend upon the configuration of the system on which it is run. For example, in our configuration the analysis of a time-lapse experiment with images of 1024x1024 pixels over 865 timepoints used 14GB of computer memory. Since the original DeLTA code was optimized for images from the mother machine, where cells are constrained to one-dimensional chambers, tracking was relatively straightforward. In two dimensions, tracking is a more complex task and the number of cells that need to be tracked simultaneously increases dramatically. It can be challenging to identify which cells are associated with which lineage. To improve tracking speed, we crop a 256x256 pixel area around the cell of interest. This approach works because a single cell is expected to remain in the immediate vicinity of where it was in the preceding frame, so it is reasonable to restrict the search for daughter cells to the local area. These coordinates are then used to crop the three other inputs (previous phase contrast image, current phase contrast image, and current segmentation). The dimensions of these cropping windows can be changed in the configuration file. To reduce overfitting, DeLTA 2.0 uses several new data augmentation operations while training. In addition to operations such as random shifting, scaling, rotation, flipping, and illumination, which were present in the original software, we added three new functions, two for segmentation and one for tracking. To help deal with an occasional out of focus frame, we added a blurring function that slides a Gaussian kernel over the image (Methods). In addition, electronic noise is another issue when dealing with biological samples where the minimization of the total exposure to excitation light decreases the signal-to-noise ratio of the camera’s sensor. To deal with this, we added a function that adds Gaussian noise (Methods). To simulate exaggerated cell movement during tracking, such as when an agarose pad dries out and causes the field of view to shift over time, we wrote a new augmentation function that introduces image translations between different timepoints (Methods). These operations help expand the training dataset and allow the model to generalize to realistic conditions. Because drift of cells within images is a common concern for some applications, we further characterized the algorithm’s performance under shifts of different sizes (S3 Fig). When the cell density is low, DeLTA can reliably handle shifts of up to ~30 pixels per timepoint in our images, which corresponds to ~4 μm. This problem is exacerbated in conditions where the frame is crowded with cells. When the cell density is high, performance begins to degrade after shifts of ~15 pixels, or ~2 μm. Thus, optimizing experimental conditions to minimize drift is important for high quality analysis. To showcase the utility of DeLTA 2.0, we performed several experiments where we grew E. coli microcolonies on agarose pads and analyzed the output. First, we used DeLTA 2.0 to distinguish differences in growth rate between antibiotic resistant and susceptible cells grown in the same field of view. In this experiment, we mixed two strains of E. coli, one containing a tetracycline resistance gene and a constitutively expressed red fluorescent protein (RFP) reporter, and the other without the resistance gene and containing a green fluorescent protein (GFP) reporter. We grew cells in a co-culture on an agarose pad containing an inhibitory concentration of tetracycline (0.5 μg/ml). DeLTA 2.0 reliably segmented cells within the image (Fig 2A). The tetracycline resistance gene allowed the RFP-expressing cells to grow well whereas the tetracycline sensitive GFP-expressing cells grew very slowly (Fig 2B). The RFP and GFP fluorescence of individual cells can be plotted over time and shows the two distinct strains (Fig 2C). By extracting mean fluorescence levels for all cells within the time-lapse images, we found that fluorescence levels for the two populations were well-separated and maintained over time, as would be expected for the constitutive reporters. We also used DeLTA 2.0 to calculate the individual cell growth rates with respect to fluorescence. We observed two distinct clusters, corresponding to RFP cells that grew normally and GFP cells that grew slowly or did not grow (Fig 2D). These results highlight the ability to track cells with different properties simultaneously within the same movie.

Fig 2

Resistant and susceptible strains of E. coli on agarose pads containing an inhibitory concentration of tetracycline.

Resistant and susceptible strains of E. coli on agarose pads containing an inhibitory concentration of tetracycline.

(A) Phase contrast images and associated fluorescence overlays. RFP expressing cells contain a tetracycline resistance gene and GFP expressing cells do not. The magenta and green cell outlines in the fluorescence overlay represent the resistant and susceptible cells, respectively. Region of interest boxes show the areas represented in (B). (B) Representative examples of antibiotic resistant and susceptible cells tracked over time. (C) RFP and GFP fluorescence tracked for individual cells over time. (D) GFP fluorescence versus RFP fluorescence for single cells plotted against growth rate. Fluorescence values are the averages over all the frames for that cell. For growth rate calculations, only cells that were present at t = 150 min were tracked, which is a time point mid-to-late in the movie. The analysis omits those cells that enter the field of view after t = 150 min since the growth rates become noisier with less data. Three resistant cell outliers with growth rates of ~1.4 1/hr are omitted from this view. DeLTA 2.0 is well suited for measuring growth and gene expression for many cells within an image. Because of this, another potential application is the study of replicative aging within bacterial microcolonies. Recently studies have shown that non-genetic differences may be passed down to offspring, causing a modest but measurable change in growth rate [7,17,18,30,31]. This can be tracked by recording pole age over time. Rod shaped bacteria have two poles, where one end of the cell is referred to as the ‘old’ pole if it was passed down from the mother. The pole formed after division is referred to as the ‘new’ pole (Fig 3A).

Fig 3

Pole age and its impact on growth rate.

Pole age and its impact on growth rate.

(A) Schematic showing how poles are passed down during a division. When a cell divides, the newly formed poles are defined as the ‘new’ poles (white dot) whereas the poles that were passed down from the mother are defined as the ‘old’ poles (black dot). Scale bar, 2 μm. (B) Pole assignment schematic. When the mother cell with known poles divides, the daughter cell that inherits the mother’s old pole is denoted ‘O’ whereas the daughter that inherits the mother’s new pole is ‘N.’ For each generation, either an O or an N is appended to the pole history. (C) Growth rate within each generation. The growth rate of an individual cell is calculated for the period right after the mother’s division until right before the cell divides again. To reduce noise, only cells present for at least three frames were included in the analysis. Daughters (n = 11,246 cells; two tailed unpaired t-test; p-value *** ≤ 0.001), granddaughters (n = 10,726 cells; a one-way ANOVA with post hoc Tukey test used for statistical analysis. Statistical significance: ‘OO’ and ‘NO’ versus ‘NN’ and ‘ON’; p-value ** ≤ 0.01), and great granddaughters (n = 10,217 cells; a one-way ANOVA with post hoc Tukey test used for statistical analysis. Statistical significance: ‘OOO’ and ‘NOO’ versus ‘ONN’,’NNN’,’NON’, and ‘OON’; p-value * ≤ 0.05). Error bars show standard error of the mean. To date, many experiments studying pole age have been conducted in the mother machine microfluidic device due to the ease of tracking cells [7,17,24]. However, a limitation of this approach is that it is only possible to track cells for a small number of generations because older generations are swept out of the imaging chamber while the mother cell’s old pole stays at the dead end of the chamber. For this reason, the original DeLTA algorithm did not track pole age information, and in a division event the algorithm simply assigned the cell closest to the dead end of the chamber to be the mother and the other cell to be the daughter. However, on two-dimensional surfaces cells can be aligned in any orientation, therefore DeLTA 2.0 assigns old and new poles after division based on the position of the septum. To highlight the ability to track pole age over time, we analyzed a movie of E. coli growing in unstressed conditions. At t = 0, we do not know the history of the cells, so the poles are initially unassigned (Fig 3B). After division, the mother’s poles that are passed down become the ‘old’ poles and the newly divided poles are the ‘new’ poles of the daughters. This proceeds over subsequent generations. To keep track of this, we denote the daughter that receives the old pole as ‘O’ and the new pole as ‘N’. When these daughter cells divide, they form cells that are granddaughters of the original mother cell. We append an O or N at the end of the pole sequence to record this. For example, the cell that inherits the original old pole is denoted OO and its sibling is ON. This continues through the generations, such that great granddaughters of the original mother have three letters in their pole sequence (e.g. OON). First, we compared the growth rate of the old and new pole daughters from time-lapse movies of E. coli growing in unstressed conditions. Consistent with prior literature [17,18,30], we found that old pole (O) daughters grew more slowly than the new pole (N) daughters (Fig 3C). Next, we used the rich generational information provided by DeLTA 2.0 to test for differences in growth rate between granddaughters and great granddaughters with different pole ages. We found that growth rate differences were dependent on which pole the cell most recently received. For instance, OO and NO had growth rates that were lower than the NN and ON (Fig 3C). Therefore, the old versus new pole influence upon growth rate is dominated by effects that extend back only one generation. This result was consistent with great granddaughters as well, where growth rates tended to be slower in cells that most recently received an old pole (OOO, NOO, ONO, NNO) than in those that received a new pole (ONN, NNN, NON, OON). These results demonstrate how pole age information can be tracked with DeLTA 2.0, enabling studies on replicative aging. Finally, to test the generality of the algorithm we analyzed a time-lapse movie of >1,000 B. subtilis cells growing in a microfluidic device. This movie was generated in a different laboratory than any of the data that was included in the training set and no data from this movie were included in training. Using the previously trained model with no modifications, we analyzed the new movie and obtained excellent results (S2 Movie). The performance of DeLTA 2.0 on conditions it has not been trained for demonstrates its adaptability. Cropping to remove chamber edges and training to ignore device features such as support posts could further improve performance. This result highlights the broad potential for impact of DeLTA 2.0 in image analysis of two-dimensional bacterial cultures.

Discussion

In this work, we developed a deep learning pipeline that can process time-lapse images of bacterial microcolonies and output single-cell data. Utilizing the U-Net convolutional neural network architecture, the model can rapidly segment and track cells frame-to-frame with a low error rate. We applied this to successfully differentiate growth rates between sensitive and resistant strains of E. coli growing on an agarose pad. This demonstrates DeLTA’s new ability to measure co-culture dynamics, which are hard to capture in devices like the mother machine. DeLTA 2.0 retains all the functionality of the original version of DeLTA and can now be used on mother machine data or microcolonies of bacteria growing in two dimensions. The analysis pipeline is centered around the two trainable models for segmentation and tracking. We avoided hard-coding ad hoc rules such as pre- and post-processing steps or lineage reconstruction rules to the greatest extent possible. The current models work very well on standard rod-shaped bacteria such as E. coli (S1 Movie) and B. subtilis (S2 Movie). This suggests that analysis of cells with similar morphologies such as other Bacillus, Pseudomonas, and Salmonella species will be straightforward. DeLTA 2.0 may require further training for cases where the appearance of the cells deviates from the data the current models were trained on. For example, although the models can handle some elongated cells (Fig 1B and 1C), they are not currently optimized for cells with highly filamented morphologies or cells undergoing stress. However, since we avoided embedding rules in the code that are specific to our use case, we anticipate that DeLTA can be adapted not only to new morphologies after re-training but also to different organisms. For example, Fox et al. used DeLTA 1.0 to train a model to segment yeast cells with high accuracy [2]. Although segmentation works efficiently and has a low error rate, the model sometimes makes incorrect predictions about distinct cells being connected (S4 Fig). In particular, if movie frames are acquired at a high frequency, for example every minute or less, the segmentation model can fluctuate in its decision to split a cell undergoing division, which then complicates tracking. Within isolated images, these errors can be difficult to catch, even for humans. However, by looking at earlier and later frames it is usually possible to identify such errors because cells cannot divide and then merge back together. As a potential future direction, deep learning architectures that use time-series information such as recurrent neural networks could be combined with our models to improve segmentation by incorporating temporal context. The tracking model runs robustly but can slow down when there are hundreds of cells in the image. Because every cell in the frame creates an input for the tracking model, this increases exponentially as the bacterial microcolony grows. At present, for movies with many cells, tracking is the current bottleneck for processing, whereas segmentation is comparatively faster. For example, S2 Movie, which has 100 frames and over 1000 cells in each frame takes a total time of ~1 hour to process with our system configuration, with over 50 minutes attributed to calculating the tracking events. Future efforts to optimize the tracking algorithm could help to address this by avoiding methods that scale linearly with the number of cells. In addition, initial tests suggest that it may be possible to decrease the size of the convolutional neural network by removing layers, though this would likely need to be customized for specific applications (S5 Fig). This could be applied to the segmentation or tracking model. Overall, DeLTA can now process two-dimensional movies accurately and capture spatial dynamics in a high throughput manner with no human intervention. It works with many common microscopy file formats and extracts single-cell features such as cell poles, length, lineage, and fluorescence levels automatically and saves data into Python and Matlab compatible formats. As many microbiology researchers work with these types of data, we envision that this software can be used to increase the throughput of microscopy image analysis.

Methods

Implementation and network architecture

Code, installation instructions, and datasets are on Gitlab: https://gitlab.com/dunloplab/delta. Documentation for the software is available at: https://delta.readthedocs.io/en/latest/. We also provide a Google Colab notebook, which allows users to test DeLTA 2.0 with their own data without installing the code on their local machine: https://colab.research.google.com/drive/1UL9oXmcJFRBAm0BMQy_DMKg4VHYGgtxZ. The U-Nets are implemented in TensorFlow/Keras. We developed a fully-integrated pipeline that can compile single-cell data from Bio-Formats compatible files or TIFF image sequences, but we also provide simpler scripts and data that illustrate the core principles of our algorithm for easy adaptation to different use cases. In S1 Table, we have listed the packages and exact versions that we use in our environment to run DeLTA 2.0. Both the segmentation and tracking models implement a U-Net neural network architecture. The tracking model inputs a phase contrast image and segmentation from the current and previous frame and outputs a greyscale image of the predicted tracking event (S1 Fig). In the original version of DeLTA [11], the tracking model uses a softmax function as the final activation layer and a categorical cross-entropy loss function to produce three greyscale output images with 1’s in each layer representing the mother cell, daughter cell, and background, respectively. In DeLTA 2.0, the tracking model uses a sigmoid function as the final activation layer and a pixelwise-weighted binary cross-entropy loss function to produce a single greyscale output image with 1’s representing tracked cells (mother and potential daughter) and 0’s representing the background and the cells that did not track to the input cell.

Loss functions and training

To train both models, we implemented a pixelwise-weighted binary cross-entropy loss function, as in the original U-Net paper [9]. This loss function was adapted from the binary cross-entropy function in Tensorflow/Keras which measures the pixelwise loss of a sample. The loss is multiplied elementwise with the weight map to magnify or reduce the loss (S6 Fig). Lastly, we normalized the loss based on the sum of the total weight map to evenly distribute how much each sample updates the model. Overall, the loss function determines the difference between the model output and the ground truth, which is then used to update the weights within the model. As in Ronneberger et al. [9], our loss function takes a weight map as an extra input to assign more importance to certain pixels in the ground truth during training. We used custom weight maps to improve segmentation on rod-shaped bacteria by increasing weights for the center of the cells and the borders between the cells. We also minimize weight on the background, where background is defined as anything in the image that is not a cell or border (S7 Fig). More specifically, we maximized the weights for the skeletons of the cells and borders, which are pixel-wide representations of binary objects in images [32]. Determining the exact borders of the cells by eye is hard and partially arbitrary. To prevent the model from learning these arbitrary cell-border interfaces, we reduced the weights in these areas (S7 Fig). In addition, the background weights were set to be variable, where the weight increased with respect to an incorrect prediction (S8 Fig). The model outputs a number between 0 and 1, with 0 representing the background and 1 representing a cell. Since we had high confidence in the ground truths for the background, we were able to set the values of the weight map for the background to be equal to the actual prediction for the background. If the model incorrectly predicted a cell for a pixel that is background, then there would be a high value for that pixel in the weight map. Alternatively, if the model correctly predicted background for a pixel that was background, then there would be a low value for that pixel in the weight map. This method allows the model to efficiently recognize and discard debris and reduce overfitting on the background. Our code includes a function to automatically generate these weight maps from the ground truth segmentations. Custom weight maps were also implemented for the tracking model, although they were found to be less critical to training a successful model. The weight map was similarly generated by applying morphological operations to the segmentation of all cells in the current frame and the ground truth. The skeleton of the ground truth cell was set to the highest weight while other cells’ pixels were set to decreasing weights depending on the distance from the tracked cell (S1B Fig). The function generating these maps is also provided in our software. In addition to the data augmentation operations described in the original version of DeLTA [11], two new data augmentations were used while training the segmentation model. We used the blurring function GaussianBlur from the OpenCV package which convolves a 5x5 Gaussian kernel over the image. For the noise function we used the random.normal function from the numpy package which outputs random samples from a Gaussian distribution into an array the same size as the image. This is added to the original image and rescaled back to a range between 0 and 1. Both the blurring and noise functions are applied to all input images with user-specified standard deviations. We set 1 and 0.03 to be the default standard deviations for the blurring and noise functions, respectively. To simulate exaggerated cell movement during tracking, such as when an agarose pad dries out and causes the field of view to shift over time, we added a function that randomly shifts the inputs containing the current frame (microscopy image of the current frame, segmentation mask of all the cells in the current frame, ground truth, and weight map) up to a user-specified number of pixels (e.g. 5 pixels). These operations help expand the training dataset and allow the model to generalize to realistic conditions. The segmentation model used to quantify the error rate was trained for 600 epochs with 300 steps per epoch and a batch size of 1. The Adam optimizer was used with a learning rate of 10−4. The tracking model used to quantify the error rate was trained for 500 epochs with 300 steps per epoch and a batch size of 2. The Adam optimizer was used with a learning rate of 10−5. In all cases, the models converged during these training period. For example, S9 Fig shows convergence results for segmentation model training.

Training set generation and testing

For the segmentation training dataset, the initial segmentations were generated semi-automatically by an expert using the interactive learning and segmentation toolkit Ilastik [33]. This accounted for 11% of the final training set. Once the segmentation model was performing well on the test data, which we defined as being more than 95% accurate, we used it to generate more training data. Incorrect DeLTA 2.0 outputs, like segmentations that connect two distinct cells together, were manually corrected. Processed DeLTA outputs accounted for 36% of the training set. Additionally, we incorporated published segmentation data from van Vliet et al. [34] where cells were segmented and tracked to measure the spatial dynamics of gene expression in bacterial microcolonies. We obtained the cell segmentation and tracking data from the ETH archive: https://doi.org/10.5905/ethz-1007-77. On this dataset, we performed operations to improve the data quality including smoothing filters, dilation, erosion, and skeletonize functions. These data accounted for the remaining 53% of the training set. The final training set had 307 training examples from sixty movies, with a maximum of 10 frames per movie to increase sample diversity. Each training example consisted of a phase contrast image as the input, the corresponding segmented ground truth, and a pre-generated weight map used in the loss function (S7 Fig). The test movie used to evaluate the segmentation model was colony 150310–05 from the trpL data zip file from van Vliet et al. [34] on the ETH archive. For the tracking training dataset, we used a modified version of the Matlab script used in DeLTA 1.0 to generate the initial training examples. Instead of showing the whole frame in the graphical user interface, the modified script showed a zoomed-in 75x75 pixel box around the cell of interest. In addition, the modified script had one output consisting of the mother and daughter cell whereas the original script had three outputs for the mother cell, daughter cell, and background. The Matlab script was used to produce 15% of the training set. Each training example consisted of four inputs, one output, and one weight map. The inputs were the phase contrast of the previous frame, segmentation of the cell of interest in the previous frame, phase contrast of the current frame, and segmentation of all the cells in the current frame (S1A Fig). The output is a segmentation mask for the cell(s) that the cell of interest in the previous frame tracked to. The weight map is used in the loss function. Once the tracking model was performing with more than 99% accuracy on test data, we used it to generate more training data. Movies with images taken 5 minutes apart were processed using DeLTA 2.0 and then new training examples were generated by tracking cells across longer time intervals. Instead of tracking from the frame immediately before the current timepoint, cells were tracked from a frame from two or three timepoints before. This allowed us to generalize to longer acquisition intervals and to situations where the cells grew faster or travelled further between frames. These processed DeLTA outputs accounted for 20% of the training set. In addition, published tracking data from van Vliet et al. was incorporated to increase the training set size, accounting for 65% of all the training examples. The final tracking training set had 23,655 examples. The test movies used to evaluate the tracking model were colony 140408–01 from the cib data zip file; colonies 151029_E1-1, 151029_E1-5, and 151101_E3-12 from the rpsM data zip file; and colonies 150309–04 and 150310–05 from the trpL data zip file from van Vliet et al. [34].

Time-lapse microscopy experiments

Overnight cultures of E. coli MG1655 were diluted 1:100 and allowed to grow for 1–2 hours in LB medium. For the co-culture experiment, we included 30 μg/mL of kanamycin for plasmid maintenance. We created the co-culture with a 1:5 dilution by mixing 0.5 μL of the resistant strain and 0.5 μL of the susceptible strain with 4 μL of LB medium. For the pole age experiment, the culture was diluted 1:100 in MGC media (M9 salts supplemented with 2 mM MgSO4, 0.2% glycerol, 0.01% casamino acids, 0.15 μg/ml biotin, and 1.5 μM thiamine). A 1:100 dilution was used to decrease cell density. For both experiments, 1–1.5 μL of the diluted samples were added to prewarmed 1.5% low melting temperature agarose pads made with MGC media. Samples were prepared and imaged as described in Young, et al. [19]. A Nikon Ti-E microscope was used with a 100x oil objective for all microscopy experiments. In the co-culture experiment, E. coli MG1655 were transformed with a single plasmid, either with tetracycline resistance or without. Both plasmids originated from the BioBrick plasmid library (pBbA7k) [35]. The plasmid for the resistant cells harbors both a tetracycline resistant gene and red fluorescent protein gene (rfp) (pBbA7k-RFP-tetA), while the sensitive cells contain only green fluorescent protein (gfp) (pBbA7k-sfGFP). The pads contained 30 μg/mL kanamycin for plasmid maintenance and 0.5 μg/mL tetracycline. Phase contrast, GFP, and RFP fluorescence images were taken every 5 minutes. In the pole age experiment, E. coli MG1655 was used and no antibiotics were present in the culture. Phase contrast images were taken every 5 minutes. We calculated the growth rate as: Where L is the cell length at time t, L is the length at time t-1, and Δt is the difference in time between t and t-1 (5 min = 0.083 hr in our movies). Growth rates were measured for one generation. For example, to measure the growth rate of OO, measurements start when O divides into OO and end when OO divides into OON and OOO. However, no information about the growth rate of O is used to calculate the growth rate of OO. The growth rate is the average across the timepoints within this generation. To reduce noise, growth rates were only recorded in the analysis if cells were present for at least three frames.

Testing impact of frame shifts on tracking

We considered three representative conditions and performed tracking under artificial frame shifts (S3 Fig). The conditions were cases of “low,” “medium,” and “high” cell density, which we defined as having <5, 20–25, and >200 cells in the field of view. We calculated the percent overlap between the model prediction and ground truth for all tracking events as the frame was artificially shifted. For each tracking event, the field of view was shifted over 11 different distances ranging from 1 to 100 pixels, which correspond to 0.129 to 12.9 μm, and each distance was applied in all four cardinal directions to generate a total of 48 shifts plus one unshifted version. The time-lapse movie used for this analysis was colony 151101_E3-12 from the rpsM zip file from van Vliet et al. [6].

Schematic showing how the tracking model uses the inputs to make predictions.

(A) The four inputs are concatenated into one array and processed by the model, which outputs a greyscale image of the cell tracked (or cells, in the case of a division event). (B) Ground truth and custom weight map for tracking. (EPS) Click here for additional data file.

Accuracy of the segmentation model.

(A) For each frame in the movie we plotted the intersection-over-union (IOU) / Jaccard index, Dice score / F1 coefficient. Error rate, as defined in the manuscript, is also shown. (B) The proportion of pixels that were True Positives (TP) + True Negatives (TN), False Positives (FP), and False Negatives (FN) for each frame in the movie. Note that these are pixelwise predictions as opposed to predictions per cell. TP and TN represent the rate of correct pixelwise predictions in the frame. FP represent the rate of erroneous predictions of pixels as part of a cell when the ground truth reports it as background. FN represent the rate of erroneous predictions of pixels as background. The test set (S1 Movie) was used to calculate these evaluation metrics. (EPS) Click here for additional data file.

Impact of frame shifts on tracking model performance.

The average prediction in the four cardinal directions to ground truth overlap for 20 representative tracking events plotted as a function of the shift distance. Tracking model performance when (A) cell density is low (<5 cells in the frame), (B) medium (20–25 cells in the frame), or (C) high (>200 cells in the frame). As the shift distance increases, the performance decreases. The model generally performs better with fewer cells to track per frame. The black line represents the mean for each shift distance. (EPS) Click here for additional data file.

Limitations of segmentation.

Two sequential phase contrast images of E. coli microcolonies with their respective segmentations. The red arrow points to an error where the model has incorrectly combined two cells into one. This type of error is very hard to correct out of context. (EPS) Click here for additional data file.

Reducing the size of the model to increase speed.

Schematics showing different network architectures. (A) Original U-Net architecture that we use throughout the paper. (B) U-Net architecture without the bottom layer. (C) U-Net architecture without the bottom two layers. The model has been trained on segmentation as well as tracking for these reduced networks. The network in (C) runs twice as fast as the original network in (A) and sacrifices very little accuracy. (EPS) Click here for additional data file.

Schematic of the pixelwise binary cross-entropy loss function used to train the segmentation and tracking models.

The inputs and outputs for the loss function are shown. The inputs include the ground truth (GT), the prediction made by the model (Pred), and the associated weight maps. The output is the pixelwise-weighted loss, which is a greyscale image. In step 1, the ground truth and prediction are used to calculate the binary cross-entropy loss. The first half of the equation measures the pixelwise loss associated with the model predicting background when the ground truth is a cell. The second half of the equation measures the pixelwise loss associated with the model predicting cell when the ground truth is background. In step 2, the weighted loss is calculated by performing an elementwise multiplication of the weight map and the loss calculated in step 1. The weight map helps the model learn the more important features, such as cell borders. This schematic of the loss function is simplified for visualization purposes. For clarity, we also show how the pixelwise-weighted loss maps onto the prediction, where cyan regions highlight areas where the loss is emphasized in order to improve the model’s performance for these regions of the example image. (EPS) Click here for additional data file.

Training the model on segmentation with custom weight maps.

Schematic showing two models trained on the same dataset with different weight maps. In this example, both U-Net models were trained for 600 epochs, 300 steps per epoch, with a batch size of 2. Inputs necessary to train the model include the phase contrast image, the associated segmented ground truth, and weight map. (A) Model trained with weight maps derived from the original U-Net paper. Green ovals show examples of errors. (B) Model trained with custom weight maps which were generated by applying morphological operations to the segmented ground truth. The model trained on the new weight maps performs better, as shown by the outputs on a test image. (C) To aid visualization, the ‘Overlay’ shows the custom weight map overlayed on the phase contrast image. The overlay shows that the weights are emphasized at the core of the cells (shown by red lines) and at the borders (shown by yellow lines). (EPS) Click here for additional data file.

Utilizing variable background weight maps for training the model on debris.

A simplified schematic showing how the loss is calculated for a single input using weight maps. (A) Schematic showing the traditional use of weight maps. (B-C) Schematics showing the use of variable weight maps. The prediction is used to update the weight map. (B) The background weight map values are replaced by the background values in the prediction. This method forces the model to quickly learn to filter out debris as the weight map values for the background increase significantly when the model predicts debris as cells. (C) Conversely, when the model correctly classifies the debris as background, the weight map values for the background remain similar to the original values. (EPS) Click here for additional data file.

Loss history of segmentation model trained over 600 epochs.

Loss of the model during training as a function of the total number of epochs. Only points where the model performance improves and loss reaches a new minimum are shown. There were no improvements in the last 85 epochs, showing convergence. (EPS) Click here for additional data file.

Time-lapse movie of a bacterial microcolony analyzed with DeLTA 2.0.

Phase contrast images containing E. coli cells outlined with different colors representing unique cells. Cells can be tracked by following their respective colors throughout the movie. White arrows indicate cell division events. White and colored dots refer to the new and old poles, respectively. This time-lapse movie was part of the test set used to calculate the error rate for tracking and segmentation. Timestamp shows time in HH:MM format. (MP4) Click here for additional data file.

Time-lapse movie of a dense bacterial microcolony growing in a microfluidic device analyzed by DeLTA 2.0.

Phase contrast images show B. subtilis cells outlined with different colors. Cells can be tracked by following their respective colors throughout the movie. White arrows indicate cell division events. White and colored dots refer to the new and old poles, respectively. Frame rate is one frame per minute. Original movie data were kindly provided by Prof. Avigdor Eldar and Dr. Jordi van Gestel. (MP4) Click here for additional data file.

Specific versions used in the environment to run DeLTA 2.0.

Package name and respective number of the version that was used for analysis presented in this manuscript, as well as for other working installations. (DOCX) Click here for additional data file. 29 Sep 2021 Dear Dr Dunlop, Thank you very much for submitting your manuscript "DeLTA 2.0: A deep learning pipeline for quantifying single-cell spatial and temporal dynamics" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. All three reviewers are generally positive about the work. However, not only do they point to several instances where there is a lack of clarity and detail, but there is also general agreement between the reviewers that additional benchmarking or contextualization of the results (what are the strengths and limitations of the approaches taken) would make the manuscript's claims more grounded. I'll also note that two reviewers independently mention issues regarding installation and suggest a google collab (or equivalent) addition to the extant documentation. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts. Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Luis Pedro Coelho Associate Editor PLOS Computational Biology Kiran Patil Deputy Editor PLOS Computational Biology *********************** All three reviewers are generally positive about the work. However, not only do they point to several instances where there is a lack of clarity and detail, but there is also general agreement between the reviewers that additional benchmarking or contextualization of the results (what are the strengths and limitations of the approaches taken) would make the manuscript's claims more grounded. I'll also note that two reviewers independently mention issues regarding installation and suggest a google collab (or equivalent) addition to the extant documentation. Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: O’Connor et al describe DeLTA 2.0, a cell-segmentation, tracking, and fluorescence data analysis pipeline which improves upon the original version of DeLTA in several significant ways. In the introduction, the authors note that increasing data throughput requires reducing time spent on manually correcting segmentation and tracking. This undersells the potential impact of this work, which does not require particularly high throughput data to save many hours. There are also experiments beyond timelapse fluorescence microscopy of bacteria that the DeLTA 2.0 data pipeline can be modified to cover. And some examples the authors cite include a constitutive fluorescence signal to assist in segmentation, which is a complication that can hopefully be avoided with DeLTA. In addition to creating an end-to-end pipeline from raw data to fluorescence for cell lineages, DeLTA 2.0 makes improvements to segmentation training that increase training speed and accuracy. The authors show that the network can be reduced in size without much impact on performance, which would increase the number of potential users with mobile GPUs, for example, or allow for larger images to be processed without tiling. As with DeLTA 1.0, in my hands DeLTA 2.0 must be trained on our particular data sets to achieve usable segmentation; despite having a similar resolution, a reduction in contrast in our images requires retraining the network. This is in contrast with MiSiC (DOI: 10.7554/eLife.65151) that applies a shape index map to all training and prediction inputs to achieve a degree of generalization. In future work, it would be interesting to see whether a DeLTA 2.0 model trained on shape index maps would be sufficient to generate initial training data. For many labs the barrier to using DeLTA will be in generating training data and adjusting training parameters. In the readme file the authors write, “We are currently working on the implementation graphical interfaces to generate training sets and to analyze and correct delta outputs,” which will be important, especially for those without Matlab. Specific comments: * Line 141: “DeLTA 2.0 can process datasets of any dimension quickly” is presumably not true at some limit. A brief discussion of system and GPU memory requirements and likely performance on a common consumer-grade processor and GPU would be useful. On a PC with 16 GB of system memory I ran into some issues for a large example file. * Line 193: The addition of noise is also a feature of MiSiC - 10.7554/eLife.65151 - which could be discussed further. With DeLTA 1.0, I found that the inverse operation also worked – applying a low-pass filter to training and test data improved performance, especially for segmenting objects thinner than E. coli cells. My impression was that this occurred because some data augmentation operations effectively do this by interpolating pixels, so most training data was unlike test data. * Figure 2 and Figure S1: I do not see any need for Fig. S1; the difference in growth rates is very clear in Fig. 2D as well as Fig. A. Also, the unlabeled cells in the “Susceptible” panel of Fig. 2B appear to be “Resistant” and perhaps it would be more clear to show only this microcolony, highlighting the green “Susceptible” cell and purple “Resistant” cells in all frames. General minor comments: * The authors could briefly compare to DeepCell - 10.1371/journal.pcbi.1005177 - which also describes E. coli segmentation and tracking. * Installation on Windows is not straightforward, but is described well enough in the readme to manage. Listing specific versions in the .yml file would have been helpful, and perhaps installation can be simplified for users not needing bioformats. * The file including models, example data, and training data is very large. Separate files would be useful--especially a small file of example data. * More people will try DeLTA if there is a Google Colab notebook or similar that eliminates needing to sort out system configuration; it would be valuable to have one example notebook for training and one example notebook for segmentation, tracking, and displaying some figures of timelapse data similar to those in the paper. Reviewer #2: Summary: The authors introduce the second version of their python package: “Deep Learning for Time-lapse Analysis (DeLTA)”, which can be used to analyse microscopic images of bacteria, including segmentation and tracking over time in 2D. They use one U-Net to segment microscopy images and one U-Net to track single cells from frame to frame. To showcase the pipeline the authors analyze mixed populations of antibiotic resistant and susceptible cells by tracking pole age and growth rate across generations. This package can prove very useful for many researchers. Major concerns: Manuscript: line 147: It would be interesting to see how the segmentation performs on an independent held out testset using common segmentation metrics, such as the dice score to evaluate how well the segmentation model performs. It is unclear to me if the segmentation model has systematic errors, such as over- or under-segmentations. As the authors claim that they quantify cell growth and cell length, the segmentation accuracy should be high. In Figure 3A it looks to me as if the model would under-segment the cells as the segmented cells look thinner than in the phase contrast image. A scatter plot (or violin) of the error rate (line147) against the segmentation overlap (line 148/149) would be interesting to gain insight into the relationship between the two parameters. Python package: The authors should meet commonly used software standards for python packages to ensure long term usability of their tool: Create a python package including version control using continuous numbering for every release Provide a small example for users in the git repository so it is easy to initially run the package and gain understanding of how it works Implement unit tests (e.g. using pytest) with their pipeline to ensure quality control of their code and of future versions. It would be helpful for many users to provide a jupyter-notebook, or google-colab-notebook, which users are guided through. (e.g. similar to: https://github.com/stardist/stardist/blob/master/examples/2D/3_prediction.ipynb) Having a pip-installable version using pypi of the package would be great for many users and ease applications. Minor concerns: line 62: “successive convolutional layers are progressively down-sampled”, not the layers are down-sampled, but the information is down-sampled using pooling layers. line 64: One major feature, which causes the UNet to perform well are the skip connections, which preserve fine details (high frequency information). I think the authors should mention this here. line 313: could computational cost be reduced by ignoring cells that have just divided, assuming a certain minimum time-distance between division events? Or by ignoring cells of a size smaller than a threshold (that is motivated biologically)? line 379: is the gaussian noise used for data augmentation only positive? When simulating microscopy noise, only additive gaussian noise should be considered. For further improvement lightning augmentations (contrast, illumination) could be added in a later release. line 397: “For the segmentation training dataset, the initial segmentations were generated”, does this mean pixel-wise annotation by an expert? Figure 1A looks redundant with Figure 1B, C, D to me. Could be shortened? All figures: Label sizes differ between subfigures, should be unified Fig 1E, could all daughter cells of the one cell highlighted be plotted as well to emphasize the ability to track cells across division events? Other remarks: Figure 1A: gray errors “Old Pole”, “New Pole” difficult to see. Figure 2: Could the authors indicate which cutout of figure 2A is shown in Figure 2B, this would make the figure more comprehensive Figure 1A: Scale bar is hardly visible line 78/79: “Click or tap here to enter text..“, should be removed Reviewer #3: The authors present an extension of their previous work on tracking and quantifying bacterial behavior via Deep Learning. The system relies on U-Nets for cell segmentation and for tracking and can be deployed for systematic large scale analyses of spatial and temporal dynamics of bacterial development in a 2D environment. The authors show two applications of their system for bacteria growing on 2D substrates. In particular they present an analysis demonstrating the impact of pole age on bacterial growth dynamics. This is a good paper in principle. My major criticism would be the lack of details in describing the methods. While I see that it makes sense to reference earlier work here it might still be better to make the paper more self-contained by explaining e.g. the U-Net training (and loss functions) in more detail. For a broader audience it would also be important to discuss if this approach can be generalized to other applications of cell tracking or where the respective limitations are. Detailed comments see below: [page 8]: (D) Cell tracking between frames. Representative examples of cell tracking with and without division are shown with a phase contrast image of the ‘previous frame’ on the left, a phase contrast image of the ‘current frame’ in the middle, and a grayscale image of the ‘prediction’ on the right. The ‘current frame’ shows the tracking prediction overlayed. Please briefly describe the U-Net setup for the tracking in the methods section at least. What exactly are the inputs and outputs here? [page 12]: (C) Growth rate within each generation. What do the connecting lines mean in the right panel (showing great granddaughters). I am not sure I am interpreting this the right way. [page 12]: (n = 10,726 cells; one-way ANOVA test performed; p-value ** ≤… How did you locate the group differences? An ANOVA would only tell you that there is a difference somewhere, or maybe I am missing something here. [page 13]: Utilizing the U-Net convolutional neural network architecture, the model can rapidly segment and track cells frame-to-frame with a low error rate. How is tracking done here? [page 13]: coli, suggesting that analysis of cells with similar morphologies such as Bacillus, Pseudomonas, and Salmonella species will be straightforward. Have you tested this claim by any chance? [page 13]: In addition, if the position of the cells within the field of view shifts dramatically from frame to frame, the current software does not perform well. Do you have an estimate at least? [page 14]: recurrent neural networks could be combined with our models to improve segmentation by incorporating temporal context. Have you tested his idea? How well does your method generalize to related tracking experiments? [page 14]: Future efforts to optimize the tracking algorithm could help to address this by avoiding methods that scale linearly with the number of cells. Any ideas here that could be worth mentioning? [page 14]: It works with many common microscopy file formats and extracts single-cell features such as cell poles, length, lineage, and fluorescence levels automatically and saves data into Python and Matlab compatible formats. How easy is it for a user (say someone who can work with data and image analysis in Python) to adapt your system to other use-cases? [page 14]: As many microbiology researchers work with these types of data, we envision that this software can be used to increase the throughput of microscopy image analysis. Given constraints on e.g. shape and frame rate it might be limited to a specific field I think. Could you comment on this or say to which scenarios you think the approach would generalize well? [page 14]: Network architecture for the segmentation and tracking model is as described in the original DeLTA publication… I think it is worth doing a short recap of the networks here. [page 14]: In DeLTA 2.0, the tracking model uses a sigmoid function as the final activation layer and a pixelwise-weighted binary cross-entropy loss function to produce a single grayscale output image with 1’s representing tracked cells (mother and potential daughter) and 0’s representing the background and the cells that did not track to the input cell. What is the rationale or idea here? [page 15]: The segmentation model used to quantify the error rate was trained for 600 epochs with 300 steps per epoch and a batch size of 1. Did you check convergence of the training? ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Zach Hensel Reviewer #2: Yes: Dominik Waibel Reviewer #3: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at . Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols 30 Nov 2021 Submitted filename: Response_to_Reviewers.docx Click here for additional data file. 25 Dec 2021 Dear Dr Dunlop, We are pleased to inform you that your manuscript 'DeLTA 2.0: A deep learning pipeline for quantifying single-cell spatial and temporal dynamics' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Luis Pedro Coelho Associate Editor PLOS Computational Biology Kiran Patil Deputy Editor PLOS Computational Biology *********************************************************** The reviewers recommend accepting the paper and agree that the manuscript and tool are valuable for the community. They have a few comments, which the authors may want to take into consideration for their future work (and perhaps there is a small coding issue that could be addressed over the short term), but none that would preclude publication. Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: I think that the authors sufficiently addressed all issues that were raised in review. DeLTA 2.0 is certainly something we will use in our lab, and changes described in this revision that make it easier to adopt mean that more labs will use it. I tested the Colab pipeline implementation and found that it worked well after commenting out two lines that were giving an error (91: elif self.reader.resfolder is not None:, and 92: self.resfolder = self.reader.resfolder). For sensitivity to global drift (lines 218-224), we have found that preprocessing after segmentation to estimate and correct drift from the center of mass of segmented objects works well. However, this approach might not be easily generalizable to data sets with cells entering/exiting the image and only works for global shifts in object positions. Reviewer #2: I thank the authors for their improvements on the manuscript and repository and think their work will be very beneficial for the community. Without further comments I recommend this manuscript for publication. No further review is uploaded as an attachment. Reviewer #3: Thanks for revising the manuscript. I particularly appreciate the improved documentation and examples. I am convinced this will be very helpful for the community. I think there was a misunderstanding regarding my question about the tracking model and binary classification loss. My question was: What is the motivating idea for posing the tracking question as a binary segmentation problem at the pixel level? But I think this is not a critical point to discuss. Sorry for not phrasing the question precisely enough. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Zach Hensel Reviewer #2: Yes: Dominik Waibel Reviewer #3: Yes: Nico Scherf 12 Jan 2022 PCOMPBIOL-D-21-01464R1 DeLTA 2.0: A deep learning pipeline for quantifying single-cell spatial and temporal dynamics Dear Dr Dunlop, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Zsofia Freund PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

27 in total

1. Ageing in Escherichia coli requires damage by an extrinsic agent.

Authors: Camilla U Rang; Annie Y Peng; Art F Poon; Lin Chao
Journal: Microbiology Date: 2012-03-15 Impact factor: 2.777

2. Biased partitioning of the multidrug efflux pump AcrAB-TolC underlies long-lived phenotypic heterogeneity.

Authors: Tobias Bergmiller; Anna M C Andersson; Kathrin Tomasek; Enrique Balleza; Daniel J Kiviet; Robert Hauschild; Gašper Tkačik; Călin C Guet
Journal: Science Date: 2017-04-21 Impact factor: 47.728

3. BglBrick vectors and datasheets: A synthetic biology platform for gene expression.

Authors: Taek Soon Lee; Rachel A Krupa; Fuzhong Zhang; Meghdad Hajimorad; William J Holtz; Nilu Prasad; Sung Kuk Lee; Jay D Keasling
Journal: J Biol Eng Date: 2011-09-20 Impact factor: 4.355

4. Periplasmic Acid Stress Increases Cell Division Asymmetry (Polar Aging) of Escherichia coli.

Authors: Michelle W Clark; Anna M Yie; Elizabeth K Eder; Richard G Dennis; Preston J Basting; Keith A Martinez; Brian D Jones; Joan L Slonczewski
Journal: PLoS One Date: 2015-12-29 Impact factor: 3.240

5. Deep Learning Automates the Quantitative Analysis of Individual Cells in Live-Cell Imaging Experiments.

Authors: David A Van Valen; Takamasa Kudo; Keara M Lane; Derek N Macklin; Nicolas T Quach; Mialy M DeFelice; Inbal Maayan; Yu Tanouchi; Euan A Ashley; Markus W Covert
Journal: PLoS Comput Biol Date: 2016-11-04 Impact factor: 4.475

6. Single-cell measurement of plasmid copy number and promoter activity.

Authors: Bin Shao; Jayan Rammohan; Daniel A Anderson; Nina Alperovich; David Ross; Christopher A Voigt
Journal: Nat Commun Date: 2021-03-05 Impact factor: 14.919

7. Short-range quorum sensing controls horizontal gene transfer at micron scale in bacterial communities.

Authors: Jordi van Gestel; Tasneem Bareia; Bar Tenennbaum; Alma Dal Co; Polina Guler; Nitzan Aframian; Shani Puyesky; Ilana Grinberg; Glen G D'Souza; Zohar Erez; Martin Ackermann; Avigdor Eldar
Journal: Nat Commun Date: 2021-04-19 Impact factor: 14.919

8. MicrobeJ, a tool for high throughput bacterial cell detection and quantitative analysis.

Authors: Adrien Ducret; Ellen M Quardokus; Yves V Brun
Journal: Nat Microbiol Date: 2016-06-20 Impact factor: 17.745

9. An Optogenetic Platform for Real-Time, Single-Cell Interrogation of Stochastic Transcriptional Regulation.

Authors: Marc Rullan; Dirk Benzinger; Gregor W Schmidt; Andreas Milias-Argeitis; Mustafa Khammash
Journal: Mol Cell Date: 2018-05-17 Impact factor: 17.970

10. CellProfiler 3.0: Next-generation image processing for biology.

Authors: Claire McQuin; Allen Goodman; Vasiliy Chernyshev; Lee Kamentsky; Beth A Cimini; Kyle W Karhohs; Minh Doan; Liya Ding; Susanne M Rafelski; Derek Thirstrup; Winfried Wiegraebe; Shantanu Singh; Tim Becker; Juan C Caicedo; Anne E Carpenter
Journal: PLoS Biol Date: 2018-07-03 Impact factor: 8.029

3 in total