Literature DB >> 33104721

Automated classification of bacterial cell sub-populations with convolutional neural networks.

Denis Tamiev¹, Paige E Furman¹, Nigel F Reuel².

Abstract

Quantification of phenotypic heterogeneity present amongst bacterial cells can be a challenging task. Conventionally, classification and counting of bacteria sub-populations is achieved with manual microscopy, due to the lack of alternative, high-throughput, autonomous approaches. In this work, we apply classification-type convolutional neural networks (cCNN) to classify and enumerate bacterial cell sub-populations (B. subtilis clusters). Here, we demonstrate that the accuracy of the cCNN developed in this study can be as high as 86% when trained on a relatively small dataset (81 images). We also developed a new image preprocessing algorithm, specific to fluorescent microscope images, which increases the amount of training data available for the neural network by 72 times. By summing the classified cells together, the algorithm provides a total cell count which is on parity with manual counting, but is 10.2 times more consistent and 3.8 times faster. Finally, this work presents a complete solution framework for those wishing to learn and implement cCNN in their synthetic biology work.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2020 PMID： 33104721 PMCID： PMC7588061 DOI： 10.1371/journal.pone.0241200

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Control of cell morphology, cycle state, and cell-to-cell interactions are some of the key design goals in synthetic biology to develop ‘living materials’ [1-4]. As one example, bacteria endospores can be incorporated into materials for revival and functional activation [5]. The phenotypic heterogeneity of cells needs to be controlled for precise functional materials such as sensors or bacterial therapies [6, 7]. Classification of heterogeneity is currently assessed via manual inspection of microscope images, as we did before with B. subtilis endospore activated as fluorescent reporters (Fig 1A) [8]. Accurate, higher throughput quantification of cell sub-populations is necessary for improved design of cell-based materials, sensors, and therapies. Such tools would enable better understanding of underlying genetic mechanisms of the cell sub-populations.

Fig 1

Classifying and counting bacterial sub-populations.

Classifying and counting bacterial sub-populations.

(A) Examples of B. subtilis cells, forespores and spores captured on microscope images, identified manually in prior work. (B) “Raw” or unprocessed fluorescence microscopy image of B. subtilis cells during vegetative growth. (C) Examples of various cell clusters found on fluorescent microscope images (i. Artifact (cell debris, impurity), ii. single cell, iii. four-cell cluster, and iv. ten-cell clusters.). (D) Flow cytometry output fluorescence data for viability assay of B. subtilis cells. (i) Gated region contains signal from live cells. Data presented in log (RFU) scale. (ii) Clusters of data on flow cytometry output from (Di) that likely associate with different cell states (vegetative, spore) but are difficult to determine from the FACS data. Current methods of classification work are insufficient to match the throughput of synthetic design in bacteria. While flow-assisted cell counting (FACS) can address larger, eukaryotic, cells the method cannot be as easily applied to sorting bacterial cells which can be up to 1,000 time smaller than eukaryotes. Moreover, bacteria cells can rapidly transition through development states, such as B. subtilis which repeatedly transitions between vegetative cell, forespore, and spore states (Fig 1A) [8]. Moreover, bacteria can form clusters of cells, especially biofilm forming B. subtilis (Fig 1B), which further confounds FACS based analysis which can misrepresent a cluster of cells (or cluster of various cell types) as a single cell (Fig 1C) [9]. However, FACS can be used for some bacteria classification where there are clear fluorescent reporters and distinct populations, such as classifying viability of B. subtilis (Fig 1Di) [10] using the permeability to green (Syto 9) dye but not red (Propidium Iodide) as an indication of cell viability [11]. However, when a mixture of cells, spores and forespores of B. subtilis are subjected to this method more than two clusters of fluorescence signal are present in the flow cytometry data (Fig 1Dii). Correlating these clusters of data to bacterial sub-types can be a challenging task; thus quantification of bacterial cell populations is typically performed with manual microscopy. However, there has been many advances in automated approaches to analysis of microscope images including the use of deep learning. Deep learning tools such as Convolutional Neural Networks (CNN) are well suited for automating classification and quantification of bacterial phenotypes present in microscope images; they have been applied in other microbiology tasks such as classification of coccoliths formed by various coccolithophores, stalked protozoa identification, and bacterial plankton classification [12-15]. CNN transform an image volume to a linear output volume (holding the class scores) using a stack of interconnected convolutional, pooling, and fully-connected layers. The convolutional and fully-connected layers transform the images using weights and biases that are tuned via gradient descent to match the CNN output (class scores) to the annotated training data set [16]. The architecture of the CNN is based on the number of these layers and how they are stacked (see Supplement 3 in S1 File for summary of CNN framework used in this work and solution steps). A successful demonstration of the capability of modern CNN during the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2012 attracted interest to this field and spiked the development of a variety of more advanced, dense convolutional neural network architectures (DCNN) [17, 18]. The main disadvantage in applying these dense neural networks is that a large amount of data is required to achieve sufficient performance due to the density of these networks (e.g. large number of weights to optimize) [19]. Simpler CNN architectures such as LeNet-5, which has a 7 layer structure, is better suited for simple images and sparse training sets [20, 21]. Fluorescent microscope images are typically single channel (greyscale, unlike 3-channel RGB images) and relative low resolution; also, bacterial cells lack high level features that would necessitate a dense CNN for proper generalization. In addition, simple architectures can require significantly fewer images to achieve proper generalization [22]. One example of a CNN based on LeNet-5, modified to reflect recent progress in CNN design such as implementation of non-saturating non-linearities (ReLU), local response normalization, non-overlapping max-pooling and random dropout layers, was successfully implemented in bacterial colony counting [23-27]. Large colonies of cells present on agar plates have simple circular structure and lack high level features, similar to smaller bacterial cell clusters. Nonetheless, there are unique challenges associated with efficient pre-processing of more limited fluorescent microscope images and adapting CNN to analyze bacterial cell clusters, the scope of this work. In this work, we demonstrate classification and enumeration of bacterial (B. subtilis) sub-populations present in fluorescent microscope images with a classification-type convolutional neural network (cCNN) adapted from the previous, larger colony counting work [23]. To achieve this, we present a more efficient method for microscope image preprocessing and augmentation that improves the training efficiency from a limited set of annotated images. We assess the accuracy and confidence of the algorithm for each sub-population type, and discuss how the quality of the algorithm can be further improved. We benchmark this algorithm against one established tool, ImageJ, for total count of cells. We compare the performance of our cCNN algorithm, in terms of accuracy, speed and repeatability, to manual processing. By archiving all our cCNN code and training data, this work can also serve as a clear example to others wishing to implement cCNN for other classification tasks that are encountered when trying to identify and count the effects of synthetic biology design.

Results and discussion

Image preprocessing

While the applications for convolutional neural networks are becoming more widespread and the tools more freely available, these networks are typically designed to work with macroscopic objects with many distinct features (cars, people, animals etc.) [28]. Microorganisms captured on fluorescent microscope images are significantly different in appearance compared to images of common objects (Fig 2A) as they have no real internal features, just the unique size and outline of the cells. For this work we acquired 1,000 fluorescent microscope images (single channel) of B. subtilis cells, and evaluated image preprocessing algorithms commonly used with cCNNs. The objective was to identify data preparation methods that resulted in highest performance of common neural networks.

Fig 2

Image processing workflow.

Image processing workflow.

Raw fluorescent microscope images (a) were processed with a binary segmentation algorithm, and clusters of bacterial cells were manually annotated. All image segments of cell clusters were standardized to the same size with either (b) Null Bumper, (b) Blended or (d) Masked methods. These annotated training images were passed to the cCNN to determine optimal network weights (e). The output of the network (from image depicted in panel c) is a confidence value for each sub-class (A–artifact; I—X–single through ten cell cluster), here presented in a radar chart (F). Major tics– 20% confidence increments, minor– 10%. The raw microscope images were segmented using an adaptive binary thresholding algorithm to identify individual clusters of cells that were spatially segregated from each other (Fig 2A, Supplement 2 in S1 File). Some of these clusters were single cells, some were artifacts (dust, debris, etc.) and others were multicellular clusters of cells (nascent biofilm) with a drop off in number as the size of the cluster increased (i.e. rarer sub-population, see Supplement 2 in S1 File for population numbers of the segmented data). The varied sizes of the segmented data presented a challenge as the majority of classic neural networks operate based on fixed resolution images. As such, all images must be adjusted to a standard size prior to training the network. Generally, there are three ways to standardize sizing of the image data. Images can be (1) distorted (shrunk or stretched), (2) masked or (3) framed [29]. We chose to standardize cropped images to a 200 by 200-pixel resolution, since most of the cell cluster segments did not exceed these dimensions. Distortion of the cropped images changed the appearance of bacterial cells. Also, application of a mask on cropped images (i.e. applying a filter to remove background pixel data) altered the edges of clusters (Fig 2D). Therefore, we settled on framing approaches with two different methods of filling in the frame. In the first framed approach (referred to as “null bumper” or NB), the cropped images were placed in the center of a 200x200 canvas, and the void space was filled with 0 intensity pixels (Fig 2B). In the second, we created a novel “blended bumper” (referred to as BB) where the frames of cropped images were blended with local background pixels, which resulted in a much more consistent appearance of microscope images (Fig 2C, Supplement 2 in S1 File). The NB and BB image datasets were created by rotating captured images at right angles and in a mirrored dimension, resulting in 8x times as many images. During training of NB and BB networks, we observed that the latter was underfitting (Supplement 5 in S1 File). To reduce the magnitude of underfitting, we trained a third network with images created by rotation at finer angle which generated more training data (referred to as “advanced rotation” or AR). The AR image dataset was created by rotating images by 10 degrees across both dimensions, then blending those images with the same algorithm described for BB, resulting in 72x increase in training images. As mentioned above, the NB, BB, and AR data were used to train cCNNs (see Supplement 3 in S1 File for code and steps). For this study we selected a network architecture that was demonstrated to work well in a similar classification tasks and used it with all three image datasets [23]. The structure of this cCNN is discussed in Supplement 4 in S1 File. Briefly, the input of the cCNN is an image (2D array of pixel intensities) and the output of the network are confidence values (0%-100%) for each output class (artifact or one, two, three, and so on, through ten cell-cluster). The top confidence was selected to indicate the predicted class (Fig 2F). To evaluate the quality of the network, annotated images that were not used for training of the network were passed through the network for classification. Accuracy (comparison of prediction to annotated truth value) and confidence (% of how certain the cCNN is of its prediction) were used as metrics to identify best methods of data preparation (NB, BB, vs. AR) and overall performance of the network at cell cluster classification tasks.

Evaluation of networks

Accuracies of CNNs trained on NB, BB, and AR datasets were then evaluated with new images. These results can be tabulated in common confusion matrices (Supplements 7 and 8 in S1 File) that count the precise predictions and number of each off-target predictions. Alternatively, we can visualize this same data using a box and whisker plot (Fig 3), to show the most frequent classifications and the directionality of variance. This plot maps the true (manually annotated count, X- axis) against the algorithm predictions (Y-axis). It is observed that the NB-trained network was much more likely to misclassify A, I and II classes compared to BB- and AR-trained networks. Availability of more training data (AR-trained network), improved the network’s performance in classifying III, IV, V-cell clusters.

Fig 3

Accuracy of NB, BB and AR trained cCNNs across all output classes (A, I-X cell clusters).

Accuracy of NB, BB and AR trained cCNNs across all output classes (A, I-X cell clusters).

Standard box and whisker levels are used with the center mark (dotted line) indicating the median, the bottom and top edges of the box indicating the 25th and 75th percentiles, respectively, the whiskers extending to the most extreme data points not considered outliers. The average (across all cluster types) accuracies of the blended bumper and null bumper cCNNs were similar, 344% and 40% respectively (Supplement 8 in S1 File). The average accuracy of the AR cCNN was 58%. Upon closer examination, the networks performed best at differentiating between an artifact, single cells and two cell clusters (cells joined together), with accuracies reaching as high as 86%, 73% and 75% for the AR network (Fig 3, Supplement 8 in S1 File). As another measure of performance, we assess the relative confidences of the NB, BB, and AR trained cCNN on their predictions for each population sub-class (Fig 4). When examining mean confidences for each individual output class, it becomes evident that the NB network classified all cell clusters with lower confidence than BB and AR networks (Fig 4); the average confidences of both BB and AR networks were 12% higher than that of the NB network across all classes (Supplement 9 in S1 File). The most apparent improvement in prediction confidence can be observed for smaller objects (artifacts, single cells and II-cell clusters), and small cell clusters (III-cell cluster). At this small scale, there is a dramatic improvement in network’s performance when more data is provided for training using the AR network. We attribute this to the asymmetric features of these smaller clusters that present dramatically new images when rotated. This effect is apparently diminished as the cluster size increases, as the NB and AR perform at the same relative level. These larger clusters become more symmetric and thus rotation does not present dramatically new images on which to train.

Fig 4

Confidence of NB, BB and AR trained cCNN across all output classes (A, I-X).

Confidence of NB, BB and AR trained cCNN across all output classes (A, I-X).

Standard box and whisker levels are used with the center mark indicating the median, the bottom and top edges of the box indicating the 25th and 75th percentiles, respectively, the whiskers extending to the most extreme data points not considered outliers, and the outliers plotted individually. The apparent increase in performance at the higher cluster count may also be an artifact due to the limitation of classes imposed by the network. There are only 11 outputs available and thus anything larger would get placed in the largest feature class. However, this could also be an effect of more features present on the larger cluster; additional images of large cell clusters would have to be obtained to determine if this is a real effect or artifact. The variability in accuracy and confidence between different classes can be explained by the fact that the networks were trained on a limited number of training images which did not capture all spatial and rotational orientations of multicellular clusters. These could be further improved using more training data, although at this scale the cCNN approach provides suitable confidence in distinguishing smaller cell clusters and artifacts especially when using the AR training data. Additionally, the variability in sub-population count is less important when each class is summed together to obtain a total count of cells, which is another common application in synthetic biology applications, which we present next.

Comparing cCNN total cell count to ImageJ

As mentioned in the introduction, total count of bacterial cells, especially those prone to clustering, is difficult to do with FACS and is an important metric in quantifying effects of synthetic design. Here we benchmark our cCNN data (summing counts across classes) against two existing methods: 1) a popular open source tool ImageJ that relies on a watershed algorithm and 2) manual counting [30]. For benchmarking the automated approaches, 24 new microscope images were segmented with a binary thresholding algorithm, and cell clusters were either classified and summed with a cCNN (NB, BB and AR trained networks) or sub-segmented with a watershed algorithm and counted with ImageJ (Supplement 10 in S1 File). For truth data we performed manual counting (one person, averaging three independent counts) and compare the automated findings in a parity chart (Fig 5). In this representation a slope equal to unity would indicate that the software approach has the same evaluation as manual counting. NB-trained networks appear to over-count cells on microscope images (slope of 0.75), BB-trained networks perform significantly better (slope of 0.93) but some over counting is still present, and AR-trained networks count equally well as person (slope of 1). ImageJ undercounts images, as evident by the slope of 1.2.

Fig 5

Comparing accuracy of manual (same person, average of 3 counts) to software (cCNN or ImageJ) counting methods (Supplement 13 in ).

Each point represents an evaluated image. The units are the number of cells per image.

Comparing accuracy of manual (same person, average of 3 counts) to software (cCNN or ImageJ) counting methods (Supplement 13 in ).

Each point represents an evaluated image. The units are the number of cells per image. These results clearly demonstrate a significant improvement in average accuracy when comparing NB and BB networks. This improvement is attributed to the quality of the data used to train the network, further supporting that blending the background of images leads to a more accurate network. Some of the neural networks created for this study were overcounting cells on images that contained high cell counts. This can be explained by the fact that high cell count images also contained more multi-cell clusters compared to low cell count images. High cell count images resulted from cell cultures with high cell density, which leads to biofilm formation in B. subtilis. As observed previously, the networks developed in this study exhibited lower accuracy at predicting multi-cell clusters, compared to single or two-cell clusters. The undercounting of ImageJ is attributed to its poor capacity to accurately sub-segment images of biofilms with a watershed algorithm (Supplement 11 in S1 File). Surprisingly, while the AR trained network had an average class prediction accuracy of 58%, when these sub-population numbers are summed together the overall count is on par with manual counting.

Comparing consistency, repeatability, and speed of cCNN total cell count to human performance

While the accuracy of the AR-trained network prepared in this study is similar to that of manual for total cell counting, consistency, repeatability, and speed should vary. To quantify these metrics and benchmark manual counting against the cCNN approach, five people were recruited to count bacterial cells on printed microscope images that were used for this study. Test subjects were given 25 images, 15 unique and 5 that appeared twice in random order (Supplement 12 in S1 File). The study evaluated two parameters–(1) how the total bacterial cell count on all 25 images compared between five people, and (2) how the cell counts compared when test subjects were presented with duplicate images. Variability of human test subjects that counted cells on 25 images was, on average, as high as 39.5 cells per image (Fig 6A). As expected, larger variability was observed with images that depicted more cells (Supplement 12 in S1 File). In comparison, variability of the cCNN based algorithm on average did not exceed 3.7 cells per image. Average variability across all images for human test subjects was 10.2 times greater than for cCNN based algorithm (Fig 6A, inset).

Fig 6

Comparing consistency and repeatability of manual cell counting.

Comparing consistency and repeatability of manual cell counting.

(A) Standard deviation (STD) as a measure of variance of cell counts across 25 microscope images (ordered in increasing number of cells per image) using manual counting from five separate human counters (interoperator variability) benchmarked against the advanced rotation (AR) trained cCNN algorithm. Average variance of human subjects was 10.2 times larger than that of the cCNN algorithm (insert). (B) Standard deviation of each test subject when counting a duplicate image randomized in their set (intraoperator variability). Images 1 through 5 presented in the order of increasing cell population (Supplement 13 in ). Total time required to count cells on all images for each operator is presented in the pie chart (inset). Full circle is 1000 seconds, tick marks are set to 100 seconds. Each individual is represented with a different color (Blue–Person 1; Orange–Person 2; Yellow–Person 3; Purple–Person 4; Green–Person 5). If the data from a specific person is not present, then their variability was 0 cells/image. Variability due to human error is not uncommon in manual tasks. In some cases, differences between individuals can be normalized if there are patterns in human behavior. For example, if one person is systematically under or over-counting, a correction factor (bias) can be introduced to correct the results of a cohort. However, it is not always possible to correct results, if the inconsistencies are random. In this study we evaluated inconsistency of specific individuals on duplicate images, and compared it to that of the cCNN based algorithm. Results show that individuals tend to count a different number of cells on the same image that was duplicated (Fig 6B). The difference is reported as standard deviation between the two instances for each individual (Supplement 12 in S1 File). The variability of individuals in some cases was as high as 12.0 cells per image, or 10% (Fig 6B). In contrast, the variability of the algorithm tested on duplicate images was 2.26 cells per image or 2% on average, and it ranged from 0.7 to 4.24, or 1–4% (Supplement 12 in S1 File). It is important to note that the network performs with no variability when it is loaded in the memory of the computer. We traced algorithm variability to the process of loading the network with a computer vision library. It is possible that the global system variables of the device impact how the network is loaded (rounding error). In addition, we observed that the processing speed of the cCNN based algorithm is considerably faster than that of human test subjects. Specifically, when the algorithm was tested on a computer with an NVIDIA Quadro K620 GPU, the algorithm processed all 25 images up to 3.8 times faster than human test subjects (Supplement 12 in S1 File, Fig 6A inset). Considerably higher processing speeds can be achieved with more computational power.

Conclusions and summary

Convolutional neural networks have played a significant role in automating many complex image processing tasks in biomedical research such as histology, cellular localization, embryonic cell counting and many others [31-33]. In this paper we explored the prospect of developing a cCNN algorithm that can classify bacteria sub populations (in this case stage of biofilm development) from fluorescent microscope images. This algorithm can then be used to readily quantify the effects of synthetic genetic circuits installed in cell-based materials, therapies, and sensors. In this paper we tested the classification performance of cCNN trained on three different types of annotated training data sets: null bumper (NB), blended bumper (BB), and advanced rotation (AR). Of these we find that advanced rotation is best suited for improving accuracy and confidence of the CNN for smaller clusters (debris artifacts, single, double, and triple cells) with accuracies of 86%. The larger clusters (4 to 10 cell clusters) would require additional training data to improve the accuracy (now at 50–66%).A significant finding was that although the individual classification accuracies were lower than desired, when the total count of cells is summed, these inconsistencies balance out and near parity is found using our algorithm compared to manual counting when assessed over 24 images. Established algorithms, such as ImageJ, undercounts especially with higher cell count images. We also benchmarked this AR-trained cCNN algorithm against multiple manual counts and found 10.2x reduction in interoperator variability and 3.8X increase in processing speed. It is important to note that these results were achieved with a modest training data set (annotated segments from 81 images that had representative clusters from 1–10 cells per cluster). Acquiring additional images would be the simplest method to improve the cCNN performance (through additional training) but in this application, this does take significant time with microscopy to search for the rarer, sub-populations (high cell count clusters). Data acquisition of rare events has always been one of the central challenges of the AI industry, but some level of autonomous data acquisition can be achieved with unsupervised learning. Then, this tool could be used for synthetic biology applications, such as tracking the effect of genetic changes on the timing and heterogeneity of sporulation development stages. We anticipate there will be many other applications for automated classification in this field, and thus have provided all our data and architecture code to serve as another learning example (freely available at http://www.reuelgroup.org/resources.html).

Methods

Cell culture prep for microscope imaging

Cells of B. subtilis 168 were grown overnight until saturation in LB media using standard methods. The overnight was used as a seed culture to infect sterile LB in a microplate. The growth in the microplate reader was monitored as a measure of absorbance at 600 nm. Cells were grown at 37C, and with reasonable agitation to prevent excess clumping. Cells were harvested at various stages of growth–early growth, log phase, saturation. When necessary, the culture was diluted with fresh LB prior to microscope imaging. More detailed instructions on sample preparation are outlined in the Supplement 1 in S1 File.

Flow cytometry

Cells were diluted to the appropriate density using LB with 5% sodium azide solution. Then, cell samples were used with the BacLight bacterial viability and Counting Kit (ThermoFisher). All flow cytometry experiments were performed on the factory-direct unmodified BD FACSCanto flow cytometer (San Jose, CA). We used 488 nm laser for extinction and detected fluorescence with a 525/550 nm as well as 610/620 nm bandpass filters as described previously [8].

Fluorescent microscopy and image preprocessing

Images of B. subtilis cell culture were acquired on the Nikon Eclipse E800 microscope equipped with a FITC filter set. Cells were stained with Ethidium Bromide to promote fluorescence. Raw images (unaltered pixel intensity) were segmented with an adaptive binary thresholding algorithm. Images of individual cells or cell clusters were cropped, saved, and annotated using a custom Matlab GUI (Supplement 13 in S1 File). All cell clusters were examined, and we determined that the image segments did not exceed 200 by 200 pixel resolution. As such, we set the input layer of the neural network to accept images of 200 by 200 pixel resolution (Supplement 3 in S1 File). In the null bumper approach, cropped images of cell clusters were placed in the middle of the 200 by 200 matrix, and the pixels around the cropped image were set to 0 intensity. In the blended bumper approach, the space around the cropped image was filled with pixel intensities that matched those in the local background of the cropped image (Supplement 2 in S1 File). For this study, we acquired 1,000 fluorescent microscope images, and then manually annotated these (20,855 segments in total) and 10% of the data was used for training the network and the remainder was reserved for evaluation of performance (see Supplement 2 in S1 File for population numbers of the segmented data).

Convolutional neural network and training

The structure of the convolutional neural network used in this study is described in the Supplement section 3 [23]. Briefly, it is a feed-forward network with 4 convolutional layers. Image data was preprocessed on local computers, while the training procedures were executed with Amazon Web Service resources (Supplement 4 in S1 File).

Evaluating network

Trained networks were evaluated with reserved data (10% of the original data set). The output of the network from these annotated, evaluation images was scored with a confusion matrix (Supplement 6 in S1 File).

Counting cells

The networks were also used to perform cell counting. To achieve that, raw fluorescence microscope images were preprocessed with an adaptive binary thresholding algorithm. Cropped images of cells and cell clusters were normalized to the 200 by 200 pixel resolution, and supplied to the classification neural network. The output of the network corresponded to the number of cells in a given cell clusters. The individual outputs were then added to find the total cell count. For ImageJ adaptive binary thresholding is first used to find clusters and then these cell clusters were further sub-segmented with a watershed algorithm. The image is treated as a topological map, with highest pixel intensities representing the bottom of the “basin”. The basins are then filled with pixels until pixels from neighboring basins come in contact. The boundary is drawn in the place of the contact. The total number of sub-segmented cells was then reported as a total cell count.

Manual cell counting

Fluorescent microscope images that were used to test the efficiency of software-based cell counting were also counted manually. These human cell counting experiments were conducted under the IRB’s oversight (IRB ID 19–566) and is described in more detail in the Supplement sections 10 (for Fig 5), and 12 (for Fig 6). (DOCX) Click here for additional data file.

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present. 28 Aug 2020 PONE-D-20-24391 Automated Classification of Bacterial Cell Sub-Populations with Convolutional Neural Networks PLOS ONE Dear Dr. Reuel, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. I agreed with the comments from both reviewers and their decisions that your manuscript requires major revision. Please address their comments by supplementing your current manuscript with additional materials in your revised manuscript in order to be considered for acceptance. Please submit your revised manuscript by Oct 11 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Yan Chai Hum Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: No Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: 1. No details on the computer vision algorithms. 2. Most of the references are outdated. 3. Comparison with previous studies are not available. 4. Summarize your contribution using at least one paragraph to explain the reason why other readers should read your article, what are the significance of this study and what knowledge this article provides? 5. It looks like an application rather than a research to me. Correct me if I am wrong. Reviewer #2: problem statement and project motivation are clearly defined. However, limited explanation on their proposed technique cCNN, why is this neural network been selected? There are many types of CNN are made available for this kind of similar classification works. No benchmarks literature are available to support their proposed techniques. Presently literature are very limited with only 19 references, many state-of-arts neural networks in recent years are not discussed in the present manuscript. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Khin Wee Lai [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 22 Sep 2020 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at a. https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf b. https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf Thank you for providing us with these details on how to appropriately format the manuscript. We have made the changes to comply with the editing style of PLOS One. Please see the edited manuscript, and let us know if you see anything that requires additional stylistic edits. 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. a. Reviewer #1: No b. Reviewer #2: Yes Thank you for providing that feedback. We have strengthened the manuscript through the comments below. 3. Has the statistical analysis been performed appropriately and rigorously? a. Reviewer #1: Yes b. Reviewer #2: No Thank you for providing that feedback. We have strengthened the manuscript through the comments below. 4. Have the authors made all data underlying the findings in their manuscript fully available? a. Reviewer #1: No b. Reviewer #2: Yes Thank you for providing that feedback. We can see how it can be overlooked that the data (labeled microscope images, and trained neural networks) were made available, as they are located on our group’s website (http://www.reuelgroup.org/resources.html). We reviewed the manuscript, and made sure that the correct links are listed. Additional information, such as Matlab code, is available in the supplement section. If the reviewers feel that additional code or other details need to be included in the main manuscript or the supplement, we will gladly address those requests prior to the publication. 5. Review Comments to the Author Reviewer #1: 1. No details on the computer vision algorithms. Thank you for outlining this weakness of our paper. We revised the introduction section, and provided additional detail on the computer vision algorithm, the architecture of the neural network, and its origins. If you see any seminal review or publication missing, we would be interested in including. We described the process of image preprocessing (which is what we defined as computer vision algorithms) in the results section. Lines 103-110 discuss object segmentation. In the following paragraph, lines 111-128, we described data augmentation methods used in this paper. Specifically, we talked about size normalization, and images rotation. The rest of the paragraph was dedicated towards elaborating on the type of the network that was used in this study. Additional information our choice of the neural network was described in the introduction. We decided that the training process information better fits in the supplement section of the paper, and as such the reader was referred to the supplement sections. With that said, we conceive of the possibility that the reader might have a slightly different definition of what “computer vision algorithms” means, and, perhaps, this is what the reviewer is referring to. If there are additional details you would like us to add to our description of computer vision algorithms used for this paper, please provide specifics and we can make those edits. 2 Most of the references are outdated. Thank you for expressing your concern about the age of the cited literature. We revisited the reference literature used in our manuscript, and cited recent reviews describing architectures of CNN, and use cases of CNN in microbiology imaging. This was done to also satisfy response to Reviewer #2’s similar request to update the reference literature. Please see below. 3. Comparison with previous studies are not available. Thank you for pointing this out. We would like to compare our approach to other classification architectures, but we cannot find other studies describing the use of CNN architectures to classify and enumerate cells from fluorescent microscope images. As such, we benchmarked our method with current go-to methods used by biologists such as manual image analysis and feature based recognition, like ImageJ. If there is a specific algorithm or paper to which you want us to benchmark our approach, please point us to the reference. 4. Summarize your contribution using at least one paragraph to explain the reason why other readers should read your article, what are the significance of this study and what knowledge this article provides? Our primary target audience consists of cell researchers who routinely use microscopy to classify and enumerate cells present on a slide. Our paper shows how to prepare such images for regression and expand a limited image data set using the rotation algorithm. It then presents a thorough guide of using a simple CNN architecture to determine network weights and classify images. For the cell type presented in this work (B. subtilis) we show superior accuracy and speed than existing methods such as manual counting and feature based recognition (ImageJ). We provide all code, and thus this technique can be readily adopted by others for other cell types, providing more accurate and timely results in their work. Our secondary audience is all those learning to use CNN type architectures in their work. Our paper and accompanying data sets (all labeled images) presents a useful learning module for those wanting to go through the design steps of classification with CNN before applying to their problem. 5. It looks like an application rather than a research to me. Correct me if I am wrong. This work is a research application. It is true we are not a group that works on new CNN architectures. Instead we are applying an existing architecture to a new problem, one that we (and others to date) have had to rely on manual counting to achieve. In doing so, we had to determine how to expand a limited data set for training and evaluate the performance against existing methods. Many papers that we routinely read are similar in scope (see examples below), using an established technique in a new application area. We posit that this has merit in building the foundation of science, as it expands the utility of a new method/tool. Without such application build out, there never would be dissemination of a new method.1–5 Examples: (1) Derrien, T.; Estellé, J.; Sola, S. M.; Knowles, D. G.; Raineri, E.; Guigó, R.; Ribeca, P. Fast Computation and Applications of Genome Mappability. PLOS ONE 2012, 7 (1), e30377. https://doi.org/10.1371/journal.pone.0030377. (2) Lu, H.-Y. Application of Optimal Designs to Item Calibration. PLOS ONE 2014, 9 (9), e106747. https://doi.org/10.1371/journal.pone.0106747. (3) Faria, B.; Abreu, F. V. de. Cellular Frustration Algorithms for Anomaly Detection Applications. PLOS ONE 2019, 14 (7), e0218930. https://doi.org/10.1371/journal.pone.0218930. (4) Onsorodi, A. H. H.; Korhan, O. Application of a Genetic Algorithm to the Keyboard Layout Problem. PLOS ONE 2020, 15 (1), e0226611. https://doi.org/10.1371/journal.pone.0226611. (5) Mehta, A. S.; Lau, D. T.-Y.; Wang, M.; Islam, A.; Nasir, B.; Javaid, A.; Poongkunran, M.; Block, T. M. Application of the Doylestown Algorithm for the Early Detection of Hepatocellular Carcinoma. PLOS ONE 2018, 13 (8), e0203149. https://doi.org/10.1371/journal.pone.0203149. Reviewer #2: Problem statement and project motivation are clearly defined. However, limited explanation on their proposed technique cCNN, why is this neural network been selected? There are many types of CNN are made available for this kind of similar classification works. Thank you for pointing out this weakness in our manuscript. We agree that a thorough description of the justification for selecting the CNN architecture that we used was lacking. To address, we reworked the introduction section, and included that discussion. There are many types of CNN are made available for this kind of similar classification works. No benchmarks literature are available to support their proposed techniques. Presently literature are very limited with only 19 references, many state-of-arts neural networks in recent years are not discussed in the present manuscript. We greatly appreciate this feedback, and agree that additional references can help the reader gain a better understanding of the background. We revised the introduction to contain a description of the current strategy for selecting networks, as well as a brief history of the field of deep learning. We backed up that information with additional, relevant citations. If we have missed a seminal work, please bring this to our attention, and we will be happy to add it to the list of references. Submitted filename: AI paper Cover Letter PLOS oNE_Revisions.docx Click here for additional data file. 12 Oct 2020 Automated Classification of Bacterial Cell Sub-Populations with Convolutional Neural Networks PONE-D-20-24391R1 Dear Dr. Reuel, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Yan Chai Hum Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: all comments have been addressed accordingly. I would recommend for acceptance of this article in current form. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: No 16 Oct 2020 PONE-D-20-24391R1 Automated Classification of Bacterial Cell Sub-Populations with Convolutional Neural Networks Dear Dr. Reuel: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Yan Chai Hum Academic Editor PLOS ONE

16 in total

1. Stalked protozoa identification by image analysis and multivariable statistical techniques.

Authors: A L Amaral; Y P Ginoris; A Nicolau; M A Z Coelho; E C Ferreira
Journal: Anal Bioanal Chem Date: 2008-03-08 Impact factor: 4.142

2. Formation and functionalization of membraneless compartments in Escherichia coli.

Authors: Shao-Peng Wei; Zhi-Gang Qian; Chun-Fei Hu; Fang Pan; Meng-Ting Chen; Sang Yup Lee; Xiao-Xia Xia
Journal: Nat Chem Biol Date: 2020-06-29 Impact factor: 15.040

3. Transcription factor-based biosensor for detection of phenylalanine and tyrosine in urine for diagnosis of phenylketonuria.

Authors: Chieh Lin; Yung-Cheng Jair; Yi-Chieh Chou; Pai-Shan Chen; Yi-Chun Yeh
Journal: Anal Chim Acta Date: 2018-08-29 Impact factor: 6.558

Review 4. Engineering advanced cancer therapies with synthetic biology.

Authors: Ming-Ru Wu; Barbara Jusiak; Timothy K Lu
Journal: Nat Rev Cancer Date: 2019-04 Impact factor: 60.716

Review 5. Biofilms: an emergent form of bacterial life.

Authors: Hans-Curt Flemming; Jost Wingender; Ulrich Szewzyk; Peter Steinberg; Scott A Rice; Staffan Kjelleberg
Journal: Nat Rev Microbiol Date: 2016-08-11 Impact factor: 60.633

6. Resilient living materials built by printing bacterial spores.

Authors: Lina M González; Nikita Mukhitov; Christopher A Voigt
Journal: Nat Chem Biol Date: 2019-12-02 Impact factor: 15.040

7. Controlling Heterogeneity and Increasing Titer from Riboswitch-Regulated Bacillus subtilis Spores for Time-Delayed Protein Expression Applications.

Authors: Denis Tamiev; Alyssa Lantz; Grace Vezeau; Howard Salis; Nigel F Reuel
Journal: ACS Synth Biol Date: 2019-09-19 Impact factor: 5.110

8. Genetic Engineering of Oligotropha carboxidovorans Strain OM5-A Promising Candidate for the Aerobic Utilization of Synthesis Gas.

Authors: Daniel Siebert; Tobias Busche; Aline Y Metz; Medina Smaili; Bastian A W Queck; Jörn Kalinowski; Bernhard J Eikmanns
Journal: ACS Synth Biol Date: 2020-05-19 Impact factor: 5.110

9. Propidium iodide staining underestimates viability of adherent bacterial cells.

Authors: Merilin Rosenberg; Nuno F Azevedo; Angela Ivask
Journal: Sci Rep Date: 2019-04-24 Impact factor: 4.379

10. Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides.

Authors: Arkadiusz Gertych; Zaneta Swiderska-Chadaj; Zhaoxuan Ma; Nathan Ing; Tomasz Markiewicz; Szczepan Cierniak; Hootan Salemi; Samuel Guzman; Ann E Walts; Beatrice S Knudsen
Journal: Sci Rep Date: 2019-02-06 Impact factor: 4.379

2 in total

1. A comprehensive review of image analysis methods for microorganism counting: from classical image processing to deep learning approaches.

Authors: Jiawei Zhang; Chen Li; Md Mamunur Rahaman; Yudong Yao; Pingli Ma; Jinghua Zhang; Xin Zhao; Tao Jiang; Marcin Grzegorzek
Journal: Artif Intell Rev Date: 2021-09-29 Impact factor: 9.588

2. Applications of artificial neural networks in microorganism image analysis: a comprehensive review from conventional multilayer perceptron to popular convolutional neural network and potential visual transformer.

Authors: Jinghua Zhang; Chen Li; Yimin Yin; Jiawei Zhang; Marcin Grzegorzek
Journal: Artif Intell Rev Date: 2022-05-04 Impact factor: 9.588

2 in total