| Literature DB >> 29203784 |
Takashi Kamatani1, Koichi Fukunaga2, Kaede Miyata3, Yoshitaka Shirasaki3, Junji Tanaka4, Rie Baba1, Masako Matsusaka1, Naoyuki Kamatani4, Kazuyo Moro5, Tomoko Betsuyaku1, Sotaro Uemura3.
Abstract
For single-cell experiments, it is important to accurately count the number of viable cells in a nanoliter well. We used a deep learning-based convolutional neural network (CNN) on a large amount of digital data obtained as microscopic images. The training set consisted of 103 019 samples, each representing a microscopic grayscale image. After extensive training, the CNN was able to classify the samples into four categories, i.e., 0, 1, 2, and more than 2 cells per well, with an accuracy of 98.3% when compared to determination by two trained technicians. By analyzing the samples for which judgments were discordant, we found that the judgment by technicians was relatively correct although cell counting was often difficult by the images of discordant samples. Based on the results, the system was further enhanced by introducing a new algorithm in which the highest outputs from CNN were used, increasing the accuracy to higher than 99%. Our system was able to classify the data even from wells with a different shape. No other tested machine learning algorithm showed a performance higher than that of our system. The presented CNN system is expected to be useful for various single-cell experiments, and for high-throughput and high-content screening.Entities:
Mesh:
Year: 2017 PMID: 29203784 PMCID: PMC5715092 DOI: 10.1038/s41598-017-17012-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Stepwise change in loss and accuracy in the training process. Loss denotes the value of cross entropy at each step. The process of training with images of square-shaped wells is assessed by reference to changes in training loss (a), training accuracy (b), validation loss (c), and validation accuracy (d) with each iteration step. The process of training with images of round-shaped wells is assessed by reference to changes in training loss (e), training accuracy (f), validation loss (g), and validation accuracy (h) with each iteration step.
Proportions of accurately classified samples for different cell numbers determined by technicians.
| Correct number of cells per well | Overall | 0 | 1 | 2 | >2 |
|---|---|---|---|---|---|
| Number of wells according to trained technicians | 7,920 | 6,458 | 1,225 | 209 | 28 |
| Number of concordant wells* | 7,787 | 6,420 | 1,150 | 193 | 24 |
| Proportion | 0.98321 | 0.99412 | 0.93878 | 0.92344 | 0.85714 |
*Number of wells in which answers from trained technicians and trained CNN were the same.
Proportions of accurately classified samples for different predicted cell numbers by trained CNN.
| Predicted number of cells per well | Overall | 0 | 1 | 2 | >2 |
|---|---|---|---|---|---|
| Number of wells according to trained CNN | 7,920 | 6,467 | 1,203 | 225 | 25 |
| Number of concordant wells | 7,787 | 6,420 | 1,150 | 193 | 24 |
| Proportion | 0.98321 | 0.99273 | 0.95594 | 0.85778 | 0.96 |
Discordance matrix showing the numbers of samples (wells) for which answers from technicians and CNN were different.
| Number of cells by trained CNN | ||||||
|---|---|---|---|---|---|---|
| 0 | 1 | 2 | over 2 | Total | ||
| Number of cells by trained technicians | 0 | — | 38* | 0 | 0 | 38 |
| 1 | 47 | — | 28 | 0 | 75 | |
| 2 | 0 | 15 | — | 1 | 16 | |
| over 2 | 0 | 0 | 4 | — | 4 | |
| total | 47 | 53 | 32 | 1 | 133 | |
*Number of wells for which decisions by trained technicians and CNN were different (discordant wells).
Figure 2Images of nanoliter wells. (a) An example of a grayscale, 511 × 511 pixel digital image used as a sample. In this example, one cell is present in the well. (b) Examples of discordant well images. (c) An example 511 × 511-pixel image for the circular shaped nanoliter wells.
Concordance rate between the majority answers by 10 people and decisions by two technicians or those by trained CNN.
| Majority of 10 people vs | Two technicians | Trained CNN | p value* |
|---|---|---|---|
| Number of concordant wells | 87** | 36*** | |
| Number of discordant wells | 46 | 97 | |
| Total | 133 | 133 | |
| Concordance rate (%) | 0.65 | 0.27 | <0.0001 |
*Chi square test.
**Number of wells in which the decisions were concordant between the majority answers by 10 people and two technicians.
***Number of wells in which the decisions were concordant between the majority answers by 10 people and trained CNN.
Figure 3Defining ambiguous samples increases prediction accuracy. Ambiguous samples were defined by their highest outputs being lower than a threshold, and subsequently assessed by trained technicians. (a) A box plot comparison of log transformed highest outputs of concordant (7835) and discordant (133) samples. The results clearly indicate that discordant samples had much lower highest outputs as compared with concordant samples (P < 2 × 10−16, Mann-Whitney’s U test). (b–d) Change in the concordance rate between the results from the trained CNN and the trained technicians by changing the threshold value for the highest output from CNN for each sample. The samples with the highest outputs lower than the threshold value were classified as ambiguous samples. In this analysis, the ambiguous samples are assumed to be subsequently judged by technicians and all will become concordant. When the proportions of the ambiguous samples are 0 (b), 1 (c), and 5% (d), the concordance rates will become 98.3%, 99.0%, and 99.5% respectively.
Architecture of the network.
| Layer description | No of nodes | No of Weights |
|---|---|---|
| Input | 511 × 511 | |
| Local respose normalization 1 | 511 × 511 | |
| Max pooling 1 | 256 × 256 | |
| Convolution 1 | 32 × 256 × 256 | 5 × 5 × 1 × 32 |
| Convolution 2 | 32 × 256 × 256 | 5 × 5 × 32 × 32 |
| Max pooling 2 | 32 × 128 × 128 | |
| Local respose normalization 2 | 32 × 128 × 128 | |
| Convolution 3 | 64 × 64 × 64 | 5 × 5 × 32 × 64 |
| Convolution 4 | 64 × 64 × 64 | 5 × 5 × 64 × 64 |
| Convolution 5 | 64 × 64 × 64 | 5 × 5 × 64 × 64 |
| Max pooling 3 | 64 × 32 × 32 | |
| Local respose normalization 3 | 64 × 32 × 32 | |
| Convolution 6 | 32 × 16 × 16 | 5 × 5 × 64 × 32 |
| Convolution 7 | 32 × 16 × 16 | 5 × 5 × 32 × 32 |
| Convolution 8 | 32 × 16 × 16 | 5 × 5 × 32 × 32 |
| Max pooling 4 | 32 × 8 × 8 | |
| Local respose normalization 4 | 32 × 8 × 8 | |
| Totally connected 1 | 32 | 32 × 32 × 8 × 8 |
| Totally connected 2 | 512 | 512 × 32 |
| Output | 4 | 4 × 512 |
| Total | 468,768 |