| Literature DB >> 34290253 |
Godai Suzuki1, Yutaka Saito1,2,3, Motoaki Seki4, Daniel Evans-Yamamoto4,5,6, Mikiko Negishi4, Kentaro Kakoi4, Hiroki Kawai7, Christian R Landry8,9,10,11,12, Nozomu Yachie13,14,15,16, Toutai Mitsuyama17.
Abstract
Morphological profiling is a combination of established optical microscopes and cutting-edge machine vision technologies, which stacks up successful applications in high-throughput phenotyping. One major question is how much information can be extracted from an image to identify genetic differences between cells. While fluorescent microscopy images of specific organelles have been broadly used for single-cell profiling, the potential ability of bright-field (BF) microscopy images of label-free cells remains to be tested. Here, we examine whether single-gene perturbation can be discriminated based on BF images of label-free cells using a machine learning approach. We acquired hundreds of BF images of single-gene mutant cells, quantified single-cell profiles consisting of texture features of cellular regions, and constructed a machine learning model to discriminate mutant cells from wild-type cells. Interestingly, the mutants were successfully discriminated from the wild type (area under the receiver operating characteristic curve = 0.773). The features that contributed to the discrimination were identified, and they included those related to the morphology of structures that appeared within cellular regions. Furthermore, functionally close gene pairs showed similar feature profiles of the mutant cells. Our study reveals that single-gene mutant cells can be discriminated from wild-type cells based on BF images, suggesting the potential as a useful tool for mutant cell profiling.Entities:
Mesh:
Year: 2021 PMID: 34290253 PMCID: PMC8295336 DOI: 10.1038/s41540-021-00190-w
Source DB: PubMed Journal: NPJ Syst Biol Appl ISSN: 2056-7189
List of the target genes for creating mutant cells.
| Target gene | Functional information |
|---|---|
| PSMA2 | Subunit alpha-2 of the alpha ring of the proteasome core complex. Paralog of PSMA7. |
| PSMA7 | Subunit alpha-4 of the alpha ring of the proteasome core complex. Paralog of PSMA2. |
| PSMB5 | Subunit beta-5 of the beta ring of the proteasome core complex. Paralog of PSMB6. |
| PSMB6 | Subunit beta-1 of the beta ring of the proteasome core complex. Paralog of PSMB5. |
| PSME1 | Alpha subunit of the PA28 complex involved in the immunoproteasome. Paralog of PSME2. |
| PSME2 | Beta subunit of the PA28 complex involved in the immunoproteasome. Paralog of PSME1. |
| UBQLN1 | Ubiqulin that shuttles a protein to the proteasome. Paralog of UBQLN2. |
| UBQLN2 | Ubiqulin that shuttles a protein to the proteasome. Paralog of UBQLN1. |
Fig. 1Workflow for the image analysis of single-gene mutant cells.
a HEK293Ta was used as parental cells (wild type), and single-gene knockout mutant cells were produced by CRISPR-Cas9 genome editing. b Cell imaging. Using an automated image acquisition system, the BF image and the fluorescent image (Hoechst33342) of over 670 cells were obtained for each mutant and the wild type. c Quantification of texture features for each single cell. Each single-cell region was identified based on the position of a nucleus, and texture features were quantified from the BF images.
Fig. 2Evaluation of the discriminative models for mutant cells.
a Receiver operating characteristic (ROC) curves in the discrimination of PSMB5 mutant cells from wild-type cells. Thin blue lines represent ROC curves calculated with tenfold cross-validation. Thick blue line and gray area represent the mean and the quartile of the ROC curves, respectively. b Relationship between posterior probabilities and linear predictors of the discriminative model for PSMB5 mutant cells. Upper and middle plots show the distributions of linear predictors of wild-type cells and mutant cells, respectively. Lower plot shows a sigmoid curve between posterior probabilities and linear predictors. c AUC of the discriminative model for each mutant cell type. The error bar shows the standard deviation in tenfold cross-validation. Red dashed line represents AUC = 0.59.
Fig. 3Examples of the large-weight features in the discriminative model for PSMB5 mutant cells.
a Schematics of the feature extraction procedure from BF images. Tau indicates a threshold of pixel value to identify clumps within a cellular region as described previously[15]. b Distribution of PSMB5 mutant cells and wild-type cells on the feature “sample mean of number of clumps”. Blue and red histograms represent mutant and wild-type cells, respectively. P value of one-sided U test between mutant and wild-type cells, and the regression coefficient (RC) in the logistic regression model are shown. c Distribution of PSMB5 mutant cells and wild-type cells on the feature “sample mean of average size of clump areas” shown in the same way as in b. d Distribution of PSMB5 mutant cells and wild-type cells on the feature “sample mean of average value of noncircularity” shown in the same way as in b. e Example of clump detection using an image of a wild-type cell. The leftmost panel is an input cellular image, and the right panels are those processed with different thresholds. In the magnified panel, each clump is shown in a different color.
Fig. 4Comparison of the mutants based on morphological profiles.
Clustering of mutants (vertical) and features (horizontal) using the regression coefficients in the discriminative models. Red and blue colors in the heatmap represent positive and negative regression coefficients, respectively. Features not selected by L1 regularization were colored in white. Numbers described on the right side of gene names show the number of selected features.