| Literature DB >> 32615917 |
Minhua Qiu1, Bin Zhou2, Frederick Lo2, Steven Cook2, Jason Chyba2, Doug Quackenbush2, Jason Matzen2, Zhizhong Li2,3, Puiying Annie Mak2,4, Kaisheng Chen2, Yingyao Zhou5.
Abstract
BACKGROUND: Image-based high throughput (HT) screening provides a rich source of information on dynamic cellular response to external perturbations. The large quantity of data generated necessitates computer-aided quality control (QC) methodologies to flag imaging and staining artifacts. Existing image- or patch-level QC methods require separate thresholds to be simultaneously tuned for each image quality metric used, and also struggle to distinguish between artifacts and valid cellular phenotypes. As a result, extensive time and effort must be spent on per-assay QC feature thresholding, and valid images and phenotypes may be discarded while image- and cell-level artifacts go undetected.Entities:
Keywords: Cell-level quality control; CellProfiler; High throughput image analysis; Image quality measurement; Machine learning
Mesh:
Year: 2020 PMID: 32615917 PMCID: PMC7333376 DOI: 10.1186/s12859-020-03603-5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Schematic diagram of QC workflow. a A phenotype sampler to collect images that retain phenotypic variety. Grouped by their phenotypes, sampled images provide representative cells to train one one-class SVM per phenotype. (Dashed inset in (i) expanded in (ii-iv)). b A set of one-class SVMs to identify artifacts from each image. (‘!’ denotes a phenotype is excluded through visual inspection.) In all cell QC masks, artifacts are marked in green and good quality cells are labeled in blue, with the percentage of artifacts listed in white
Fig. 2Performance of cell-level QC, assay α. a Images sampled by our phenotype sampler are grouped into 7 phenotypes with 5 examples shown for each. b-c A comparison between conventional image quality measurements and AR. Each dot represents one image with the size and intensity proportional to AR. For the conventional QC metrics, FocusScore (FS) and PowerLogLogSlope (PLLS), the dashed line indicates the median value and the solid lines show 1st quartile - IQR*1.5 and 3rd quartile + IQR*1.5 respectively (IQR = interquartile range). Examples of cell-level QC result (colored arrows) are listed with their FS, PLLS and AR (AR) in (C). In all QC masks, artifacts are marked in green and good quality cells are labeled in blue. d Cell-level QC improves the accuracy of well-level phenotype summary. Top, an example of inconsistent dose response: out of six replicates (grey curves), the outlier curve (marked by red squares) contains images with AR between 3.8 and 63.0% as indicated by marker size. Bottom, comparison of drug response consistency before and after cell-level QC at concentration equal to 10uM (see Supplementary Fig. 7 for other concentrations). Each circle represents one compound
Fig. 3Performance of cell-level QC, assay β. a Examples of sampled images grouped by their phenotypes. Sampled images were grouped into seven phenotypes with five examples shown for each. For visualization purposes, only a representative region of the original well image is displayed. b A comparison of phenotype score before and after cell-level QC for different cell lines. From top to bottom are examples of raw images and their phenotype masks before and after QC. Red dash line box shows a well dominated by artifacts (AR > 70%) and removed before downstream analysis. c Examples of wells containing heterogeneous subpopulations to show minority phenotypes (e.g., cells labeled in magenta) were not mistakenly labeled as artifacts by our cell-level QC approach. In all masks, nuclei with phenotype of interest are labeled in magenta, otherwise blue; detected artifacts are marked in green
Fig. 4Detection of staining artifacts, a comparison between our cell-level QC and a patch-level QC application recently integrated into common image analysis platforms. a From left to right, four different types of artifacts are displayed with their: top - raw image, middle - single cell QC result, and bottom - patch-level defocus score generated by a deep learning approach (Yang 2018). In the cell QC mask, artifacts are marked in green and good quality cells are labeled in blue. In the results generated by the patch-level approach, the patch outlines denote the predicted defocus level by hue (red for least defocus) and prediction certainty by lightness (increased lightness for increased certainty). b A comparison between the ratio of defocused patches (y-axis) and AR (x-axis) for images collected from assay α and β. Each dot represents an image with both the size and intensity proportional to the prediction certainty of defocus level. Patches with defocus level greater than 5 are considered out of focus. Images within the red dashed circle show high defocus patch ratio but low prediction certainty. More comparisons using different defocus level cutoffs can be found in Supplementary Fig. 8 along with labeled examples