| Literature DB >> 30310752 |
Jeffrey C Berry1, Noah Fahlgren1, Alexandria A Pokorny1, Rebecca S Bart1, Kira M Veley1.
Abstract
High-throughput phenotyping has emerged as a powerful method for studying plant biology. Large image-based datasets are generated and analyzed with automated image analysis pipelines. A major challenge associated with these analyses is variation in image quality that can inadvertently bias results. Images are made up of tuples of data called pixels, which consist of R, G, and B values, arranged in a grid. Many factors, for example image brightness, can influence the quality of the image that is captured. These factors alter the values of the pixels within images and consequently can bias the data and downstream analyses. Here, we provide an automated method to adjust an image-based dataset so that brightness, contrast, and color profile is standardized. The correction method is a collection of linear models that adjusts pixel tuples based on a reference panel of colors. We apply this technique to a set of images taken in a high-throughput imaging facility and successfully detect variance within the image dataset. In this case, variation resulted from temperature-dependent light intensity throughout the experiment. Using this correction method, we were able to standardize images throughout the dataset, and we show that this correction enhanced our ability to accurately quantify morphological measurements within each image. We implement this technique in a high-throughput pipeline available with this paper, and it is also implemented in PlantCV.Entities:
Keywords: Image analysis; Image correction; Large-scale biology; Least-squares regression; Phenotyping
Year: 2018 PMID: 30310752 PMCID: PMC6174877 DOI: 10.7717/peerj.5727
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Effect of temperature on light intensity and image variability.
(A) Graph of calculated deviance (y-axis) within the plant image set and the hour the image was taken in ZT time (x-axis). Each dot represents a single image in the plant image set. The deviance from reference was calculated in Eq. (7). (B and C) Graphs of temperature readings (y-axis) taken at hours at which imaging was done in ZT time (x-axis) in either diurnal (B) or constant (C) conditions. (D and E) Graphs of calculated deviance (y-axis) within the temperature image set and the hour the image was taken in ZT time (x-axis) under either diurnal (D) or constant (E) conditions. Each dot represents a mean of two images in the “temperature image set”. The deviance from reference was calculated in Eq. (7). (A–C). Daytime ZT 0–14 = hr 0,700–2,100 and nighttime ZT 14–24 = hr 2,100–0,700 (A–E). Temperature settings at particular times of day are indicated, either 22 °C (blue) or 32 °C (gold).
Figure 2Standardization effects on shape measurements.
(A–D) Example of the effect of standardization. (A) original unedited image. (B) same image after applying standardization. Plant-isolation masks generated without (C) and with (D) standardization. The magenta coloring in the unstandardized mask indicates pixels that are not contained in the standardized mask. Scale bar = 15 cm. (E) Effect size (y-axis) of each of the shapes (x-axis) before and after standardization grouped by experimental day and night. Values of each shape per image are normalized by subtracting the mean and dividing by the variance. All shapes except ellipse major axis, ellipse center x-coordinate, aspect ratio, and ellipse center y-coordinate are significantly different between daytime and nighttime temperatures (Unequal variance t-test, p-value < 0.05). Shapes are sorted by increasing p-values and are: area, convex hull area, width, perimeter, circularity, solidity, center of mass x-coordinate, height, ellipse minor axis, fractal dimension, roundness, eccentricity, center of mass y-coordinate, ellipse angle, convex hull vertices, ellipse minor axis, ellipse center x-coordinate, aspect ratio, and ellipse center y-coordinate. (F) Association of biomass (y-axis) to area (x-axis) before and after standardization. Every point is an image taken on the last day within the plant image set for which weight was recorded. Linear fit (green line: y = − 0.69 + 0.126 ×; pink line: y = − 0.85 + 0.124 ×) and R2 is displayed for each condition. (G and H) Effect of standardization on perimeter (G) and area (H). Every point is an image in the entire plant image set. Displayed are the measured perimeter and area from before standardization (x-axis) and after standardization (y-axis). Temperature settings at particular times of day are indicated, either 22 °C (blue) or 32 °C (gold). Black line indicates y = x.
Figure 3Standardization effects on color measurements.
(A–D) Average hue histograms of plants 19 days after planting. Shown are the average histograms of unstandardized daytime and nighttime images as well as standardized daytime and nighttime images. For each facet, the x-axis is the hue channel from 0 to 140 degrees and the y-axis is the percentage of the mask at a particular degree. The color of each degree is shown as the color of the line. Gray areas are the 95% confidence interval. (E) Using the same histograms as in (A–D), cumulative distributions of each are shown on the y-axis, and the hue range from 0 to 140 are on the x-axis. As the legend indicates, each line is given a number and the Kolmogorov–Smirnov test as described in Eqs. (8), (9) and (10) are displayed using the numbers that are assigned to each condition. (F–I) Constrained analysis of principal coordinates conditioning on time, in days, for both unstandardized and standardized images colored by nitrogen treatment for both daytime and nighttime images. Constrained axis 1 (CAP1, x-axis) is conditioned on time while multidimensional scaled axis 1 (MDS1, y-axis) is not conditioned on any variables.