Literature DB >> 33986091

sparse-growth-curve: a Computational Pipeline for Parsing Cellular Growth Curves with Low Temporal Resolution.

Chuankai Cheng1, J Cameron Thrash2.   

Abstract

Here, we introduce a Python-based repository, sparse-growth-curve, a software package designed for parsing cellular growth curves with low temporal resolution. The repository uses cell density and time data as the input, automatically separates different growth phases, calculates the exponential growth rates, and produces multiple graphs to aid in interpretation.
Copyright © 2021 Cheng and Thrash.

Entities:  

Year:  2021        PMID: 33986091      PMCID: PMC8142577          DOI: 10.1128/MRA.00296-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The cellular growth curve arising from cell density variation over time is an important phenotype for cell physiology and systems biology studies (1). Improving the throughput of growth rate calculations from growth curves benefits from automated curve-fitting algorithms, but most existing software has been developed for data with high temporal resolution (2–8). Such data are usually generated by automated optical density measurements (5), which apply only to a limited set of organisms that can achieve high cell densities. Absolute cell count quantification (cells per milliliter) can be labor-intensive for high-temporal-resolution growth curves when methods like flow cytometry or microscopy are used. Therefore, we have developed a growth-curve-fitting pipeline specifically for characterizing sparse data. The sparse-growth-curve method takes an array of cell density X and the corresponding acquisition time t (see the template Excel file at GitHub, https://github.com/thrash-lab/sparse-growth-curve) as the input (i.e., typical growth data) (Fig. 1A). The method differentiates growth phases and returns a piecewise linear fit on a logarithmic scale (Fig. 1B). The growth phases are determined through decision tree regression (Fig. 1C), based on defining each growth phase as a period with similar instantaneous growth rates (see https://github.com/thrash-lab/sparse-growth-curve/blob/main/detailed_method.md for detailed methods). The result is automated detection of exponential phase in sparse data and calculation of growth rates according to linear regression.
FIG 1

Overview of the process within sparse-growth-curve. (A) The input for the method (red open circles connected by dashed lines) is cell density X versus time t. Here, we use an imperfect and very sparse curve to illustrate the functionality of the software. (B) The output of the method is piecewise linear regression for each phase. The exponential phase is highlighted in red. The doubling rates are labeled. (C) The subdomains of the piecewise fit in plot B are determined through decision tree regression. Here, gray open circles and squares represent the instantaneous doubling rates calculated from the input data. Red solid squares represent the noise-removed rates versus time for decision tree regression. The state prediction (dashed line) ends up as a step function, and each step corresponds to a growth phase. (D) An extended feature of the sparse-growth-curve package is that the package imports experimental growth curves of multiple strains under multiple conditions. Exponential growth rates for all of the growth curves are calculated. The conversion of doubling rate γ to specific growth rate λ is λ = ln(2)γ. The solid lines are the interpolated connection mean growth rates, and the shaded areas indicate the range between the maximum and minimum growth rates.

Overview of the process within sparse-growth-curve. (A) The input for the method (red open circles connected by dashed lines) is cell density X versus time t. Here, we use an imperfect and very sparse curve to illustrate the functionality of the software. (B) The output of the method is piecewise linear regression for each phase. The exponential phase is highlighted in red. The doubling rates are labeled. (C) The subdomains of the piecewise fit in plot B are determined through decision tree regression. Here, gray open circles and squares represent the instantaneous doubling rates calculated from the input data. Red solid squares represent the noise-removed rates versus time for decision tree regression. The state prediction (dashed line) ends up as a step function, and each step corresponds to a growth phase. (D) An extended feature of the sparse-growth-curve package is that the package imports experimental growth curves of multiple strains under multiple conditions. Exponential growth rates for all of the growth curves are calculated. The conversion of doubling rate γ to specific growth rate λ is λ = ln(2)γ. The solid lines are the interpolated connection mean growth rates, and the shaded areas indicate the range between the maximum and minimum growth rates. The software package can analyze multiple growth curves from different replicates, strains, or conditions (e.g., salinity, temperature, or substrate). The output includes multiple products, including (i) plots of the raw growth data on a logarithmic scale, including the best-fit linear regressions for exponential phase; (ii) box plots of growth rates separated by treatment and colored by replicates; (iii) growth rate calculations in both doubling and specific growth rates, with output in an Excel file; and (iv) multistrain/replicate plots of growth rates for experimental designs with continuous variables (e.g., temperature) (Fig. 1D).

Data availability.

sparse-growth-curve is available on GitHub (https://github.com/thrash-lab/sparse-growth-curve) under MIT license. The code can be run directly in the browser through Google Colaboratory (https://colab.research.google.com/notebooks/intro.ipynb) without any configuration required on a local machine. The code in iPython Notebook v5.5.0 format illustrates the detailed steps of the methods. There are three separate notebooks, i.e., (i) a notebook showing how a single growth curve (X versus t) (Fig. 1A) is fitted (Fig. 1B); (ii) a notebook showing how a single Excel file with multiple growth curves is parsed and characterized; and (iii) a notebook that can handle multiple Excel files (people may want to put growth curves collected from different experimental runs in separate files) and analyze them all in a batch. The Python v3.7.10 (9) code relies on multiple packages (which are automatically installed by running the script in the Google Colab notebooks), including NumPy v1.19.5 (10) for numerical analysis, SciPy v1.4.1 (11) and Scikit-learn v0.22.2 (12) for statistical analysis, Pandas v1.1.5 (13) for handling table format data, Matplotlib v3.2.2 (14) for data visualization, and XlsxWriter v1.3.9 (https://github.com/jmcnamara/XlsxWriter) for writing Excel files. Note that, as time goes on, the package versions will be updated within the Google Colab environment.
  9 in total

Review 1.  From growth physiology to systems biology.

Authors:  Moselio Schaechter
Journal:  Int Microbiol       Date:  2006-09       Impact factor: 2.479

2.  Detecting differential growth of microbial populations with Gaussian process regression.

Authors:  Peter D Tonner; Cynthia L Darnell; Barbara E Engelhardt; Amy K Schmid
Journal:  Genome Res       Date:  2016-11-18       Impact factor: 9.043

3.  PMAnalyzer: a new web interface for bacterial growth curve analysis.

Authors:  Daniel A Cuevas; Robert A Edwards
Journal:  Bioinformatics       Date:  2017-06-15       Impact factor: 6.937

Review 4.  Array programming with NumPy.

Authors:  Charles R Harris; K Jarrod Millman; Stéfan J van der Walt; Ralf Gommers; Pauli Virtanen; David Cournapeau; Eric Wieser; Julian Taylor; Sebastian Berg; Nathaniel J Smith; Robert Kern; Matti Picus; Stephan Hoyer; Marten H van Kerkwijk; Matthew Brett; Allan Haldane; Jaime Fernández Del Río; Mark Wiebe; Pearu Peterson; Pierre Gérard-Marchant; Kevin Sheppard; Tyler Reddy; Warren Weckesser; Hameer Abbasi; Christoph Gohlke; Travis E Oliphant
Journal:  Nature       Date:  2020-09-16       Impact factor: 49.962

5.  Growthcurver: an R package for obtaining interpretable metrics from microbial growth curves.

Authors:  Kathleen Sprouffske; Andreas Wagner
Journal:  BMC Bioinformatics       Date:  2016-04-19       Impact factor: 3.169

6.  PRECOG: a tool for automated extraction and visualization of fitness components in microbial growth phenomics.

Authors:  Luciano Fernandez-Ricaud; Olga Kourtchenko; Martin Zackrisson; Jonas Warringer; Anders Blomberg
Journal:  BMC Bioinformatics       Date:  2016-06-23       Impact factor: 3.169

Review 7.  SciPy 1.0: fundamental algorithms for scientific computing in Python.

Authors:  Pauli Virtanen; Ralf Gommers; Travis E Oliphant; Matt Haberland; Tyler Reddy; David Cournapeau; Evgeni Burovski; Pearu Peterson; Warren Weckesser; Jonathan Bright; Stéfan J van der Walt; Matthew Brett; Joshua Wilson; K Jarrod Millman; Nikolay Mayorov; Andrew R J Nelson; Eric Jones; Robert Kern; Eric Larson; C J Carey; İlhan Polat; Yu Feng; Eric W Moore; Jake VanderPlas; Denis Laxalde; Josef Perktold; Robert Cimrman; Ian Henriksen; E A Quintero; Charles R Harris; Anne M Archibald; Antônio H Ribeiro; Fabian Pedregosa; Paul van Mulbregt
Journal:  Nat Methods       Date:  2020-02-03       Impact factor: 28.547

8.  Gene Dispensability in Escherichia coli Grown in Thirty Different Carbon Environments.

Authors:  Madeline Tong; Shawn French; Sara S El Zahed; Wai Kit Ong; Peter D Karp; Eric D Brown
Journal:  mBio       Date:  2020-09-29       Impact factor: 7.867

9.  A Bayesian non-parametric mixed-effects model of microbial growth curves.

Authors:  Peter D Tonner; Cynthia L Darnell; Francesca M L Bushell; Peter A Lund; Amy K Schmid; Scott C Schmidler
Journal:  PLoS Comput Biol       Date:  2020-10-26       Impact factor: 4.475

  9 in total
  2 in total

1.  Draft Genome Sequence of the Marine Flavobacteriaceae sp. Strain LSUCC0859.

Authors:  Holly R D Stapelfeldt; Shelby J Barnes; Michael W Henson; J Cameron Thrash
Journal:  Microbiol Resour Announc       Date:  2022-05-02

2.  Ecophysiology of the Cosmopolitan OM252 Bacterioplankton (Gammaproteobacteria).

Authors:  Emily R Savoie; V Celeste Lanclos; Michael W Henson; Chuankai Cheng; Eric W Getz; Shelby J Barnes; Douglas E LaRowe; Michael S Rappé; J Cameron Thrash
Journal:  mSystems       Date:  2021-06-29       Impact factor: 6.496

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.