Literature DB >> 35687595

SALARECON connects the Atlantic salmon genome to growth and feed efficiency.

Maksim Zakhartsev^1,2, Filip Rotnes^1,2, Marie Gulla^1,2, Ove Øyås^1,2, Jesse C J van Dam³, Maria Suarez-Diez³, Fabian Grammes², Róbert Anton Hafþórsson², Wout van Helvoirt², Jasper J Koehorst³, Peter J Schaap³, Yang Jin², Liv Torunn Mydland², Arne B Gjuvsland², Simen R Sandve², Vitor A P Martins Dos Santos³, Jon Olav Vik^1,2.

Abstract

Atlantic salmon (Salmo salar) is the most valuable farmed fish globally and there is much interest in optimizing its genetics and rearing conditions for growth and feed efficiency. Marine feed ingredients must be replaced to meet global demand, with challenges for fish health and sustainability. Metabolic models can address this by connecting genomes to metabolism, which converts nutrients in the feed to energy and biomass, but such models are currently not available for major aquaculture species such as salmon. We present SALARECON, a model focusing on energy, amino acid, and nucleotide metabolism that links the Atlantic salmon genome to metabolic fluxes and growth. It performs well in standardized tests and captures expected metabolic (in)capabilities. We show that it can explain observed hypoxic growth in terms of metabolic fluxes and apply it to aquaculture by simulating growth with commercial feed ingredients. Predicted limiting amino acids and feed efficiencies agree with data, and the model suggests that marine feed efficiency can be achieved by supplementing a few amino acids to plant- and insect-based feeds. SALARECON is a high-quality model that makes it possible to simulate Atlantic salmon metabolism and growth. It can be used to explain Atlantic salmon physiology and address key challenges in aquaculture such as development of sustainable feeds.

Entities: Chemical

Mesh：

Substances：
Amino Acids

Year: 2022 PMID： 35687595 PMCID： PMC9223387 DOI： 10.1371/journal.pcbi.1010194

Source DB: PubMed Journal: PLoS Comput Biol ISSN： 1553-734X Impact factor: 4.779

Introduction

Salmonid aquaculture has grown in volume and economic importance over the past several decades, and Atlantic salmon (Salmo salar) has become the world’s most valuable fish commodity [1]. This is largely thanks to selective breeding, which has improved both growth rate and feed efficiency [2]. The increase in fish farming has also increased demand for feed, and insufficient marine resources has led to a switch to plant-based ingredients [3]. This has reduced production costs and exploitation of fish stocks, but salmon are not adapted to eating plants and current plant-based feeds have a negative impact on fish health and the environment [4, 5]. Also, plant-based feeds are complex, the ingredient market is fluctuating, and feeding trials are demanding. Thus, developing feeds that minimize cost and environmental impact while providing necessary nutrients to the fish is an important challenge [6]. The metabolic network of a cell or organism converts nutrients that are present in the environment to the energy and building blocks that are required to live and grow. It consists of metabolites that are interconverted by metabolic reactions, most of which are catalyzed by enzymes that are encoded by the genome, and it can be translated to a metabolic model, which allows mathematical analysis of network functionality through methods such as flux balance analysis (FBA) [7]. Specifically, metabolic models allow prediction of growth and metabolic fluxes (steady-state reaction rates) that are linked to the genome through logical gene-protein-reaction (GPR) associations, making them promising tools for addressing challenges in aquaculture such as breeding for feed efficiency and sustainable feed development [8]. Large databases of metabolic reactions and models [9-11] and methods for metabolic network reconstruction from annotated genomes [12, 13] have made such models available for organisms ranging from microbes to animals [14]. However, there are still very few metabolic models of fish available [15-18] and none of Atlantic salmon or other important farmed fish species. Here, we present SALARECON: a metabolic model built from the Atlantic salmon genome [19] that predicts growth and metabolic fluxes. It has been manually curated to ensure flux consistency and focuses on energy, amino acid, and nucleotide metabolism. SALARECON is a high-quality model according to community-standardized tests, and it captures expected metabolic (in)capabilities such as amino acid essentiality. Using oxygen-limited growth under hypoxia as an example, we show that model predictions can explain salmon physiology in terms of metabolic fluxes that are, in turn, tied to genes and pathways. Furthermore, we demonstrate an important application for aquaculture by predicting growth-limiting amino acids and feed efficiencies for commercial feed ingredients in agreement with data.

Methods

Building the metabolic model

We manually built a draft model focusing on Atlantic salmon energy, amino acid, and nucleotide metabolism using the genome [19] with annotations from KEGG [11] and the software Insilico Discovery (Insilico Biotechnology, Stuttgart, Germany). Pathways were added or edited one by one with information about reactions obtained from databases and literature (S1 Fig). After adding or editing a pathway, the energy and redox balances and topological properties of the model, e.g. flux consistency, were checked. Based on the results from these analyses, the pathway was either kept or modified. Before final acceptance of a pathway, FBA was performed to ensure that the model was able to predict growth and metabolic fluxes. We used WoLF PSORT [20] with default settings through SAPP [21] to assign metabolites and reactions to six different compartments (cytosol, mitochondrion, inner mitochondrial membrane, extracellular environment, peroxisome, and nucleus). Exchange reactions were added to allow metabolite import (negative flux) and export (positive flux). After finishing the draft model, we converted the model to the BiGG [10] namespace and used COBRApy [22] to iteratively curate it. We added and removed metabolites, reactions, and genes, mapped genes to reactions using AutoKEGGRec [23], and added a salmon-specific biomass reaction. We also added annotations from MetaNetX [9], KEGG [11], UniProt [24], and NCBI [25]. To infer GPR associations for reactions, we mapped Atlantic salmon genes to human homologs and copied GPR associations from the most recent human model [26]. If no GPR association could be inferred for a reaction, we used an OR relation between genes mapped to that reaction. To build the biomass reaction, we estimated the fractional composition of macromolecules in 1 g dry weight biomass (gDW) from Atlantic salmon whole-body composition [27]. We mapped macromolecules to metabolites and estimated the fractional composition of amino acids in proteins and nucleoside triphosphates in nucleic acids from proteome and genome sequences [19], respectively. We finalized the model by alternating semi-automated annotation and curation with quality evaluation (as decribed below), iterating until we saw no further opportunities to improve the model without expanding its scope beyond energy, amino acid, and nucleotide metabolism. The final model was exported to Systems Biology Markup Language (SBML) format [28].

Evaluating the quality of the metabolic model

First, we compared the reaction contents in SALARECON to other models of multicellular eukaryotes available in the BiGG [10] namespace (Danio rerio [17], Mus musculus [29], Cricetulus griseus [30], Homo sapiens [26], and Phaeodactylum tricornutum [31]). Considering only intracellular metabolic reactions in compartments shared by all models (cytoplasm, mitochondrion, and peroxisome), we clustered the models based on their reaction contents using all suitable dissimilarity measures (16) and agglomerative hierarchical clustering methods (5) available through SciPy [32]. For each measure and method, we evaluated the resulting dendrogram by computing the cophenetic correlation coefficient (CCC) [33]: where x(i, j) and t(i, j) are Euclidean and dendrogrammatic distance between observations i and j, respectively, with averages and . The CCC indicates how well the dendrogram preserves pairwise dissimilarities. Second, we tested SALARECON’s consistency and annotation using the community standard MEMOTE [34] and its metabolic (in)capabilities using tasks defined for mammalian cells [35]. We adapted tasks to Atlantic salmon by moving metabolites from compartments not included in SALARECON to the cytoplasm and by modifying the expected outcomes of amino acid synthesis tests to match known essentiality [27, 36]. Third, we used the model to predict growth in the absence of individual amino acids. We allowed both uptake and secretion of all extracellular metabolites, disabled uptake of each amino acid separately, and maximized growth rate using FBA. Amino acids were classified as essential if they were required for growth and non-essential otherwise, and the predicted essentiality was compared to experimental data [27, 36]. Finally, we evaluated the ability of SALARECON to capture fish-specific metabolism by comparing metabolite uptake and secretion to the most recent human model [26]. Specifically, we enumerated minimal growth-supporting uptake and secretion sets for both models using scalable metabolic pathway analysis [37]. We required non-zero growth rate, uptake of oxygen and essential amino acids, and secretion of carbon dioxide as well as ammonia, urea, or urate. We enumerated minimal uptake sets first and then allowed uptake of all metabolites found in uptake sets before enumerating minimal secretion sets.

Analyzing oxygen-limited growth

We used parsimonious FBA (pFBA) [38] to find maximal growth rates and minimal flux distributions for 1,000 randomized conditions and 50 logarithmically spaced oxygen uptake rates in the range r ∈ (0, rmax) where r is oxygen uptake rate and rmax is the minimal oxygen uptake rate at maximal growth. For each condition, we uniformly sampled random ratios (1–100) of nutrients in a minimal feed (essential amino acids and choline) that were used as coefficients in a boundary reaction representing feed uptake. We always normalized feed uptake to the same total mass (g gDW−1 h−1) to ensure that conditions were comparable. The absolute value was selected to be large enough to ensure feed uptake was not limiting but otherwise arbitrary as only relative predictions were needed. We allowed unlimited uptake of phosphate and disabled all other uptakes as well as secretion of feed nutrients. We did not allow uptake of any other compounds than essential amino acids, choline, phosphate, and oxygen under any condition. To account for uncertainty in relative flux capacities and ensure that no single set of reactions was always growth-limiting, we also sampled random bounds for all reactions for each condition. The flux bound b of an enzymatic reaction is determined by the turnover number kcat and total enzyme concentration [E]: Approximately lognormal distributions have been observed for both kcat [39] and [E] [40], and the product of two lognormal random variables is also lognormal. We therefore sampled b from a lognormal distribution with mean 0 and standard deviation 2 for the natural logarithm of b. We kept the original reaction reversibilities and sampled bounds for reversible reactions separately for each direction. For each oxygen uptake rate, we computed mean growth rate with 95% confidence band from bootstrapping with 1,000 samples. We fit the means to experimental data [41-44] by assuming a simple piecewise linear relationship between water oxygen saturation (x) and relative oxygen uptake rate: where x0 and x1 are the oxygen saturations at which the relative growth rate is 0 and 1, respectively. We estimated x0 and x1 by least-squares fitting of where μ is growth rate, μmax is maximal growth rate when oxygen is not limiting, and f is a function that linearly interpolates the metabolic model predictions. We also fit a logistic model with asymptotes -1 and 1, where k is the logistic growth rate, and a Monod model extended with an x-intercept, where K + x0 is the saturation at which . To test the effect of random sampling on predictions and parameter estimates, we also repeated the analysis above with 100 randomly sampled feeds and default flux bounds as well as with a fish meal feed (Table 1) and default flux bounds. The fish meal feed includes non-essential amino acids that were not present in the randomly sampled minimal feeds and we also allowed unlimited uptake of the lipid precursor choline. This gave a non-zero maximal growth rate in the absence of oxygen, which we subtracted from predicted growth rates in the presence of oxygen to get purely aerobic growth rates suitable for comparison to the other growth rate predictions. We identified limiting reactions with and without random sampling of feeds and flux bounds by comparing predicted fluxes to their corresponding non-zero lower and upper flux bounds. A reaction was identified as limiting if the distance between predicted flux and one of its non-zero flux bounds was within the numerical tolerance of the solver.

Table 1

Amino acid compositions of feed ingredients.

Mass percentage of each amino acid relative to total mass of amino acids in feed ingredients used in simulations [57].

Amino acid	Fish meal	Soybean meal	Insect meal
Ala	6.82	4.43	7.05
Arg	7.19	7.54	5.34
Asn/Asp	10.02	11.87	10.07
Cys	0.93	1.74	0.62
Gln/Glu	13.98	18.74	11.12
Gly	6.88	4.19	6.67
His	2.62	2.69	3.32
Ile	4.64	4.61	4.86
Leu	7.91	8.02	7.76
Lys	8.31	6.44	6.19
Met	3.07	1.45	2.06
Phe	4.29	5.22	4.31
Pro	4.45	5.08	6.39
Ser	4.29	4.13	4.71
Thr	4.57	3.67	4.29
Trp	1.13	1.58	1.61
Tyr	3.40	3.60	6.85
Val	5.48	5.00	6.79

Amino acid compositions of feed ingredients.

Mass percentage of each amino acid relative to total mass of amino acids in feed ingredients used in simulations [57]. To identify reaction contributions to oxygen-limited growth, we took the absolute value of the pFBA fluxes (with randomly sampled feeds and flux bounds), normalized each flux by its maximum value within each condition, and used Ward’s minimum variance method to cluster the resulting absolute relative fluxes by Euclidean distance. We mapped reactions from the top eight clusters to genes and used g:Profiler [45] to identify enriched pathways from KEGG [11]. We used the genes in the model as background, considered pathways with adjusted p ≤ 0.05 to be enriched, and discarded pathways outside the model’s scope (xenobiotics and drug metabolism).

Predicting growth-limiting amino acids in feeds

We obtained ratios of amino acids in three commercial feed ingredients: fish, soybean, and black soldier fly larvae meal (Table 1). For each feed, these ratios were used as coefficients for amino acids in a boundary reaction representing feed consumption. Mass was divided equally between amino acids that were combined in the feed formulation (Asn/Asp and Gln/Glu). For each feed ingredient, we deactivated import of amino acids via other boundary reactions, fixed the growth rate to the same value (arbitrary, as we were interested in generated biomass relative to consumed feed), and normalized feed uptake to the same total mass (g gDW−1 h−1) before minimizing feed uptake flux. To simulate growth limitations from protein synthesis rather than energy generation, we also allowed unlimited uptake of glucose. This is supported by evidence that reducing feed amino acid levels has a negative effect on feed intake regardless of dietary energy level [46]. We multiplied molecular mass with reduced cost in the optimal solution for each amino acid exchange reaction and identified the one with largest negative value as limiting [47]. To supplement the feed with the limiting amino acid, we set the bounds of its exchange reaction to only allow import, and we penalized supplementation by adding the exchange reaction to the objective with coefficient equal to molecular mass (S2 Fig). We repeated the steps above until all limiting amino acids had been found for each feed.

Results

We built a metabolic model of Atlantic salmon (SALARECON) from its genome [19], metabolic reaction and model databases, and literature (Fig 1). The model focuses on energy, amino acid, and nucleotide metabolism and covers 1,133 genes, which amounts to 2% of the 47,329 annotated genes in the genome and 50% of the 2,281 Atlantic salmon genes that are associated with metabolic reactions in KEGG [11]. The genes are mapped through gene-protein-reaction (GPR) associations to a metabolic network of 718 reactions and 530 metabolites (Fig 2a) with node degree distributions that are typical for metabolic and other biological networks [48] (S3 Fig). Reactions and metabolites are divided between six compartments: cytosol, mitochondrion, inner mitochondrial membrane, extracellular environment, peroxisome, and nucleus (Fig 2b). The compartments are connected by 175 transport reactions that allow metabolite exchange through the cytosol, and 86 boundary reactions allow metabolites to move in and out of the system through the extracellular environment. There are 357 unique metabolites when those occurring in multiple compartments are counted once. A salmon-specific biomass reaction based on whole-body composition [27] allows growth rate prediction by accounting for production of the proteins, lipids, carbohydrates and nucleic acids that consitute biomass from metabolites supplied by the metabolic network (Fig 2c).

Fig 1

Model construction.

SALARECON was built from the annotated Atlantic salmon genome, metabolic reaction and model databases, and literature. The procedure involved (1) manual metabolic network reconstruction using Insilico Discovery (Insilico Biotechnology, Stuttgart, Germany), (2) semi-automated annotation and curation using COBRApy [22], and (3) quality evaluation using the standardized metabolic model testing tool MEMOTE [34] and metabolic tasks [35]. Steps 2 and 3 were iterated until quality criteria were satisfied. Illustration of metabolic tasks from Richelle et al. [35].

Fig 2

Model contents.

Model construction.

Model contents.

(a) SALARECON contains 1,133 genes (2% of all genes and 50% of Atlantic salmon genes mapped to reactions in KEGG [11]), 718 reactions (175 transporting metabolites between compartments and 86 exchanging metabolites with the extracellular environment), and 530 metabolites (357 when metabolites occuring in multiple compartments are only counted once). (b) Metabolites and reactions are divided between five compartments (mitochondrion includes the inner mitochondrial membrane). Transport reactions are counted multiple times (once for each compartment of exhanged metabolites). Boundary reactions in cytosol are sink or demand reactions [12]. The inset shows how many unique metabolites can be transported between the cytosol and the other compartments (indicated by their initials). (c) Biomass composition of Atlantic salmon estimated from measured whole-body composition [27]. The inset summarizes each class of macromolecules. Carbohydrates and lipids are represented by glycogen and phosphatidylcholine (PC), respectively. ATP serves both as energy for protein synthesis and as a building block in RNA synthesis. To investigate whether SALARECON is likely to be an accurate representation of Atlantic salmon metabolism, we first compared it to the only existing high-quality metabolic model of a fish as well as all models of multicellular eukaryotes currently available in the BiGG database [10] (Fig 3a, S4 and S5 Figs). Specifically, we hierarchically clustered the reaction contents of SALARECON and models of zebrafish (Danio rerio) [17], mouse (Mus musculus) [29], chinese hamster ovary (CHO; Cricetulus griseus) [30], human (Homo sapiens) [26], and the diatom Phaeodactylum tricornutum [31]. We combined 16 different dissimilarity measures with five different methods for agglomerative hierarchical clustering, and we evaluated the agreement between the resulting dendrograms and the underlying dissimilarities using the cophenetic correlation coefficient (CCC) [33] (S4 Fig). Across measures and methods, we found that models tended to cluster by phylogeny with fish and, to some extent, mammals forming distinct clusters and the diatom being an outlier. As shown in S4 Fig, salmon and zebrafish formed a cluster in 57/80 trees (71%), the mammals formed a cluster in 36/80 trees (45%), and the diatom was an outlier in 69/80 trees (86%). This is largely consistent with the hypothesis that the models capture organism-specific metabolism, suggesting that SALARECON captures salmon- or at least fish-specific metabolism. However, there are also significant discrepancies between trees built with different measures and methods, and it is important to note that the clusters likely reflect large differences in model scope as well as organism specificity (S5 Fig). Fig 3a shows the tree obtained for Jaccard distance, which is the most common metric for measuring metabolic model dissimilarity [49], with the “average” method (CCC = 0.95).

Fig 3

Model quality evaluation.

Model quality evaluation.

(a) Hierarchical clustering of SALARECON and metabolic models of other multicellular eukaryotes based on Jaccard distance between reaction contents and the “average” method. Atlantic salmon (Salmo salar) is closer to zebrafish (Danio rerio) [17] than mouse (Mus musculus) [29], chinese hamster ovary (CHO; Cricetulus griseus) [30], human (Homo sapiens) [26], and the diatom Phaeodactylum tricornutum [31]. (b) Model score and subscores from MEMOTE [34]. Subscores evaluate Systems Biology Ontology (SBO) annotation, model consistency, and database mappings for metabolites, reactions, and genes. (c) Ability of SALARECON to perform metabolic tasks [35]. Tasks are grouped by metabolic system and classified as successful if model predictions reflected expected metabolic (in)capabilities. (d) Essential amino acids predicted by SALARECON match observations [27, 36]. (e) Minimal growth-supporting sets of metabolite uptakes and secretions for RECON3D [26] and SALARECON. Arrows indicate uptake and secretion. Metabolites that are only used or produced by human or salmon are indicated by blue and red, respectively, and metabolites that are used or produced by both are indicated in purple. Uptake of oxygen and essential amino acids was required, as well as secretion of carbon dioxide and ammonia, urea, or urate. Ammonia secretion can be replaced by urea or urate secretion in human but not in salmon. The number of alternative metabolites is given in parentheses where applicable. SALARECON performed well in community-standardized MEMOTE tests [34], which evaluate model consistency and annotation (Fig 3b). It achieved an overall MEMOTE score of 96% (best possible score is 100%) with subscores of 100% for Systems Biology Ontology (SBO) annotation, 98% for model consistency, 94% for metabolite annotation, 87% for reaction annotation, and 71% for gene annotation. We also evaluated the ability of SALARECON to perform 210 metabolic tasks grouped into seven metabolic systems (Fig 3c) and 73 metabolic subsystems (S6 Fig). These tasks were originally defined for mammalian cells [35] but we changed the expected outcomes of amino acid synthesis tests to match known essentiality in Atlantic salmon [27, 36]. SALARECON correctly captured all expected metabolic (in)capabilities for the three metabolic systems within the scope of the model (energy, amino acid, and nucleotide metabolism). It also succeeded in 44% of vitamin and cofactor tasks, 43% of carbohydrate tasks, and 15% of lipid tasks, reflecting the fact that these parts of metabolism are simplified in the model. The only system completely outside the scope of SALARECON was glycan metabolism, in which no tasks were successfully performed. In total, SALARECON succeeded in 66% of all metabolic tasks, notably all tasks related to amino acid essentiality (Fig 3d). Finally, to test the ability of SALARECON to capture basic fish physiology, we compared it to one of the latest human models, RECON3D [26], by computing minimal sets of metabolite uptakes and secretions that allow growth [37] (Fig 3e). In addition to oxygen and essential amino acids, SALARECON required uptake of choline, a lipid precursor, and phosphate, an essential nutrient for fish that is supplemented in salmon feeds [50]. The only secretions needed to support growth were carbon dioxide and ammonia. Notably, we found that secretion of urea was also possible, but not sufficient to support growth without secretion of ammonia. In line with this, ammonia is the major nitrogenous waste product in fish and urea is a comparatively minor contributor [51]. RECON3D is much larger than SALARECON and therefore allowed for a wider range of lipid precursors (27 options). It also required secretion of a carboxylic acid (11 options) and a lipid byproduct (132 options) in addition to carbon dioxide and a nitrogenous waste product. Urea is the major nitrogenous waste in mammals, but RECON3D could grow while secreting only ammonia or urate. In general, RECON3D captures a much larger space of possible growth-associated metabolic activities than SALARECON due to the large difference in model scope (S5 Fig). However, SALARECON specifically captures the key metabolic activities of a fish. In our first application of SALARECON, we predicted oxygen-limited growth rates under hypoxia on a minimal feed containing essential amino acids and choline, using random sampling to account for uncertainty in feed nutrient ratios and flux capacities (Fig 4a and S7 Fig). Supporting the hypothesis that SALARECON captures fish-specific metabolism, we found that the major secretion products across all oxygen levels and sampled conditions were CO2 and NH3 (S8 Fig). Urea had the third highest secretion flux but this was much smaller than the secretion flux for NH3, and the secretion fluxes of all other secreted metabolites combined was vanishingly small. Assuming that relative oxygen uptake rate is a linear function of water oxygen saturation (percent air saturation), we fit our predictions to experimental data [41-44] along with a logistic model and an extended Monod model (Fig 4b). The choice of a linear model for the metabolic fit was motivated by the fact that diffusive oxygen uptake in fish gills is governed by Fick’s law and therefore proportional to the oxygen gradient [52]. Also, replacing the linear model by a Michaelis-Menten model would make the metabolic and Monod fits virtually identical because the Monod and Michaelis-Menten equations have the same form. We found that the metabolic, logistic, and extended Monod models fit the data about equally well (R2 ≈ 0.6) but they differed in their parameter estimates (Fig 4b). All the models estimated the minimal oxygen saturation required for growth, but the logistic estimate was low with high standard error (x0 = 0.11 ± 0.16) and the Monod fit was high with low standard error (x0 = 0.45 ± 0.04). The metabolic model gave an intermediate estimate and standard error (x0 = 0.31 ± 0.10), and it also allowed estimation of the minimal oxygen saturation required for maximal growth (x1 = 1.37 ± 0.15). The metabolic fit was closer than the two other fits to the expected relationship between water oxygen saturation and growth rate [52], both in terms of the shape of the fitted curve and the estimated parameter values. The SALARECON estimates were within one and two standard errors, respectively, of the values x0 ≈ 0.3 and x1 ≤ 1.2 suggested by Thorarensen et al. [52]. The logistic estimate was within two standard errors of the suggested x0, but this confidence interval also included zero.

Fig 4

Oxygen-limited growth analysis.

Oxygen-limited growth analysis.

(a) SALARECON predictions of relative growth rate under oxygen limitation as a function of relative oxygen uptake rate. Feed composition and flux capacities were randomized 1,000 times (light blue) and the mean across conditions is shown with 95% confidence band from bootstrapping with 1,000 samples (dark blue). (b) Metabolic, logistic, and extended Monod model fits to experimental data from Berg and Danielsberg [41] (circles), Bergheim et al. [42] (triangles), Hosfeld et al. [43] (squares), and Hosfeld et al. [44] (diamonds). SALARECON predictions were fit by assuming a linear relationship between relative oxygen uptake rate and water oxygen saturation. (c) Coefficient of determination (R2), minimal oxygen saturation required for growth (x0), and minimal oxygen saturation required for maximal growth (x1) from fitted models. Error bars indicate two standard errors of the estimates. (d) Minimal flux distributions for metabolic model predictions from parsimonious flux balance analysis (pFBA) [38]. Rows are reactions, columns are flux distributions sorted by relative oxygen uptake rate, and each cell shows absolute flux normalized by maximum value for each condition. Rows are clustered by Euclidean distance using Ward’s minimum variance method and divided into eight clusters indicated by colors. (e) Mean absolute relative flux with 95% confidence bands from bootstrapping with 1,000 samples for the eight clusters. Relative growth rate is indicated by a dashed line. (f) Enrichment of metabolic pathways from KEGG [11] for the eight clusters with size reflecting the fraction of genes in each pathway that are found in a cluster (recall). Repeating our oxygen-limited growth analysis with default rather than randomly sampled flux bounds, we found similar growth predictions and parameter estimates for 100 randomly sampled feeds as well as a feed based on fish meal (S9 Fig). However, random sampling of feeds allowed us to account for a much larger selection of potentially limiting reactions, showing that our results were robust to uncertainty in flux capacities as well as feed compositions (S10 Fig). With randomly sampled feeds and bounds, 310 different reactions were limiting in at least one solution, compared to 25 with the default bounds (for both randomly sampled and fish meal feeds). Growth was always limited by the flux capacities of internal reactions (and oxygen uptake) rather than by the feed uptake reaction. In contrast to the simple growth models, SALARECON is mechanistic and makes it possible to explain predictions in terms of metabolic fluxes (Fig 4d). Assuming that organisms have generally evolved to grow as efficiently as possible, we used parsimonius flux balance analysis (pFBA) [38] to minimize overall flux through the metabolic network while requiring maximal growth rate for each randomly sampled condition and oxygen level. We identified eight clusters of reactions whose pFBA fluxes made distinct contributions to oxygen-limited growth (Fig 4e). Connecting clusters to the Atlantic salmon genome and databases through GPR associations and their annotation, we found enriched metabolic pathways among the genes associated with each cluster (Fig 4f). In one cluster, fluxes were perfectly correlated with relative growth rate, indicating that they contained reactions that were always necessary for growth. Indeed, this cluster was enriched in lipid metabolism, which directly produces a biomass precursor, and pathways related to NAD(P)H metabolism. The fluxes of two other clusters both increased rapidly at the very lowest oxygen levels before plateauing at higher oxygen levels, in one case decreasing slightly after the initial increase. These clusters were enriched in pathways such as the tricarboxylic acid (TCA) cycle, glycolysis, oxidative phosphorylation, pyruvate, and thiamine metabolism, indicating that energy generation from glucose was maximized at low oxygen levels while other energy-generating pathways were activated at higher oxygen levels. Four of the five remaining clusters increased slightly less than the clusters enriched in energy generation from glucose at low oxygen levels but kept increasing at higher oxygen levels. These clusters were enriched in pathways related to metabolism of fatty acids and amino acids, suggesting that these compounds become important energy sources after saturation of glucose catabolism at low oxygen levels. Nitrogen metabolism, which includes amino acid biosynthesis and disposal of nitrogenous waste products, was also overrepresented. The final cluster consisted of reactions with no or very little flux, even at the highest oxygen levels, and was enriched in metabolism of pyrimidines, β-alanine, and essential amino acids. Finally, to demonstrate the potential of SALARECON to address key challenges in aquaculture, we used it to predict growth-limiting amino acids and feed efficiencies for three commercial feed ingredients: fish, soybean, and insect meal (Table 1 and Fig 5a). For each feed ingredient, we iteratively identified and supplemented the most limiting amino acid until all amino acid limitations had been lifted, computing feed efficiency at each iteration (S2 Fig). Comparing predicted limiting amino acids in fish meal to soybean and insect meal, we found that lysine and threonine were more limiting in both soybean and insect meal, methionine was more limiting in soybean meal, and arginine was more limiting in insect meal (Fig 5a and 5c, and S11 Fig). The feed efficiency predictions suggest that the baseline feed efficiency of fish meal can be achieved by supplementing one and three amino acids for soybean and insect meal, respectively (Fig 5d). For soybean meal, major increases in feed efficiency were predicted for lysine, threonine, and methionine supplementation, while lysine had the largest impact on insect meal (S11 Fig). The predictions from SALARECON agree well with expected baseline feed efficiencies [53, 54] as well as reports that lysine, methionine, threonine, and arginine are more limiting in plant-based feeds than in marine feeds [55, 56].

Fig 5

Growth-limiting amino acids in commercial feed ingredients.

(a) Amino acid composition of SALARECON biomass, fish meal, soybean meal, and insect meal [57]. (b) Order of amino acid limitations in feed ingredients based on soybean and fish meal. Amino acids that are closer to the top left and bottom right corners were more limiting in soybean meal and fish meal, respectively, as indicated by size. (c) Order of amino acid limitations in feed ingredients based on insect and fish meal. Amino acids that are closer to the top left and bottom right corners were more limiting in insect meal and fish meal, respectively, as indicated by size and color. (d) Feed efficiency after successive supplementation of the most limiting amino acid for fish, soybean, and insect meal. The baseline feed efficiency of fish meal is indicated by a dashed blue line, and ranges observed by Kolstad et al. [53] and Dvergedal et al. [54] are highlighted in gray.

Growth-limiting amino acids in commercial feed ingredients.

Discussion

SALARECON is the first metabolic model of a production animal, bridging the gap between production and systems biology and initiating a framework for adapting Atlantic salmon breeding and nutrition strategies to modern feeds. By explicitly representing connections between metabolites, reactions, and genes, it connects the genome to metabolism and growth in a way that can be tuned to specific genetic and environmental contexts by integration of domain knowledge and experimental data [8]. Thus, SALARECON forms a transdiciplinary framework for diverse disciplines and data sets involved in Atlantic salmon research and aquaculture. Tools developed for constraint-based modeling of microbes and well-studied plants and animals can now be applied in production biology, providing a sharper lens through which to interpret omics data by requiring consistency with flux balances and other known constraints. This enables clearer analysis than classical multivariate statistics, which does not incorporate such mechanistic knowledge. Although laborious and time-consuming, our bottom-up manual reconstruction of the Atlantic salmon metabolic network was necessary to make SALARECON a high-quality predictive model. Automatically built models work well for microbes but are still outperformed by models that are built by manual iteration, and reconstruction of eukaryotes is more challenging due to larger genomes, less knowledge, and compartmentalization [12, 13]. However, semi-automated annotation and curation combined with automated MEMOTE tests [34] and metabolic tasks [35] allowed faster iteration, and future reconstructions of related species [58] can benefit from our efforts by using SALARECON as a template. MEMOTE and metabolic tasks were instrumental in the development of SALARECON, and we highly recomend integrating testing in model development. Tests help catch mistakes that arise when modifying a model and do triple duty by specifying what it should be capable of, identifying broken functionality, and forming a basis for comparison with other models, e.g. new versions or models of different tissues or species. Clearly formulated tests also make the model more accessible to non-modelers, speaking the same language as nutritionists or physiologists. Such experts can point out missing or ill-formulated tests, which in turn contribute to improvement. We have strived to make SALARECON an accurate model of Atlantic salmon metabolism and growth, but it does not aim to capture salmon physiology exhaustively or perfectly. It covers 2% of the genes in the genome, which amounts to 50% of Atlantic salmon genes mapped to reactions in KEGG [11], and its focus is on core metabolism generating energy and biomass. This covers pathways that connect feed to fillet, which is a primary focus of research and aquaculture, but obviously excludes many other interesting processes such as synthesis of long-chain polyunsaturated fatty acids. Still, SALARECON performs very well according to all of our metrics: it is more similar to the latest zebrafish model [17] than to any other multicellular eukaryote for which a model is available in BiGG [26, 29–31], achieves a MEMOTE score of 96%, which is better than all models in BiGG [10] (although many BiGG models could presumably be annotated and curated to reach a comparable score with reasonable effort), and performs all metabolic tasks within the scope of the model (amino acid, nucleotide, and energy metabolism). It also correctly classifies amino acids as essential [27, 36] and captures basic fish physiology, e.g. aerobic growth with uptake of essential amino acids, choline, and phosphate, and secretion of carbon dioxide and ammonia. The extensive annotation of genes, metabolites, and reactions is a key strength of SALARECON that facilitates use with existing models, tools, and data. In particular, identifiers from BiGG [10] make it easy to compare and combine SALARECON with state-of-the-art models [59, 60], e.g. to predict interactions between Atlantic salmon and its gut microbiota. It also allows direct application of implemented methods such as evaluation of metabolic tasks [35]. The salmon-specific biomass reaction enables prediction of growth and related fluxes and is based on organism-specific data [27], making SALARECON a more realistic representation of salmon metabolism than a network reconstruction [12]. As demonstrated for Atlantic cod [18], even getting to this stage is challenging for non-model animals. Our analysis of growth under oxygen limitation shows that phenotypes predicted by SALARECON can be fit to experimental data and produce detailed mechanistic explanations of Atlantic salmon physiology. Specifically, SALARECON explained hypoxic metabolism and growth in terms of metabolic fluxes with implications for fish welfare and productivity in aquaculture. The growth predictions depend on unknown environmental conditions and flux capacities, but SALARECON can be used to account for such uncertainty through random sampling. Average growth predictions from SALARECON fit the available data [41-44] as well as simple growth models and gave accurate estimates of critical water oxygen saturations in agreement with observations [52]. The predicted metabolic fluxes defined clusters of reactions with distinct pathway enrichments and contributions to hypoxic growth, notably suggesting that energy generation from glucose becomes saturated at low oxygen levels and that amino and fatty acids become more important energy sources with increasing oxygen. Predictions contrasting growth-limiting amino acids in three commercial feed ingredients also agreed well with data [55, 56] and showed that SALARECON can be used to evaluate the efficiency of sustainable feeds, a key challenge for modern aquaculture. Feed efficiencies predicted by SALARECON lie within reported ranges [53, 54] and suggest that the feed efficiency of fish meal can be achieved by supplementing one amino acid for insect meal and three for soybean meal. This shows that SALARECON can be used to evaluate both current and novel feeds, potentially reducing the need for expensive fish experiments in vitro or in vivo. In future work, we will expand SALARECON to cover more processes such as lipid and carbohydrate metabolism in full detail, and we will tailor it to gut, liver, muscle, and other tissues using omics data and metabolic tasks [35]. We will also leverage automated metabolic reconstruction tools for microbes to build models of the Atlantic salmon gut microbiota [59]. By coupling tissue-specific models to each other and to gut microbiota models, we can make detailed and partially dynamic whole-body models [61]. This would be a major leap from available dynamic models [62] and provide a mechanistic alternative to state-of-the-art bioenergetics models [63], opening up new possibilities for understanding fish physiology and rational engineering of feeds, conditions, and genetics.

Conclusion

SALARECON covers half of the annotated metabolic genes in the Atlantic salmon genome and can predict metabolic fluxes and growth with a salmon-specific biomass reaction. It has been extensively annotated, curated, and evaluated, and it can be used to tackle research questions from fish physiology to aquaculture. In particular, SALARECON is a promising new tool for predicting breeding strategies and novel feeds that optimize for production parameters such as feed efficiency and impact on fish health and environment. Future work will expand SALARECON and integrate it with omics data to make tissue-specific and partially dynamic whole-body models. SALARECON should facilitate systems biology studies of Atlantic salmon and other salmonids, and we hope that it will be widely used by modelers as well as biologists.

Draft model construction.

Flowchart showing the procedure used to add new pathways to the draft model or edit pathways already in the draft model. Pathways were added or edited one by one with information about reactions obtained from databases and literature. After adding or editing a pathway, the energy and redox balances and topological properties of the model, e.g. flux consistency, were checked. Based on the results from these analyses, the pathway was either kept or modified. Before final acceptance of a pathway, FBA was performed to ensure that the model was able to predict growth and metabolic fluxes. (TIFF) Click here for additional data file.

Adding nutritional supplements to a feed uptake reaction.

Feed uptake reactions are similar to biomass reactions, but supply metabolites rather than consuming them. The ratios between feed components are represented stoichiometrically, and scaled to sum to 1 g feed per mol uptake, so that one gram of the feed in the figure is equivalent to 2 mol A, 3 mol B and 1.4 mol C. With a fixed growth rate, the minimization of feed uptake is used as the objective of FBA. Surplus of metabolites in the feed uptake reactions are allowed to be exported via exchange reactions to avoid blocking the feed uptake reaction. Limiting metabolites can be identified from the reduced costs of the FBA solution. To avoid large molecules being favored, the reduced cost should be multiplied by the molecular mass (M) of the metabolite. Other factors such as price, CO2 equivalents, or environmental cost could be taken into account in this step. The boundaries of the limiting exchange reaction are reversed to allow uptake, and the reaction is scaled by molecular mass and added to the objective. In this case, the cost of supplements is assumed to be equivalent to mass, but the cost could also be set to be higher than the other feed ingredients, which could be more realistic. (TIFF) Click here for additional data file.

Model degree distributions.

(a) Distribution of number of metabolites converted by reactions. Boundary reactions exchange one metabolite with the extracellular environment and transport reactions usually exchange an even number of metabolites between compartments. (b) Distribution of number of genes associated with reactions. Transport and boundary reactions lack annotation and are not associated with any genes. Most metabolic reactions (95%) are associated with one or more genes. (c) Cumulative distribution of number of reactions associated with genes and metabolites (number of genes or metabolites associated with k or more reactions for all k). Most genes and metabolites are associated with a few reactions but some metabolites are highly connected hubs. Power law fits are shown for genes and metabolites. (TIFF) Click here for additional data file.

Dendrograms for models of multicellular eukaryotes.

Dendrograms from agglomerative hierarchical clustering of reaction contents of metabolic models of Salmo salar (SS), Danio rerio (DR) [17], Mus musculus (MM) [29], Cricetulus griseus (CG) [30], Homo sapiens (HS) [26], and Phaeodactylum tricornutum (PT) [31]. We combined 16 different dissimilarity measures with five different clustering methods and computed the cophenetic correlation coefficient (CCC) [33] for each measure and method. SS and DR are highlighted in red. (TIFF) Click here for additional data file.

Reaction contents of models of multicellular eukaryotes.

Reaction contents of metabolic models of Salmo salar, Danio rerio [17], Mus musculus [29], Cricetulus griseus [30], Homo sapiens [26], and Phaeodactylum tricornutum [31]. Each row is an organism, each column is a reaction, and a dark cell indicates a reaction that is found in the model of that organism. Rows are clustered by Jaccard distance using the “average” method and the number of reactions is given for each organism. (TIFF) Click here for additional data file.

Metabolic task results by subsystem.

Ability of SALARECON to perform metabolic tasks [35]. Tasks are grouped by metabolic subsystem and classified as successful if model predictions reflected expected metabolic (in)capabilities. (TIFF) Click here for additional data file.

Conditions and growth rates from oxygen-limited growth analysis.

(a) Feed coefficients of amino acids and choline in conditions used to predict oxygen-limited growth (1,000 samples). The coefficients were randomly sampled from a uniform distribution. (b) Pairwise Pearson correlations between metabolites of feed coefficents shown in a. (c) Flux bounds for conditions used to predict oxygen-limited growth (1,000 samples). Flux bounds were randomly sampled from a lognormal distribution. (d) Pairwise Pearson correlations of flux bounds shown in a. (e) Predicted absolute growth rates as a function of absolute oxygen uptake rates for the 1,000 randomly sampled conditions. The absolute growth rates were not intended to be realistic and only relative growth rates were used in the analysis (normalized by maximum growth rate without oxygen limitation). (TIFF) Click here for additional data file.

Secreted metabolites in oxygen-limited growth analysis.

Secretion flux relative to growth rate from oxygen-limited growth simulations. Fluxes are shown for CO2, NH3, urea, and all other secreted metabolites combined. Mean relative flux across 1,000 randomly sampled conditions is shown with 95% confidence bands from bootstrapping with 1,000 samples. (TIFF) Click here for additional data file.

Effect of sampling on results from oxygen-limited growth analysis.

Results from oxygen-limited growth analysis with (a–c) 1,000 randomly sampled feeds and flux bounds, (d–f) 100 randomly sampled feeds with default flux bounds, and (g–i) a fish meal feed (Table 1) with default flux bounds. See legend for Fig 4a–4c. (TIFF) Click here for additional data file.

Effect of sampling on limiting reactions in oxygen-limited growth analysis.

Limiting reactions in oxygen-limited growth analysis with (a) 1,000 randomly sampled feeds and flux bounds, (b) 100 randomly sampled feeds with default flux bounds, and (c) fish meal feed (Table 1) with default flux bounds. Rows are reactions, columns are flux distributions sorted by condition and relative oxygen uptake rate, and a dark cell indicates that a reaction is limiting in a solution (i.e. has flux equal to a non-zero flux bound). Rows are clustered by Euclidean distance using Ward’s minimum variance method. (TIFF) Click here for additional data file. Feed efficiency as a function of number of supplemented amino acids, measured in mg feed ingredient and supplemented amino acids consumed / gDW biomass produced for (a) fish meal, (b) soybean meal, and (c) black soldier fly larvae meal. Amino acids are indicated by color and ordered from most limiting (left) to least limiting (right). Each bar represents the fed amount of amino acid sources, with one amino acid supplemented per step towards the right. Limiting amino acids were supplemented until all feed protein had been replaced. (TIFF) Click here for additional data file. 19 Nov 2021 Dear Dr. Øyås, Thank you very much for submitting your manuscript "SALARECON connects the Atlantic salmon genome to growth and feed efficiency" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. All reviewers unanimously raised concerns about the lack of sufficient details and quite critical about the potential physiological concerns existing in the model. Thus, it would be absolutely crucial to improve the clarity of the manuscript and provide the requested details including the results of simulations asked by reviewers. Furthermore, it is important that the model and the process of curation is available to everyone using mentioned version control according to the best practices in the community. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts. Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Aleksej Zelezniak Guest Editor PLOS Computational Biology Kiran Patil Deputy Editor PLOS Computational Biology *********************** All reviewers unanimously report the lack of sufficient details and quite critical concerns about the model, the manuscript cannot proceed to publication unless all comments are sufficiently addressed and supported. Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Salmon is indeed a commercially very valuable fish in the market and it has high global demand. Therefore, in order to meet the global demand, it is produced at a higher scale. However, the challenge is to get the feed correct (similar to the level of marine feed) so that good fish quality is maintained. In this work, the authors have developed the genome-scale metabolic model of salmon fish and then used the model as a tool to examine a different type of commercial fish feed. The model predictions with regards to different feed and supplementation of few amino acids are in line with other experimental observations. Furthermore, the model is capable of mimicking hypoxic growth. Overall, the procedures followed to obtain the model are standard. I believe that the model will be used as a valuable tool to study salmon metabolism and growth. In my view, the manuscript (the main text) is written very well but the quality of the figures is very poor. In most of the figures, I cannot read what is written in x and y-axis labels. Other texts on the figures are unclear, therefore, I am afraid, it is difficult to judge the quality of the paper. I am happy to go through the paper once the good-quality figures are provided. Other than figures (results), which need additional review, I have few more comments. Line 47-50: Authors mention that due to lack of information, GPR is converted to OR type for all reactions. Is it normal to do so? I believe such detailed experimental information is not available in zebrafish as well as other higher organisms, so how do those models keep GPR relations? Analysis of oxygen-limited growth is not clear to me. I did not understand what is 100 randomized conditions? Is it 100 randomly chosen nutrients from the list of all exchange metabolites? Or do you have a media setup and then you chose the reaction bound randomly 100 times? In minimal feed, authors mention amino acids and choline. Which amino acids are essential here? Line 117: Authors minimize the feed uptake. In my opinion, it needs a more detailed explanation. Is it one of the objective function reactions where all feed constituents are lumped into a feed reaction, similar to one in the biomass? Or do you minimize uptake reactions one by one iteratively while keeping the rest constant? I did not fully understand this part. Line 129: I am surprised to see this low percentage of metabolic gene coverage in the model. Only 48% of kegg metabolic genes are covered in the model. Why are the rest 52% of metabolic genes from kegg not included? I think this is a very big percent of genes to ignore -- and they are mostly metabolic genes as given in the KEGG. I am not saying all kegg metabolic genes should be included but leaving 52 % is too much I think. I think this low coverage of metabolic genes is the main reason why the model achieved only 53% MEMOTE score. If the aim is to understand the specific tasks so, in that case, can we not say that the reconstructed model is a core model considering only central pathways/reactions? Reviewer #2: The authors have 1) reconstructed a metabolic model of Atlantic salmon, 2) conducted a number of tests of the model both standardized and custom, the later including its Jaccard distance to other models; essentiality of amino acids; relative growth rate with randomly sampled nutrient uptake and flux capacities, 3) used the model to evaluate the effect of amino acid supplements to different feeds. While new metabolic models are always useful and needed, the paper seems too limited in biological insight and in motivation of its methodology, as detailed in the following specific comments. Major comments The authors interpret the Jaccard distance as phylogenic distance. However, other factors may also influence the metric, e.g. the size or focus of the reconstruction, in figure S2 it is reported that the different models span a range of 456-3554 reactions. The model constitutes a unique opportunity to compare Salmon metabolism to that of other organisms, but no fish specific biological conclusions seem to be drawn, e.g. differences in nitrogen excretion or oxygen utilization. It is not clear how the results would differ if a model from another organism was used, assuming that the biomass equation was amended. Also differences in arginine metabolism between fish and mammals would be interesting to discuss, arginine is only considered a conditionally essential amino acid in human. The reference cited for its essentiality in salmon does not seem to be the original source of the claim. The method to randomly sample input and flux constraints appears to be novel since no references are given, but it is not thoroughly motivated, described, or analyzed, e.g. what would the effect be of only sampling inputs or constraints? Is it reasonable to assume that only essential amino acids are provided and that they are uniformly distributed? How is the sampling of constraints motivated? How is this approach expected to fair with respect to well-known biases from sampling of highly interdependent fluxes (Heinonen 2019). The results of this method would require careful study, to rule out unintended effects of the imposed constraints, e.g. in the analysis of the flux distributions the flux through glycolysis is implicitly assumed to be energy-generating, however, since uptake of glucose is blocked, and glycogen is in the biomass equation, the flux is likely reversed (gluconeogenesis). The manuscript mentions blockage of nutrient secretion of feed nutrients, does this mean that the constraint only applies to the amino acids, what would be the consequence of not doing so? A set of fitting-parameters and relative rates are used to fit the model to experimental data, this, however, seems unnecessary, since the minimum oxygen uptake should follow from the oxygen uptake required for maintenance ATP expenditure and the maximum oxygen uptake should depend on the maximum specific growth rate together with the growth associated ATP expenditure, i.e. more explanatory parameters that could be estimated from the data. Fish meal is used as reference in Figure 5, but presumably a similar conclusion could more easily be drawn by comparing the optimal amino acid composition identified by the model to the to the composition in the feed. Since only essential amino acids are considered, the interconversion capacity of the model will likely not be utilized, and the optimal composition is therefore presumably highly similar to the amino acid composition in biomass, in particular since unlimited glucose is supplied so that the possibility for ATP constrained growth is neglected. This, btw, does seem to be a reasonable assumption under optimized growth conditions (El-Mowafi 2010). On a more general level the feed optimization assumes no effects of gut microbiota or obligate losses, which perhaps should be discussed. Minor comments There seems to be no reason to use the arbitrary specific growth rate of 1 h-1, which is unphysiologically high. Similar to elsewhere in the manuscript yields, gdw/gdw/mmol O2 could be reported. Figure 4d could be improved for interpretability. How sensitive are the clusters to the 100 random samples, if 1000 samples were used, would the figure look the same? Due to differences in specific activity between enzymes, minimized flux is not the same as minimal enzymatic cost (line 196). On line 129 it is stated that the model focuses on energy metabolism, but on line 279 it is stated that it focuses on amino acid, nucleotide, and energy metabolism. This a more accurate description, since most of the results relate to amino acids. It is unusual to see the Monod model with an intercept term as in figure 4b. References Markus Heinonen, Maria Osmala, Henrik Mannerström, Janne Wallenius, Samuel Kaski, Juho Rousu, Harri Lähdesmäki, Bayesian metabolic flux analysis reveals intracellular flux couplings, Bioinformatics, Volume 35, Issue 14, July 2019, Pages i548–i557 El-Mowafi, A., Ruohonen, K., Hevrøy, E.M. and Espe, M. (2010), Impact of digestible energy levels at three different dietary amino acid levels on growth performance and protein accretion in Atlantic salmon. Aquaculture Research, 41: 373-384. Reviewer #3: The authors present a metabolic model of atlantic salmon. They compare the model to genome-scale models of other organisms, and use the model for looking into areas of metabolism that change under varying feed/oxygen uptakes, and detecting specific amino acids that are limiting in different feed sources. Overall the paper is well written and is a good foundation stone for salmon modeling work to come. I commend the authors for using several state of the art practices in model development, such as unit testing and memote validation. However, there are a few major points that should be addressed in the manuscript and model before publication. Major comments: * L37: The description of how the draft model was generated is extremely brief; a more thorough explanation of the methods used is required, especially considering that a) the most important result of this manuscript is the model itself, and b) the authors used proprietary software for it, which hinders reproducibility. Was a template model used to build this reconstruction, or was it built from scratch? Which alignment method was use to map the genes to KEGG, or was that information directly taken from the annotated genome? Which criteria were used for adding reactions to the draft model? Which criteria were used to finish adding reactions? Was there any gap-filling performed? How did the authors ensure the model could produce biomass? * L47: Assuming every gene-reaction-relationship is an OR relationship is a quite strong assumption, that will lead to many false negatives when using the model for gene essentiality predictions (one of the most common uses for metabolic models). Most reconstruction tools are able to account for complexes; if insilico discovery cannot do it, I would suggest, for instance, using a reference model for retrieving this information: if the same reaction is present in both models, and there is homology between the corresponding genes, the AND/OR relationship from the reference can be copied. * L78: This section reads quite confusing and it took me a few reads to understand what the authors did. Considering that most of the result section is about the oxygen simulations, the authors should either make this section clearer to read, or remove focus from it to highlight the (much clearer) next section "Predicting growth-limiting amino acids in feeds". * L141: It is misleading to claim that this model accounts for the metabolic production of lipids, as the only lipid accounted for in the biomass reaction is PC 16:0/16:0, i.e. most of the rich lipid metabolism that fish has is ignored in the model, and the uptake costs of choline are largely over-estimated. Similarly, carbohydrates are only represented with glycogen. Considering the many published semi-automated tools for model generation, have the authors considered adding some additional metabolic content to their model so that it can produce some additional biomass constituents, e.g. lipids and/or carbohydrates, but also vitamins, cofactors, trace minerals? If not, then at the very least the authors should consider removing mentions to those pathways as being fully accounted for in the model. * L163: Only at this point in the manuscript I understood the scope of the model (energy, amino acid, and nucleotide metabolism). I believe the abstract, introduction and any other summary section of the manuscript should clearly highlight this scope and mention that this is a smaller-scale / highly-curated model and not genome-scale model, otherwise it reads misleading, making the reader think that this model is part of the genome-scale family. Furthermore, the authors should provide a richer discussion about what are the advantages/disadvantages of generating such a model instead of using the (more prevalent) full genome-scale model approach. * The model repository is quite unorganized and could benefit from some cleanup: I would recommend following a standard (e.g. https://github.com/SysBioChalmers/Human-GEM or the broader https://github.com/drivendata/cookiecutter-data-science). Specifically, the following ideas would increase readability of the repo: * Group scripts/notebooks in a separate folder. * Store the S. salar model in a simpler path than models/sasa/salarecon_bigg_curated.xml * It is confusing to find the model within a plethora of genome-scale models from different species. Furthermore, is it even needed to store models from other studies in this repo? Providing the links from BiGG should be enough. * Having a folder with old versions of the model defeats the purpose of using git: Shouldn't those models be accessible through specific commits in the git history? Labels can be provided to those commits for quick reference. Minor comments: * L39: Please detail (in methods or in supplementary material) the settings that were chosen for WoLF PSORT and SAPP. * L44: Unclear to me what "We iteratively converted the model to the BiGG namespace" means: why would the authors need to convert the model ids more than once to BiGG? * L56: It is unclear to me what was the exact criteria for stopping model annotation/curation. * L80: For a randomized approach like the one the authors present, 100 randomized conditions is probably not enough for properly sampling the solution space. I would recommend the authors increasing this number to at least 1000. On the other hand, 100 linearly spaced oxygen uptake rates is probably redundant and could be replaced with a smaller number, considering that the response of the model seems to be more or less linear (Fig. 4d). * L89: Here I am still confused: as far as I understand there were 100 (feed random uptake combinations) * 100 (O2 linearly spaced O2 uptakes) = 10,000 simulations performed. Are there also other reaction bounds randomly sampled? * L133: If the authors claim that the node degree distributions of the network "are typical for metabolic and other biological networks", then I would expect Figure S1 to show numbers for those networks, however only the information for the salmon model is presented. * Figures 3a, 3d and 4c are not referenced anywhere in the manuscript as far as I can tell. * Fig. 3d: Amino acid essentiality data should be shown together with the predictions here, otherwise the statement in the caption "Essential amino acids predicted by SALARECON match data" is not proven. * L205: When performing pathway enrichment, it is standard practice to show in the plot (Fig. 4f) the number of reactions that each pathway contains, as a pathway with less than e.g. 5 reactions in the model should probably be filtered out, given that already a couple of reactions showing in a cluster could be counted as significantly over-represented. Note that the filtering here should be done by reactions, not genes (to not account for the same flux value multiple times). * L263: "Automatically built models work well for microbes but are still outperformed by models that are built by manual iteration" This is a strong claim that I believe requires at least some references and explaining what is meant by "outperforming". * L277: Comparing the memote score of this model to models in BiGG is in my opinion unfair, as all those models were published before the memote score was introduced, and therefore did not account for it in their development. Furthermore, by adding a few things to those models, such as annotation and/or SBO terms, many of those models would increase their score to close to 95% as well. * L295: This is achieved only because the biomass reaction is relatively simple (e.g. one type of carbohydrate and one type of PC), so I'm not sure it should be highlighted as a great advantage of the model. * L320: Check grammar of sentence. * L356: Should say S3 Fig. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Dr Mohammad Tauqeer Alam, Department of Biology, United Arab Emirates University, UAE Reviewer #2: No Reviewer #3: Yes: Benjamín J. Sánchez Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at . Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols 28 Jan 2022 Submitted filename: salarecon_revision_response_reviewers.pdf Click here for additional data file. 6 Apr 2022 Dear Dr. Øyås, Thank you very much for submitting your manuscript "SALARECON connects the Atlantic salmon genome to growth and feed efficiency" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely considering this manuscript for publication, providing that you address all reviewers comments. Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Aleksej Zelezniak Guest Editor PLOS Computational Biology Kiran Patil Deputy Editor PLOS Computational Biology *********************** A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately: [LINK] Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: I have gone through the comments and rebuttal. All of my concerns were addressed by the reviewers. They have revised the manuscript accordingly, and I have no further comments. Brilliant job! Congratulations to the authors for a nice piece of work. Reviewer #2: For this revision of their paper on metabolic modeling of Atlantic salmon the authors have developed the description of their methodology and have expanded the biological analysis. However, some questions remain to be addressed as detailed in the following specific comments. Major comments 1. The authors maintain that Jacard distance is indicative of similarity between fish metabolism and a metric of salmon-specificity, “The models clustered by phylogeny with fish and mammals forming distinct groups and the diatom as an outlier, indicating that SALARECON captures fish- and likely salmon-specific metabolism.”. If these models were all genome-scale reconstructions, this may be a correct conclusion, however, since many reactions are missing from SALARECON due to its limited scope, it is not apparent if the clustering is driven by similarity in scope or in content. As an illustration: if every reaction in the salmon reconstruction is also present in the human reconstruction, the Jaccard distance would still be very large ~0.85 (1 - 456/3554), due to difference in reconstruction size, while the corresponding value for zebra fish would be 0.65, and in such case they would thus cluster due to similarity in scope. At best the results are consistent with the hypothesis that SALARECON captures fish and salmon-specific metabolism, but Jaccard distance does simply not seem like an appropriate metric when comparing reconstructions of different scope. 2. Similarly, in Figure 2e it is not clear if the observed differences are due to difference in species or reconstruction size, all differences seem to suggest that the Human model has additional capacity, e.g. for the human model, the phosphate likely originates from phospholipid metabolism, which is not reconstructed in SALARECON. It is not clear if SALARECON has any metabolites or reactions that are not also present in the human reconstruction, and it is not clear if the reactions or metabolites that are missing in SALARECON are due to limited reconstruction scope or genuinely missing. 3. The Monod model with an intercept term is not the Monod model. It is perhaps an “Extended Monod model”. However, I the r/r _max term was used instead of x in the Monod equation, there would likely be no need for an intercept and the results would be more comparable to the model as they receive the same input. 4. Regarding the choice of uniform distribution for nutrient sampling, the Authors write in their point-by-point response “we do not have prior information and therefore choose to sample from a uniform distribution”, however, is not the fish meal in Fig.5a such prior knowledge? 5. The capacity constraints are sampled from a log normal distribution, spanning 6 orders of magnitude. How do the authors ensure that the uptake constrains are of a corresponding magnitude? The manuscript states that uptake is normalized to 1 g/gdw/h, and that this value is arbitrary, however, it will not be arbitrary if the flux capacities are constrained, e.g. if uptake is much larger than capacity, then capacity will determine growth, and if uptake rates are much lower than capacity, then uptake will determine growth. To ensure that both constraints are active the authors could compare the curve in Figure 4a with and without uptake constraints. Alternatively, the predicted fluxes could be compared with the constraints to determine the frequency of constraining growth (flux==constraint) for each internal- and uptake reaction. 6. The revised sampling method resembles the one used by Beg et al 2007, further described by Adadi et al 2012, it may be worth checking if they are mathematically the same as the proposed method. 7. The authors state that the purpose of figure 4 is to “show that SALARECON can give good estimates of physiological parameters (minimal and maximal water oxygen saturation)”. However, the minimal water oxygen saturation is a fitting parameter (x0) and the maximum is another fitting parameter (x1), so it is not clear that the results achieve this purpose. Minor comments: 1. The definition of Jaccard distance is in equation 1 seems to be the Jaccard similarity, for distance: 1 – J(A, B). 2. It may be contested that Robinson et al. 2020 have reconstructed a more recent human model than Recon3D, but perhaps this is the most recent model in the BIGG database. 3. The authors sample the upper- and lower bounds of reversible fluxes independently. Since the [E] term is present in both the forward and backward reaction, they are in principle not independent. However, in practice this will not matter since pFBA ensures that each reaction will be either in the forward or backward direction. 4. Figure 3c says “fitted value” on y axis, but R2 is the calculated coefficient of determination, not a fitted value. 5. Figure S6F shows absolute growth rate and uptake rates. The growth rate is 1% per hour, which is approximately 24 times higher than observed in salmon Cook et al 2000. 6. In figure 2e the figure legend could perhaps be more clearly state that slash (“/”) indicates sets of metabolites, e.g. NH3 is present in both the red and blue box, but this is presumably because it is compared against the whole set NH3/Urea/Urate. References Beg, Q. K. et al. Intracellular crowding defines the mode and sequence of substrate uptake by Escherichia coli and constrains its metabolic activity. Proc. Natl. Acad. Sci. U. S. A. 104, 12663–12668 (2007). Adadi, R., Volkmer, B., Milo, R., Heinemann, M. & Shlomi, T. Prediction of Microbial Growth Rate versus Biomass Yield by a Metabolic Network with Kinetic Parameters. PLoS Comput. Biol. 8, e1002575 (2012). Robinson, J et al. An atlas of human metabolism. Sci. Signal. 13, eaaz1482 (2020). Adadi, R., Volkmer, B., Milo, R., Heinemann, M. & Shlomi, T. Prediction of Microbial Growth Rate versus Biomass Yield by a Metabolic Network with Kinetic Parameters. PLoS Comput. Biol. 8, e1002575 (2012). Cook, J. T., McNiven, M. A., Richardson, G. F. & Sutterlin, A. M. Growth rate, body composition and feed digestibility/conversion of growth-enhanced transgenic Atlantic salmon (Salmo salar). Aquaculture 188, 15–32 (2000). ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Mohammad Tauqeer Alam, Department of Biology, United Arab Emirates University, Al Ain, Abu Dhabi. Reviewer #2: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols References: Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. 7 May 2022 Submitted filename: salarecon_revision_response_reviewers.pdf Click here for additional data file. 10 May 2022 Dear Dr. Øyås, We are pleased to inform you that your manuscript 'SALARECON connects the Atlantic salmon genome to growth and feed efficiency' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Aleksej Zelezniak Guest Editor PLOS Computational Biology Kiran Patil Deputy Editor PLOS Computational Biology *********************************************************** 7 Jun 2022 PCOMPBIOL-D-21-01635R2 SALARECON connects the Atlantic salmon genome to growth and feed efficiency Dear Dr Øyås, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Agnes Pap PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

41 in total

1. Fast automated reconstruction of genome-scale metabolic models for microbial species and communities.

Authors: Daniel Machado; Sergej Andrejev; Melanie Tramontano; Kiran Raosaheb Patil
Journal: Nucleic Acids Res Date: 2018-09-06 Impact factor: 16.971

2. A protocol for generating a high-quality genome-scale metabolic reconstruction.

Authors: Ines Thiele; Bernhard Ø Palsson
Journal: Nat Protoc Date: 2010-01-07 Impact factor: 13.491

Review 3. Functional amino acids in fish nutrition, health and welfare.

Authors: Synne M Andersen; Rune Waagbø; Marit Espe
Journal: Front Biosci (Elite Ed) Date: 2016-01-01

4. Functional Annotation of All Salmonid Genomes (FAASG): an international initiative supporting future salmonid research, conservation and aquaculture.

Authors: Daniel J Macqueen; Craig R Primmer; Ross D Houston; Barbara F Nowak; Louis Bernatchez; Steinar Bergseth; William S Davidson; Cristian Gallardo-Escárate; Tom Goldammer; Yann Guiguen; Patricia Iturra; James W Kijas; Ben F Koop; Sigbjørn Lien; Alejandro Maass; Samuel A M Martin; Philip McGinnity; Martin Montecino; Kerry A Naish; Krista M Nichols; Kristinn Ólafsson; Stig W Omholt; Yniv Palti; Graham S Plastow; Caird E Rexroad; Matthew L Rise; Rachael J Ritchie; Simen R Sandve; Patricia M Schulte; Alfredo Tello; Rodrigo Vidal; Jon Olav Vik; Anna Wargelius; José Manuel Yáñez
Journal: BMC Genomics Date: 2017-06-27 Impact factor: 3.969

5. Automated generation of genome-scale metabolic draft reconstructions based on KEGG.

Authors: Emil Karlsen; Christian Schulz; Eivind Almaas
Journal: BMC Bioinformatics Date: 2018-12-04 Impact factor: 3.169

6. UniProt: a worldwide hub of protein knowledge.

Authors:
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971

7. New approach for understanding genome variations in KEGG.

Authors: Minoru Kanehisa; Yoko Sato; Miho Furumichi; Kanae Morishima; Mao Tanabe
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971

8. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update).

Authors: Uku Raudvere; Liis Kolberg; Ivan Kuzmin; Tambet Arak; Priit Adler; Hedi Peterson; Jaak Vilo
Journal: Nucleic Acids Res Date: 2019-07-02 Impact factor: 16.971

9. BiGG Models 2020: multi-strain genome-scale models and expansion across the phylogenetic tree.

Authors: Charles J Norsigian; Neha Pusarla; John Luke McConn; James T Yurkovich; Andreas Dräger; Bernhard O Palsson; Zachary King
Journal: Nucleic Acids Res Date: 2020-01-08 Impact factor: 16.971

10. Model-based assessment of mammalian cell metabolic functionalities using omics data.

Authors: Anne Richelle; Benjamin P Kellman; Alexander T Wenzel; Austin W T Chiang; Tyler Reagan; Jahir M Gutierrez; Chintan Joshi; Shangzhong Li; Joanne K Liu; Helen Masson; Jooyong Lee; Zerong Li; Laurent Heirendt; Christophe Trefois; Edwin F Juarez; Tyler Bath; David Borland; Jill P Mesirov; Kimberly Robasky; Nathan E Lewis
Journal: Cell Rep Methods Date: 2021-06-30