| Literature DB >> 28545559 |
Daniel B Goodman1,2, Gleb Kuznetsov1,2,3, Marc J Lajoie1,2, Brian W Ahern4, Michael G Napolitano1,2,5, Kevin Y Chen4, Changping Chen4, George M Church6,7.
Abstract
Inexpensive DNA sequencing and advances in genome editing have made computational analysis a major rate-limiting step in adaptive laboratory evolution and microbial genome engineering. We describe Millstone, a web-based platform that automates genotype comparison and visualization for projects with up to hundreds of genomic samples. To enable iterative genome engineering, Millstone allows users to design oligonucleotide libraries and create successive versions of reference genomes. Millstone is open source and easily deployable to a cloud platform, local cluster, or desktop, making it a scalable solution for any lab.Entities:
Keywords: Bioinformatics; Experimental evolution; Genome engineering; Laboratory evolution; Microbial evolution; Synthetic biology; Synthetic genomics; Whole-genome sequencing
Mesh:
Substances:
Year: 2017 PMID: 28545559 PMCID: PMC5445467 DOI: 10.1186/s13059-017-1223-1
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Millstone enables rapid iterative multiplex genome analysis and engineering. a To use Millstone, a researcher uploads a reference genome and next-generation sequencing reads for many individual genomic clones, for example from long-term evolution or targeted genome editing. Millstone performs alignment and variant calling for both single nucleotide variants and structural variation and then assigns predicted effects based on reference genome annotations. A unified data model stores sample genotype, phenotype, and variant annotation data. Variants can then be queried, filtered, and grouped into sets for export, triage, and analysis. These variant sets can be used to design oligonucleotides to recreate or revert mutations of interest, or used to generate new versions of the reference genome. b A combined screenshot of the Millstone analysis and alignment visualization views (condensed and cropped for clarity). A custom query language and a corresponding query form in the user interface allow searching and filtering over the data. As variant calls sometimes require visual inspection and comparison, Millstone’s variant analysis view provides programmatically generated links to visualizations of the relevant read alignments in JBrowse [18]
Fig. 2Millstone accurately detects genomic variants and can iteratively version genomes. a Millstone was used to analyze genomic clones involved in generating and rationally optimizing a genomically recoded organism. MAGE [2] and CAGE [3] were used to generate the C321. ΔA strain of Escherichia coli [7]. With sequencing data from these strains, Millstone confirmed the designed mutations, identified and annotated off-target mutations, and generated a new reference genome. Further reversion of variants was performed with MAGE to improve the strain’s fitness [8], and a final reference genome was generated. b Analysis of 11 escapee clones from a biocontainment selection with a synthetic non-standard amino acid (nsAA) auxotrophy [9] identified two escape mechanisms, either mutation of tyrS or disruption of lon. c Millstone can also be used for adaptive laboratory evolution studies. We employed Millstone to analyze mutations across 115 clones in the Tenaillon et al. [10] high-temperature evolution experiment. Millstone was used to create a new reference genome for the ancestral strain from REL606, the closest available reference genome, and called variants against this new reference. Millstone reports 99.2% of SNVs, deletions, and mobile elements found by the Tenaillon pipeline, as well some not identified in the original study (Additional file 1: Table S2). GRO genomically recoded organism, Ref. reference, SNV single-nucleotide variant