| Literature DB >> 29945570 |
Thomas H Clarke1, Lauren M Brinkac2,3, Jason M Inman2, Granger Sutton2, Derrick E Fouts2.
Abstract
BACKGROUND: Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer.Entities:
Keywords: Pan-chromosome; Pan-genome; PanOCT; Viewer; Visualization; fGI; fGR
Mesh:
Year: 2018 PMID: 29945570 PMCID: PMC6020400 DOI: 10.1186/s12859-018-2250-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1PanACEA Pipeline Flowchart. The PanACEA pipeline with the initial files shown in dark gray, the PanACEA PERL scripts shown in blue font, the resulting PanACEA intermediate files shown in light gray, and the final files shown in yellow. The final PanACEA output includes all the HTML pages, JSON files, and Javascripts scripts necessary to run the viewer. The RGI output referenced is generated by the RGI software package. Additional information on the requirements for the input files can be found in the user manual located on the GitHub page
Fig. 2PanACEA Views of E. hormaechei gat and aga Operons. The PanACEA pan-chromosome images (a), fGR view (b), and phylogeny (c) showing the gat operon that can differentiate E. hormaechei subsp. hormaechei from other subsp. [12]. The location of the fGI in b and c is highlighted with the orange box. The default coloring scheme is shown in (a) with variable regions in dark gray and core regions in light gray. The variable regions are also shown at 0.75 height and on alternating sides of the chromosome to help differentiate small neighboring regions. The bounding core region that contains the aga operon is shown in the preview panel highlighted by the light blue box in a. The cluster of genomes containing the gat operon fGI are annotated as E and are highlighted in the genome phylogeny in c using the pink box. The images in b and c are derived from PNGs downloaded directly from the website. Additional information about the visualization can be found in the user manual located on the GitHub page