| Literature DB >> 34632414 |
Ignacio Ferrés1,2, Gregorio Iraola1,2,3,4.
Abstract
Multiple downstream analyses are necessary to interpret the output of bacterial pangenome reconstruction software. This requires integrating diverse kinds of genetic and phenotypic data, which to date are left to each user's criterion. To fill this gap, we created Pagoo, a pangenome post-processing tool that leverages a standardized but flexible and extensible framework for data integration, analysis, and storage. Here, we provide the protocol for running Pagoo and performing from simple to more complex comparative analyses on bacterial pangenome data. For complete details on the use and execution of this protocol, please refer to Ferrés and Iraola (2021).Entities:
Keywords: Bioinformatics; Genomics; Microbiology
Mesh:
Year: 2021 PMID: 34632414 PMCID: PMC8487088 DOI: 10.1016/j.xpro.2021.100802
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Principal components analysis
A PCA is generated directly from the gene presence/absence matrix and in this case organisms are colored by host of origin.
Figure 2Pangenome curves
Pangenome curves show the accessory and core genome size and are indicative of the gene pool size in a certain dataset.
Figure 3Visualization of pangenome features
Pagoo can be integrated with other R packages to produce publication-quality figures in a simple way. In this case, the figure shows an assembly of different analyses that summarize general features of this example pangenome: (A) pangenome curves, (B) gene frequency plots, (C) Accessory genes PCA and (D) pie chart with gene subsets.
Figure 4Core genome phylogenies
This shows the output of the above-described recipe aiming to generate a core genome phylogeny directly from the pangenome object. Panel (A) shows the three colored by lineage defined in the same recipe through a population structure analysis and panel (B) shows the tree colored by host.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| GFF3 input files for pangenome reconstruction | Figshare | |
| Pagoo | ||
| Roary | ||
| $core_∗ | $shell_∗ | $cloud_∗ | |
|---|---|---|---|
| $∗_genes | $core_genes | $shell_genes | $cloud_genes |
| $∗_clusters | $core_clusters | $shell_clusters | $cloud_clusters |
| $∗_sequences | $core_sequences | $shell_sequences | $cloud_sequences |