| Literature DB >> 29277862 |
Tina Zekic1,2,3, Guillaume Holley1,2,3, Jens Stoye4,5,6.
Abstract
Computational pan-genome analysis has emerged from the rapid increase of available genome sequencing data. Starting from a microbial pan-genome, the concept has spread to a variety of species, such as plants or viruses. Characterizing a pan-genome provides insights into intra-species evolution, functions, and diversity. However, researchers face challenges such as processing and maintaining large datasets while providing accurate and efficient analysis approaches. Comparative genomics methods are required for detecting conserved and unique regions between a set of genomes. This chapter gives an overview of tools available for indexing pan-genomes, identifying the sub-regions of a pan-genome and offering a variety of downstream analysis methods. These tools are categorized into two groups, gene-based and sequence-based, according to the pan-genome identification method. We highlight the differences, advantages, and disadvantages between the tools, and provide information about the general workflow, methodology of pan-genome identification, covered functionalities, usability and availability of the tools.Keywords: Accessory genome; Comparative genomics; Core genome; Pan-genomics
Mesh:
Year: 2018 PMID: 29277862 DOI: 10.1007/978-1-4939-7463-4_2
Source DB: PubMed Journal: Methods Mol Biol ISSN: 1064-3745