| Literature DB >> 28053165 |
Daniel R Mende1,2, Ivica Letunic3, Jaime Huerta-Cepas1, Simone S Li1,4, Kristoffer Forslund1, Shinichi Sunagawa1,5, Peer Bork6,7,8,9.
Abstract
The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de.Entities:
Mesh:
Year: 2016 PMID: 28053165 PMCID: PMC5210662 DOI: 10.1093/nar/gkw989
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Availability of sequenced genomes and species clusters availability over time. Colors represent the habitat annotation of the genomes/species clusters.
Figure 2.Workflow to generate the underlying data of the database.
Figure 3.Overview of the representative genome set according to the NCBI Taxonomy. GC content, habitat information, genome size and antibiotic resistance gene carriage are displayed as additional datasets. Different Phyla are displayed as alternating light and dark gray clades within the tree (28).
Figure 4.Clade/specI species cluster view on the proGenomes website. All sequences and annotations for the genomes within a clade/specI species cluster can be downloaded directly. Individual member genomes can be accessed at the bottom of the page.