| Literature DB >> 27634291 |
Wenling E Chang1, Matthew W Peterson2, Christopher D Garay2, Tonia Korves3.
Abstract
BACKGROUND: Pathogen metadata includes information about where and when a pathogen was collected and the type of environment it came from. Along with genomic nucleotide sequence data, this metadata is growing rapidly and becoming a valuable resource not only for research but for biosurveillance and public health. However, current freely available tools for analyzing this data are geared towards bioinformaticians and/or do not provide summaries and visualizations needed to readily interpret results.Entities:
Keywords: BioSample; Biosurveillance; Geocoding; Java; LabKey; Metadata; Pathogen; PostgreSQL
Mesh:
Year: 2016 PMID: 27634291 PMCID: PMC5025631 DOI: 10.1186/s12859-016-1231-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Overview of the tables in the sample metadata database
| Database table | Content |
|---|---|
| Sample | Identity of a sample, including strain name, serovar, and submission date |
| Collection | Where, when and what type of environment the sample was collected from |
| Human_Host | Information about the human host for clinical samples, such as age and gender |
| Non_Human_Host | Information about non-human hosts for environmental samples |
| Study_Method | Methods used for obtaining and identifying a sample |
| Project | Information about the project associated with the strain |
| Project_Sample | Links projects to samples |
| Owner | Information about the organization that submitted the information about a strain |
| Collection_Owner | Information about the organization that collected the strain |
| Project_Publication | Links a sample to publications by PubMed Identifier |
| Cross_Reference | Stores source and id pairs for documents and databases that reference a sample |
Fig. 1Obtaining information about Listeria strains using the Pathogen Metadata Platform. a Querying for a pathogen name. b Filtering query results. c Selecting metadata types for a 2D histogram. d 2D histogram of counts for two metadata types
Fig. 2Obtaining and visualizing metadata for Salmonella strains. a Querying for a set of sequence read archive ids. b 2D histogram of counts for isolation source and year. c Table of query results. d Map of collection locations of the strains