| Literature DB >> 21169373 |
Mikhail Gostev1, Julio Fernandez-Banet, Johan Rung, Joern Dietrich, Inga Prokopenko, Samuli Ripatti, Mark I McCarthy, Alvis Brazma, Maria Krestyaninova.
Abstract
SUMMARY: The Sample avAILability system-SAIL-is a web based application for searching, browsing and annotating biological sample collections or biobank entries. By providing individual-level information on the availability of specific data types (phenotypes, genetic or genomic data) and samples within a collection, rather than the actual measurement data, resource integration can be facilitated. A flexible data structure enables the collection owners to provide descriptive information on their samples using existing or custom vocabularies. Users can query for the available samples by various parameters combining them via logical expressions. The system can be scaled to hold data from millions of samples with thousands of variables. AVAILABILITY: SAIL is available under Aferro-GPL open source license: https://github.com/sail.Entities:
Mesh:
Year: 2010 PMID: 21169373 PMCID: PMC3035801 DOI: 10.1093/bioinformatics/btq693
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.A high level data structure in SAIL. (See Supplementary Materials for a complete database schema).
Fig. 2.Sample availability matrix for three collections, where collection 1 and 2 are annotated with one vocabulary and collection 3 with a different vocabulary. As Glucose and Glu_con are tagged as synonymous, the result of a query for Glucose will show samples available from the three collections.