| Literature DB >> 19761579 |
Erik Pitzer1, Ronilda Lacson, Christian Hinske, Jihoon Kim, Pedro Af Galante, Lucila Ohno-Machado.
Abstract
BACKGROUND: Large repositories of biomedical research data are most useful to translational researchers if their data can be aggregated for efficient queries and analyses. However, inconsistent or non-existent annotations describing important sample details such as name of tissue or cell line, histopathological type, and subject characteristics like demographics, treatment, and survival are seldom present in data repositories, making it difficult to aggregate data.Entities:
Mesh:
Year: 2009 PMID: 19761579 PMCID: PMC2745696 DOI: 10.1186/1471-2105-10-S9-S9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Screenshot of annotation explorer. All probes have been re-annotated to the AceView transcript database [22] and normalized using a modified version of quantile normalization [23] that has been adapted for very large datasets. The screenshot shows the distribution of normalized measurement values of BRCA1 for different disease states.
Variable definition file
This example shows the definition of a small annotation form Colon Cancer with the two groups Patient and Sample that contain the variables age, genetically modified and tumor size which reference previously defined variables and one new variable tissue with the predefined choices normal, benign, cancerous and non-colon.
Figure 2Screenshot of annotation tool. Left: Several links lead the annotators to additional information. The samples within one study can be searched, grouped and organized to accelerate the annotation of similar samples. Right: Predefined variables and values facilitate the annotation. The freedom of defining new variables or new values for existing variables and a way to find them help in capturing as much information as possible in an organized way.