| Literature DB >> 21310501 |
Crysten E Blaby-Haas1, Valérie de Crécy-Lagard.
Abstract
Nearly 2200 genomes that encode around 6 million proteins have now been sequenced. Around 40% of these proteins are of unknown function, even when function is loosely and minimally defined as 'belonging to a superfamily'. In addition to in silico methods, the swelling stream of high-throughput experimental data can give valuable clues for linking these unknowns with precise biological roles. The goal is to develop integrative data-mining platforms that allow the scientific community at large to access and utilize this rich source of experimental knowledge. To this end, we review recent advances in generating whole-genome experimental datasets, where this data can be accessed, and how it can be used to drive prediction of gene function.Entities:
Mesh:
Year: 2011 PMID: 21310501 PMCID: PMC3073767 DOI: 10.1016/j.tibtech.2011.01.001
Source DB: PubMed Journal: Trends Biotechnol ISSN: 0167-7799 Impact factor: 19.536