| Literature DB >> 21943367 |
Lili Wang1, Pouya Khankhanian, Sergio E Baranzini, Parvin Mousavi.
Abstract
BACKGROUND: The speed at which biological datasets are being accumulated stands in contrast to our ability to integrate them meaningfully. Large-scale biological databases containing datasets of genes, proteins, cells, organs, and diseases are being created but they are not connected. Integration of these vast but heterogeneous sources of information will allow the systematic and comprehensive analysis of molecular and clinical datasets, spanning hundreds of dimensions and thousands of individuals. This integration is essential to capitalize on the value of current and future molecular- and cellular-level data on humans to gain novel insights about health and disease.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21943367 PMCID: PMC3190406 DOI: 10.1186/1471-2105-12-380
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Data sources and distribution
| Genes (proteins) | Diseases | Drugs | Tissues | |
|---|---|---|---|---|
| Genes (proteins) | 39,137a | 12,539b | 6,053c | 73,131d |
| Diseases | 12,539b | NA | NA | 285e |
| Drugs | 6,053c | NA | NA | NA |
| Tissues | 73,131d | 285e | NA | NA |
a HPRD/UCSD;b GWAS Catalog;c DrugBank;d Unigene;e manual curation.
Number of disease-gene associations at different GWAS cutoff values
| Cutoff | Disease-gene associations |
|---|---|
| No filter | 12,539 |
| 10-4 | 6,284 |
| 10-5 | 4,626 |
| 10-6 | 3,163 |
| 10-7 | 2,459 |
| 10-8 | 2,026 |
| 10-9 | 1,696 |
| 10-10 | 1,489 |
Figure 1Screenshot of iCTNet. Genetic association data for more than 200 traits and diseases are available to download from the iCTNet database at a user-selectable significance threshold (-Log10(p)). In addition, the user can choose to download disease-tissue, tissue-gene, and drug-gene interactions by simply ticking a checkbox. Protein-protein and protein-DNA interactions can also be downloaded at different degrees of separation (ds) or distance. Choosing a distance ds = 0 only downloads direct disease-gene associations, and any existing interaction among associated gene products (protein-protein). A distance ds = 1 will also include the first neighbors of genes directly associated.
Figure 2A network of five common autoimmune diseases. A. Disease-gene interaction network (ds = 0) for five common autoimmune diseases. Each disease has unique and shared associations. RA, T1D, and MS are closely related both through HLA and non-HLA associated genes. B. A simplified version of the network shown in A, using the "create similarity net" feature of iCTNet. In this representation, diseases are connected by edges of a color proportional to the number of shared genes. C. Same network as in A with drug-target interactions. Colored circles represent diseases (MS = yellow, T1D = red, RA = green, Ps = magenta, CD = teal), white triangles represent genes, and cyan round squares represent drugs. Disease-gene interactions are colored according to the disease. Protein-protein and DNA-protein interactions are shown as white edges. Drug-gene interactions are represented as cyan dashed edges.