| Literature DB >> 18560577 |
Laura Miozzi1, Rosario Michael Piro, Fabio Rosa, Ugo Ala, Lorenzo Silengo, Ferdinando Di Cunto, Paolo Provero.
Abstract
BACKGROUND: High-throughput gene expression data can predict gene function through the "guilt by association" principle: coexpressed genes are likely to be functionally associated. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2008 PMID: 18560577 PMCID: PMC2409962 DOI: 10.1371/journal.pone.0002439
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Predicted annotations.
| dataset | measure | annotations | PFP(%) |
| AFFY | P | 1882 | 0.26 |
| AFFY | E | 925 | 0.61 |
| SAGE | P | 738 | 1.26 |
| SAGE | E | 374 | 2.53 |
| SAGE | D | 414 | 2.33 |
| SAGE | O | 539 | 1.70 |
Number of non-redundant predicted annotations obtained with the various dataset/measure combinations, and the corresponding PFP. The coexpression measures are: E: Euclidean; P: Pearson; D: Poisson “distinct”; O: Poisson “overrepresentation”.
Overlap.
| AFFY P | AFFY E | SAGE P | SAGE E | SAGE D | SAGE O | |
| AFFY P |
| 168 | 38 | 20 | 11 | 21 |
| AFFY E |
| 16 | 13 | 6 | 15 | |
| SAGE P |
| 58 | 46 | 65 | ||
| SAGE E |
| 45 | 69 | |||
| SAGE D |
| 201 | ||||
| SAGE O |
|
Overlap between the predicted annotations found by the various dataset/measure combinations. E: Euclidean, P: Pearson, D: Poisson-distinct, O: Poisson-overrepresentation.
Figure 1Enrichment of RCGs in genes involved in similar phenotypes.
For each dataset/coexpresison measure combination we show the fraction of RCGs associated to an OMIM phenotype as described in the text. The fraction is shown for functionally characterized RCGs (grey), all RCGs (purple), and randomized RCGs (orange). The coexpression measures are: E: Euclidean; P: Pearson; D: Poisson “distinct”; O: Poisson “overrepresentation”.
Predicted disease genes.
| Gene | Phenotype | Status |
| PCOLCE | Ehlers-Danlos syndrome, type VIII | A |
| KCNQ2 | Electroencephalogram, low-voltage | B* |
| CACNA1H | Microphthalmia, isolated, with cataract 1 | A |
| MYF6 | Scapuloperoneal myopathy | A |
| COL2A1 | Spondylometaphyseal dysplasia, Kozlowski type | B# |
| PSMB5 | Basal ganglia calcification, idiopathic, 1 | A |
| NEDD8 | Basal ganglia calcification, idiopathic, 1 | A |
| C14orf2 | Basal ganglia calcification, idiopathic, 1 | A |
| ENSG00000196992 | Basal ganglia calcification, idiopathic, 1 | A |
| RSHL3 | Craniometaphyseal dysplasia, autosomal recessive | A |
| SC5DL | Rosselli-Gulienetti syndrome | A |
| DKFZp762E1312 | Holoprosencephaly | A |
| TULP1 | Spinocerebellar ataxia, autosomal recessive 3 | A |
| NRSN1 | Spinocerebellar ataxia, autosomal recessive 3 | A |
| OPN1MW | Microphthalmia, isolated, with coloboma 1 | B |
| OPN1MW2 | Microphthalmia, isolated, with coloboma 1 | B |
| CACNA1F | Albinism, ocular, type II | B# |
| OPN1LW | Colorblindness, blue-mono-cone-monochromatic type | B |
| OPN1MW | Colorblindness, blue-mono-cone-monochromatic type | B |
| OPN1MW2 | Colorblindness, blue-mono-cone-monochromatic type | B |
| NRK | Megalocornea | A |
| MYL2 | Spinal muscular atrophy, distal, congenital nonprogressive | A |
| TDG | Spinal muscular atrophy, distal, congenital nonprogressive | A |
| CAMK1D | Refsum disease with increased pipecolic acidemia | A |
| AIPL1 | Cone-rod dystrophy 5 | C$ |
| LOC257039 | Glaucoma 1, open angle | A |
| BFSP2 | Glaucoma 1, open angle | A |
| SLC35A1 | Retinitis pigmentosa 25 | A |
| IMPG1 | Cone-rod dystrophy 7 | A |
| ELOVL4 | Cone-rod dystrophy 7 | B$ |
| CNGA1 | Stargardt disease 4 | C |
| SAA4 | Hyperlipidemia, combined, 2 | A |
| CYP4F22 | Ichthyosis, nonlamellar and nonerythrodermic, congenital, autosomal | A |
| ACTA1 | Muscular dystrophy, congenital, 1b | B |
| CABC1 | Muscular dystrophy, congenital, 1b | A |
| APOA5 | High density lipoprotein deficiency, 3 | B |
| APOA4 | High density lipoprotein deficiency, 3 | B |
| BFSP1 | Cataract, posterior polar, 3 | B |
| PCP4L1 | Cone-rod dystrophy, 8 | A |
| ABCF3 | Abdominal obesity-metabolic syndrome | A |
| FOXE1 | Cataract, autosomal recessive, early-onset, pulverulent | A |
| MYOT | Myopathy, distal 2 | B% |
| LOC493869 | Myopathy, distal 2 | A |
| CMYA5 | Myopathy, distal 2 | A |
| KRT1 | Exfoliative ichthyosis, autosomal recessive | B |
| KRT75 | Exfoliative ichthyosis, autosomal recessive | A |
| KRT2 | Exfoliative ichthyosis, autosomal recessive | B |
| KRT5 | Exfoliative ichthyosis, autosomal recessive | B |
| KRT77 | Exfoliative ichthyosis, autosomal recessive | A |
| KRT6A | Exfoliative ichthyosis, autosomal recessive | A |
| FLNC | Muscular dystrophy, limb-girdle, type 1f | A |
| CRYBB1 | Myopia 6 | A |
| CRYBA4 | Myopia 6 | A |
The gene name is the official HUGO name where available, or the Ensembl ID. The third column summarizes the available information about the association of candidates with the disease. In particular, genes annotated with A have to our knowledge not been associated to the disease or to similar phenotypes, genes annotated with B are involved in MIM phenotypes with phenomap scores of 0.4 or higher, genes annotated with C have been associated to similar phenotypes, but display a phenomap score lower than 0.4. Moreover, genes annotated with # represent the actual disease gene, because mutations have been found in patients (but the association with the disease is not reported by Ensembl, version 45); genes annotated with have been excluded on clinical basis; genes annotated with $ can be excluded because mutations have been found in a different gene; genes annotated % are at the moment excluded because they have been screened but mutations have not been found. In all cases labeled by special characters we also provide a reference to the corresponding supporting evidence. More details are available in Supporting Information Text S4.