| Literature DB >> 16757574 |
Nicki Tiffin1, Euan Adie, Frances Turner, Han G Brunner, Marc A van Driel, Martin Oti, Nuria Lopez-Bigas, Christos Ouzounis, Carolina Perez-Iratxeta, Miguel A Andrade-Navarro, Adebowale Adeyemo, Mary Elizabeth Patti, Colin A M Semple, Winston Hide.
Abstract
Genome-wide experimental methods to identify disease genes, such as linkage analysis and association studies, generate increasingly large candidate gene sets for which comprehensive empirical analysis is impractical. Computational methods employ data from a variety of sources to identify the most likely candidate disease genes from these gene sets. Here, we review seven independent computational disease gene prioritization methods, and then apply them in concert to the analysis of 9556 positional candidate genes for type 2 diabetes (T2D) and the related trait obesity. We generate and analyse a list of nine primary candidate genes for T2D genes and five for obesity. Two genes, LPL and BCKDHA, are common to these two sets. We also present a set of secondary candidates for T2D (94 genes) and for obesity (116 genes) with 58 genes in common to both diseases.Entities:
Mesh:
Year: 2006 PMID: 16757574 PMCID: PMC1475747 DOI: 10.1093/nar/gkl381
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Loci associated with T2D through linkage and association studies
| Locus | Reference |
|---|---|
| 1q21–25 | (74–82) |
| 1p31 | (83) |
| 2p11 | (84) |
| 2p22–2p13 | (75,79,85) |
| 2p25 | (83) |
| 2q12 | (86) |
| 2q24 | (87) |
| 2q33–2q37 | (81,88–91) |
| 3p12–3p13 | (83,92) |
| 3p24-22 | (74,84,93) |
| 3p26 | (83) |
| 3q11 | (75) |
| 3q27–29 | (79,87) |
| 4q27–4q28 | (74,92,94) |
| 4q32–34 | (84,95) |
| 5q13 | (82,93) |
| 5q31–5q32 | (74,82) |
| 6p21–6p22 | (88) |
| 6q12 | (75) |
| 6q15–6q27 | (74,78,85,96– |
| 7p15 | ( |
| 7p21–7p22 | ( |
| 7q22 | ( |
| 7q36 | ( |
| 8p21–8p22 | ( |
| 8p11–8p12 | ( |
| 8q11 | ( |
| 8q24 | ( |
| 9p13–p24 | ( |
| 9q31 | ( |
| 9q33 | ( |
| 10p13 | ( |
| 10q23 | ( |
| 10q26 | ( |
| 11p12–p14 | ( |
| 11q23 | ( |
| 12p11 | ( |
| 12q15–12q21 | ( |
| 12q24 | ( |
| 14q11–14q13 | ( |
| 14q23–14q24 | ( |
| 14q32 | ( |
| 15q11 | ( |
| 15q13–q21 | ( |
| 16p12–16q11 | ( |
| 17p13–17q22 | ( |
| 18p11 | ( |
| 18q21–18q23 | ( |
| 19p13 | ( |
| 19q13 | ( |
| 20p11–20p13 | ( |
| 20q12–13 | ( |
| Xq23–27.3 | ( |
Comparison of gene sets selected by 5/7 methods, with pathways defined by Mootha et al. (32)
| Pathway with total number of assigned genes | No. of genes | T2D | % of gene set | % of pathway | Obesity | % of gene set | % of pathway | T2D and Obesity | % of gene set | % of pathway |
|---|---|---|---|---|---|---|---|---|---|---|
| Fatty acid metabolism | 24 | 1 | 1.1 | 4.2 | 4 | 3.4 | 16.7 | 1 | 1.7 | 4.2 |
| Gluconeogenesis | 32 | 3 | 3.2 | 9.4 | 2 | 1.7 | 6.3 | 2 | 3.4 | 6.3 |
| Glycolysis | 31 | 1 | 1.1 | 3.2 | 2 | 1.7 | 6.5 | 1 | 1.7 | 3.2 |
| Glycogen metabolism | 24 | 2 | 2.1 | 8.3 | 0 | 0.0 | 0.0 | 0 | 0.0 | 0.0 |
| Insulin signaling | 50 | 1 | 1.1 | 2.0 | 2 | 1.7 | 4.0 | 1 | 1.7 | 2.0 |
| Ketogenesis | 9 | 0 | 0.0 | 0.0 | 0 | 0.0 | 0.0 | 0 | 0.0 | 0.0 |
| Pyruvate metabolism | 435 | 11 | 11.7 | 2.5 | 16 | 13.8 | 3.7 | 8 | 13.8 | 1.8 |
| Reactive oxygen species homeostasis | 7 | 0 | 0.0 | 0.0 | 0 | 0.0 | 0.0 | 0 | 0.0 | 0.0 |
| Krebbs cycle | 16 | 1 | 1.1 | 6.3 | 1 | 0.9 | 6.3 | 1 | 1.7 | 6.3 |
| Oxidative phosphorylation | 87 | 1 | 1.1 | 1.1 | 0 | 0.0 | 0.0 | 0 | 0.0 | 0.0 |
| Mitochondria | 457 | 14 | 14.9 | 3.1 | 17 | 14.7 | 3.7 | 9 | 15.5 | 2.0 |
| Total Candidate Genes selected | 94 | 116 | 58 |
No. of genes, the total number of genes assigned to each pathway; T2D/Obesity, the number of T2D or obesity 5/7 candidates found in this pathway; T2D and Obesity, the number of genes in common to T2D and obesity found in this pathway; % of gene set, the percentage of T2D and/or obesity candidates that are found in this pathway; % of pathway, the percentage of total genes assigned to this pathway found in the T2D and/or obesity candidate set.
(A) Sources of input data for each method and (B) number of genes in the starting candidate set, and number of genes selected by each method
| Methods | |||||||
|---|---|---|---|---|---|---|---|
| GeneSeeker | eVOC system | DGP | PROSPECTR | SUSPECTS | G2D | POCUS | |
| (A) Input | |||||||
| PubMed abstracts | X | X | X | ||||
| eVOC annotation | X | ||||||
| Sequence data | X | X | X | X | |||
| GO annotation | X | X | X | ||||
| Protein data | X | X | X | X | |||
| Expression libraries | X | X | X | ||||
| Orthologous mouse genes | X | ||||||
| OMIM | X | ||||||
| (B) Number of genes selected | |||||||
| Starting set of candidates | 9556 | 9556 | 9556 | 9556 | 9556 | 9556 | 562 |
| T2D | 642 | 2504 | 1174 | 791 | 956 | N/Aa | 2 |
| Obesity | 281 | 3046 | 1174 | 791 | 995 | N/Aa | 2 |
aG2D ranks all candidate genes, rather than selecting a subset of candidates.
Properties of candidate genes selected by 6/7 methods for both T2D and obesity, for T2D only and for obesity only
| Name | HUGO ID | Ensembl_ID | Locus | LOD score | Disease affiliation | Gene function |
|---|---|---|---|---|---|---|
| Lipoprotein lipase precursor | ENSG00000175445 | 8p21.3 | 2.55 ( | Hyperlipoproteinemia type I | Hydrolysis of triglycerides | |
| Branched-chain alpha-keto acid dehydrogenase | ENSG00000142046 | 19q13.2 | 1.50 ( | Maple syrup urine disease | Amino acid metabolism | |
| Ornithine aminotransferase | ENSG00000065154 | 10q26.13 | 2.88 ( | Ornithine aminotransferase deficiency. | Amino acid metabolism | |
| Macrophage colony stimulating factor I | ENSG00000182578 | 5q32 | 1.22 ( | Myeloid malignancy | Receptor for colony stimulating factor 1, a cytokine which controls the production, differentiation, and function of macrophages. | |
| Glucosidase II beta subunit precursor | ENSG00000130175 | 19p13.2 | 1.81 ( | Polycystic liver disease | Beta-subunit of glucosidase II, an N-linked glycan-processing enzyme in the endoplasmic reticulum | |
| TGFβ receptor type II | ENSG00000163513 | 3p24.1, 3p22 | 1.27 ( | Colon cancer, Marfan syndrome type II | Receptor for TGFβ, with a protein kinase domain that activates transcription factors | |
| Phospho-glucomutase 1 | ENSG00000079739 | 1p31.3 | 1.50 ( | OMIM 171900 | Catalyses the transfer of a phosphate group between the 1- and 6-positions of glucose. | |
| Acetyl-CoA acyltransferase 2 | ENSG00000167315 | 18q21.1 | 2.62 ( | OMIM 604770 | Mitochondrial fatty acid metabolism (β-oxidation) | |
| Enoyl-CoA hydratase | ENSG00000127884 | 10q26.3 | 1.69 ( | OMIM 602292 | Mitochondrial fatty acid metabolism (β-oxidation) | |
| Catalase | ENSG00000121691 | 11p13 | 3.08 ( | Acatalasia, | Catalysis of hydrogen peroxide to water and oxygen in the peroxisome | |
| Sialidase 1 | ENSG00000184494 | 6p21.33 | 2.07 ( | Neuraminidase deficiency | Cleavage of terminal sialic acid residues from substrates such as glycoproteins and glycolipids in the lysosome | |
| VLDLR | ENSG00000147852 | 9p24.2 | 2.38 ( | OMIM 192977 | Transport of triacylglycerol from the liver to adipose tissue | |
Figure 1Involvement of candidate T2D genes in cellular metabolism pathways, selected candidates are shown in red.
Analysis of previously proposed T2D and obesity disease genes from loci studied
| Symbol | Ensembl ID | Locus | G2D | eVOC | SUSPECTS | DGP | GeneSeeker | POCUS | PROSPECTR | Total |
|---|---|---|---|---|---|---|---|---|---|---|
| ENSG00000104812 | 19q13 | X | X | X | X | X | 5 | |||
| ENSG00000169047 | 2q33–2q37 | X | X | 2 | ||||||
| ENSG00000142330 | 2q33–2q37 | X | 1 | |||||||
| ENSG00000160789 | 1q21.2–q21.3 | X | 1 | |||||||
| ENSG00000197594 | 6q22–q23 | X | X | 2 | ||||||
| ENSG00000101076 | 20q12–q13.1 | X | X | 2 | ||||||
| ENSG00000111725 | 12q24 | X | X | X | 3 | |||||
| ENSG00000131791 | 1q21 | X | X | 2 | ||||||
| ENSG00000118217 | 1q23 | X | X | 2 | ||||||
| Total | 6 | 5 | 2 | 5 | 2 | 0 | 0 | |||
| ENSG00000181092 | 3q27–29 | X | X | X | 3 | |||||
| ENSG00000169252 | 5q31–5q32 | X | X | X | 3 | |||||
| ENSG00000188778 | 8p11–8p12 | X | X | 2 | ||||||
| ENSG00000116678 | 1p31 | X | X | X | 3 | |||||
| ENSG00000113580 | 5q31–5q32 | X | X | X | X | X | 5 | |||
| ENSG00000111956 | 6p21.3 | X | X | X | 3 | |||||
| ENSG00000110244 | 11q23 | X | 1 | |||||||
| ENSG00000130164 | 19p13.3 | X | X | X | X | X | 5 | |||
| Total | 5 | 4 | 4 | 7 | 3 | 0 | 2 | |||
aGenes used in PROSPECTR training set (68–70).
bGenes from the Genetic Association Database, located in regions studied and with more than one independent non-negative association study ().