| Literature DB >> 18263617 |
Daniel Shriner1, Tesfaye M Baye, Miguel A Padilla, Shiju Zhang, Laura K Vaughan, Ann E Loraine.
Abstract
Linkage studies of complex traits frequently yield multiple linkage regions covering hundreds of genes. Testing each candidate gene from every region is prohibitively expensive and computational methods that simplify this process would benefit genetic research. We present a new method based on commonality of functional annotation (CFA) that aids dissection of complex traits for which multiple causal genes act in a single pathway or process. CFA works by testing individual Gene Ontology (GO) terms for enrichment among candidate gene pools, performs multiple hypothesis testing adjustment using an estimate of independent tests based on correlation of GO terms, and then scores and ranks genes annotated with significantly-enriched terms based on the number of quantitative trait loci regions in which genes bearing those annotations appear. We evaluate CFA using simulated linkage data and show that CFA has good power despite being conservative. We apply CFA to published linkage studies investigating age-of-onset of Alzheimer's disease and body mass index and obtain previously known and new candidate genes. CFA provides a new tool for studies in which causal genes are expected to participate in a common pathway or process and can easily be extended to utilize annotation schemes in addition to the GO.Entities:
Mesh:
Year: 2008 PMID: 18263617 PMCID: PMC2275105 DOI: 10.1093/nar/gkn007
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
QTL analyzed for Alzheimer's disease
| Chromosome | Nucleotide start | Nucleotide end | Reference |
|---|---|---|---|
| 6q27 | 136356912 | 156040884 | ( |
| 11q25 | 119605730 | 134256682 | ( |
| 14q22 | 35284307 | 77866250 | ( |
QTL analyzed for body mass index
| Chromosome | Nucleotide start | Nucleotide end | Reference |
|---|---|---|---|
| 1p36 | 1 | 13368006 | ( |
| 1p22 | 81686191 | 101686191 | ( |
| 2q14 | 114345119 | 134345119 | ( |
| 3q22 | 138413583 | 158413583 | ( |
| 3q27 | 167105819 | 187105819 | ( |
| 4q12 | 47597526 | 67597526 | ( |
| 5q12 | 56263229 | 76263229 | ( |
| 5q32 | 135232609 | 155232609 | ( |
| 6p25 | 765305 | 12968512 | ( |
| 6q23-25 | 137282655 | 162104889 | ( |
| 7p21 | 18028844 | 27780107 | ( |
| 7q32 | 121695999 | 141695999 | ( |
| 10p11 | 18725142 | 31684305 | ( |
| 11q14 | 117235383 | 128161860 | ( |
| 13q14 | 42330695 | 62330695 | ( |
| 16p11-12 | 23004204 | 26125659 | ( |
| 19q13 | 50237821 | 70237821 | ( |
| 20q13 | 28737193 | 48737193 | ( |
Figure 1.Workflow diagram. The flow of data from each step is schematically depicted. The genome-wide correlation matrix is computed for all GO terms and saved for subsequent analysis with different data sets. Genes overlapping with QTL regions and their associated GO annotations are obtained from the UCSC Genome Informatics DAS/1 server and the Entrez Gene database, respectively. For each data set, a study-specific correlation matrix is blocked from the genome-wide correlation matrix. Genes from each QTL are combined to form a study-wide gene list and each term is then tested for over-representation using Fisher's exact test. The effective number of independent tests is estimated using Velicer's minimum average partial test, and P-values obtained are adjusted upward based on the effective number of independent tests. Genes are then scored using weights computed from principal components of the study-specific correlation matrix and the number of QTL containing genes with enriched annotations. Rectangles indicate products of data processing and cylinders indicate databases.
2 × 2 Table for Fisher's exact test for enrichment of a Gene Ontology term
| Gene in list | Gene not in list | Total | |
|---|---|---|---|
| Gene annotated with term | |||
| Gene not annotated with term | |||
| Total |
Figure 2.Distributions of positive correlation coefficients. (A) Correlation coefficients >0.2 for term pairs in which both terms belong to the same sub-ontology. (B) Correlation coefficients >0.2 for term pairs in which the terms belong to different sub-ontologies.
Significantly enriched Gene Ontology terms for Alzheimer's disease
| Term | Adjusted |
|---|---|
| MHC class I protein complex | 0.0017 |
| Antigen presentation | 0.0020 |
| MHC class I receptor activity | 0.0057 |
| Integral to membrane | 0.0080 |
| Olfactory receptor activity | 0.0081 |
| Sensory perception of smell | 0.0087 |
| Palmitoyl-CoA hydrolase activity | 0.0090 |
| Estrogen receptor activity | 0.0223 |
| Photoreceptor cell maintenance | 0.0223 |
| Endoplasmic reticulum | 0.0321 |
| Membrane | 0.0398 |
Enriched GO terms cross-referenced with occurrence of term and Alzheimer's in Entrez Gene and OMIM among human genes
| GO Term | |||||||
|---|---|---|---|---|---|---|---|
| Gene | Membrane (113 total) | Integral to membrane (39 total) | Endoplasmic reticulum | Estrogen receptor | MHC Class 1 protein complex | Sensory perception of smell | Chromosome |
| APH1A | X | X | 1p36-q31 | ||||
| X | X | X | 21q21 | ||||
| BCHE | X | 3q26 | |||||
| CASP7 | X | X | 10q25 | ||||
| X | X | 15q21 | |||||
| X | X | X | 14q32 | ||||
| CYP7B1 | X | X | 8q21 | ||||
| X | X | X | 1p33-31 | ||||
| ESR1 | X | X | |||||
| X | |||||||
| GNAS | X | X | 20q13 | ||||
| X | X | 6p21 | |||||
| X | X | 22q13 | |||||
| X | X | X | Xp11 | ||||
| ITGB1 | X | X | X | 10p11 | |||
| X | X | 1q22-23 | |||||
| OPRS1 | X | X | 9p13 | ||||
| X | X | 20p13 | |||||
| X | X | ||||||
| X | X | X | 1q31-42 | ||||
| PSENEN | X | X | 19q13 | ||||
| STX8 | X | X | 17p12 | ||||
The terms ‘antigen presentation’, ‘MHC Class I receptor activity’, ‘photoreceptor cell maintenance’, ‘olfactory’ and ‘palmitoyl-CoA hydrolase activity’ did not return any human genes containing the GO term and ‘Alzheimer's’ from Entrez Gene. Only a partial listing for genes annotated with ‘membrane’ or ‘integral to membrane’ is shown. Genes that contain ‘Alzheimer’ in their OMIM reference are indicated in bold. Chromosomal regions for genes that co-localize with QTL included in this study are indicated in .
Significantly enriched Gene Ontology terms for body mass index
| Term | Adjusted |
|---|---|
| Homophilic cell adhesion | 1.727e-23 |
| Calcium-dependent cell-cell adhesion | 1.366e-8 |
| Cell adhesion | 3.520e-8 |
| Transcription | 2.570e-6 |
| Regulation of transcription, DNA-dependent | 1.526e-6 |
| Synaptogenesis | 2.920e-5 |
| Protein binding | 2.932e-4 |
| Serine-type endopeptidase inhibitor activity | 0.0101 |
| Nucleus | 0.0133 |
| Metal ion binding | 0.0148 |
| Lipid binding | 0.0182 |
| Membrane | 0.0426 |
Enriched GO terms cross-referenced with occurrence of term body mass index or BMI in Entrez Gene and OMIM
| GO Term | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Metal ion binding | Nucleus | Regulation of transcription, DNA-dependent | Membrane | Transcription | Protein binding | Cell Adhesion | Serine-type endopeptidase inhibitor activity | Lipid binding | Chromosome |
| X | X | 17q23 | ||||||||
| X | X | 10q24-26 | ||||||||
| X | X | X | X | |||||||
| X | 8p11-12 | |||||||||
| AGT | X | 1q42-43 | ||||||||
| APOC3 | X | 11q23 | ||||||||
| X | 11p13 | |||||||||
| CYP17A1 | X | 10q24 | ||||||||
| DGAT1 | X | 8q24 | ||||||||
| DRD4 | X | 11p15 | ||||||||
| X | 6q22-23 | |||||||||
| X | X | X | X | X | X | X | ||||
| IL1RN | X | 2q14 | ||||||||
| X | ||||||||||
| X | 11p15 | |||||||||
| X | ||||||||||
| X | X | 1p31 | ||||||||
| LPL | X | 8p22 | ||||||||
| MAOA | X | Xp11 | ||||||||
| X | X | X | X | X | X | |||||
| X | X | 9q22 | ||||||||
| PPARA | X | X | X | X | X | 22q13 | ||||
| PPARD | X | X | X | 6p21 | ||||||
| X | X | X | X | X | 3p25 | |||||
| X | 7q31 | |||||||||
| X | X | 7q21-22 | ||||||||
| SORBS1 | X | X | 10q23-24 | |||||||
| TFRC | X | 3q29 | ||||||||
| X | X | |||||||||
| X | X | X | X | X | 6p21 | |||||
| TNFRSF1B | X | X | ||||||||
| X | X | 4q31 | ||||||||
| X | 11q13 | |||||||||
| X | X | X | X | X | 12q13 | |||||
The terms ‘homophilic cell adhesion’, ‘calcium-dependent cell–cell adhesion’, other more general related terms such as ‘calcium-dependent adhesion’, and ‘synaptogenesis’ did not return any hits when searched for co-occurrence with ‘body mass index’ or ‘BMI’ in Entrez Gene. Genes that contain ‘body mass index’ or ‘BMI’ in their OMIM reference are indicated in bold. Chromosomal regions for genes that co-localize with QTL included in this study are indicated in .