| Literature DB >> 17709344 |
Paul Fisher1, Cornelia Hedeler, Katherine Wolstencroft, Helen Hulme, Harry Noyes, Stephen Kemp, Robert Stevens, Andrew Brass.
Abstract
It is increasingly common to combine Microarray and Quantitative Trait Loci data to aid the search for candidate genes responsible for phenotypic variation. Workflows provide a means of systematically processing these large datasets and also represent a framework for the re-use and the explicit declaration of experimental methods. In this article, we highlight the issues facing the manual analysis of microarray and QTL data for the discovery of candidate genes underlying complex phenotypes. We show how automated approaches provide a systematic means to investigate genotype-phenotype correlations. This methodology was applied to a use case of resistance to African trypanosomiasis in the mouse. Pathways represented in the results identified Daxx as one of the candidate genes within the Tir1 QTL region. Subsequent re-sequencing in Daxx identified a deletion of an amino acid, identified in susceptible mouse strains, in the Daxx-p53 protein-binding region. This supports recent experimental evidence that apoptosis could be playing a role in the trypanosomiasis resistance phenotype. Workflows developed in this investigation, including a guide to loading and executing them with example data, are available at http://workflows.mygrid.org.uk/repository/myGrid/PaulFisher/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17709344 PMCID: PMC2018629 DOI: 10.1093/nar/gkm623
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(a) An illustration showing the prioritization of phenotype candidates, from the pathway-driven approach. All pathways are differentially expressed in the microarray data. Those pathways which contain genes from the QTL region are assigned a higher priority (pathways A and B) than those with no link to the QTL region (pathway C). Higher priority pathways are then ranked according to their involvement in the phenotypes expression, based on literature evidence. Abbreviations: CHR:Chromosome; QTL:Quantitative Trait Loci. (b) An illustration for the pathway-driven approach to genotype–phenotype correlations. The process of annotating candidate genes from microarray and QTL investigations with their biological pathways is shown. The pathways gathered from both studies are compared and those common to both are extracted as the candidate pathways for further analysis.These pathways represent a set of hypotheses, in that the candidates are the hypothetical processes which may contribute to the expression of the observed phenotype. Subsequent verification is required for each pathway by wet lab investigation and literature searches. This apporach is separated into two sections, distinguished by the dividing line between the selction of common pathways and the generation of hypotheses. The section labelled A represents the workflow side of the investigation;, whilst the section labelled B represents verification of the hypotheses through wet lab experimentation and literature mining.
Figure 2.Annotation workflow to gather genes in a QTL region, and provide information on the pathways involved with a phenotype. This workflow, shown as a sub-set of the complete workflow, requires a chromosome, and QTL start and stop positions in base pairs. The genes in this QTL region are then returned from Ensembl via a BioMart plug-in. These genes are subsequently annotated with UniProt and Entrez identifiers, start and end positions, Ensembl Transcript ids and Affymetrix probeset identifiers for the chips Mouse430_2 and Mouse430a_2. The UniProt and Entrez ids are submitted to the KEGG gene database, retrieving a list of KEGG gene ids.
A subset of the KEGG pathways found to be differentially expressed at Day 7 in the microarray data
| KEGG pathway Ids | Pathway descriptions | Genes in Tir1 QTL region | |||
|---|---|---|---|---|---|
| path:mmu00240 | Pyrimidine metabolism—Mus musculus (mouse) | Znrd1 | |||
| path:mmu04610 | Complement and coagulation cascades—Mus musculus (mouse) | C4b | C2 | Cfb | |
| path:mmu04320 | Dorso-ventral axis formation—Mus musculus (mouse) | Notch3 | Notch4 | ||
| path:mmu00620 | Pyruvate metabolism—Mus musculus (mouse) | Glo1 | |||
| path:mmu00600 | Sphingolipid metabolism—Mus musculus (mouse) | Neu1 | |||
| path:mmu04370 | VEGF signalling pathway—Mus musculus (mouse) | Mapk14 | Mapk13 | ||
| path:mmu04540 | Gap junction—Mus musculus (mouse) | Tubb5 | |||
| path:mmu00565 | Ether lipid metabolism—Mus musculus (mouse) | Agpat1 | |||
| path:mmu00564 | Glycerophospholipid metabolism—Mus musculus (mouse) | Agpat1 | |||
| path:mmu04310 | Wnt signalling pathway—Mus musculus (mouse) | Csnk2b | |||
| path:mmu00590 | Arachidonic acid metabolism—Mus musculus (mouse) | Cyp4f14 | Cyp4f15 | Cyp4f13 | Cyp4f16 |
| path:mmu04620 | Toll-like receptor signalling pathway—Mus musculus (mouse) | Mapk14 | Tnf | Mapk13 | |
| path:mmu04912 | GnRH signalling pathway—Mus musculus (mouse) | Mapk13 | Mapk14 | ||
| path:mmu04670 | Leukocyte transendothelial migration—Mus musculus (mouse) | Mapk13 | Mapk14 | ||
| path:mmu04330 | Notch signalling pathway—Mus musculus (mouse) | Notch3 | Notch4 | ||
| path:mmu04110 | Cell cycle—Mus musculus (mouse) | Cdkn1a | |||
| path:mmu00561 | Glycerolipid metabolism—Mus musculus (mouse) | Agpat1 | |||
| path:mmu04530 | Tight junction—Mus musculus (mouse) | Csnk2b | |||
| path:mmu04510 | Focal adhesion—Mus musculus (mouse) | Col11a2 | Tnxb | ||
| path:mmu04010 | MAPK signalling pathway—Mus musculus (mouse) | Mapk14 | Tnf | Daxx | Mapk13 |
| path:mmu00062 | Fatty acid elongation in mitochondria—Mus musculus (mouse) | Ppt2 | |||
| path:mmu04664 | Fc epsilon RI signalling pathway—Mus musculus (mouse) | Mapk14 | Tnf | Mapk13 | |
| path:mmu00310 | Lysine degradation—Mus musculus (mouse) | Ehmt2 | |||
| path:mmu04630 | Jak-STAT signalling pathway—Mus musculus (mouse) | Pim1 | |||
| path:mmu00230 | Purine metabolism—Mus musculus (mouse) | Pde9a | Znrd1 | ||
| path:mmu04910 | Insulin signalling pathway—Mus musculus (mouse) | Flot1 | |||
| path:mmu03320 | PPAR signalling pathway—Mus musculus (mouse) | Rxrb | Angptl4 | ||
| path:mmu00260 | Glycine, serine and threonine metabolism—Mus musculus (mouse) | Cbs | |||
| path:mmu00604 | Glycosphingolipid biosynthesis - ganglioseries—Mus musculus (mouse) | B3galt4 | |||
| path:mmu04520 | Adherens junction—Mus musculus (mouse) | Csnk2b | |||
| path:mmu04920 | Adipocytokine signalling pathway—Mus musculus (mouse) | Rxrb | Tnf | ||
The genes listed above that appear in the differentially expressed pathways are also located within the Tir1 QTL region. We have chosen to ignore the pathways containing the H2 complex genes due to the highly polymorphic nature of these genes. Pathways that do not represent metabolic processes have also been removed from this table. A complete list of the genes and pathways common to the Tir1 QTL region and Day 7 gene expression data can be found within the Supplementary Data.
Figure 3.Alignment of part of the acidic region of the Daxx gene in which an aspartate was deleted. 35/41 amino acids in this region are aspartic acid (D) or glutamic acid (E).