| Literature DB >> 19478022 |
Timothy D O'Connor1, Nicholas I Mundy.
Abstract
MOTIVATION: Mapping between genotype and phenotype is one of the primary goals of evolutionary genetics but one that has received little attention at the interspecies level. Recent developments in phylogenetics and statistical modelling have typically been used to examine molecular and phenotypic evolution separately. We have used this background to develop phylogenetic substitution models to test for associations between evolutionary rate of genotype and phenotype. We do this by creating hybrid rate matrices between genotype and phenotype.Entities:
Mesh:
Year: 2009 PMID: 19478022 PMCID: PMC2687985 DOI: 10.1093/bioinformatics/btp231
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.The likelihood surface of a simulated dataset generated under the null model. The genotype parameter is a single rate under the F81 model, used to simplify the search space for visualization. The phenotype parameter is a separate rate calculated for a binary phenotype. The z-axis is the log likelihood evaluated at that point. Branch lengths were fixed from an optimized estimation from the genotype data.
Fig. 2.A possible implementation of a 16 taxa tree with random branch lengths generated from a uniform distribution for an average tree length, total of all branch lengths, of 3.
Fig. 3.The relationship between tree length and sensitivity/FP rate in simulations. Based on 50 simulations for each tree length with 16 taxa in a balanced tree.
Results of primate data sets using second test (D versus D)
| Gene | Number of sites | Taxa | LRT | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 1135 | 27 | −32225.463 | −32223.604 | 3.717 | 0.486 | 34.406 | 0.515 | 0.020, 0.050 | |
| 354 | 11 | −14265.705 | −14265.539 | 0.332 | 0.022 | 19.501 | 0.022 | 0, 47.789 | |
| 2649 | 14 | −21839.929 | −21839.926 | 0.007 | 0.591 | 0.129 | 0.392 | 7.402, 7.404 | |
| 4245 | 16 | −23134.827 | −23129.409 | 10.836** | 0.227 | 5.116 | 0.139 | 2.485, 11.039 | |
| 555 | 16 | −8019.329 | −8019.328 | 0.002 | 0.206 | 11.840 | 0.208 | 11.882, 11.883 |
Key: D−ln(L) is the negative log likelihood for the Dependent model with weight parameters fixed to each other, D−ln(L) is the negative log likelihood for the Dependent model. LRT is the likelihood ratio test statistic or two times the difference in log likelihood with significant values signified by ** (P < 0.005 after a Bonferroni correction for multiple testing) for a χ2 distribution with one degree of freedom. W[01] is the scale factor in the null model where both weights are equal, W[i] is the weight parameter given phenotype i.