| Literature DB >> 24194607 |
Fabian Schreiber1, Mateus Patricio, Matthieu Muffato, Miguel Pignatelli, Alex Bateman.
Abstract
TreeFam (http://www.treefam.org) is a database of phylogenetic trees inferred from animal genomes. For every TreeFam family we provide homology predictions together with the evolutionary history of the genes. Here we describe an update of the TreeFam database. The TreeFam project was resurrected in 2012 and has seen two releases since. The latest release (TreeFam 9) was made available in March 2013. It has orthology predictions and gene trees for 109 species in 15,736 families covering ∼2.2 million sequences. With release 9 we made modifications to our production pipeline and redesigned our website with improved gene tree visualizations and Wikipedia integration. Furthermore, we now provide an HMM-based sequence search that places a user-provided protein sequence into a TreeFam gene tree and provides quick orthology prediction. The tool uses Mafft and RAxML for the fast insertion into a reference alignment and tree, respectively. Besides the aforementioned technical improvements, we present a new approach to visualize gene trees and alternative displays that focuses on showing homology information from a species tree point of view. From release 9 onwards, TreeFam is now hosted at the EBI.Entities:
Mesh:
Year: 2013 PMID: 24194607 PMCID: PMC3965059 DOI: 10.1093/nar/gkt1055
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.This figure shows the pruned tree for TreeFam family TF101005 (Cyclin E) showing model species only. Alongside the leaves protein domain information is plotted. The top panel allows to select/deselect various tree annotations features, e.g. bootstrap values, internal node names. The panel on the right hand side show information on mouse over for sequences and protein domains.
This table shows in the first column the tested tools Pagan, RAxML-ML and RAxML-MP, in the second column the average Robinson-Foulds distance between trees generated with each of the three tools and the gene tree generated by TreeBest used as a reference
| Program | Average RF distance | Runtime (s) |
|---|---|---|
| Pagan | 0.267 | 84.502 |
| RAxML-EPA-ML | 0.295 | 66.178 |
| RAxML-EPA-parsimony | 0.206 | 0.164 |
The last column shows the average runtime for each tool.