| Literature DB >> 18055500 |
Ann-Charlotte Berglund1, Erik Sjölund, Gabriel Ostlund, Erik L L Sonnhammer.
Abstract
The InParanoid eukaryotic ortholog database (http://InParanoid.sbc.su.se/) has been updated to version 6 and is now based on 35 species. We collected all available 'complete' eukaryotic proteomes and Escherichia coli, and calculated ortholog groups for all 595 species pairs using the InParanoid program. This resulted in 2 642 187 pairwise ortholog groups in total. The orthology-based species relations are presented in an orthophylogram. InParanoid clusters contain one or more orthologs from each of the two species. Multiple orthologs in the same species, i.e. inparalogs, result from gene duplications after the species divergence. A new InParanoid website has been developed which is optimized for speed both for users and for updating the system. The XML output format has been improved for efficient processing of the InParanoid ortholog clusters.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18055500 PMCID: PMC2238924 DOI: 10.1093/nar/gkm1020
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Orthophylogram of all 35 species in InParanoid 6. This UPGMA tree is based on the average fraction of orthologs between species. For instance, on average 91.2% of the proteins in H. sapiens and P. troglodytes are orthologous. The tree topology generally corresponds to the standard taxonomy, but a few exceptions were noted (see text).
Figure 2.Histogram of average number of inparalogs/cluster per species for all species–species comparisons in InParanoid 6. The peak around five inparalogs/cluster is entirely caused by O. sativa, rice.