| Literature DB >> 24225318 |
Sébastien Moretti1, Balazs Laurenczy, Walid H Gharib, Briséïs Castella, Arnold Kuzniar, Hannes Schabauer, Romain A Studer, Mario Valle, Nicolas Salamin, Heinz Stockinger, Marc Robinson-Rechavi.
Abstract
Selectome (http://selectome.unil.ch/) is a database of positive selection, based on a branch-site likelihood test. This model estimates the number of nonsynonymous substitutions (dN) and synonymous substitutions (dS) to evaluate the variation in selective pressure (dN/dS ratio) over branches and over sites. Since the original release of Selectome, we have benchmarked and implemented a thorough quality control procedure on multiple sequence alignments, aiming to provide minimum false-positive results. We have also improved the computational efficiency of the branch-site test implementation, allowing larger data sets and more frequent updates. Release 6 of Selectome includes all gene trees from Ensembl for Primates and Glires, as well as a large set of vertebrate gene trees. A total of 6810 gene trees have some evidence of positive selection. Finally, the web interface has been improved to be more responsive and to facilitate searches and browsing.Entities:
Mesh:
Year: 2013 PMID: 24225318 PMCID: PMC3964977 DOI: 10.1093/nar/gkt1065
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics on release 06 of Selectome
| Taxonomic group | Species number | Ensembl release | Subtrees | Sequences per subtree | ||||
|---|---|---|---|---|---|---|---|---|
| Total | Filtered | Computed | With positive selection | Median | Max | |||
| Euteleostomi | 54 | 68 | 19 940 | 15 923 | 13 695 | 6543 | 32 | 139 |
| Glires | 7 | 71 | 20 114 | 4656 | 4656 | 136 | 6 | 257 |
| Primates | 10 | 70 | 20 300 | 15 738 | 15 738 | 131 | 8 | 180 |
aPruned from larger Ensembl Compara trees, according to the taxonomic group.
bSubtrees with at least six sequences after alignment quality filtering.
cThe largest gene trees were not computed.
dMany Glires subtrees do not have six sequences before or after our filtering.
Figure 1.Selectome subtrees from Ensembl Compara gene tree. Left, the tree for human gene ENSGT00410000025651 from Ensembl release 68. Right, the subtrees selected for use in Selectome. Note that (i) as the tree is rooted in Amniota (i.e. there are no homologs detected outside Amniota), which is a subset of Euteleostomi, this node was chosen for the subtree for Euteleostomi; (ii) there are four Primate subtrees, due to gene duplications; (iii) only the Glires subtree with at least six sequences was used; (iv) some Primate or Glires subtrees can differ from the Ensembl tree because they use later Ensembl releases (Table 1).