| Literature DB >> 17982176 |
Akihiro Matsuya1, Ryuichi Sakate, Yoshihiro Kawahara, Kanako O Koyanagi, Yoshiharu Sato, Yasuyuki Fujii, Chisato Yamasaki, Takuya Habara, Hajime Nakaoka, Fusano Todokoro, Kaori Yamaguchi, Toshinori Endo, Satoshi Oota, Wojciech Makalowski, Kazuho Ikeo, Yoshiyuki Suzuki, Kousuke Hanada, Katsuyuki Hashimoto, Momoki Hirai, Hisakazu Iwama, Naruya Saitou, Aiko T Hiraki, Lihua Jin, Yayoi Kaneko, Masako Kanno, Katsuhiko Murakami, Akiko Ogura Noda, Naomi Saichi, Ryoko Sanbonmatsu, Mami Suzuki, Jun-ichi Takeda, Masayuki Tanaka, Takashi Gojobori, Tadashi Imanishi, Takeshi Itoh.
Abstract
Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes (http://h-invitational.jp/), we constructed a fully curated database of evolutionary features of human genes, called 'Evola'. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In 'd(N)/d(S) view', natural selection on genes can be analyzed between human and other species. In 'Locus maps', all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17982176 PMCID: PMC2238928 DOI: 10.1093/nar/gkm878
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.An example of manually curated gene pair from H.sapiens (red underline) and Macaca.sp (blue underline). In this case, conditions of phylogenetic topologies, outgroup species (light gray background) and bootstrap values (two circles) are sufficient (refer to the text). Thus, the pair was defined as a manually curated ortholog.
Number of orthologs provided in Evola (release 4.1, June 2007)
| Species | Genes | Human genes |
|---|---|---|
| 18 968 | – | |
| 16 368 | 15 615 | |
| 12 037 | 12 352 | |
| 15 570 | 14 574 | |
| 15 632 | 14 302 | |
| 14 730 | 13 916 | |
| 9375 | 10 181 | |
| 13 201 | 13 588 | |
| 9266 | 10 738 | |
| 12 334 | 10 468 | |
| 11 505 | 9820 | |
| 9738 | 9459 |
Numbers of genes of both human and other species are listed. Owing to lineage-specific duplication or loss, the numbers are usually different (for example, 15 570 mouse genes are orthologous to 14 574 human genes). 18 968 human genes have at least one ortholog among other 11 species.
aMacaca mulatta, Macaca fascicularis, Macaca fuscata, etc. are included.
Figure 2.Evola main page. This page is divided into left and right frames. In the left frame, tables of orthologs, download data, Gene ontology, and InterPro are listed. Three green buttons are links to show ‘Alignment’ (A), ‘d/d view’ (B) and ‘Locus maps’ (C) in the right frame.