| Literature DB >> 26578592 |
Huaiyu Mi1, Sagar Poudel2, Anushya Muruganujan2, John T Casagrande2, Paul D Thomas3.
Abstract
PANTHER (Protein Analysis THrough Evolutionary Relationships, http://pantherdb.org) is a widely used online resource for comprehensive protein evolutionary and functional classification, and includes tools for large-scale biological data analysis. Recent development has been focused in three main areas: genome coverage, functional information ('annotation') coverage and accuracy, and improved genomic data analysis tools. The latest version of PANTHER, 10.0, includes almost 5000 new protein families (for a total of over 12 000 families), each with a reference phylogenetic tree including protein-coding genes from 104 fully sequenced genomes spanning all kingdoms of life. Phylogenetic trees now include inference of horizontal transfer events in addition to speciation and gene duplication events. Functional annotations are regularly updated using the models generated by the Gene Ontology Phylogenetic Annotation Project. For the data analysis tools, PANTHER has expanded the number of different 'functional annotation sets' available for functional enrichment testing, allowing analyses to access all Gene Ontology annotations--updated monthly from the Gene Ontology database--in addition to the annotations that have been inferred through evolutionary relationships. The Prowler (data browser) has been updated to enable users to more efficiently browse the entire database, and to create custom gene lists using the multiple axes of classification in PANTHER.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26578592 PMCID: PMC4702852 DOI: 10.1093/nar/gkv1194
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Additional genomes added to PANTHER 9.0 and 10.0
| Bacteria | Plants | Animals (Tetrapods) |
|---|---|---|
| Purple false brome ( | Lizard ( | |
| Soybean ( | Horse ( | |
| Black cottonwood ( | Cat ( | |
| Tomato ( | Pig ( | |
| Sorghum ( | ||
| Grape ( | ||
Figure 1.Phylogenetic distribution of genomes available in PANTHER 10. The tree is derived from information in the NCBI Taxonomy resource (17), with some multifurcating nodes resolved using the sTOL resource (18).
Increase in PANTHER version 10 coverage of protein-coding genes in major model organisms
| Organism | # genes before new families | # genes in new families | Total # genes in PANTHER 10 families | Total # in proteome | Percent coverage |
|---|---|---|---|---|---|
| Human | 18390 | 794 | 19184 | 20814 | 92% |
| Mouse | 20085 | 722 | 20807 | 22320 | 93% |
| Rat | 20622 | 749 | 21371 | 22656 | 94% |
| Chicken | 13854 | 404 | 14258 | 15696 | 91% |
| Zebrafish ( | 21836 | 639 | 22475 | 25357 | 89% |
| Fruit fly ( | 10447 | 340 | 10787 | 13690 | 79% |
| Nematode ( | 14004 | 764 | 14768 | 20490 | 72% |
| Budding yeast ( | 4370 | 384 | 4754 | 6597 | 72% |
| Fission yeast ( | 4038 | 323 | 4361 | 5101 | 85% |
| Slime mold | 8665 | 576 | 9241 | 13051 | 71% |
| Arabidopsis | 19396 | 3858 | 23254 | 26684 | 87% |
| 2695 | 628 | 3323 | 4260 | 78% | |
| Total (104 genomes) | 920552 | 105869 | 1026421 | 1424953 | 72% |
Number of new GO annotations added to PANTHER 10 by the GO Phylogenetic Annotation Project, for selected genomes
| Organism | Number of genes with new annotations | Number of annotationsa |
|---|---|---|
| Human | 2936 | 15731 |
| Mouse | 3171 | 16938 |
| Rat | 3242 | 17351 |
| Chicken | 2157 | 11855 |
| Zebrafish ( | 3505 | 19003 |
| Fruit fly ( | 1358 | 7024 |
| Nematode ( | 1840 | 9445 |
| Budding yeast ( | 718 | 3561 |
| Fission yeast ( | 700 | 3548 |
| Slime mold | 999 | 4923 |
| Arabidopsis | 2263 | 10644 |
| 298 | 1055 |
aAn annotation is the combination of a gene and a GO term (counting only those with the most specific term in the ontology graph)
Annotation sets available for PANTHER gene list analysis tools
| Annotation gene set | Description |
|---|---|
| PANTHER Pathway | Signaling and metabolic pathways manually curated by experts ( |
| PANTHER GO-slim Molecular Function | Phylogenetically inferred annotations, to a reduced set of GO molecular function terms (GO-slim). |
| PANTHER GO-slim Biological Process | Phylogenetically inferred annotations, to a reduced set of GO biological process terms (GO-slim). |
| PANTHER GO-slim Cellular Component | Phylogenetically inferred annotations, to a reduced set of GO cellular component terms (GO-slim). |
| PANTHER Protein Class | PANTHER/X ontology terms to classify protein families and subfamilies. These classes are grouping terms commonly used by biologists for families and subfamilies, that are sometimes but not always related to molecular function. |
| GO molecular function experimental only | GO molecular function annotations, from the GO database, based on experimental evidence published in scientific publications. These annotations are usually manually curated by GO curators. |
| GO biological process experimental only | GO biological process annotations, from the GO database, based on experimental evidence published in scientific publications. These annotations are usually manually curated by GO curators. |
| GO cellular component experimental only | GO cellular component annotations, from the GO database, based on experimental evidence published in scientific publications. These annotations are usually manually curated by GO curators. |
| GO molecular function complete | Complete GO molecular function annotations including both manually curated (above) and electronic annotations. Electronic annotations are generated by a variety of different computer algorithms. |
| GO biological process complete | Complete GO biological process annotations including both manually curated (above) and electronic annotations. Electronic annotations are generated by a variety of different computer algorithms. |
| GO cellular component complete | Complete GO cellular component annotations including both manually curated (above) and electronic annotations. Electronic annotations are generated by a variety of different computer algorithms. |
Figure 2.Screenshot of the settings summary of the gene list analysis tool. Settings can be easily changed from this summary table, and results rapidly regenerated using the new settings.
Figure 3.New Prowler for browsing the different classification axes, and retrieving genes from class intersections. Here, a user first selected a species (Homo sapiens) and then the biological process cell differentiation, and pressing the button on the right will retrieve from the database all 161 human genes annotated as involved in cell differentiation.