| Literature DB >> 27899595 |
Huaiyu Mi1, Xiaosong Huang2, Anushya Muruganujan2, Haiming Tang2, Caitlin Mills2, Diane Kang2, Paul D Thomas3.
Abstract
The PANTHER database (Protein ANalysis THrough Evolutionary Relationships, http://pantherdb.org) contains comprehensive information on the evolution and function of protein-coding genes from 104 completely sequenced genomes. PANTHER software tools allow users to classify new protein sequences, and to analyze gene lists obtained from large-scale genomics experiments. In the past year, major improvements include a large expansion of classification information available in PANTHER, as well as significant enhancements to the analysis tools. Protein subfamily functional classifications have more than doubled due to progress of the Gene Ontology Phylogenetic Annotation Project. For human genes (as well as a few other organisms), PANTHER now also supports enrichment analysis using pathway classifications from the Reactome resource. The gene list enrichment tools include a new 'hierarchical view' of results, enabling users to leverage the structure of the classifications/ontologies; the tools also allow users to upload genetic variant data directly, rather than requiring prior conversion to a gene list. The updated coding single-nucleotide polymorphisms (SNP) scoring tool uses an improved algorithm. The hidden Markov model (HMM) search tools now use HMMER3, dramatically reducing search times and improving accuracy of E-value statistics. Finally, the PANTHER Tree-Attribute Viewer has been implemented in JavaScript, with new views for exploring protein sequence evolution.Entities:
Mesh:
Year: 2016 PMID: 27899595 PMCID: PMC5210595 DOI: 10.1093/nar/gkw1138
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Coverage of version 11.1 PANTHER GO-slim functional annotations for selected genomes
| Organism | Total number of genes in genome | Total number of genes annotated | Total number of annotations | Total number of genes with new annotations from GO Phylo-project | Total number of new annotations from GO Phylo-project |
|---|---|---|---|---|---|
| Human | 20 972 | 13 387 | 57 271 | 5925 | 28 996 |
| Mouse | 22 322 | 14 331 | 61 828 | 6804 | 33 318 |
| Rat | 23 781 | 15 536 | 67 112 | 7642 | 37 378 |
| Pig | 21 398 | 13 469 | 58 475 | 6266 | 31 080 |
| Dog | 19 692 | 12 715 | 54 307 | 5727 | 27 629 |
| Chicken | 15 789 | 10 056 | 43 605 | 4114 | 20 524 |
| Zebrafish | 27 187 | 17 113 | 73 161 | 6904 | 33 578 |
| Fruit fly ( | 14 217 | 6942 | 26 408 | 3128 | 13 452 |
| Nematode ( | 20 472 | 8308 | 31 072 | 3781 | 16 052 |
| Slime mold ( | 13 044 | 5133 | 19 582 | 2267 | 10 121 |
| Budding yeast ( | 6728 | 3372 | 13 364 | 1796 | 8363 |
| Fission yeast ( | 5140 | 3110 | 12 306 | 1610 | 7533 |
| 9329 | 3489 | 13 455 | 1780 | 8146 | |
| 27 352 | 12 130 | 44 243 | 6119 | 24 204 | |
| Rice | 62 904 | 17 873 | 61 811 | 9307 | 34 673 |
| Tomato | 34 783 | 13 630 | 48 464 | 6820 | 26 737 |
| 4262 | 1990 | 6267 | 705 | 2249 | |
| 4199 | 1702 | 5452 | 546 | 1733 |
The total number of genes with at least one functional annotation (across molecular function, biological process and cellular component aspects) is listed, followed by the number of genes/annotations added by the GO Phylogenetic Annotation project. Detailed statistics, such as coverage of each aspect, for all genomes in PANTHER can be found at http://pantherdb.org/panther/summaryStats.jsp.
Coverage of non-synonymous variants scored by PANTHER PSEP, compared to the older subPSEC method
| Varibench (non-pathologic) | variBench (pathologic) | swissVar (non-pathologic) | swissVar (pathologic) | |
|---|---|---|---|---|
| Total number of variants | 23 683 | 19 335 | 41 165 | 30 074 |
| Variants scored by PSEP (current version) | 18 492 | 18 655 | 38 307 | 28 814 |
| Variants scored by PSEC (previous version) | 13 910 | 15 710 | 27 340 | 24 405 |
Coverage is improved for all categories of variants, for two standard variant benchmarking sets, VariBench (17) and SwissVar (18).
Figure 1.Reactome pathways in the PANTHER gene list analysis tool. Users can select Reactome pathways from a drop-down list of annotation sets. They can also upload a custom reference list for calculating overrepresentation by clicking on the ‘Change’ button. The Reactome pathways are accessible to both statistical test types in PANTHER: the overrepresentation tool (shown here) that takes a user-uploaded gene list as input, and the enrichment tool (not shown), that takes a user-uploaded list of genes, each of which is associated with a value, such as log-fold-change in expression or association P-value.
Figure 2.Hierarchical view of GO (molecular function) in gene list analysis result page. Each block of related function classes (the first block starts with AMP-activated protein kinase activity, the second with acid-sensing ion channel activity) is arranged with the most specific class at the top, with less specific classes indented below it.
Figure 3.Tree Viewer, showing the tree panel on the left, and multiple sequence alignment panel on the right with the ‘Evolutionary History’ view selected. This view shows in red text the amino acids that have changed in the tree branch immediately leading to each sequence. (A) The standard ‘phylogenetic view’ of the tree, showing only the sequences of the leaves (extant genes) of the tree. (B) The new ‘hierarchical view’ of the tree, showing the sequences of both internal nodes (inferred ancestral sequences) and leaves.